AIPD-DC3-FH

Towards Improving Generalizability of AI Models – Prediction of Depression and Cognitive Decline of PD Patients

  • Host Institution: Fraunhofer SCAI
  • PhD Enrolment: University of Bonn
  • Start Date: October 2025
  • Duration: 36 months
  • Official PhD Supervisor: Holger Fröhlich

Research Objectives

Data in medicine is often not representative of the entire disease population. This is because the selection of patients for specific studies or the choice of specific hospitals induces a statistical selection bias, a situation that FH has also investigated empirically in prior work. Consequently, the assumption that training data has been sampled independent and identically distributed (i.i.d.) from the overall population, which is a basis of machine learning theory, is typically violated. This in turn has negative impacts on the generalization ability of AI/ML models and imposes a significant challenge for trustworthiness and for transfer of such models into clinical practice. The objective of this project is to investigate whether the generalization ability of AI/ML models trained on multi-modal clinical and genomic data could be improved by leveraging modern concepts for domain adaptation of neural networks that have mostly been developed in the imaging and natural language processing fields. Given the breadth of possible domain adaptation strategies, we aim to conduct a systematic comparison of representation learning techniques and supervised model adaptation, as well as unsupervised domain adaption-based, including adversarial learning, domain translation, contrastive learning, and invariant feature learning (a causal machine learning technique) using e.g. the TLlib transfer learning library. The use case is supervised prediction of cognitive decline and depression in PD with a neural network, using a multimodal combination of genetic, clinical and demographic data in PPMI, ICEBERG and LuxPARK cohorts. We will formulate these prediction problems as time-to-event-based risk models with an appropriate loss function. XAI techniques such as (causal) SHAP will be used to understand the putative causal influence of genetic, demographic and clinical factors on cognitive decline and depression. DC3 will closely collaborate with DC6, who will tackle generalizability based on speech data.

Expected Results

  • Innovative AI/ML models predicting cognitive decline and depression on an individual patient level
  • Better understanding of the influence of genetic, demographic and clinical factors on these endpoints
  • Understanding of the potential benefit of different domain adaptation techniques for improving the generalizability of AI/ML algorithms
     

Planned Secondment(s)

  • Host: Centre Hospitalier de Luxembourg
    • Duration: 18 months
    • Purpose: Learning about PD and working with LuxPARK data

This project is part of the "Trustworthiness" work package.

References

  • Birkenbihl C, Salimi Y, Fröhlich H. Unraveling the heterogeneity in Alzheimer's disease progression across multiple cohorts and the implications for data-driven disease modeling. Alzheimer's Dement. 2022; 18: 251–261. https://doi.org/10.1002/alz.12387
  • Birkenbihl C, Salimi Y, Domingo-Fernándéz D, et al. Evaluating the Alzheimer's disease data landscape. Alzheimer's Dement. 2020; 6:e12102. https://doi.org/10.1002/trc2.12102.
  • Jiang, J., Shu, Y., Wang, J., & Long, M. (2022). Transferability in deep learning: A survey. arXiv preprint arXiv:2201.05867http://arxiv.org/abs/2201.05867