Biomedical AI & Data Science

Mission: Bringing better treatments to the right patients at the right time and dose

The Biomedical AI & Data Science Department develops, adapts, and applies AI/ML-based methods for robust, interpretable, and evidence-oriented analysis of biomedical data along a value chain, which largely aligns with the needs of the pharma and biotech industry as well as the public health sector:

  • Target prioritization (better targets):
    • Rational selection of molecular target structures for future therapies
  • Precision medicine (the right drug for the right patient at the right time and dose):
    • Prediction of disease risk, molecular subtype, disease progression, or treatment response at the individual patient level
  • Clinical trials (better trials):
    • Simulation of (counterfactual) synthetic disease trajectories
    • Estimating causal intervention effects from observational data

To address these complex applications, state-of-the-art machine learning algorithms and artificial neural network architectures are adapted to the specific characteristics of biomedical data, including heterogeneity, sparsity, longitudinal structure, and multimodality. In addition, we develop problem-specific model architectures where existing approaches are insufficient. During the last years, our methodological work has specifically covered

  • Hybrid AI/knowledge infusion, e.g., via geometric deep learning or scientific machine learning
  • Generative AI and time series modeling
  • Multi-modal data integration
  • Causal machine learning

Work further explores under which assumptions and in which contexts AI-based predictions can be meaningfully applied to real-world clinical applications. Rather than focusing solely on predictive performance, we thus study explainability, uncertainty, consistency with biomedical knowledge, robustness, and external validity using real-world and clinical data, thereby advancing the aim of making AI medically meaningful and trustworthy.

Long-standing experience spans a broad spectrum of relevant data types in biomedicine:

  • Multimodal longitudinal clinical studies, comprising, e.g.:
    • Questionnaire-based clinical assessments
    • Images
    • Molecular markers
  • Real-world data:
    • Electronic health records (EHRs) and claims data
    • Digital markers: data derived from digital device technologies (e.g., gait sensors) and smartphone applications

Scientific services

Our services cover relevant parts of the value chain in translational biomedical research within the biotech and pharmaceutical industry. Companies are offered

  • Development of tailor-made, cutting edge AI and data mining algorithms
  • Applied research studies on specified biomedical use cases, focusing on the interpretability, robustness and evidential limitations of AI-based prediction models
  • Systematic literature reviews
  • Trainings

Our services are offered, e.g., through contract research. This way, different customer needs can be addressed while providing temporary resources and expertise to support internal projects.

Research projects

The department is heavily involved in multiple national and international research projects. We collaborate closely with clinical, academic and industrial partners to address methodologically challenging and societally relevant research questions.

  • ACCESS-AD (Innovative Health Initiative)
    Development of (causal) AI to predict drug response and adverse drug reactions
  • AIPD (EU)
    Coordinator, Trustworth AI; AI for precision medicine and digital health in Parkinson's Disease
  • CERTAINTY (HORIZON Europe)
    Development of (generative) AI models for the prediction and simulation of therapy response and therapy side effects
  • COMMUTE (EU)
    Causal AI/ML models for predicting the risk of neurodegenerative diseases in dependency of a COVID-19 infection
  • IDERHA (Innovative Health Initiative)
    AI based risk models using real-world data
  • NFDI4Health (DFG)
    AI methods for synthetic patient data
  • PREDICTOM (Innovative Health Initiative / EU)
    AI-screening-platform for identifying dementia risk
  • PsychSTRATA (HORIZON Europe)
    AI/ML models for precision psychiatry
  • Real4Reg (HORIZON Europe)
    AI models for real-world data
  • SYNTHIA (Innovative Health Initiative)
    Development and evaluation of generative AI methods for clinical trial data

  • ACCESS-AD
    Development of (causal) AI to predict drug response and adverse drug reactions
  • Graph Machine Learning
    Development of a modern Graph Neural Network approach to support drug target selection.
  • IDERHA
    AI based risk models using real-world data
  • PREDICTOM
    AI-screening-platform for identifying dementia risk
  • Scientific Machine Learning
    Development of hybrid machine learning approaches, which combine mechanistic differential equation models with neural networks.
  • SYNTHIA
    Development and evaluation of generative AI methods for clinical trial data

  • ADIS
    Coordinator, AI methods for biological system modeling and disease diagnosis
  • AETIONOMY
  • AIOLOS
    AI based models for early detection and monitoring of pandemic situations as well as decision support
  • CePPH
    AI/ML models for precision medicine
  • COPERIMOplus
    AI based risk models for COVID-19
  • DIGIPD
    Coordinator, AI methods for patient stratification
  • IDSN
    Text Mining
  • ParKInsonPredict
    AI for modeling of Parkinson's Disease progression
  • Prediction of Protein-Protein Interactions
    Development of a novel deep learning architecture to predict virus-host protein-protein interactions.
  • RADAR-AD
    AI-based progression models
  • The Virtual Brain Cloud
    AI-based progression models

Selected publications

  • Colin Birkenbihl, Johann de Jong, Ilya Yalchyk, Holger Fröhlich (2024), Deep learning-based patient stratification for prognostic enrichment of clinical dementia trials, Brain Communications, 6(6): fcae445
  • Philipp Wendland, Colin Birkenbihl, Marc Gomez-Freixa, Meemansa Sood, Maik Kschischo, Holger Fröhlich (2022), Generation of realistic synthetic data using Multimodal Neural Ordinary Differential Equations. npj Digital Medicine, 5(1): 122
  • Tom Hähnel, Tamara Raschka, Stefano Sapienza, Jochen Klucken, Enrico Glaab, Jean-Christophe Corvol, Björn Falkenburger, Holger Fröhlich (2024), Progression subtypes in Parkinson’s disease identified by a data driven multi cohort analysis, npj Parkinson’s Disease, 10(1): 95
  • Manuel Lentzen, Thomas Linden, Sai Veeranki, Sumit Madan, Diether Kramer, Werner Leodolter, Holger Fröhlich (2023), A Transformer-Based Model Trained on Large Scale Claims Data for Prediction of Severe COVID-19 Disease Progression, IEEE Journal of Biomedical and Health Informatics, 27(9): 4548-4558
  • Colin Birkenbihl, Yasamin Salimi, Holger Fröhlich (2022), Unraveling the heterogeneity in Alzheimer's disease progression across multiple cohorts and the implications for data-driven disease modeling. Alzheimer's & Dementia, 18(2): 251– 261

Videos


How AI detects Parkinson's earlier

The new film from BMFTR, “How AI recognizes what the human eye cannot see,” provides answers.

Scientific publications

Research software

 

  • FoundationEHR: A foundation AI model for structured electronic health records 
  • MultiGML: Multimodal graph machine learning for drug target prioritization

Examples

 

Examples of current research work in the group "AI & Data Science"

Team