Abstract

Deep learning tools that predict peptide binding by major histocompatibility complex (MHC) proteins play an essential role in developing personalized cancer immunotherapies and vaccines. In order to ensure equitable health outcomes from their application, MHC binding prediction methods must work well across the vast landscape of MHC alleles observed across human populations. Here, we show that there are alarming disparities across individuals in different racial and ethnic groups in how much binding data are associated with their MHC alleles. We introduce a machine learning framework to assess the impact of this data imbalance for predicting binding for any given MHC allele, and apply it to develop a state-of-the-art MHC binding prediction model that additionally provides per-allele performance estimates. We demonstrate that our MHC binding model successfully mitigates much of the data disparities observed across racial groups. To address remaining inequities, we devise an algorithmic strategy for targeted data collection. Our work lays the foundation for further development of equitable MHC binding models for use in personalized immunotherapies.

Original languageAmerican English
Article numbere2405106122
JournalProceedings of the National Academy of Sciences of the United States of America
Volume122
Issue number8
DOIs
StatePublished - Feb 25 2025

ASJC Scopus subject areas

  • General

Keywords

  • cancer immunotherapies
  • deep learning
  • health equity in precision oncology
  • neoantigen prediction
  • predicting major histocompatibility complex (MHC) binding

Fingerprint

Dive into the research topics of 'Toward equitable major histocompatibility complex binding predictions'. Together they form a unique fingerprint.

Cite this