Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Machine learning analysis of the T cell receptor repertoire identifies sequence features that predict self-reactivity

View ORCID ProfileJohannes Textor, Franka Buytenhuijs, View ORCID ProfileDakota Rogers, Ève Mallet Gauthier, View ORCID ProfileShabaz Sultan, View ORCID ProfileInge M. N. Wortel, View ORCID ProfileKathrin Kalies, Anke Fähnrich, René Pagel, View ORCID ProfileHeather J. Melichar, View ORCID ProfileJürgen Westermann, View ORCID ProfileJudith N. Mandl
doi: https://doi.org/10.1101/2022.11.23.517563
Johannes Textor
1Data Science group, Institute for Computing and Information Sciences, Radboud University, Nijmegen, The Netherlands
2Department of Tumor Immunology, Radboud Institute for Molecular Life Sciences, Radboudumc, Nijmegen, The Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Johannes Textor
  • For correspondence: johannes.textor@ru.nl judith.mandl@mcgill.ca
Franka Buytenhuijs
1Data Science group, Institute for Computing and Information Sciences, Radboud University, Nijmegen, The Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Dakota Rogers
3Department of Physiology and McGill Research Centre on Complex Traits, McGill University, Montreal, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Dakota Rogers
Ève Mallet Gauthier
4Immunology-Oncology Unit, Maisonneuve-Rosemont Hospital Research Center, Montreal, Canada
5Department of Microbiology, Infectious Diseases, and Immunology, Université de Montréal, Montreal, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Shabaz Sultan
1Data Science group, Institute for Computing and Information Sciences, Radboud University, Nijmegen, The Netherlands
2Department of Tumor Immunology, Radboud Institute for Molecular Life Sciences, Radboudumc, Nijmegen, The Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Shabaz Sultan
Inge M. N. Wortel
1Data Science group, Institute for Computing and Information Sciences, Radboud University, Nijmegen, The Netherlands
2Department of Tumor Immunology, Radboud Institute for Molecular Life Sciences, Radboudumc, Nijmegen, The Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Inge M. N. Wortel
Kathrin Kalies
6Institut für Anatomie, Universität zu Lübeck, Lübeck, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Kathrin Kalies
Anke Fähnrich
6Institut für Anatomie, Universität zu Lübeck, Lübeck, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
René Pagel
6Institut für Anatomie, Universität zu Lübeck, Lübeck, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Heather J. Melichar
4Immunology-Oncology Unit, Maisonneuve-Rosemont Hospital Research Center, Montreal, Canada
7Department of Medicine, Université de Montréal, Montreal, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Heather J. Melichar
Jürgen Westermann
6Institut für Anatomie, Universität zu Lübeck, Lübeck, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jürgen Westermann
Judith N. Mandl
3Department of Physiology and McGill Research Centre on Complex Traits, McGill University, Montreal, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Judith N. Mandl
  • For correspondence: johannes.textor@ru.nl judith.mandl@mcgill.ca
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Summary

The T cell receptor (TCR) determines the specificity and affinity for both foreign and self-peptides presented by MHC. It is established that self-pMHC reactivity impacts T cell function, but it has been challenging to identify TCR sequence features that predict T cell fate. To discern patterns distinguishing TCRs from naïve CD4+ T cells with low versus high self-pMHC reactivity, we used data from 42 mice to train a machine learning (ML) algorithm that predicts self-reactivity directly from TCRβ sequences. This approach revealed that n-nucleotide additions and acidic amino acids weaken self-reactivity. We tested our ML predictions of TCRβ sequence self-reactivity using retrogenic mice. Extrapolating our analyses to independent datasets, we found high predicted self-reactivity for regulatory CD4+ T cells and low predicted self-reactivity for T cells responding to chronic infection. Our analyses suggest a potential trade-off between repertoire diversity and self-reactivity intrinsic to the architecture of a TCR repertoire.

Competing Interest Statement

The authors have declared no competing interest.

Footnotes

  • We have performed an additional validation analysis according to a previously published data analysis plan (https://zenodo.org/record/7459701#.ZBrQ5y2iF69) and included the results (new Figure 6).

  • https://github.com/jtextor/tcr-self-reactivity

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Back to top
PreviousNext
Posted March 22, 2023.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Machine learning analysis of the T cell receptor repertoire identifies sequence features that predict self-reactivity
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Machine learning analysis of the T cell receptor repertoire identifies sequence features that predict self-reactivity
Johannes Textor, Franka Buytenhuijs, Dakota Rogers, Ève Mallet Gauthier, Shabaz Sultan, Inge M. N. Wortel, Kathrin Kalies, Anke Fähnrich, René Pagel, Heather J. Melichar, Jürgen Westermann, Judith N. Mandl
bioRxiv 2022.11.23.517563; doi: https://doi.org/10.1101/2022.11.23.517563
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Machine learning analysis of the T cell receptor repertoire identifies sequence features that predict self-reactivity
Johannes Textor, Franka Buytenhuijs, Dakota Rogers, Ève Mallet Gauthier, Shabaz Sultan, Inge M. N. Wortel, Kathrin Kalies, Anke Fähnrich, René Pagel, Heather J. Melichar, Jürgen Westermann, Judith N. Mandl
bioRxiv 2022.11.23.517563; doi: https://doi.org/10.1101/2022.11.23.517563

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Immunology
Subject Areas
All Articles
  • Animal Behavior and Cognition (4369)
  • Biochemistry (9546)
  • Bioengineering (7068)
  • Bioinformatics (24768)
  • Biophysics (12562)
  • Cancer Biology (9924)
  • Cell Biology (14297)
  • Clinical Trials (138)
  • Developmental Biology (7930)
  • Ecology (12074)
  • Epidemiology (2067)
  • Evolutionary Biology (15954)
  • Genetics (10904)
  • Genomics (14707)
  • Immunology (9844)
  • Microbiology (23582)
  • Molecular Biology (9454)
  • Neuroscience (50692)
  • Paleontology (369)
  • Pathology (1535)
  • Pharmacology and Toxicology (2674)
  • Physiology (3998)
  • Plant Biology (8639)
  • Scientific Communication and Education (1505)
  • Synthetic Biology (2388)
  • Systems Biology (6415)
  • Zoology (1344)