Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Linking Gene Expression to Clinical Outcomes in Pediatric Crohn’s Disease Using Machine Learning

Kevin A Chen, Nina Nishiyama, Meaghan M Kennedy Ng, Alexandra Shumway, Chinmaya U Joisa, Matthew R Schaner, Grace Lian, Caroline Beasley, Lee-Ching Zhu, Surekha Bantumilli, Muneera R Kapadia, Shawn M Gomez, Terrence S Furey, Shehzad Z Sheikh
doi: https://doi.org/10.1101/2022.11.07.515480
Kevin A Chen
aCenter for Gastrointestinal Biology and Disease, University of North Carolina at Chapel Hill, Chapel Hill, USA
bDepartment of Surgery, University of North Carolina at Chapel Hill, Chapel Hill, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Nina Nishiyama
aCenter for Gastrointestinal Biology and Disease, University of North Carolina at Chapel Hill, Chapel Hill, USA
cDepartment of Genetics, Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Meaghan M Kennedy Ng
aCenter for Gastrointestinal Biology and Disease, University of North Carolina at Chapel Hill, Chapel Hill, USA
cDepartment of Genetics, Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Alexandra Shumway
dDepartment of Biomedical Sciences, College of Veterinary Medicine, Cornell University, Ithaca, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Chinmaya U Joisa
eJoint Department of Biomedical Engineering, University of North Carolina at Chapel Hill, Chapel Hill, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Matthew R Schaner
aCenter for Gastrointestinal Biology and Disease, University of North Carolina at Chapel Hill, Chapel Hill, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Grace Lian
aCenter for Gastrointestinal Biology and Disease, University of North Carolina at Chapel Hill, Chapel Hill, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Caroline Beasley
aCenter for Gastrointestinal Biology and Disease, University of North Carolina at Chapel Hill, Chapel Hill, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Lee-Ching Zhu
fDepartment of Pathology and Laboratory Medicine, University of North Carolina at Chapel Hill, Chapel Hill, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Surekha Bantumilli
fDepartment of Pathology and Laboratory Medicine, University of North Carolina at Chapel Hill, Chapel Hill, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Muneera R Kapadia
bDepartment of Surgery, University of North Carolina at Chapel Hill, Chapel Hill, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Shawn M Gomez
eJoint Department of Biomedical Engineering, University of North Carolina at Chapel Hill, Chapel Hill, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Terrence S Furey
aCenter for Gastrointestinal Biology and Disease, University of North Carolina at Chapel Hill, Chapel Hill, USA
cDepartment of Genetics, Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: tsfurey@email.unc.edu shehzad_sheikh@med.unc.edu
Shehzad Z Sheikh
aCenter for Gastrointestinal Biology and Disease, University of North Carolina at Chapel Hill, Chapel Hill, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: tsfurey@email.unc.edu shehzad_sheikh@med.unc.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Preview PDF
Loading

Abstract

Introduction Pediatric Crohn’s disease (CD) is the fastest growing age group and is characterized by frequent disease complications. We sought to analyze both ileal and colonic gene expression in a cohort of pediatric CD patients and apply machine learning-based models to predict risk of developing future complications.

Methods RNA-seq was generated from matched ileal and colonic biopsies from formalin-fixed, paraffin-embedded (FFPE) tissue obtained from patients with non-stricturing/non-penetrating, treatment-naïve CD and from controls. Clinical outcomes including development of strictures or fistulas, progression to surgery, and remission were analyzed first using differential expression. Machine learning models were then developed for each outcome, combining gene expression and clinical factors. Models were assessed using area under the receiver operating characteristic curve (AUROC).

Results 56 patients with CD and 46 controls were included. Differential expression analysis revealed a distinct colonic transcriptome for patients who developed strictures, with downregulation of pathways related to inflammation and extra-cellular matrix production. In contrast, there were few differentially expressed genes for other outcomes and for ileal tissue. Despite this, machine learning-based models were able to incorporate colonic gene expression and clinical characteristics to predict outcomes with high accuracy. Models showed an AUROC of 0.84 for strictures, 0.83 for remission, and 0.75 for surgery. Certain genes with potential prognostic importance for strictures (REG1A, MMP3, and DUOX2) were not identified in single gene differential analysis but were found to have strong contributions to predictive models.

Conclusions Our findings in FFPE tissue support the importance of colonic gene expression and the potential for machine learning-based models in predicting outcomes for pediatric CD.

Competing Interest Statement

The authors have declared no competing interest.

Footnotes

  • Updated abstract

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted November 14, 2022.
Download PDF
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Linking Gene Expression to Clinical Outcomes in Pediatric Crohn’s Disease Using Machine Learning
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Linking Gene Expression to Clinical Outcomes in Pediatric Crohn’s Disease Using Machine Learning
Kevin A Chen, Nina Nishiyama, Meaghan M Kennedy Ng, Alexandra Shumway, Chinmaya U Joisa, Matthew R Schaner, Grace Lian, Caroline Beasley, Lee-Ching Zhu, Surekha Bantumilli, Muneera R Kapadia, Shawn M Gomez, Terrence S Furey, Shehzad Z Sheikh
bioRxiv 2022.11.07.515480; doi: https://doi.org/10.1101/2022.11.07.515480
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Linking Gene Expression to Clinical Outcomes in Pediatric Crohn’s Disease Using Machine Learning
Kevin A Chen, Nina Nishiyama, Meaghan M Kennedy Ng, Alexandra Shumway, Chinmaya U Joisa, Matthew R Schaner, Grace Lian, Caroline Beasley, Lee-Ching Zhu, Surekha Bantumilli, Muneera R Kapadia, Shawn M Gomez, Terrence S Furey, Shehzad Z Sheikh
bioRxiv 2022.11.07.515480; doi: https://doi.org/10.1101/2022.11.07.515480

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Bioinformatics
Subject Areas
All Articles
  • Animal Behavior and Cognition (4227)
  • Biochemistry (9107)
  • Bioengineering (6751)
  • Bioinformatics (23944)
  • Biophysics (12089)
  • Cancer Biology (9495)
  • Cell Biology (13740)
  • Clinical Trials (138)
  • Developmental Biology (7616)
  • Ecology (11661)
  • Epidemiology (2066)
  • Evolutionary Biology (15479)
  • Genetics (10617)
  • Genomics (14296)
  • Immunology (9462)
  • Microbiology (22792)
  • Molecular Biology (9078)
  • Neuroscience (48888)
  • Paleontology (355)
  • Pathology (1479)
  • Pharmacology and Toxicology (2565)
  • Physiology (3823)
  • Plant Biology (8308)
  • Scientific Communication and Education (1467)
  • Synthetic Biology (2290)
  • Systems Biology (6172)
  • Zoology (1297)