ABSTRACT
Tumour immunity is key for the prognosis and treatment of colon adenocarcinoma, but its characterisation remains cumbersome and expensive, requiring sequencing or other complex assays. Detecting tumour-infiltrating lymphocytes in haematoxylin and eosin (H&E) slides of cancer tissue would provide a cost-effective alternative to support clinicians in treatment decisions, but inter- and intra-observer variability can arise even amongst experienced pathologists. Furthermore, the compounded effect of other cells in the tumour microenvironment is challenging to quantify but could yield useful additional biomarkers. We combined RNA sequencing, digital pathology and deep learning through the InceptionV3 architecture to develop a fully automated computer vision model that detects prognostic tumour immunity levels in H&E slides of colon adenocarcinoma with an area under the curve (AUC) of 82%. Amongst tumour infiltrating T cell subsets, we demonstrate that CD8+ effector memory T cell patterns are most recognisable algorithmically with an average AUC of 83%. We subsequently applied nuclear segmentation and classification via HoVer-Net to derive complex cell-cell interaction graphs, which we queried efficiently through a bespoke Neo4J graph database. This uncovered stromal barriers and lymphocyte triplets that could act as structural hallmarks of low immunity tumours with poor prognosis. Our integrated deep learning and graph-based workflow provides evidence for the feasibility of automated detection of complex immune cytotoxicity patterns within H&E-stained colon cancer slides, which could inform new cellular biomarkers and support treatment management of this disease in the future.
Competing Interest Statement
The authors have declared no competing interest.
Footnotes
Financial support: MPC was supported by an Academy of Medical Science Springboard award (SBF004\1042). GMT was supported by a Wellcome Seed Award in Science (215296/Z/19/Z). EW acknowledges the receipt of a studentship award from the Health Data Research UK-The Alan Turing Institute Wellcome PhD Programme in Health Data Science (Grant Ref: 218529/Z/19/Z). MS and SP were supported by a UKRI Future Leaders Fellowship (MR/T042184/1). Work in MS’s lab was supported by a BBSRC equipment grant (BB/R01356X/1) and a Wellcome Institutional Strategic Support Fund (204841/Z/16/Z). For the purpose of Open Access, the author has applied a CC BY public copyright licence to any Author Accepted Manuscript version arising from this submission.
Conflict of interest: The authors declare no potential conflicts of interest.