Multistain deep learning for prediction of prognosis and therapy response in colorectal cancer

Foersch, Sebastian; Glasner, Christina; Woerl, Ann-Christin; Eckstein, Markus; Wagner, Daniel-Christoph; Schulz, Stefan; Kellers, Franziska; Fernandez, Aurélie; Tserea, Konstantina; Kloth, Michael; Hartmann, Arndt; Heintz, Achim; Weichert, Wilko; Roth, Wilfried; Geppert, Carol; Kather, Jakob Nikolas; Jesinghaus, Moritz

doi:10.1038/s41591-022-02134-1

Article
Published: 09 January 2023

Multistain deep learning for prediction of prognosis and therapy response in colorectal cancer

Sebastian Foersch ORCID: orcid.org/0000-0002-4740-6900¹,
Christina Glasner¹,
Ann-Christin Woerl^1,2,
Markus Eckstein³,
Daniel-Christoph Wagner¹,
Stefan Schulz¹,
Franziska Kellers ORCID: orcid.org/0000-0003-1770-3610^1,4,
Aurélie Fernandez¹,
Konstantina Tserea¹,
Michael Kloth¹,
Arndt Hartmann³,
Achim Heintz⁵,
Wilko Weichert⁶,
Wilfried Roth¹,
Carol Geppert³,
Jakob Nikolas Kather ORCID: orcid.org/0000-0002-3730-5348^7,8,9 &
…
Moritz Jesinghaus ORCID: orcid.org/0000-0002-0018-5661^6,10

Nature Medicine volume 29, pages 430–439 (2023)Cite this article

12k Accesses
49 Citations
91 Altmetric
Metrics details

Subjects

Abstract

Although it has long been known that the immune cell composition has a strong prognostic and predictive value in colorectal cancer (CRC), scoring systems such as the immunoscore (IS) or quantification of intraepithelial lymphocytes are only slowly being adopted into clinical routine use and have their limitations. To address this we established and evaluated a multistain deep learning model (MSDLM) utilizing artificial intelligence (AI) to determine the AImmunoscore (AIS) in more than 1,000 patients with CRC. Our model had high prognostic capabilities and outperformed other clinical, molecular and immune cell-based parameters. It could also be used to predict the response to neoadjuvant therapy in patients with rectal cancer. Using an explainable AI approach, we confirmed that the MSDLM’s decisions were based on established cellular patterns of anti-tumor immunity. Hence, the AIS could provide clinicians with a valuable decision-making tool based on the tumor immune microenvironment.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Fig. 1: Clinical characteristics and CONSORT diagrams for the prognostic cohorts.**

**Fig. 2: Clinical characteristics and CONSORT diagrams for the neoadjuvant cohort.**

**Fig. 3: Training and cross-validation of the MSDLM.**

**Fig. 4: Determination and performance of the AIS.**

**Fig. 5: Assessment of the MSDLM using xAI.**

**Fig. 6: Predictive performance of the MSDLM in rectal cancer.**

Biology-guided deep learning predicts prognosis and cancer immunotherapy response

Article Open access 23 August 2023

Yuming Jiang, Zhicheng Zhang, … Ruijiang Li

Technological advances in cancer immunity: from immunogenomics to single-cell analysis and artificial intelligence

Article Open access 20 August 2021

Ying Xu, Guan-Hua Su, … Yi-Zhou Jiang

How can artificial intelligence models assist PD-L1 expression scoring in breast cancer: results of multi-institutional ring studies

Article Open access 26 May 2021

Xinran Wang, Liang Wang, … Yueping Liu

Data availability

No publicly available datasets were used in this study. The datasets and models were generated from institutional cohorts (Technical University Munich cohort, Comprehensive Cancer Centre Erlangen-EMN cohort, Mainz cohort, and the neoadjuvant cohort consisting of FFPE material in the form of TMAs) and cannot be made publicly available due to general data protection regulations and institutional guidelines. Example data from another use case (for trying out our approach) is available at https://zenodo.org/record/6791937.

Code availability

An open source version of the code base is available at https://github.com/AGFoersch/MultiStainDeepLearning. For more information please contact the corresponding author at sebastian.foersch@unimedizin-mainz.de.

References

Galon, J. et al. Type, density, and location of immune cells within human colorectal tumors predict clinical outcome. Science 313, 1960–1964 (2006).
Article CAS PubMed Google Scholar
Bindea, G. et al. Spatiotemporal dynamics of intratumoral immune cells reveal the immune landscape in human cancer. Immunity 39, 782–795 (2013).
Article CAS PubMed Google Scholar
Pages, F. et al. International validation of the consensus Immunoscore for the classification of colon cancer: a prognostic and accuracy study. Lancet 391, 2128–2139 (2018).
Article PubMed Google Scholar
Nagtegaal, I. D. et al. The 2019 WHO classification of tumours of the digestive system. Histopathology 76, 182–188 (2020).
Article PubMed Google Scholar
Argiles, G. et al. Localised colon cancer: ESMO Clinical Practice Guidelines for diagnosis, treatment and follow-up. Ann. Oncol. 31, 1291–1305 (2020).
Article CAS PubMed Google Scholar
Denkert, C. et al. Standardized evaluation of tumor-infiltrating lymphocytes in breast cancer: results of the ring studies of the International Immuno-oncology Biomarker Working Group. Mod. Pathol. 29, 1155–1164 (2016).
Article CAS PubMed Google Scholar
Angell, H. K., Bruni, D., Barrett, J. C., Herbst, R. & Galon, J. The immunoscore: colon cancer and beyond. Clin. Cancer Res. 26, 332–339 (2020).
Article CAS PubMed Google Scholar
Sinicrope, F. A. & Graham, R. P. Tumor-infiltrating lymphocytes for prognostic stratification in nonmetastatic colon cancer: are we there yet? JAMA Oncol. 7, 969–970 (2021).
Article PubMed Google Scholar
Schurch, C. M. et al. Coordinated cellular neighborhoods orchestrate antitumoral immunity at the colorectal cancer invasive front. Cell 182, 1341–1359 (2020).
Article CAS PubMed PubMed Central Google Scholar
Nearchou, I. P. et al. Automated analysis of lymphocytic infiltration, tumor budding, and their spatial relationship improves prognostic accuracy in colorectal cancer. Cancer Immunol. Res. 7, 609–620 (2019).
Article CAS PubMed Google Scholar
Vayrynen, J. P. et al. Spatial organization and prognostic significance of NK and NKT-like cells via multimarker analysis of the colorectal cancer microenvironment. Cancer Immunol. Res. 10, 215–227 (2022).
Article CAS PubMed Google Scholar
Ribeiro, A. H. et al. Automatic diagnosis of the 12-lead ECG using a deep neural network. Nat. Commun. 11, 1760 (2020).
Article CAS PubMed PubMed Central Google Scholar
De Fauw, J. et al. Clinically applicable deep learning for diagnosis and referral in retinal disease. Nat. Med. 24, 1342–1350 (2018).
Article PubMed Google Scholar
Ardila, D. et al. End-to-end lung cancer screening with three-dimensional deep learning on low-dose chest computed tomography. Nat. Med. 25, 954–961 (2019).
Article CAS PubMed Google Scholar
Esteva, A. et al. Dermatologist-level classification of skin cancer with deep neural networks. Nature 542, 115–118 (2017).
Article CAS PubMed PubMed Central Google Scholar
Saldanha, O. L. et al. Swarm learning for decentralized artificial intelligence in cancer histopathology. Nat. Med. 28, 1232–1239 (2022).
Article CAS PubMed PubMed Central Google Scholar
Woerl, A. C. et al. Deep learning predicts molecular subtype of muscle-invasive bladder cancer from conventional histopathological slides. Eur. Urol. 78, 256–264 (2020).
Article CAS PubMed Google Scholar
Cifci, D., Foersch, S. & Kather, J. N. Artificial intelligence to identify genetic alterations in conventional histopathology. J. Pathol. 257, 430–444 (2022).
Article PubMed Google Scholar
Foersch, S. et al. Deep learning for diagnosis and survival prediction in soft tissue sarcoma. Ann. Oncol. 32, 1178–1187 (2021).
Article CAS PubMed Google Scholar
Schulz, S. et al. Multimodal deep learning for prognosis prediction in renal cancer. Front. Oncol. 11, 788740 (2021).
Article PubMed PubMed Central Google Scholar
Huang, S. C., Pareek, A., Seyyedi, S., Banerjee, I. & Lungren, M. P. Fusion of medical imaging and electronic health records using deep learning: a systematic review and implementation guidelines. NPJ Digit. Med. 3, 136 (2020).
Article PubMed PubMed Central Google Scholar
Chen, R. J. et al. Pathomic fusion: an integrated framework for fusing histopathology and genomic features for cancer diagnosis and prognosis. IEEE Trans. Med. Imaging 41, 757–770 (2022).
Article PubMed Google Scholar
Chen, R. J. et al. Pan-cancer integrative histology-genomic analysis via multimodal deep learning. Cancer Cell 40, 865–878 (2022).
Article CAS PubMed Google Scholar
Bubendorf, L., Nocito, A., Moch, H. & Sauter, G. Tissue microarray (TMA) technology: miniaturized pathology archives for high-throughput in situ studies. J. Pathol. 195, 72–79 (2001).
Article CAS PubMed Google Scholar
Skrede, O. J. et al. Deep learning for prediction of colorectal cancer outcome: a discovery and validation study. Lancet 395, 350–360 (2020).
Article CAS PubMed Google Scholar
El Sissy, C. et al. A diagnostic biopsy-adapted immunoscore predicts response to neoadjuvant treatment and selects patients with rectal cancer eligible for a watch-and-wait strategy. Clin. Cancer Res. 26, 5198–5207 (2020).
Article CAS PubMed Google Scholar
Galon, J. et al. Towards the introduction of the ‘Immunoscore’ in the classification of malignant tumours. J. Pathol. 232, 199–209 (2014).
Article CAS PubMed Google Scholar
Kather, J. N. et al. Topography of cancer-associated immune cells in human solid tumors. Elife 7, e36967 (2018).
Article PubMed PubMed Central Google Scholar
Saltz, J. et al. Spatial organization and molecular correlation of tumor-infiltrating lymphocytes using deep learning on pathology images. Cell Rep. 23, 181–193 (2018).
Article CAS PubMed PubMed Central Google Scholar
Reichling, C. et al. Artificial intelligence-guided tissue analysis combined with immune infiltrate assessment predicts stage III colon cancer outcomes in PETACC08 study. Gut 69, 681–690 (2020).
Article CAS PubMed Google Scholar
Boehm, K. M. et al. Multimodal data integration using machine learning improves risk stratification of high-grade serous ovarian cancer. Nat. Cancer 3, 723–733 (2022).
Article CAS PubMed PubMed Central Google Scholar
Fridman, W. H. et al. B cells and cancer: to B or not to B? J. Exp. Med. 218, e20200851 (2021).
Article CAS PubMed Google Scholar
Aras, S. & Zaidi, M. R. TAMeless traitors: macrophages in cancer progression and metastasis. Br. J. Cancer 117, 1583–1591 (2017).
Article CAS PubMed PubMed Central Google Scholar
Wulczyn, E. et al. Interpretable survival prediction for colorectal cancer using deep learning. NPJ Digit. Med. 4, 71 (2021).
Article PubMed PubMed Central Google Scholar
Brockmoeller, S. et al. Deep learning identifies inflamed fat as a risk factor for lymph node metastasis in early colorectal cancer. J. Pathol. 256, 269–281 (2022).
Article CAS PubMed Google Scholar
Kleppe, A. et al. Designing deep learning studies in cancer diagnostics. Nat. Rev. Cancer 21, 199–211 (2021).
Article CAS PubMed Google Scholar
Dworak, O., Keilholz, L. & Hoffmann, A. Pathological features of rectal cancer after preoperative radiochemotherapy. Int. J. Colorectal Dis. 12, 19–23 (1997).
Article CAS PubMed Google Scholar
Bankhead, P. et al. QuPath: open source software for digital pathology image analysis. Sci. Rep. 7, 16878 (2017).
Article PubMed PubMed Central Google Scholar
Reinhard, E., Adhikhmin, M., Gooch, B. & Shirley, P. Color transfer between images. IEEE Comput. Graph. Appl. 21, 34–41 (2001).
Article Google Scholar
He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 770–778 (IEEE, 2016).
Selvaraju, R.R., et al. Grad-CAM: visual explanations from deep networks via gradient-based localization. In 2017 IEEE International Conference on Computer Vision (ICCV) 618–626 (IEEE, 2020).

Download references

Acknowledgements

S.F. was supported by the Federal Ministry of Education and Research (16SV8167), the Stage-I-Program of the University Medical Center Mainz, the Mainz Research School of Translational Biomedicine (TransMed) and the Manfred-Stolte-Foundation. J.N.K. is supported by the German Federal Ministry of Health (DEEP LIVER, ZMVI1-2520DAT111) and the Max-Eder-Program of the German Cancer Aid (grant 70113864). Aspects of this work are part of the medical doctoral theses of S.S., F.K. and K.T.

Author information

Authors and Affiliations

Institute of Pathology, University Medical Center Mainz, Mainz, Germany
Sebastian Foersch, Christina Glasner, Ann-Christin Woerl, Daniel-Christoph Wagner, Stefan Schulz, Franziska Kellers, Aurélie Fernandez, Konstantina Tserea, Michael Kloth & Wilfried Roth
Institute of Computer Science, Johannes Gutenberg University Mainz, Mainz, Germany
Ann-Christin Woerl
Institute of Pathology and Comprehensive Cancer Center EMN, University Hospital Erlangen, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany
Markus Eckstein, Arndt Hartmann & Carol Geppert
Department of Pathology, University Hospital Schleswig-Holstein, Kiel, Germany
Franziska Kellers
Department of General Visceral and Vascular Surgery, Marien Hospital Mainz, Mainz, Germany
Achim Heintz
Institute of Pathology, Technical University Munich, Munich, Germany
Wilko Weichert & Moritz Jesinghaus
Department of Medicine III, University Hospital RWTH Aachen, Aachen, Germany
Jakob Nikolas Kather
Pathology and Data Analytics, Leeds Institute of Medical Research at St James’s, University of Leeds, Leeds, UK
Jakob Nikolas Kather
Else Kroener Fresenius Center for Digital Health, Medical Faculty Carl Gustav Carus, Technical University Dresden, Dresden, Germany
Jakob Nikolas Kather
Institute of Pathology, University Hospital Marburg, Marburg, Germany
Moritz Jesinghaus

Authors

Sebastian Foersch
View author publications
You can also search for this author in PubMed Google Scholar
Christina Glasner
View author publications
You can also search for this author in PubMed Google Scholar
Ann-Christin Woerl
View author publications
You can also search for this author in PubMed Google Scholar
Markus Eckstein
View author publications
You can also search for this author in PubMed Google Scholar
Daniel-Christoph Wagner
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Schulz
View author publications
You can also search for this author in PubMed Google Scholar
Franziska Kellers
View author publications
You can also search for this author in PubMed Google Scholar
Aurélie Fernandez
View author publications
You can also search for this author in PubMed Google Scholar
Konstantina Tserea
View author publications
You can also search for this author in PubMed Google Scholar
Michael Kloth
View author publications
You can also search for this author in PubMed Google Scholar
Arndt Hartmann
View author publications
You can also search for this author in PubMed Google Scholar
Achim Heintz
View author publications
You can also search for this author in PubMed Google Scholar
Wilko Weichert
View author publications
You can also search for this author in PubMed Google Scholar
Wilfried Roth
View author publications
You can also search for this author in PubMed Google Scholar
Carol Geppert
View author publications
You can also search for this author in PubMed Google Scholar
Jakob Nikolas Kather
View author publications
You can also search for this author in PubMed Google Scholar
Moritz Jesinghaus
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conception and design: S.F., C. Glasner, A.-C.W., D.-C.W.; acquisition of data: S.F., M.E., D.-C.W., S.S., F.K., K.T., C. Geppert, M.J.; analysis and interpretation of data: S.F., C. Glasner, D.-C.W., M.E., M.J.; drafting of the manuscript: S.F.; critical revision of the manuscript for important intellectual content: C. Glasner, M.E., D.-C.W., J.N.K., M.J.; statistical analysis: S.F., C. Glasner, A.-C.W.; obtaining funding: S.F., W.R.; administrative, technical or material support: A.F., A. Hartmann, A. Heintz, W.W., M.K., C. Geppert, M.J.; supervision: S.F., W.R., M.J.

Corresponding author

Correspondence to Sebastian Foersch.

Ethics declarations

Competing interests

J.N.K. reports consulting services for Owkin (France), Panakeia (UK), and DoMore Diagnostics (Norway) and has received honoraria for lectures by MSD, Eisai, and Fresenius. The other authors declare no competing interests.

Peer review

Peer review information

Nature Medicine thanks Tae Hyun Hwang, Timothy Maughan and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editors: Ulrike Harjes and Saheli Sadanand, in collaboration with the Nature Medicine team.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1

a: Scatter plots of CD3, CD8, CD4, CD20, and CD68 positive immune cell counts derived from whole slide images and tissue microarrays. Spearman’s correlation coefficients and p-values from two-sided testing are given. Note that for all markers but CD3 the scale is logarithmic. The line indicates a linear regression model with indication of the 95% confidence interval. b: Confusion matrices of the IS 2 and IS 3 derived from both TMA and WSI. Results of Fisher’s exact and Chi2 test are provided. c: Scatter plots of MLH1, MSH2, MSH6, and PMS2 expression derived from whole slide images and tissue microarrays. Spearman’s correlation coefficients and p-values from two-sided testing are given. The line indicates a linear regression model with indication of the 95% confidence interval. N = 51 for all stainings (except PMS2 where n = 45). (WSI: Whole slide image. TMA: Tissue microarray).

Extended Data Fig. 2

a: Procedure of the TMA generation for the TUM–CCC-EMN cohort. For the Erlangen Cohort three cores from the invasive margin and three cores form the tumor center were used. For the Munich cohort, one core from the invasive margin and one core from the tumor center was used. b: Procedure of the TMA generation for the Mainz cohort. Three TMA cores were taken according to the ratio of invasive margin to tumor center. c, d: Further preprocessing included serial sectioning, immunohistochemistry, and preprocessing of the TMA cores and the tiles. Scale bars on the left subpanel represent ca. 200 µm. Scale bars on the right represent ca. 100 µm. e: Examples of excluded cores. Scale bars represent ca. 200 µm. (TMA: Tissue microarray, NAT: Normal adjacent tissue (not used in this study)). Some illustrations were generated with BioRender.com.

Extended Data Fig. 3 Overview of the SSDLM (Single-stain deep learning model).

Arrow labels indicate each component’s output dimensions. Dim, mmhid, num_classes, etc. are defined model parameters. Fc(x, y) represents a fully connected layer with the input dimension x and the output dimension y.

Extended Data Fig. 4 Overview of the MSDLM (Multistain deep learning model).

Arrow labels indicate each component’s output dimensions. Dim, mmhid, num_classes, etc. are defined model parameters. Fc(x, y) represents a fully connected layer with the input dimension x and the output dimension y. Subblocks 1, … n-1 are structured identically to subblock 0.

Extended Data Fig. 5

a-d: Accuracy (A), AUPRC (B), AUROC (C), and F1-Score (D) of the MSDLM and different classical machine learning techniques after training validation on the Mainz cohort. N = 11 models trained during 11-fold cross validation per group. One-way ANOVA with Dunnett Test to correct for multiple testing was used. Statistical significance is indicated by asterisks as described in the M&M section for the comparison to the MSDLM (p > 0.05:ns, p ≤ 0.05:*, p ≤ 0.01:**, p ≤ 0.001:***, p ≤ 0.0001:****). The 10th, 50th (Median), and 90th quantile as well as the minimum and maximum are depicted. E, F: Precision-recall and receiver operator characteristics curves of the MSDLM and the classical machine learning techniques. The mean of the 11-fold cross validation is shown. Shaded area indicates 1 standard deviation (std. dev.). (AUPRC: Area under the precision recall curve. AUROC: Area under the receiver operator characteristic. MSDLM: multistain deep learning model. SVM: Support vector machine classifier. RF: Random forest classifier. LR: Logistic regression classifier. GB: Gradient boosted (decision tree) classifier).

Extended Data Fig. 6

a: Kaplan–Meier curves of the test cohort for each UICC stage stratified by AImmunoscore (AIS). N = 61 for UICC stage I, n = 128 for UICC stage II, n = 114 for UICC stage III, and n = 36 for UICC stage IV. Censors are indicated with a ‘+’. Log-rank test was used.

Supplementary information

Reporting Summary

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Foersch, S., Glasner, C., Woerl, AC. et al. Multistain deep learning for prediction of prognosis and therapy response in colorectal cancer. Nat Med 29, 430–439 (2023). https://doi.org/10.1038/s41591-022-02134-1

Download citation

Received: 18 May 2022
Accepted: 17 November 2022
Published: 09 January 2023
Issue Date: February 2023
DOI: https://doi.org/10.1038/s41591-022-02134-1

This article is cited by

SARIFA as a new histopathological biomarker is associated with adverse clinicopathological characteristics, tumor-promoting fatty-acid metabolism, and might predict a metastatic pattern in pT3a prostate cancer
- Johanna S. Enke
- Matthias Groß
- Nic G. Reitsam
BMC Cancer (2024)
Novel biomarker SARIFA in colorectal cancer: highly prognostic, not genetically driven and histologic indicator of a distinct tumor biology
- Nic G. Reitsam
- Veselin Grozdanov
- Bruno Märkl
Cancer Gene Therapy (2024)
Future direction of total neoadjuvant therapy for locally advanced rectal cancer
- Yoshinori Kagawa
- J. Joshua Smith
- Takayuki Yoshino
Nature Reviews Gastroenterology & Hepatology (2024)
Artificial intelligence in liver cancer — new tools for research and patient management
- Julien Calderaro
- Laura Žigutytė
- Jakob Nikolas Kather
Nature Reviews Gastroenterology & Hepatology (2024)
Towards a general-purpose foundation model for computational pathology
- Richard J. Chen
- Tong Ding
- Faisal Mahmood
Nature Medicine (2024)