Development and validation of deep learning classifiers to detect Epstein-Barr virus and microsatellite instability status in gastric cancer: a retrospective multicentre cohort study

H.S. Muti; L.R. Heij; G. Keller; M. Kohlruss; R. Langer; B. Dislich; J.H. Cheong; Y.W. Kim; H. Kim; M.C. Kook; D. Cunningham; W.H. Allum; R.E. Langley; M.G. Nankivell; P. Quirke; J.D. Hayden; N.P. West; A.J. Irvine; T. Yoshikawa; T. Oshima; R. Huss; B. Grosser; F. Roviello; A. d'Ignazio; A. Quaas; H. Alakus; X.X. Tan; A.T. Pearson; T. Luedde; M.P. Ebert; D. Jager; C. Trautwein; N.T. Gaisa; H.I. Grabsch; J.N. Kather

doi:10.1016/S2589-7500(21)00133-3

Development and validation of deep learning classifiers to detect Epstein-Barr virus and microsatellite instability status in gastric cancer: a retrospective multicentre cohort study

H.S. Muti, L.R. Heij, G. Keller, M. Kohlruss, R. Langer, B. Dislich, J.H. Cheong, Y.W. Kim, H. Kim, M.C. Kook, D. Cunningham, W.H. Allum, R.E. Langley, M.G. Nankivell, P. Quirke, J.D. Hayden, N.P. West, A.J. Irvine, T. Yoshikawa, T. OshimaR. Huss, B. Grosser, F. Roviello, A. d'Ignazio, A. Quaas, H. Alakus, X.X. Tan, A.T. Pearson, T. Luedde, M.P. Ebert, D. Jager, C. Trautwein, N.T. Gaisa, H.I. Grabsch, J.N. Kather^*

^*Corresponding author for this work

Research output: Contribution to journal › Article › Academic › peer-review

Abstract

Background Response to immunotherapy in gastric cancer is associated with microsatellite instability (or mismatch repair deficiency) and Epstein-Barr virus (EBV) positivity. We therefore aimed to develop and validate deep learning based classifiers to detect microsatellite instability and EBV status from routine histology slides. Methods In this retrospective, multicentre study, we collected tissue samples from ten cohorts of patients with gastric cancer from seven countries (South Korea, Switzerland, Japan, Italy, Germany, the UK and the USA). We trained a deep learning-based classifier to detect microsatellite instability and EBV positivity from digitised, haematoxylin and eosin stained resection slides without annotating tumour containing regions. The performance of the classifier was assessed by within-cohort cross-validation in all ten cohorts and by external validation, for which we split the cohorts into a five-cohort training dataset and a five-cohort test dataset. We measured the area under the receiver operating curve (AUROC) for detection of microsatellite instability and EBV status. Microsatellite instability and EBV status were determined to be detectable if the lower bound of the 95% CI for the AUROC was above 0middot5. Findings Across the ten cohorts, our analysis included 2823 patients with known microsatellite instability status and 2685 patients with known EBV status. In the within-cohort cross-validation, the deep learning-based classifier could detect microsatellite instability status in nine of ten cohorts, with AUROCs ranging from 0middot597 (95% CI 0middot522-0middot737) to 0middot836 (0middot795-0middot880) and EBV status in five of eight cohorts, with AUROCs ranging from 0middot819 (0middot752-0middot841) to 0middot897 (0middot513-0middot966). Training a classifier on the pooled training dataset and testing it on the five remaining cohorts resulted in high classification performance with AUROCs ranging from 0middot723 (95% CI 0middot676-0middot794) to 0middot863 (0middot747-0middot969) for detection of microsatellite instability and from 0middot672 (0middot403-0middot989) to 0middot859 (0middot823-0middot919) for detection of EBV status. Interpretation Classifiers became increasingly robust when trained on pooled cohorts. After prospective validation, this deep learning-based tissue classification system could be used as an inexpensive predictive biomarker for immunotherapy in gastric cancer. Funding German Cancer Aid and German Federal Ministry of Health. Copyright (c) 2021 The Author(s). Published by Elsevier Ltd. This is an Open Access article under the CC BY 4.0 license.

Original language	English
Pages (from-to)	E654-E664
Number of pages	11
Journal	The Lancet Digital Health
Volume	3
Issue number	10
DOIs	https://doi.org/10.1016/S2589-7500(21)00133-3
Publication status	Published - 1 Oct 2021

Keywords

MICROSATELLITE INSTABILITY
GASTRIC-CANCER

Access to Document

10.1016/S2589-7500(21)00133-3Licence: CC BY

1 Erratum / corrigendum / retractions

Correction to Lancet Digit Health 2021; published online Aug 17. https://doi.org/10.1016/S2589-7500(21)00133-3
Muti, H. S., Heij, L. R. & Keller, G., Oct 2021, In: The Lancet Digital Health. 3, 10, p. e622-e622 1 p.
Research output: Contribution to journal › Erratum / corrigendum / retractions › Academic

Open Access

Cite this

Muti, H. S., Heij, L. R., Keller, G., Kohlruss, M., Langer, R., Dislich, B., Cheong, J. H., Kim, Y. W., Kim, H., Kook, M. C., Cunningham, D., Allum, W. H., Langley, R. E., Nankivell, M. G., Quirke, P., Hayden, J. D., West, N. P., Irvine, A. J., Yoshikawa, T., ... Kather, J. N. (2021). Development and validation of deep learning classifiers to detect Epstein-Barr virus and microsatellite instability status in gastric cancer: a retrospective multicentre cohort study. The Lancet Digital Health, 3(10), E654-E664. https://doi.org/10.1016/S2589-7500(21)00133-3

@article{4fff03041f5a4709987eb04f6c22614c,

title = "Development and validation of deep learning classifiers to detect Epstein-Barr virus and microsatellite instability status in gastric cancer: a retrospective multicentre cohort study",

abstract = "Background Response to immunotherapy in gastric cancer is associated with microsatellite instability (or mismatch repair deficiency) and Epstein-Barr virus (EBV) positivity. We therefore aimed to develop and validate deep learning based classifiers to detect microsatellite instability and EBV status from routine histology slides. Methods In this retrospective, multicentre study, we collected tissue samples from ten cohorts of patients with gastric cancer from seven countries (South Korea, Switzerland, Japan, Italy, Germany, the UK and the USA). We trained a deep learning-based classifier to detect microsatellite instability and EBV positivity from digitised, haematoxylin and eosin stained resection slides without annotating tumour containing regions. The performance of the classifier was assessed by within-cohort cross-validation in all ten cohorts and by external validation, for which we split the cohorts into a five-cohort training dataset and a five-cohort test dataset. We measured the area under the receiver operating curve (AUROC) for detection of microsatellite instability and EBV status. Microsatellite instability and EBV status were determined to be detectable if the lower bound of the 95% CI for the AUROC was above 0middot5. Findings Across the ten cohorts, our analysis included 2823 patients with known microsatellite instability status and 2685 patients with known EBV status. In the within-cohort cross-validation, the deep learning-based classifier could detect microsatellite instability status in nine of ten cohorts, with AUROCs ranging from 0middot597 (95% CI 0middot522-0middot737) to 0middot836 (0middot795-0middot880) and EBV status in five of eight cohorts, with AUROCs ranging from 0middot819 (0middot752-0middot841) to 0middot897 (0middot513-0middot966). Training a classifier on the pooled training dataset and testing it on the five remaining cohorts resulted in high classification performance with AUROCs ranging from 0middot723 (95% CI 0middot676-0middot794) to 0middot863 (0middot747-0middot969) for detection of microsatellite instability and from 0middot672 (0middot403-0middot989) to 0middot859 (0middot823-0middot919) for detection of EBV status. Interpretation Classifiers became increasingly robust when trained on pooled cohorts. After prospective validation, this deep learning-based tissue classification system could be used as an inexpensive predictive biomarker for immunotherapy in gastric cancer. Funding German Cancer Aid and German Federal Ministry of Health. Copyright (c) 2021 The Author(s). Published by Elsevier Ltd. This is an Open Access article under the CC BY 4.0 license.",

keywords = "MICROSATELLITE INSTABILITY, GASTRIC-CANCER",

author = "H.S. Muti and L.R. Heij and G. Keller and M. Kohlruss and R. Langer and B. Dislich and J.H. Cheong and Y.W. Kim and H. Kim and M.C. Kook and D. Cunningham and W.H. Allum and R.E. Langley and M.G. Nankivell and P. Quirke and J.D. Hayden and N.P. West and A.J. Irvine and T. Yoshikawa and T. Oshima and R. Huss and B. Grosser and F. Roviello and A. d'Ignazio and A. Quaas and H. Alakus and X.X. Tan and A.T. Pearson and T. Luedde and M.P. Ebert and D. Jager and C. Trautwein and N.T. Gaisa and H.I. Grabsch and J.N. Kather",

note = "Funding Information: HIG was supported by Cancer Research UK. We thank all investigators and contributing pathologists from the TCGA study. PQ and NPW are supported by Yorkshire Cancer Research programme grants L386 and L394. PQ is a National Institute of Health Senior investigator. CT was supported by the German Research Foundation (SFB CRC1382, SFB-TRR57). MPE was supported by the German Research Foundation (GRK2727). TL was funded by Horizon 2020 through the European Research Council Consolidator Grant PhaseControl (771083) and the German Research Foundation (SFB-CRC1382 and LU 1360/3-2). JNK is funded by the Max-Eder-Programme of the German Cancer Aid (Bonn, Germany; grant #70113864) and the START Programme of the Medical Faculty Aachen (Aachen, Germany, grant #691906). JNK and TL are funded by the German Ministry of Health (funding based on a resolution of the German Bundestag by the federal government; grant DEEP LIVER, #ZMVI1-2520DAT111). Funding Information: JNK declares consulting roles for OWKIN France and Panakeia (UK) without any direct connection to this work; these roles started in April, 2021, after conducting the present study. JNK also declares honoraria from MSD and Eisai. DC declares grants from Medimmune/AstraZeneca, Clovis, Eli Lilly, 4SC, Bayer, Celgene, Leap, and Roche, and Scientific Board Membership for OVIBIO. DJ declares consulting services and advisory board participation for CureVac AG, Definiens, F Hoffmann-La Roche, Genmab A-S, Life Science Inkubator GmbH, VAXIMM AG, OncoOne Research & Development Research GmbH, and Oncolytics Biotech; payment or honoraria from SKK Kliniken Heilbronn, Georg Thieme Verlag, Terrapinn, Touch Medical Medica, BMS GmbH & Co KG, and MSD; reimbursements for expert opinion on medical questions from Wilhelm-Sander Foundation, Else-Kr{\"o}ner-Fresenius Foundation, Scherer Foundation, and NordForsk; meeting support (ie, for travel) from Amgen, Oryx GmbH, Roche Glycart AG, Parexel.com , IKTZ HD GmbH, and BMS; and leadership in the BMS Foundation Immunooncology. All other authors declare no competing interests. Publisher Copyright: {\textcopyright} 2021 The Author(s). Published by Elsevier Ltd. This is an Open Access article under the CC BY 4.0 license",

year = "2021",

month = oct,

day = "1",

doi = "10.1016/S2589-7500(21)00133-3",

language = "English",

volume = "3",

pages = "E654--E664",

journal = "The Lancet Digital Health",

issn = "2589-7500",

publisher = "Lancet Publishing Group",

number = "10",

}

Muti, HS, Heij, LR, Keller, G, Kohlruss, M, Langer, R, Dislich, B, Cheong, JH, Kim, YW, Kim, H, Kook, MC, Cunningham, D, Allum, WH, Langley, RE, Nankivell, MG, Quirke, P, Hayden, JD, West, NP, Irvine, AJ, Yoshikawa, T, Oshima, T, Huss, R, Grosser, B, Roviello, F, d'Ignazio, A, Quaas, A, Alakus, H, Tan, XX, Pearson, AT, Luedde, T, Ebert, MP, Jager, D, Trautwein, C, Gaisa, NT, Grabsch, HI & Kather, JN 2021, 'Development and validation of deep learning classifiers to detect Epstein-Barr virus and microsatellite instability status in gastric cancer: a retrospective multicentre cohort study', The Lancet Digital Health, vol. 3, no. 10, pp. E654-E664. https://doi.org/10.1016/S2589-7500(21)00133-3

Development and validation of deep learning classifiers to detect Epstein-Barr virus and microsatellite instability status in gastric cancer: a retrospective multicentre cohort study. / Muti, H.S.; Heij, L.R.; Keller, G. et al.
In: The Lancet Digital Health, Vol. 3, No. 10, 01.10.2021, p. E654-E664.

Research output: Contribution to journal › Article › Academic › peer-review

TY - JOUR

T1 - Development and validation of deep learning classifiers to detect Epstein-Barr virus and microsatellite instability status in gastric cancer: a retrospective multicentre cohort study

AU - Muti, H.S.

AU - Heij, L.R.

AU - Keller, G.

AU - Kohlruss, M.

AU - Langer, R.

AU - Dislich, B.

AU - Cheong, J.H.

AU - Kim, Y.W.

AU - Kim, H.

AU - Kook, M.C.

AU - Cunningham, D.

AU - Allum, W.H.

AU - Langley, R.E.

AU - Nankivell, M.G.

AU - Quirke, P.

AU - Hayden, J.D.

AU - West, N.P.

AU - Irvine, A.J.

AU - Yoshikawa, T.

AU - Oshima, T.

AU - Huss, R.

AU - Grosser, B.

AU - Roviello, F.

AU - d'Ignazio, A.

AU - Quaas, A.

AU - Alakus, H.

AU - Tan, X.X.

AU - Pearson, A.T.

AU - Luedde, T.

AU - Ebert, M.P.

AU - Jager, D.

AU - Trautwein, C.

AU - Gaisa, N.T.

AU - Grabsch, H.I.

AU - Kather, J.N.

N1 - Funding Information: HIG was supported by Cancer Research UK. We thank all investigators and contributing pathologists from the TCGA study. PQ and NPW are supported by Yorkshire Cancer Research programme grants L386 and L394. PQ is a National Institute of Health Senior investigator. CT was supported by the German Research Foundation (SFB CRC1382, SFB-TRR57). MPE was supported by the German Research Foundation (GRK2727). TL was funded by Horizon 2020 through the European Research Council Consolidator Grant PhaseControl (771083) and the German Research Foundation (SFB-CRC1382 and LU 1360/3-2). JNK is funded by the Max-Eder-Programme of the German Cancer Aid (Bonn, Germany; grant #70113864) and the START Programme of the Medical Faculty Aachen (Aachen, Germany, grant #691906). JNK and TL are funded by the German Ministry of Health (funding based on a resolution of the German Bundestag by the federal government; grant DEEP LIVER, #ZMVI1-2520DAT111). Funding Information: JNK declares consulting roles for OWKIN France and Panakeia (UK) without any direct connection to this work; these roles started in April, 2021, after conducting the present study. JNK also declares honoraria from MSD and Eisai. DC declares grants from Medimmune/AstraZeneca, Clovis, Eli Lilly, 4SC, Bayer, Celgene, Leap, and Roche, and Scientific Board Membership for OVIBIO. DJ declares consulting services and advisory board participation for CureVac AG, Definiens, F Hoffmann-La Roche, Genmab A-S, Life Science Inkubator GmbH, VAXIMM AG, OncoOne Research & Development Research GmbH, and Oncolytics Biotech; payment or honoraria from SKK Kliniken Heilbronn, Georg Thieme Verlag, Terrapinn, Touch Medical Medica, BMS GmbH & Co KG, and MSD; reimbursements for expert opinion on medical questions from Wilhelm-Sander Foundation, Else-Kröner-Fresenius Foundation, Scherer Foundation, and NordForsk; meeting support (ie, for travel) from Amgen, Oryx GmbH, Roche Glycart AG, Parexel.com , IKTZ HD GmbH, and BMS; and leadership in the BMS Foundation Immunooncology. All other authors declare no competing interests. Publisher Copyright: © 2021 The Author(s). Published by Elsevier Ltd. This is an Open Access article under the CC BY 4.0 license

PY - 2021/10/1

Y1 - 2021/10/1

N2 - Background Response to immunotherapy in gastric cancer is associated with microsatellite instability (or mismatch repair deficiency) and Epstein-Barr virus (EBV) positivity. We therefore aimed to develop and validate deep learning based classifiers to detect microsatellite instability and EBV status from routine histology slides. Methods In this retrospective, multicentre study, we collected tissue samples from ten cohorts of patients with gastric cancer from seven countries (South Korea, Switzerland, Japan, Italy, Germany, the UK and the USA). We trained a deep learning-based classifier to detect microsatellite instability and EBV positivity from digitised, haematoxylin and eosin stained resection slides without annotating tumour containing regions. The performance of the classifier was assessed by within-cohort cross-validation in all ten cohorts and by external validation, for which we split the cohorts into a five-cohort training dataset and a five-cohort test dataset. We measured the area under the receiver operating curve (AUROC) for detection of microsatellite instability and EBV status. Microsatellite instability and EBV status were determined to be detectable if the lower bound of the 95% CI for the AUROC was above 0middot5. Findings Across the ten cohorts, our analysis included 2823 patients with known microsatellite instability status and 2685 patients with known EBV status. In the within-cohort cross-validation, the deep learning-based classifier could detect microsatellite instability status in nine of ten cohorts, with AUROCs ranging from 0middot597 (95% CI 0middot522-0middot737) to 0middot836 (0middot795-0middot880) and EBV status in five of eight cohorts, with AUROCs ranging from 0middot819 (0middot752-0middot841) to 0middot897 (0middot513-0middot966). Training a classifier on the pooled training dataset and testing it on the five remaining cohorts resulted in high classification performance with AUROCs ranging from 0middot723 (95% CI 0middot676-0middot794) to 0middot863 (0middot747-0middot969) for detection of microsatellite instability and from 0middot672 (0middot403-0middot989) to 0middot859 (0middot823-0middot919) for detection of EBV status. Interpretation Classifiers became increasingly robust when trained on pooled cohorts. After prospective validation, this deep learning-based tissue classification system could be used as an inexpensive predictive biomarker for immunotherapy in gastric cancer. Funding German Cancer Aid and German Federal Ministry of Health. Copyright (c) 2021 The Author(s). Published by Elsevier Ltd. This is an Open Access article under the CC BY 4.0 license.

AB - Background Response to immunotherapy in gastric cancer is associated with microsatellite instability (or mismatch repair deficiency) and Epstein-Barr virus (EBV) positivity. We therefore aimed to develop and validate deep learning based classifiers to detect microsatellite instability and EBV status from routine histology slides. Methods In this retrospective, multicentre study, we collected tissue samples from ten cohorts of patients with gastric cancer from seven countries (South Korea, Switzerland, Japan, Italy, Germany, the UK and the USA). We trained a deep learning-based classifier to detect microsatellite instability and EBV positivity from digitised, haematoxylin and eosin stained resection slides without annotating tumour containing regions. The performance of the classifier was assessed by within-cohort cross-validation in all ten cohorts and by external validation, for which we split the cohorts into a five-cohort training dataset and a five-cohort test dataset. We measured the area under the receiver operating curve (AUROC) for detection of microsatellite instability and EBV status. Microsatellite instability and EBV status were determined to be detectable if the lower bound of the 95% CI for the AUROC was above 0middot5. Findings Across the ten cohorts, our analysis included 2823 patients with known microsatellite instability status and 2685 patients with known EBV status. In the within-cohort cross-validation, the deep learning-based classifier could detect microsatellite instability status in nine of ten cohorts, with AUROCs ranging from 0middot597 (95% CI 0middot522-0middot737) to 0middot836 (0middot795-0middot880) and EBV status in five of eight cohorts, with AUROCs ranging from 0middot819 (0middot752-0middot841) to 0middot897 (0middot513-0middot966). Training a classifier on the pooled training dataset and testing it on the five remaining cohorts resulted in high classification performance with AUROCs ranging from 0middot723 (95% CI 0middot676-0middot794) to 0middot863 (0middot747-0middot969) for detection of microsatellite instability and from 0middot672 (0middot403-0middot989) to 0middot859 (0middot823-0middot919) for detection of EBV status. Interpretation Classifiers became increasingly robust when trained on pooled cohorts. After prospective validation, this deep learning-based tissue classification system could be used as an inexpensive predictive biomarker for immunotherapy in gastric cancer. Funding German Cancer Aid and German Federal Ministry of Health. Copyright (c) 2021 The Author(s). Published by Elsevier Ltd. This is an Open Access article under the CC BY 4.0 license.

KW - MICROSATELLITE INSTABILITY

KW - GASTRIC-CANCER

U2 - 10.1016/S2589-7500(21)00133-3

DO - 10.1016/S2589-7500(21)00133-3

M3 - Article

SN - 2589-7500

VL - 3

SP - E654-E664

JO - The Lancet Digital Health

JF - The Lancet Digital Health

IS - 10

ER -

Development and validation of deep learning classifiers to detect Epstein-Barr virus and microsatellite instability status in gastric cancer: a retrospective multicentre cohort study

Abstract

Keywords

Access to Document

Research output

Correction to Lancet Digit Health 2021; published online Aug 17. https://doi.org/10.1016/S2589-7500(21)00133-3

Cite this