Automated curation of large-scale cancer histopathology image datasets using deep learning

Lars Hilgers; Narmin Ghaffari Laleh; Nicholas P West; Alice Westwood; Katherine J Hewitt; Philip Quirke; Heike I Grabsch; Zunamys I Carrero; Emylou Matthaei; Chiara M L Loeffler; Titus J Brinker; Tanwei Yuan; Hermann Brenner; Alexander Brobeil; Michael Hoffmeister; Jakob Nikolas Kather

doi:10.1111/his.15159

Automated curation of large-scale cancer histopathology image datasets using deep learning

Lars Hilgers, Narmin Ghaffari Laleh, Nicholas P West, Alice Westwood, Katherine J Hewitt, Philip Quirke, Heike I Grabsch, Zunamys I Carrero, Emylou Matthaei, Chiara M L Loeffler, Titus J Brinker, Tanwei Yuan, Hermann Brenner, Alexander Brobeil, Michael Hoffmeister, Jakob Nikolas Kather^*

^*Corresponding author for this work

Research output: Contribution to journal › Article › Academic › peer-review

Abstract

BACKGROUND: Artificial intelligence (AI) has numerous applications in pathology, supporting diagnosis and prognostication in cancer. However, most AI models are trained on highly selected data, typically one tissue slide per patient. In reality, especially for large surgical resection specimens, dozens of slides can be available for each patient. Manually sorting and labelling whole-slide images (WSIs) is a very time-consuming process, hindering the direct application of AI on the collected tissue samples from large cohorts. In this study we addressed this issue by developing a deep-learning (DL)-based method for automatic curation of large pathology datasets with several slides per patient. METHODS: We collected multiple large multicentric datasets of colorectal cancer histopathological slides from the United Kingdom (FOXTROT, N?=?21,384 slides; CR07, N?=?7985 slides) and Germany (DACHS, N?=?3606 slides). These datasets contained multiple types of tissue slides, including bowel resection specimens, endoscopic biopsies, lymph node resections, immunohistochemistry-stained slides, and tissue microarrays. We developed, trained, and tested a deep convolutional neural network model to predict the type of slide from the slide overview (thumbnail) image. The primary statistical endpoint was the macro-averaged area under the receiver operating curve (AUROCs) for detection of the type of slide. RESULTS: In the primary dataset (FOXTROT), with an AUROC of 0.995 [95% confidence interval [CI]: 0.994-0.996] the algorithm achieved a high classification performance and was able to accurately predict the type of slide from the thumbnail image alone. In the two external test cohorts (CR07, DACHS) AUROCs of 0.982 [95% CI: 0.979-0.985] and 0.875 [95% CI: 0.864-0.887] were observed, which indicates the generalizability of the trained model on unseen datasets. With a confidence threshold of 0.95, the model reached an accuracy of 94.6% (7331 classified cases) in CR07 and 85.1% (2752 classified cases) for the DACHS cohort. CONCLUSION: Our findings show that using the low-resolution thumbnail image is sufficient to accurately classify the type of slide in digital pathology. This can support researchers to make the vast resource of existing pathology archives accessible to modern AI models with only minimal manual annotations.

Original language	English
Journal	Histopathology
DOIs	https://doi.org/10.1111/his.15159
Publication status	E-pub ahead of print - 26 Feb 2024

Keywords

colorectal cancer
deep learning
digital pathology
quality control

Access to Document

10.1111/his.15159Licence: CC BY

Cite this

Hilgers, L., Ghaffari Laleh, N., West, N. P., Westwood, A., Hewitt, K. J., Quirke, P., Grabsch, H. I., Carrero, Z. I., Matthaei, E., Loeffler, C. M. L., Brinker, T. J., Yuan, T., Brenner, H., Brobeil, A., Hoffmeister, M., & Kather, J. N. (2024). Automated curation of large-scale cancer histopathology image datasets using deep learning. Histopathology. Advance online publication. https://doi.org/10.1111/his.15159

@article{cd63e68bd65441b8a0549c8230f0edc2,

title = "Automated curation of large-scale cancer histopathology image datasets using deep learning",

abstract = "BACKGROUND: Artificial intelligence (AI) has numerous applications in pathology, supporting diagnosis and prognostication in cancer. However, most AI models are trained on highly selected data, typically one tissue slide per patient. In reality, especially for large surgical resection specimens, dozens of slides can be available for each patient. Manually sorting and labelling whole-slide images (WSIs) is a very time-consuming process, hindering the direct application of AI on the collected tissue samples from large cohorts. In this study we addressed this issue by developing a deep-learning (DL)-based method for automatic curation of large pathology datasets with several slides per patient. METHODS: We collected multiple large multicentric datasets of colorectal cancer histopathological slides from the United Kingdom (FOXTROT, N?=?21,384 slides; CR07, N?=?7985 slides) and Germany (DACHS, N?=?3606 slides). These datasets contained multiple types of tissue slides, including bowel resection specimens, endoscopic biopsies, lymph node resections, immunohistochemistry-stained slides, and tissue microarrays. We developed, trained, and tested a deep convolutional neural network model to predict the type of slide from the slide overview (thumbnail) image. The primary statistical endpoint was the macro-averaged area under the receiver operating curve (AUROCs) for detection of the type of slide. RESULTS: In the primary dataset (FOXTROT), with an AUROC of 0.995 [95% confidence interval [CI]: 0.994-0.996] the algorithm achieved a high classification performance and was able to accurately predict the type of slide from the thumbnail image alone. In the two external test cohorts (CR07, DACHS) AUROCs of 0.982 [95% CI: 0.979-0.985] and 0.875 [95% CI: 0.864-0.887] were observed, which indicates the generalizability of the trained model on unseen datasets. With a confidence threshold of 0.95, the model reached an accuracy of 94.6% (7331 classified cases) in CR07 and 85.1% (2752 classified cases) for the DACHS cohort. CONCLUSION: Our findings show that using the low-resolution thumbnail image is sufficient to accurately classify the type of slide in digital pathology. This can support researchers to make the vast resource of existing pathology archives accessible to modern AI models with only minimal manual annotations.",

keywords = "colorectal cancer, deep learning, digital pathology, quality control",

author = "Lars Hilgers and {Ghaffari Laleh}, Narmin and West, {Nicholas P} and Alice Westwood and Hewitt, {Katherine J} and Philip Quirke and Grabsch, {Heike I} and Carrero, {Zunamys I} and Emylou Matthaei and Loeffler, {Chiara M L} and Brinker, {Titus J} and Tanwei Yuan and Hermann Brenner and Alexander Brobeil and Michael Hoffmeister and Kather, {Jakob Nikolas}",

year = "2024",

month = feb,

day = "26",

doi = "10.1111/his.15159",

language = "English",

journal = "Histopathology",

issn = "1365-2559",

publisher = "Wiley",

}

TY - JOUR

T1 - Automated curation of large-scale cancer histopathology image datasets using deep learning

AU - Hilgers, Lars

AU - Ghaffari Laleh, Narmin

AU - West, Nicholas P

AU - Westwood, Alice

AU - Hewitt, Katherine J

AU - Quirke, Philip

AU - Grabsch, Heike I

AU - Carrero, Zunamys I

AU - Matthaei, Emylou

AU - Loeffler, Chiara M L

AU - Brinker, Titus J

AU - Yuan, Tanwei

AU - Brenner, Hermann

AU - Brobeil, Alexander

AU - Hoffmeister, Michael

AU - Kather, Jakob Nikolas

PY - 2024/2/26

Y1 - 2024/2/26

N2 - BACKGROUND: Artificial intelligence (AI) has numerous applications in pathology, supporting diagnosis and prognostication in cancer. However, most AI models are trained on highly selected data, typically one tissue slide per patient. In reality, especially for large surgical resection specimens, dozens of slides can be available for each patient. Manually sorting and labelling whole-slide images (WSIs) is a very time-consuming process, hindering the direct application of AI on the collected tissue samples from large cohorts. In this study we addressed this issue by developing a deep-learning (DL)-based method for automatic curation of large pathology datasets with several slides per patient. METHODS: We collected multiple large multicentric datasets of colorectal cancer histopathological slides from the United Kingdom (FOXTROT, N?=?21,384 slides; CR07, N?=?7985 slides) and Germany (DACHS, N?=?3606 slides). These datasets contained multiple types of tissue slides, including bowel resection specimens, endoscopic biopsies, lymph node resections, immunohistochemistry-stained slides, and tissue microarrays. We developed, trained, and tested a deep convolutional neural network model to predict the type of slide from the slide overview (thumbnail) image. The primary statistical endpoint was the macro-averaged area under the receiver operating curve (AUROCs) for detection of the type of slide. RESULTS: In the primary dataset (FOXTROT), with an AUROC of 0.995 [95% confidence interval [CI]: 0.994-0.996] the algorithm achieved a high classification performance and was able to accurately predict the type of slide from the thumbnail image alone. In the two external test cohorts (CR07, DACHS) AUROCs of 0.982 [95% CI: 0.979-0.985] and 0.875 [95% CI: 0.864-0.887] were observed, which indicates the generalizability of the trained model on unseen datasets. With a confidence threshold of 0.95, the model reached an accuracy of 94.6% (7331 classified cases) in CR07 and 85.1% (2752 classified cases) for the DACHS cohort. CONCLUSION: Our findings show that using the low-resolution thumbnail image is sufficient to accurately classify the type of slide in digital pathology. This can support researchers to make the vast resource of existing pathology archives accessible to modern AI models with only minimal manual annotations.

AB - BACKGROUND: Artificial intelligence (AI) has numerous applications in pathology, supporting diagnosis and prognostication in cancer. However, most AI models are trained on highly selected data, typically one tissue slide per patient. In reality, especially for large surgical resection specimens, dozens of slides can be available for each patient. Manually sorting and labelling whole-slide images (WSIs) is a very time-consuming process, hindering the direct application of AI on the collected tissue samples from large cohorts. In this study we addressed this issue by developing a deep-learning (DL)-based method for automatic curation of large pathology datasets with several slides per patient. METHODS: We collected multiple large multicentric datasets of colorectal cancer histopathological slides from the United Kingdom (FOXTROT, N?=?21,384 slides; CR07, N?=?7985 slides) and Germany (DACHS, N?=?3606 slides). These datasets contained multiple types of tissue slides, including bowel resection specimens, endoscopic biopsies, lymph node resections, immunohistochemistry-stained slides, and tissue microarrays. We developed, trained, and tested a deep convolutional neural network model to predict the type of slide from the slide overview (thumbnail) image. The primary statistical endpoint was the macro-averaged area under the receiver operating curve (AUROCs) for detection of the type of slide. RESULTS: In the primary dataset (FOXTROT), with an AUROC of 0.995 [95% confidence interval [CI]: 0.994-0.996] the algorithm achieved a high classification performance and was able to accurately predict the type of slide from the thumbnail image alone. In the two external test cohorts (CR07, DACHS) AUROCs of 0.982 [95% CI: 0.979-0.985] and 0.875 [95% CI: 0.864-0.887] were observed, which indicates the generalizability of the trained model on unseen datasets. With a confidence threshold of 0.95, the model reached an accuracy of 94.6% (7331 classified cases) in CR07 and 85.1% (2752 classified cases) for the DACHS cohort. CONCLUSION: Our findings show that using the low-resolution thumbnail image is sufficient to accurately classify the type of slide in digital pathology. This can support researchers to make the vast resource of existing pathology archives accessible to modern AI models with only minimal manual annotations.

KW - colorectal cancer

KW - deep learning

KW - digital pathology

KW - quality control

U2 - 10.1111/his.15159

DO - 10.1111/his.15159

M3 - Article

SN - 1365-2559

JO - Histopathology

JF - Histopathology

ER -