National Cancer Institute Imaging Data Commons: Toward Transparency, Reproducibility, and Scalability in Imaging Artificial Intelligence

Andrey Fedorov*, William J R Longabaugh, David Pot, David A Clunie, Steven D Pieper, David L Gibbs, Christopher Bridge, Markus D Herrmann, André Homeyer, Rob Lewis, Hugo J W L Aerts, Deepa Krishnaswamy, Vamsi Krishna Thiriveedhi, Cosmin Ciausu, Daniela P Schacherer, Dennis Bontempi, Todd Pihl, Ulrike Wagner, Keyvan Farahani, Erika KimRon Kikinis

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

The remarkable advances of artificial intelligence (AI) technology are revolutionizing established approaches to the acquisition, interpretation, and analysis of biomedical imaging data. Development, validation, and continuous refinement of AI tools requires easy access to large high-quality annotated datasets, which are both representative and diverse. The National Cancer Institute (NCI) Imaging Data Commons (IDC) hosts large and diverse publicly available cancer image data collections. By harmonizing all data based on industry standards and colocalizing it with analysis and exploration resources, the IDC aims to facilitate the development, validation, and clinical translation of AI tools and address the well-documented challenges of establishing reproducible and transparent AI processing pipelines. Balanced use of established commercial products with open-source solutions, interconnected by standard interfaces, provides value and performance, while preserving sufficient agility to address the evolving needs of the research community. Emphasis on the development of tools, use cases to demonstrate the utility of uniform data representation, and cloud-based analysis aim to ease adoption and help define best practices. Integration with other data in the broader NCI Cancer Research Data Commons infrastructure opens opportunities for multiomics studies incorporating imaging data to further empower the research community to accelerate breakthroughs in cancer detection, diagnosis, and treatment. Published under a CC BY 4.0 license.
Original languageEnglish
Article numbere230180
Number of pages14
JournalRadiographics
Volume43
Issue number12
DOIs
Publication statusPublished - 1 Dec 2023

Keywords

  • United States
  • Humans
  • Artificial Intelligence
  • National Cancer Institute (U.S.)
  • Reproducibility of Results
  • Diagnostic Imaging
  • Multiomics
  • Neoplasms/diagnostic imaging

Cite this