TY - JOUR
T1 - Enrichment of lung cancer computed tomography collections with AI-derived annotations
AU - Krishnaswamy, Deepa
AU - Bontempi, Dennis
AU - Thiriveedhi, Vamsi Krishna
AU - Punzo, Davide
AU - Clunie, David
AU - Bridge, Christopher P.
AU - Aerts, Hugo J.W.L.
AU - Kikinis, Ron
AU - Fedorov, Andrey
N1 - Funding Information:
This work has been funded in whole or in part with Federal funds from the National Cancer Institute, National Institutes of Health, under Task Order No. HHSN26110071 under Contract No. HHSN261201500003l. This work has additionally been funded in part from the T32 Image Guidance, Precision Diagnosis and Therapy Research Fellowship T32EB025823-04.
Publisher Copyright:
© 2024, The Author(s).
PY - 2024/1/4
Y1 - 2024/1/4
N2 - Public imaging datasets are critical for the development and evaluation of automated tools in cancer imaging. Unfortunately, many do not include annotations or image-derived features, complicating downstream analysis. Artificial intelligence-based annotation tools have been shown to achieve acceptable performance and can be used to automatically annotate large datasets. As part of the effort to enrich public data available within NCI Imaging Data Commons (IDC), here we introduce AI-generated annotations for two collections containing computed tomography images of the chest, NSCLC-Radiomics, and a subset of the National Lung Screening Trial. Using publicly available AI algorithms, we derived volumetric annotations of thoracic organs-at-risk, their corresponding radiomics features, and slice-level annotations of anatomical landmarks and regions. The resulting annotations are publicly available within IDC, where the DICOM format is used to harmonize the data and achieve FAIR (Findable, Accessible, Interoperable, Reusable) data principles. The annotations are accompanied by cloud-enabled notebooks demonstrating their use. This study reinforces the need for large, publicly accessible curated datasets and demonstrates how AI can aid in cancer imaging.
AB - Public imaging datasets are critical for the development and evaluation of automated tools in cancer imaging. Unfortunately, many do not include annotations or image-derived features, complicating downstream analysis. Artificial intelligence-based annotation tools have been shown to achieve acceptable performance and can be used to automatically annotate large datasets. As part of the effort to enrich public data available within NCI Imaging Data Commons (IDC), here we introduce AI-generated annotations for two collections containing computed tomography images of the chest, NSCLC-Radiomics, and a subset of the National Lung Screening Trial. Using publicly available AI algorithms, we derived volumetric annotations of thoracic organs-at-risk, their corresponding radiomics features, and slice-level annotations of anatomical landmarks and regions. The resulting annotations are publicly available within IDC, where the DICOM format is used to harmonize the data and achieve FAIR (Findable, Accessible, Interoperable, Reusable) data principles. The annotations are accompanied by cloud-enabled notebooks demonstrating their use. This study reinforces the need for large, publicly accessible curated datasets and demonstrates how AI can aid in cancer imaging.
U2 - 10.1038/s41597-023-02864-y
DO - 10.1038/s41597-023-02864-y
M3 - Article
SN - 2052-4463
VL - 11
JO - Scientific data
JF - Scientific data
IS - 1
M1 - 25
ER -