Generation of synthetic ground glass nodules using generative adversarial networks (GANs)

Zhixiang Wang; Zhen Zhang; Ying Feng; Lizza E L Hendriks; Razvan L Miclea; Hester Gietema; Janna Schoenmaekers; Andre Dekker; Leonard Wee; Alberto Traverso

doi:10.1186/s41747-022-00311-y

Generation of synthetic ground glass nodules using generative adversarial networks (GANs)

Zhixiang Wang, Zhen Zhang, Ying Feng, Lizza E L Hendriks, Razvan L Miclea, Hester Gietema, Janna Schoenmaekers, Andre Dekker, Leonard Wee^*, Alberto Traverso^*

^*Corresponding author for this work

Research output: Contribution to journal › Article › Academic › peer-review

Abstract

BACKGROUND: Data shortage is a common challenge in developing computer-aided diagnosis systems. We developed a generative adversarial network (GAN) model to generate synthetic lung lesions mimicking ground glass nodules (GGNs).

METHODS: We used 216 computed tomography images with 340 GGNs from the Lung Image Database Consortium and Image Database Resource Initiative database. A GAN model retrieving information from the whole image and the GGN region was built. The generated samples were evaluated with visual Turing test performed by four experienced radiologists or pulmonologists. Radiomic features were compared between real and synthetic nodules. Performances were evaluated by area under the curve (AUC) at receiver operating characteristic analysis. In addition, we trained a classification model (ResNet) to investigate whether the synthetic GGNs can improve the performances algorithm and how performances changed as a function of labelled data used in training.

RESULTS: Of 51 synthetic GGNs, 19 (37%) were classified as real by clinicians. Of 93 radiomic features, 58 (62.4%) showed no significant difference between synthetic and real GGNs (p ≥ 0.052). The discrimination performances of physicians (AUC 0.68) and radiomics (AUC 0.66) were similar, with no-significantly different (p = 0.23), but clinicians achieved a better accuracy (AUC 0.74) than radiomics (AUC 0.62) (p < 0.001). The classification model trained on datasets with synthetic data performed better than models without the addition of synthetic data.

CONCLUSIONS: GAN has promising potential for generating GGNs. Through similar AUC, clinicians achieved better ability to diagnose whether the data is synthetic than radiomics.

Original language	English
Article number	59
Number of pages	12
Journal	European Radiology Experimental
Volume	6
Issue number	1
DOIs	https://doi.org/10.1186/s41747-022-00311-y
Publication status	Published - 30 Nov 2022

Keywords

Algorithms
Databases, Factual
Tomography, X-Ray Computed

Access to Document

10.1186/s41747-022-00311-yLicence: CC BY

Cite this

@article{5da77a55045141e1ae4730f6b0d8e200,

title = "Generation of synthetic ground glass nodules using generative adversarial networks (GANs)",

abstract = "BACKGROUND: Data shortage is a common challenge in developing computer-aided diagnosis systems. We developed a generative adversarial network (GAN) model to generate synthetic lung lesions mimicking ground glass nodules (GGNs).METHODS: We used 216 computed tomography images with 340 GGNs from the Lung Image Database Consortium and Image Database Resource Initiative database. A GAN model retrieving information from the whole image and the GGN region was built. The generated samples were evaluated with visual Turing test performed by four experienced radiologists or pulmonologists. Radiomic features were compared between real and synthetic nodules. Performances were evaluated by area under the curve (AUC) at receiver operating characteristic analysis. In addition, we trained a classification model (ResNet) to investigate whether the synthetic GGNs can improve the performances algorithm and how performances changed as a function of labelled data used in training.RESULTS: Of 51 synthetic GGNs, 19 (37%) were classified as real by clinicians. Of 93 radiomic features, 58 (62.4%) showed no significant difference between synthetic and real GGNs (p ≥ 0.052). The discrimination performances of physicians (AUC 0.68) and radiomics (AUC 0.66) were similar, with no-significantly different (p = 0.23), but clinicians achieved a better accuracy (AUC 0.74) than radiomics (AUC 0.62) (p < 0.001). The classification model trained on datasets with synthetic data performed better than models without the addition of synthetic data.CONCLUSIONS: GAN has promising potential for generating GGNs. Through similar AUC, clinicians achieved better ability to diagnose whether the data is synthetic than radiomics.",

keywords = "Algorithms, Databases, Factual, Tomography, X-Ray Computed",

author = "Zhixiang Wang and Zhen Zhang and Ying Feng and Hendriks, {Lizza E L} and Miclea, {Razvan L} and Hester Gietema and Janna Schoenmaekers and Andre Dekker and Leonard Wee and Alberto Traverso",

note = "{\textcopyright} 2022. The Author(s) under exclusive licence to European Society of Radiology.",

year = "2022",

month = nov,

day = "30",

doi = "10.1186/s41747-022-00311-y",

language = "English",

volume = "6",

journal = "European Radiology Experimental",

issn = "2509-9280",

publisher = "SpringerOpen",

number = "1",

}

TY - JOUR

T1 - Generation of synthetic ground glass nodules using generative adversarial networks (GANs)

AU - Wang, Zhixiang

AU - Zhang, Zhen

AU - Feng, Ying

AU - Hendriks, Lizza E L

AU - Miclea, Razvan L

AU - Gietema, Hester

AU - Schoenmaekers, Janna

AU - Dekker, Andre

AU - Wee, Leonard

AU - Traverso, Alberto

PY - 2022/11/30

Y1 - 2022/11/30

N2 - BACKGROUND: Data shortage is a common challenge in developing computer-aided diagnosis systems. We developed a generative adversarial network (GAN) model to generate synthetic lung lesions mimicking ground glass nodules (GGNs).METHODS: We used 216 computed tomography images with 340 GGNs from the Lung Image Database Consortium and Image Database Resource Initiative database. A GAN model retrieving information from the whole image and the GGN region was built. The generated samples were evaluated with visual Turing test performed by four experienced radiologists or pulmonologists. Radiomic features were compared between real and synthetic nodules. Performances were evaluated by area under the curve (AUC) at receiver operating characteristic analysis. In addition, we trained a classification model (ResNet) to investigate whether the synthetic GGNs can improve the performances algorithm and how performances changed as a function of labelled data used in training.RESULTS: Of 51 synthetic GGNs, 19 (37%) were classified as real by clinicians. Of 93 radiomic features, 58 (62.4%) showed no significant difference between synthetic and real GGNs (p ≥ 0.052). The discrimination performances of physicians (AUC 0.68) and radiomics (AUC 0.66) were similar, with no-significantly different (p = 0.23), but clinicians achieved a better accuracy (AUC 0.74) than radiomics (AUC 0.62) (p < 0.001). The classification model trained on datasets with synthetic data performed better than models without the addition of synthetic data.CONCLUSIONS: GAN has promising potential for generating GGNs. Through similar AUC, clinicians achieved better ability to diagnose whether the data is synthetic than radiomics.

AB - BACKGROUND: Data shortage is a common challenge in developing computer-aided diagnosis systems. We developed a generative adversarial network (GAN) model to generate synthetic lung lesions mimicking ground glass nodules (GGNs).METHODS: We used 216 computed tomography images with 340 GGNs from the Lung Image Database Consortium and Image Database Resource Initiative database. A GAN model retrieving information from the whole image and the GGN region was built. The generated samples were evaluated with visual Turing test performed by four experienced radiologists or pulmonologists. Radiomic features were compared between real and synthetic nodules. Performances were evaluated by area under the curve (AUC) at receiver operating characteristic analysis. In addition, we trained a classification model (ResNet) to investigate whether the synthetic GGNs can improve the performances algorithm and how performances changed as a function of labelled data used in training.RESULTS: Of 51 synthetic GGNs, 19 (37%) were classified as real by clinicians. Of 93 radiomic features, 58 (62.4%) showed no significant difference between synthetic and real GGNs (p ≥ 0.052). The discrimination performances of physicians (AUC 0.68) and radiomics (AUC 0.66) were similar, with no-significantly different (p = 0.23), but clinicians achieved a better accuracy (AUC 0.74) than radiomics (AUC 0.62) (p < 0.001). The classification model trained on datasets with synthetic data performed better than models without the addition of synthetic data.CONCLUSIONS: GAN has promising potential for generating GGNs. Through similar AUC, clinicians achieved better ability to diagnose whether the data is synthetic than radiomics.

KW - Algorithms

KW - Databases, Factual

KW - Tomography, X-Ray Computed

U2 - 10.1186/s41747-022-00311-y

DO - 10.1186/s41747-022-00311-y

M3 - Article

C2 - 36447082

SN - 2509-9280

VL - 6

JO - European Radiology Experimental

JF - European Radiology Experimental

IS - 1

M1 - 59

ER -