What Does DALL-E 2 Know About Radiology?

Lisa C. Adams; Felix Busch; Daniel Truhn; Marcus R. Makowski; Hugo J. W. L. Aerts; Keno K. Bressem

doi:10.2196/43110

What Does DALL-E 2 Know About Radiology?

Lisa C. Adams, Felix Busch, Daniel Truhn, Marcus R. Makowski, Hugo J. W. L. Aerts, Keno K. Bressem^*

^*Corresponding author for this work

Research output: Contribution to journal › Article › Academic › peer-review

Abstract

Generative models, such as DALL-E 2 (OpenAI), could represent promising future tools for image generation, augmentation, and manipulation for artificial intelligence research in radiology, provided that these models have sufficient medical domain knowledge. Herein, we show that DALL-E 2 has learned relevant representations of x-ray images, with promising capabilities in terms of zero-shot text-to-image generation of new images, the continuation of an image beyond its original boundaries, and the removal of elements; however, its capabilities for the generation of images with pathological abnormalities (eg, tumors, fractures, and inflammation) or computed tomography, magnetic resonance imaging, or ultrasound images are still limited. The use of generative models for augmenting and generating radiological data thus seems feasible, even if the further fine-tuning and adaptation of these models to their respective domains are required first.

Original language	English
Article number	e43110
Number of pages	8
Journal	Journal of Medical Internet Research
Volume	25
Issue number	1
DOIs	https://doi.org/10.2196/43110
Publication status	Published - 16 Mar 2023

Keywords

DALL-E
creating images from text
image creation
image generation
transformer language model
machine learning
generative model
radiology
x-ray
artificial intelligence
medical imaging
text-to-image
diagnostic imaging

Access to Document

10.2196/43110Licence: CC BY

Cite this

@article{f4e3d721ba804370b0c68852e6a034a7,

title = "What Does DALL-E 2 Know About Radiology?",

abstract = "Generative models, such as DALL-E 2 (OpenAI), could represent promising future tools for image generation, augmentation, and manipulation for artificial intelligence research in radiology, provided that these models have sufficient medical domain knowledge. Herein, we show that DALL-E 2 has learned relevant representations of x-ray images, with promising capabilities in terms of zero-shot text-to-image generation of new images, the continuation of an image beyond its original boundaries, and the removal of elements; however, its capabilities for the generation of images with pathological abnormalities (eg, tumors, fractures, and inflammation) or computed tomography, magnetic resonance imaging, or ultrasound images are still limited. The use of generative models for augmenting and generating radiological data thus seems feasible, even if the further fine-tuning and adaptation of these models to their respective domains are required first.",

keywords = "DALL-E, creating images from text, image creation, image generation, transformer language model, machine learning, generative model, radiology, x-ray, artificial intelligence, medical imaging, text-to-image, diagnostic imaging",

author = "Adams, {Lisa C.} and Felix Busch and Daniel Truhn and Makowski, {Marcus R.} and Aerts, {Hugo J. W. L.} and Bressem, {Keno K.}",

year = "2023",

month = mar,

day = "16",

doi = "10.2196/43110",

language = "English",

volume = "25",

journal = "Journal of Medical Internet Research",

issn = "1439-4456",