Abstract

Advancements in oncology and radiology are driving more specific, and thus improved, treatment and diagnostic opportunities. This creates challenges on the assessment of management options, as more information is needed to make an informed decision. One of the methods is to use machine-and deep learning techniques to develop predictive models. Although prediction models, embedded in clinical decision support systems (CDSSs), are the foreseen solution, developing/training such prediction models requires large amounts of detailed patient information to reach decisive power. The amount of patients needed to train a reliable prediction model rapidly outgrows the numbers available in a single institution, hence the need for multicentre machine learning. To be able to learn over multiple centres, several infrastructural prerequisites need to be addressed. First, data needs to be extracted from multiple source systems and represented using standardized terminologies, preferably including the semantics (the actual description) of the represented data. For research and model training purposes, this means that value representations (e.g. “m” or “f” indicating gender) need to be converted into standardized terms (the NCI Thesaurus codes C20197 or C16576, respectively), and that patient-identifiable information (e.g. name, institutional ID, address) needs to be removed or changed in a non-identifiable way. If datasets from different institutions use the same standardized terminology and data structure, data can be merged. Finally, after merging, prediction models can be learned on the complete dataset. In this chapter, we discuss centralized versus federated learning and their associated privacy, protection and bioethics issues.
Original languageEnglish
Title of host publicationMachine and Deep Learning in Oncology, Medical Physics and Radiology
EditorsIssam El Naqa, Martin J. Murphy
PublisherSpringer International Publishing
Pages135-172
Number of pages38
Edition2
ISBN (Electronic)9783030830472
ISBN (Print)9783030830465
DOIs
Publication statusPublished - 1 Jan 2022

Keywords

  • FAIR principles
  • Federated learning
  • Machine learning
  • Privacy
  • Protection and bioethics

Fingerprint

Dive into the research topics of 'Privacy-Preserving Federated Data Analysis: Data Sharing, Protection, and Bioethics in Healthcare'. Together they form a unique fingerprint.

Cite this