Abstract
Advancements in oncology and radiology are driving more specific, and thus improved, treatment and diagnostic opportunities. This creates challenges on the assessment of management options, as more information is needed to make an informed decision. One of the methods is to use machine-and deep learning techniques to develop predictive models. Although prediction models, embedded in clinical decision support systems (CDSSs), are the foreseen solution, developing/training such prediction models requires large amounts of detailed patient information to reach decisive power. The amount of patients needed to train a reliable prediction model rapidly outgrows the numbers available in a single institution, hence the need for multicentre machine learning. To be able to learn over multiple centres, several infrastructural prerequisites need to be addressed. First, data needs to be extracted from multiple source systems and represented using standardized terminologies, preferably including the semantics (the actual description) of the represented data. For research and model training purposes, this means that value representations (e.g. “m” or “f” indicating gender) need to be converted into standardized terms (the NCI Thesaurus codes C20197 or C16576, respectively), and that patient-identifiable information (e.g. name, institutional ID, address) needs to be removed or changed in a non-identifiable way. If datasets from different institutions use the same standardized terminology and data structure, data can be merged. Finally, after merging, prediction models can be learned on the complete dataset. In this chapter, we discuss centralized versus federated learning and their associated privacy, protection and bioethics issues.
Original language | English |
---|---|
Title of host publication | Machine and Deep Learning in Oncology, Medical Physics and Radiology |
Editors | Issam El Naqa, Martin J. Murphy |
Publisher | Springer International Publishing |
Pages | 135-172 |
Number of pages | 38 |
Edition | 2 |
ISBN (Electronic) | 9783030830472 |
ISBN (Print) | 9783030830465 |
DOIs | |
Publication status | Published - 1 Jan 2022 |
Keywords
- FAIR principles
- Federated learning
- Machine learning
- Privacy
- Protection and bioethics