Privacy-Preserving Dashboard for F.A.I.R Head and Neck Cancer data supporting multi-centered collaborations

Varsha Gouthamchand*, Ananya Choudhury, Frank Hoebers, Frederik Wesseling, Mattea Welch, Sejin Kim, Benjamin Haibe-Kains, Joanna Kazmierska, Andre Dekker, Johan van Soest, Leonard Wee

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingAcademicpeer-review

23 Downloads (Pure)

Abstract

Research in modern healthcare requires vast volumes of data from various healthcare centers across the globe. It is not always feasible to centralize clinical data without compromising privacy. A tool addressing these issues and facilitating reuse of clinical data is the need of the hour. The Federated Learning approach, governed in a set of agreements such as the Personal Health Train (PHT) manages to tackle these concerns by distributing models to the data centers instead of the traditional approach of centralizing datasets. One of the prerequisites of PHT is using semantically interoperable datasets for the models to be able to find them. FAIR (Findable, Accessible, Interoperable, Reusable) principles help in building interoperable and reusable data by adding knowledge representation and providing descriptive metadata. However, the process of making data FAIR is not always easy and straight-forward. Our main objective is to disentangle this process by using domain and technical expertise and get data prepared for federated learning. This paper introduces applications that are easily deployable as Docker containers, which will automate parts of the aforementioned process and significantly simplify the task of creating FAIR clinical data. Our method bypasses the need for clinical researchers to have a high degree of technical skills. We demonstrate the FAIR-ification process by applying it to five Head and Neck cancer datasets (four public and one private). The PHT paradigm is explored by building a distributed visualization dashboard from the aggregated summaries of the FAIR-ified datasets. Using the PHT infrastructure for exchanging only statistical summaries or model coefficients allows researchers to explore data from multiple centers without breaching privacy.
Original languageEnglish
Title of host publicationSemantic Web Applications and Tools for Health Care and Life Sciences
EditorsAtsuko Yamaguchi , Andrea Splendiani, M. Scott Marshall, Chris Baker, Jerven Bolleman
Place of PublicationBasel
Pages11-21
Number of pages11
Volume3415
Edition1
Publication statusPublished - 1 Jan 2023
Event14th International Conference on Semantic Web Applications and Tools for Health Care and Life Sciences - Basel, Switzerland
Duration: 13 Feb 202316 Feb 2023
Conference number: 14

Publication series

SeriesCEUR Workshop Proceedings
ISSN1613-0073

Conference

Conference14th International Conference on Semantic Web Applications and Tools for Health Care and Life Sciences
Abbreviated titleSWAT4HCLS 2023
Country/TerritorySwitzerland
CityBasel
Period13/02/2316/02/23

Keywords

  • FAIR
  • Federated Learning
  • Knowledge graphs
  • Linked Data
  • Ontologies
  • RDF
  • Semantic Web
  • SPARQL

Cite this