Privacy-Preserving Dashboard for F.A.I.R Head and Neck Cancer data supporting multi-centered collaborations

Varsha Gouthamchand; Ananya Choudhury; Frank Hoebers; Frederik Wesseling; Mattea Welch; Sejin Kim; Benjamin Haibe-Kains; Joanna Kazmierska; Andre Dekker; Johan van Soest; Leonard Wee

Privacy-Preserving Dashboard for F.A.I.R Head and Neck Cancer data supporting multi-centered collaborations

Varsha Gouthamchand^*, Ananya Choudhury, Frank Hoebers, Frederik Wesseling, Mattea Welch, Sejin Kim, Benjamin Haibe-Kains, Joanna Kazmierska, Andre Dekker, Johan van Soest, Leonard Wee

^*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceeding › Conference article in proceeding › Academic › peer-review

23 Downloads (Pure)

Abstract

Research in modern healthcare requires vast volumes of data from various healthcare centers across the globe. It is not always feasible to centralize clinical data without compromising privacy. A tool addressing these issues and facilitating reuse of clinical data is the need of the hour. The Federated Learning approach, governed in a set of agreements such as the Personal Health Train (PHT) manages to tackle these concerns by distributing models to the data centers instead of the traditional approach of centralizing datasets. One of the prerequisites of PHT is using semantically interoperable datasets for the models to be able to find them. FAIR (Findable, Accessible, Interoperable, Reusable) principles help in building interoperable and reusable data by adding knowledge representation and providing descriptive metadata. However, the process of making data FAIR is not always easy and straight-forward. Our main objective is to disentangle this process by using domain and technical expertise and get data prepared for federated learning. This paper introduces applications that are easily deployable as Docker containers, which will automate parts of the aforementioned process and significantly simplify the task of creating FAIR clinical data. Our method bypasses the need for clinical researchers to have a high degree of technical skills. We demonstrate the FAIR-ification process by applying it to five Head and Neck cancer datasets (four public and one private). The PHT paradigm is explored by building a distributed visualization dashboard from the aggregated summaries of the FAIR-ified datasets. Using the PHT infrastructure for exchanging only statistical summaries or model coefficients allows researchers to explore data from multiple centers without breaching privacy.

Original language	English
Title of host publication	Semantic Web Applications and Tools for Health Care and Life Sciences
Editors	Atsuko Yamaguchi , Andrea Splendiani, M. Scott Marshall, Chris Baker, Jerven Bolleman
Place of Publication	Basel
Pages	11-21
Number of pages	11
Volume	3415
Edition	1
Publication status	Published - 1 Jan 2023
Event	14th International Conference on Semantic Web Applications and Tools for Health Care and Life Sciences - Basel, Switzerland Duration: 13 Feb 2023 → 16 Feb 2023 Conference number: 14

Publication series

Series	CEUR Workshop Proceedings
ISSN	1613-0073

Conference

Conference	14th International Conference on Semantic Web Applications and Tools for Health Care and Life Sciences
Abbreviated title	SWAT4HCLS 2023
Country/Territory	Switzerland
City	Basel
Period	13/02/23 → 16/02/23

Keywords

FAIR
Federated Learning
Knowledge graphs
Linked Data
Ontologies
RDF
Semantic Web
SPARQL

Access to Document

Full TextFinal published version, 1.3 MBLicence: Taverne

https://ceur-ws.org/Vol-3415/paper-2.pdf

Cite this

Gouthamchand, V., Choudhury, A., Hoebers, F., Wesseling, F., Welch, M., Kim, S., Haibe-Kains, B., Kazmierska, J., Dekker, A., van Soest, J., & Wee, L. (2023). Privacy-Preserving Dashboard for F.A.I.R Head and Neck Cancer data supporting multi-centered collaborations. In A. Yamaguchi , A. Splendiani, M. S. Marshall, C. Baker, & J. Bolleman (Eds.), Semantic Web Applications and Tools for Health Care and Life Sciences (1 ed., Vol. 3415, pp. 11-21). https://ceur-ws.org/Vol-3415/paper-2.pdf

Gouthamchand, Varsha ; Choudhury, Ananya ; Hoebers, Frank et al. / Privacy-Preserving Dashboard for F.A.I.R Head and Neck Cancer data supporting multi-centered collaborations. Semantic Web Applications and Tools for Health Care and Life Sciences. editor / Atsuko Yamaguchi ; Andrea Splendiani ; M. Scott Marshall ; Chris Baker ; Jerven Bolleman. Vol. 3415 1. ed. Basel, 2023. pp. 11-21 (CEUR Workshop Proceedings).

@inproceedings{0017927962494a7cbc6ce52fc4e7f732,

title = "Privacy-Preserving Dashboard for F.A.I.R Head and Neck Cancer data supporting multi-centered collaborations",

abstract = "Research in modern healthcare requires vast volumes of data from various healthcare centers across the globe. It is not always feasible to centralize clinical data without compromising privacy. A tool addressing these issues and facilitating reuse of clinical data is the need of the hour. The Federated Learning approach, governed in a set of agreements such as the Personal Health Train (PHT) manages to tackle these concerns by distributing models to the data centers instead of the traditional approach of centralizing datasets. One of the prerequisites of PHT is using semantically interoperable datasets for the models to be able to find them. FAIR (Findable, Accessible, Interoperable, Reusable) principles help in building interoperable and reusable data by adding knowledge representation and providing descriptive metadata. However, the process of making data FAIR is not always easy and straight-forward. Our main objective is to disentangle this process by using domain and technical expertise and get data prepared for federated learning. This paper introduces applications that are easily deployable as Docker containers, which will automate parts of the aforementioned process and significantly simplify the task of creating FAIR clinical data. Our method bypasses the need for clinical researchers to have a high degree of technical skills. We demonstrate the FAIR-ification process by applying it to five Head and Neck cancer datasets (four public and one private). The PHT paradigm is explored by building a distributed visualization dashboard from the aggregated summaries of the FAIR-ified datasets. Using the PHT infrastructure for exchanging only statistical summaries or model coefficients allows researchers to explore data from multiple centers without breaching privacy.",

keywords = "FAIR, Federated Learning, Knowledge graphs, Linked Data, Ontologies, RDF, Semantic Web, SPARQL",

author = "Varsha Gouthamchand and Ananya Choudhury and Frank Hoebers and Frederik Wesseling and Mattea Welch and Sejin Kim and Benjamin Haibe-Kains and Joanna Kazmierska and Andre Dekker and {van Soest}, Johan and Leonard Wee",

note = "Publisher Copyright: {\textcopyright} 2023 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).; 14th International Conference on Semantic Web Applications and Tools for Health Care and Life Sciences, SWAT4HCLS 2023 ; Conference date: 13-02-2023 Through 16-02-2023",

year = "2023",

month = jan,

day = "1",

language = "English",

volume = "3415",

series = "CEUR Workshop Proceedings",

publisher = "Rheinisch-Westfaelische Technische Hochschule Aachen * Lehrstuhl Informatik V",

pages = "11--21",

editor = "{Yamaguchi }, {Atsuko } and Splendiani, {Andrea } and Marshall, {M. Scott} and Chris Baker and Jerven Bolleman",

booktitle = "Semantic Web Applications and Tools for Health Care and Life Sciences",

edition = "1",

}

Gouthamchand, V , Choudhury, A , Hoebers, F , Wesseling, F, Welch, M, Kim, S, Haibe-Kains, B, Kazmierska, J, Dekker, A , van Soest, J & Wee, L 2023, Privacy-Preserving Dashboard for F.A.I.R Head and Neck Cancer data supporting multi-centered collaborations. in A Yamaguchi , A Splendiani, MS Marshall, C Baker & J Bolleman (eds), Semantic Web Applications and Tools for Health Care and Life Sciences. 1 edn, vol. 3415, Basel, CEUR Workshop Proceedings, pp. 11-21, 14th International Conference on Semantic Web Applications and Tools for Health Care and Life Sciences, Basel, Switzerland, 13/02/23. <https://ceur-ws.org/Vol-3415/paper-2.pdf>

Privacy-Preserving Dashboard for F.A.I.R Head and Neck Cancer data supporting multi-centered collaborations. / Gouthamchand, Varsha ; Choudhury, Ananya ; Hoebers, Frank et al.
Semantic Web Applications and Tools for Health Care and Life Sciences. ed. / Atsuko Yamaguchi ; Andrea Splendiani; M. Scott Marshall; Chris Baker; Jerven Bolleman. Vol. 3415 1. ed. Basel, 2023. p. 11-21 (CEUR Workshop Proceedings).

Research output: Chapter in Book/Report/Conference proceeding › Conference article in proceeding › Academic › peer-review

TY - GEN

T1 - Privacy-Preserving Dashboard for F.A.I.R Head and Neck Cancer data supporting multi-centered collaborations

AU - Gouthamchand, Varsha

AU - Choudhury, Ananya

AU - Hoebers, Frank

AU - Wesseling, Frederik

AU - Welch, Mattea

AU - Kim, Sejin

AU - Haibe-Kains, Benjamin

AU - Kazmierska, Joanna

AU - Dekker, Andre

AU - van Soest, Johan

AU - Wee, Leonard

N1 - Conference code: 14

PY - 2023/1/1

Y1 - 2023/1/1

N2 - Research in modern healthcare requires vast volumes of data from various healthcare centers across the globe. It is not always feasible to centralize clinical data without compromising privacy. A tool addressing these issues and facilitating reuse of clinical data is the need of the hour. The Federated Learning approach, governed in a set of agreements such as the Personal Health Train (PHT) manages to tackle these concerns by distributing models to the data centers instead of the traditional approach of centralizing datasets. One of the prerequisites of PHT is using semantically interoperable datasets for the models to be able to find them. FAIR (Findable, Accessible, Interoperable, Reusable) principles help in building interoperable and reusable data by adding knowledge representation and providing descriptive metadata. However, the process of making data FAIR is not always easy and straight-forward. Our main objective is to disentangle this process by using domain and technical expertise and get data prepared for federated learning. This paper introduces applications that are easily deployable as Docker containers, which will automate parts of the aforementioned process and significantly simplify the task of creating FAIR clinical data. Our method bypasses the need for clinical researchers to have a high degree of technical skills. We demonstrate the FAIR-ification process by applying it to five Head and Neck cancer datasets (four public and one private). The PHT paradigm is explored by building a distributed visualization dashboard from the aggregated summaries of the FAIR-ified datasets. Using the PHT infrastructure for exchanging only statistical summaries or model coefficients allows researchers to explore data from multiple centers without breaching privacy.

AB - Research in modern healthcare requires vast volumes of data from various healthcare centers across the globe. It is not always feasible to centralize clinical data without compromising privacy. A tool addressing these issues and facilitating reuse of clinical data is the need of the hour. The Federated Learning approach, governed in a set of agreements such as the Personal Health Train (PHT) manages to tackle these concerns by distributing models to the data centers instead of the traditional approach of centralizing datasets. One of the prerequisites of PHT is using semantically interoperable datasets for the models to be able to find them. FAIR (Findable, Accessible, Interoperable, Reusable) principles help in building interoperable and reusable data by adding knowledge representation and providing descriptive metadata. However, the process of making data FAIR is not always easy and straight-forward. Our main objective is to disentangle this process by using domain and technical expertise and get data prepared for federated learning. This paper introduces applications that are easily deployable as Docker containers, which will automate parts of the aforementioned process and significantly simplify the task of creating FAIR clinical data. Our method bypasses the need for clinical researchers to have a high degree of technical skills. We demonstrate the FAIR-ification process by applying it to five Head and Neck cancer datasets (four public and one private). The PHT paradigm is explored by building a distributed visualization dashboard from the aggregated summaries of the FAIR-ified datasets. Using the PHT infrastructure for exchanging only statistical summaries or model coefficients allows researchers to explore data from multiple centers without breaching privacy.

KW - FAIR

KW - Federated Learning

KW - Knowledge graphs

KW - Linked Data

KW - Ontologies

KW - RDF

KW - Semantic Web

KW - SPARQL

M3 - Conference article in proceeding

VL - 3415

T3 - CEUR Workshop Proceedings

SP - 11

EP - 21

BT - Semantic Web Applications and Tools for Health Care and Life Sciences

A2 - Yamaguchi , Atsuko

A2 - Splendiani, Andrea

A2 - Marshall, M. Scott

A2 - Baker, Chris

A2 - Bolleman, Jerven

CY - Basel

T2 - 14th International Conference on Semantic Web Applications and Tools for Health Care and Life Sciences

Y2 - 13 February 2023 through 16 February 2023

ER -

Gouthamchand V , Choudhury A , Hoebers F , Wesseling F, Welch M, Kim S et al. Privacy-Preserving Dashboard for F.A.I.R Head and Neck Cancer data supporting multi-centered collaborations. In Yamaguchi A, Splendiani A, Marshall MS, Baker C, Bolleman J, editors, Semantic Web Applications and Tools for Health Care and Life Sciences. 1 ed. Vol. 3415. Basel. 2023. p. 11-21. (CEUR Workshop Proceedings).