PRELIMINARY DATA ANALYSIS IN HEALTHCARE MULTICENTRIC DATA MINING: A PRIVACY-PRESERVING DISTRIBUTED APPROACH

Andrea Damiani*, Carlotta Masciocchi, Luca Boldrini, Roberto Gatta, Nicola Dinapoli, Jacopo Lenkowicz, Giuditta Chiloiro, Maria Antonietta Gambacorta, Luca Tagliaferri, Rosa Autorino, Monica Maria Pagliara, Maria Antonietta Blasi, Johan van Soest, Andre Dekker, Vincenzo Valentini

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

The new era of cognitive health care systems offers a large amount of patient data that can be used to develop prediction models and clinical decision support systems. In this frame, the multi-institutional approach is strongly encouraged in order to reach more numerous samples for data mining and more reliable statistics. For these purposes, shared ontologies need to be developed for data management to ensure database semantic coherence in accordance with the various centers' ethical and legal policies. Therefore, we propose a privacy-preserving distributed approach as a preliminary data analysis tool to identify possible compliance issues and heterogeneity from the agreed multi-institutional research protocol before training a clinical prediction model. This kind of preliminary analysis appeared fast and reliable and its results corresponded to those obtained using the traditional centralized approach. A real time interactive dashboard has also been presented to show analysis results and make the workflow swifter and easier.
Original languageEnglish
Pages (from-to)71-81
Number of pages11
JournalJournal of E-Learning and Knowledge Society
Volume14
Issue number1
DOIs
Publication statusPublished - 1 Jan 2018

Keywords

  • distributed learning
  • distributed preliminary analysis
  • privacy-preserving
  • healthcare
  • data mining
  • DECISION-SUPPORT-SYSTEMS
  • ONCOLOGY

Cite this