Vertical federated learning: a structured literature review

Research output: Contribution to journal(Systematic) Review articlepeer-review

Abstract

Federated learning (FL) has emerged as a promising distributed learning paradigm with an added advantage of data privacy. With the growing interest in collaboration among data owners, FL has gained significant attention from organizations. The idea of FL is to enable collaborating participants train machine learning (ML) models on decentralized data without breaching privacy. In simpler words, federated learning is the approach of "bringing the model to the data, instead of bringing the data to the model". Federated learning, when applied to data which is partitioned vertically across participants, is able to build a complete ML model by combining local models trained only using the data with distinct features at the local sites. This architecture of FL is referred to as vertical federated learning (VFL), which differs from the conventional FL on horizontally partitioned data. As VFL is different from conventional FL, it comes with its own issues and challenges. Motivated by the comparatively less explored side of FL, this paper provides a comprehensive overview of existing methods and developments in VFL, covering various aspects such as communication, learning, privacy, and applications. We conclude by identifying gaps in the current literature and proposing potential future directions for research in VFL.
Original languageEnglish
Article numbere1452
Pages (from-to)3205-3243
Number of pages39
JournalKnowledge and Information Systems
Volume67
Issue number4
DOIs
Publication statusPublished - Apr 2025

Keywords

  • Vertical federated learning
  • Privacy-preserving machine learning
  • Literature review
  • PRIVACY

Fingerprint

Dive into the research topics of 'Vertical federated learning: a structured literature review'. Together they form a unique fingerprint.

Cite this