Preprocessing and analysis of volatilome data

Georgios Stavropoulos, Dahlia Salman, Yaser Alkhalifah, Frederik Jan van Schooten, Agnieszka Smolinska

Research output: Chapter in Book/Report/Conference proceedingChapterAcademic

Abstract

Biomarker discovery, i.e., finding disease or condition-specific biological markers, is a crucial aspect of biomedical research. Volatile organic compounds (VOCs) are excreted by various biofluids, cells and tissues, and bacteria, and these have been investigated extensively for their potential as markers of malfunctioning status in human. The number of VOCs excreted by those media - typically detected using sophisticated analytical instrumentation - are numerically large and biologically complex. Therefore, data preprocessing and analysis are crucial for successful identification of valid VOC markers for their application in clinical practice. This chapter provides an overview of various preprocessing approaches suitable for volatilome data of diverse nature. The importance of normalization and scaling, often neglected in the field, is discussed. The most common and promising machine learning techniques are presented and discussed, including unsupervised and supervised approaches, followed by a rarely used strategy in the volatilomics field, data fusion. The chapter aims to equip the reader with a basic overview of suitable techniques for treating and successfully exploiting volatilome data.
Original languageEnglish
Title of host publicationBreathborne Biomarkers and the Human Volatilome
EditorsJonathan Beauchamp, Cristina Davis, Joachim Pleil
PublisherElsevier
Chapter38
Pages633-647
Number of pages15
ISBN (Electronic)9780128199671
ISBN (Print)9780128223970
DOIs
Publication statusPublished - 1 Jan 2020

Keywords

  • Data fusion
  • Machine learning
  • Multivariate
  • Supervised
  • Unsupervised
  • Volatile organic compounds (VOCs)

Fingerprint

Dive into the research topics of 'Preprocessing and analysis of volatilome data'. Together they form a unique fingerprint.

Cite this