Toward the integration of Omics data in epidemiological studies: still a "long and winding road"

Evangelina Lopez de Maturana; Silvia Pineda; Angela Brand; Kristel Van Steen; Nuria Malats

doi:10.1002/gepi.21992

Toward the integration of Omics data in epidemiological studies: still a "long and winding road"

Evangelina Lopez de Maturana, Silvia Pineda, Angela Brand, Kristel Van Steen, Nuria Malats^*

^*Corresponding author for this work

Research output: Contribution to journal › Article › Academic › peer-review

Abstract

Primary and secondary prevention can highly benefit a personalized medicine approach through the accurate discrimination of individuals at high risk of developing a specific disease from those at moderate and low risk. To this end precise risk prediction models need to be built. This endeavor requires a precise characterization of the individual exposome, genome, and phenome. Massive molecular omics data representing the different layers of the biological processes of the host and the nonhost will enable to build more accurate risk prediction models. Epidemiologists aim to integrate omics data along with important information coming from other sources (questionnaires, candidate markers) that has been proved to be relevant in the discrimination risk assessment of complex diseases. However, the integrative models in large-scale epidemiologic research are still in their infancy and they face numerous challenges, some of them at the analytical stage. So far, there are a small number of studies that have integrated more than two omics data sets, and the inclusion of non-omics data in the same models is still missing in most of studies. In this contribution, we aim at approaching the omics and non-omics data integration from the epidemiology scope by considering the massive inclusion of variables in the risk assessment and predictive models. We also provide already available examples of integrative contributions in the field, propose analytical strategies that allow considering both omics and non-omics data in the models, and finally review the challenges imbedding this type of research.

Original language	English
Pages (from-to)	558-569
Journal	Genetic Epidemiology
Volume	40
Issue number	7
DOIs	https://doi.org/10.1002/gepi.21992
Publication status	Published - Nov 2016

Keywords

challenges
epidemiology
exposure
genetic susceptibility
integration
outcome
omics data
statistical methods

Access to Document

10.1002/gepi.21992

Cite this

@article{c65405d4a2864252841e4c94d263a10d,

title = "Toward the integration of Omics data in epidemiological studies: still a {"}long and winding road{"}",

abstract = "Primary and secondary prevention can highly benefit a personalized medicine approach through the accurate discrimination of individuals at high risk of developing a specific disease from those at moderate and low risk. To this end precise risk prediction models need to be built. This endeavor requires a precise characterization of the individual exposome, genome, and phenome. Massive molecular omics data representing the different layers of the biological processes of the host and the nonhost will enable to build more accurate risk prediction models. Epidemiologists aim to integrate omics data along with important information coming from other sources (questionnaires, candidate markers) that has been proved to be relevant in the discrimination risk assessment of complex diseases. However, the integrative models in large-scale epidemiologic research are still in their infancy and they face numerous challenges, some of them at the analytical stage. So far, there are a small number of studies that have integrated more than two omics data sets, and the inclusion of non-omics data in the same models is still missing in most of studies. In this contribution, we aim at approaching the omics and non-omics data integration from the epidemiology scope by considering the massive inclusion of variables in the risk assessment and predictive models. We also provide already available examples of integrative contributions in the field, propose analytical strategies that allow considering both omics and non-omics data in the models, and finally review the challenges imbedding this type of research.",

keywords = "challenges, epidemiology, exposure, genetic susceptibility, integration, outcome, omics data, statistical methods",

author = "{Lopez de Maturana}, Evangelina and Silvia Pineda and Angela Brand and {Van Steen}, Kristel and Nuria Malats",

year = "2016",

month = nov,

doi = "10.1002/gepi.21992",

language = "English",

volume = "40",

pages = "558--569",

journal = "Genetic Epidemiology",

issn = "0741-0395",

publisher = "Wiley-Blackwell",

number = "7",

}

TY - JOUR

T1 - Toward the integration of Omics data in epidemiological studies: still a "long and winding road"

AU - Lopez de Maturana, Evangelina

AU - Pineda, Silvia

AU - Brand, Angela

AU - Van Steen, Kristel

AU - Malats, Nuria

PY - 2016/11

Y1 - 2016/11

N2 - Primary and secondary prevention can highly benefit a personalized medicine approach through the accurate discrimination of individuals at high risk of developing a specific disease from those at moderate and low risk. To this end precise risk prediction models need to be built. This endeavor requires a precise characterization of the individual exposome, genome, and phenome. Massive molecular omics data representing the different layers of the biological processes of the host and the nonhost will enable to build more accurate risk prediction models. Epidemiologists aim to integrate omics data along with important information coming from other sources (questionnaires, candidate markers) that has been proved to be relevant in the discrimination risk assessment of complex diseases. However, the integrative models in large-scale epidemiologic research are still in their infancy and they face numerous challenges, some of them at the analytical stage. So far, there are a small number of studies that have integrated more than two omics data sets, and the inclusion of non-omics data in the same models is still missing in most of studies. In this contribution, we aim at approaching the omics and non-omics data integration from the epidemiology scope by considering the massive inclusion of variables in the risk assessment and predictive models. We also provide already available examples of integrative contributions in the field, propose analytical strategies that allow considering both omics and non-omics data in the models, and finally review the challenges imbedding this type of research.

AB - Primary and secondary prevention can highly benefit a personalized medicine approach through the accurate discrimination of individuals at high risk of developing a specific disease from those at moderate and low risk. To this end precise risk prediction models need to be built. This endeavor requires a precise characterization of the individual exposome, genome, and phenome. Massive molecular omics data representing the different layers of the biological processes of the host and the nonhost will enable to build more accurate risk prediction models. Epidemiologists aim to integrate omics data along with important information coming from other sources (questionnaires, candidate markers) that has been proved to be relevant in the discrimination risk assessment of complex diseases. However, the integrative models in large-scale epidemiologic research are still in their infancy and they face numerous challenges, some of them at the analytical stage. So far, there are a small number of studies that have integrated more than two omics data sets, and the inclusion of non-omics data in the same models is still missing in most of studies. In this contribution, we aim at approaching the omics and non-omics data integration from the epidemiology scope by considering the massive inclusion of variables in the risk assessment and predictive models. We also provide already available examples of integrative contributions in the field, propose analytical strategies that allow considering both omics and non-omics data in the models, and finally review the challenges imbedding this type of research.

KW - challenges

KW - epidemiology

KW - exposure

KW - genetic susceptibility

KW - integration

KW - outcome

KW - omics data

KW - statistical methods

U2 - 10.1002/gepi.21992

DO - 10.1002/gepi.21992

M3 - Article

C2 - 27432111

SN - 0741-0395

VL - 40

SP - 558

EP - 569

JO - Genetic Epidemiology

JF - Genetic Epidemiology

IS - 7

ER -