Use of deep learning methods to translate drug-induced gene expression changes from rat to human primary hepatocytes

Shauna D O'Donovan; Kurt Driessens; Daniel Lopatta; Florian Wimmenauer; Alexander Lukas; Jelmer Neeven; Evgueni Smirnov; Michael Lenz; Gokhan Ertaylan; Danyel G J Jennen; Natal A W van Riel; Rachel Cavill; Ralf L M Peeters; Theo M C M de Kok

doi:10.1371/journal.pone.0236392

Use of deep learning methods to translate drug-induced gene expression changes from rat to human primary hepatocytes

Shauna D O'Donovan^*, Kurt Driessens, Daniel Lopatta, Florian Wimmenauer, Alexander Lukas, Jelmer Neeven, Evgueni Smirnov, Michael Lenz, Gokhan Ertaylan, Danyel G J Jennen, Natal A W van Riel, Rachel Cavill, Ralf L M Peeters, Theo M C M de Kok

^*Corresponding author for this work

Research output: Contribution to journal › Article › Academic › peer-review

Abstract

In clinical trials, animal and cell line models are often used to evaluate the potential toxic effects of a novel compound or candidate drug before progressing to human trials. However, relating the results of animal and in vitro model exposures to relevant clinical outcomes in the human in vivo system still proves challenging, relying on often putative orthologs. In recent years, multiple studies have demonstrated that the repeated dose rodent bioassay, the current gold standard in the field, lacks sufficient sensitivity and specificity in predicting toxic effects of pharmaceuticals in humans. In this study, we evaluate the potential of deep learning techniques to translate the pattern of gene expression measured following an exposure in rodents to humans, circumventing the current reliance on orthologs, and also from in vitro to in vivo experimental designs. Of the applied deep learning architectures applied in this study the convolutional neural network (CNN) and a deep artificial neural network with bottleneck architecture significantly outperform classical machine learning techniques in predicting the time series of gene expression in primary human hepatocytes given a measured time series of gene expression from primary rat hepatocytes following exposure in vitro to a previously unseen compound across multiple toxicologically relevant gene sets. With a reduction in average mean absolute error across 76 genes that have been shown to be predictive for identifying carcinogenicity from 0.0172 for a random regression forest to 0.0166 for the CNN model (p < 0.05). These deep learning architecture also perform well when applied to predict time series of in vivo gene expression given measured time series of in vitro gene expression for rats.

Original language	English
Article number	e0236392
Number of pages	20
Journal	PLOS ONE
Volume	15
Issue number	8
DOIs	https://doi.org/10.1371/journal.pone.0236392
Publication status	Published - 11 Aug 2020

Keywords

REPRESENTATIONS
TOXICOGENOMICS
TOXICOLOGY
TOXICITY

Access to Document

10.1371/journal.pone.0236392Licence: CC BY

Cite this

O'Donovan, S. D., Driessens, K., Lopatta, D., Wimmenauer, F., Lukas, A., Neeven, J., Smirnov, E., Lenz, M., Ertaylan, G., Jennen, D. G. J., van Riel, N. A. W., Cavill, R., Peeters, R. L. M., & de Kok, T. M. C. M. (2020). Use of deep learning methods to translate drug-induced gene expression changes from rat to human primary hepatocytes. PLOS ONE, 15(8), Article e0236392. https://doi.org/10.1371/journal.pone.0236392

@article{0aeb482bc57a49cfa29647bf5223a69c,

title = "Use of deep learning methods to translate drug-induced gene expression changes from rat to human primary hepatocytes",

abstract = "In clinical trials, animal and cell line models are often used to evaluate the potential toxic effects of a novel compound or candidate drug before progressing to human trials. However, relating the results of animal and in vitro model exposures to relevant clinical outcomes in the human in vivo system still proves challenging, relying on often putative orthologs. In recent years, multiple studies have demonstrated that the repeated dose rodent bioassay, the current gold standard in the field, lacks sufficient sensitivity and specificity in predicting toxic effects of pharmaceuticals in humans. In this study, we evaluate the potential of deep learning techniques to translate the pattern of gene expression measured following an exposure in rodents to humans, circumventing the current reliance on orthologs, and also from in vitro to in vivo experimental designs. Of the applied deep learning architectures applied in this study the convolutional neural network (CNN) and a deep artificial neural network with bottleneck architecture significantly outperform classical machine learning techniques in predicting the time series of gene expression in primary human hepatocytes given a measured time series of gene expression from primary rat hepatocytes following exposure in vitro to a previously unseen compound across multiple toxicologically relevant gene sets. With a reduction in average mean absolute error across 76 genes that have been shown to be predictive for identifying carcinogenicity from 0.0172 for a random regression forest to 0.0166 for the CNN model (p < 0.05). These deep learning architecture also perform well when applied to predict time series of in vivo gene expression given measured time series of in vitro gene expression for rats.",

keywords = "REPRESENTATIONS, TOXICOGENOMICS, TOXICOLOGY, TOXICITY",

author = "O'Donovan, {Shauna D} and Kurt Driessens and Daniel Lopatta and Florian Wimmenauer and Alexander Lukas and Jelmer Neeven and Evgueni Smirnov and Michael Lenz and Gokhan Ertaylan and Jennen, {Danyel G J} and {van Riel}, {Natal A W} and Rachel Cavill and Peeters, {Ralf L M} and {de Kok}, {Theo M C M}",

note = "Funding Information: The research in this paper was supported by a grant from the Dutch Province of Limburg awarded to T.d.K, R.P, and N.v.R. Publisher Copyright: Copyright: {\textcopyright} 2020 O'Donovan et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.",

year = "2020",

month = aug,

day = "11",

doi = "10.1371/journal.pone.0236392",

language = "English",

volume = "15",

journal = "PLOS ONE",

issn = "1932-6203",

publisher = "Public Library of Science",

number = "8",

}

O'Donovan, SD, Driessens, K, Lopatta, D, Wimmenauer, F, Lukas, A, Neeven, J, Smirnov, E, Lenz, M, Ertaylan, G, Jennen, DGJ, van Riel, NAW, Cavill, R , Peeters, RLM & de Kok, TMCM 2020, 'Use of deep learning methods to translate drug-induced gene expression changes from rat to human primary hepatocytes', PLOS ONE, vol. 15, no. 8, e0236392. https://doi.org/10.1371/journal.pone.0236392

TY - JOUR

T1 - Use of deep learning methods to translate drug-induced gene expression changes from rat to human primary hepatocytes

AU - O'Donovan, Shauna D

AU - Driessens, Kurt

AU - Lopatta, Daniel

AU - Wimmenauer, Florian

AU - Lukas, Alexander

AU - Neeven, Jelmer

AU - Smirnov, Evgueni

AU - Lenz, Michael

AU - Ertaylan, Gokhan

AU - Jennen, Danyel G J

AU - van Riel, Natal A W

AU - Cavill, Rachel

AU - Peeters, Ralf L M

AU - de Kok, Theo M C M

N1 - Funding Information: The research in this paper was supported by a grant from the Dutch Province of Limburg awarded to T.d.K, R.P, and N.v.R. Publisher Copyright: Copyright: © 2020 O'Donovan et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

PY - 2020/8/11

Y1 - 2020/8/11

N2 - In clinical trials, animal and cell line models are often used to evaluate the potential toxic effects of a novel compound or candidate drug before progressing to human trials. However, relating the results of animal and in vitro model exposures to relevant clinical outcomes in the human in vivo system still proves challenging, relying on often putative orthologs. In recent years, multiple studies have demonstrated that the repeated dose rodent bioassay, the current gold standard in the field, lacks sufficient sensitivity and specificity in predicting toxic effects of pharmaceuticals in humans. In this study, we evaluate the potential of deep learning techniques to translate the pattern of gene expression measured following an exposure in rodents to humans, circumventing the current reliance on orthologs, and also from in vitro to in vivo experimental designs. Of the applied deep learning architectures applied in this study the convolutional neural network (CNN) and a deep artificial neural network with bottleneck architecture significantly outperform classical machine learning techniques in predicting the time series of gene expression in primary human hepatocytes given a measured time series of gene expression from primary rat hepatocytes following exposure in vitro to a previously unseen compound across multiple toxicologically relevant gene sets. With a reduction in average mean absolute error across 76 genes that have been shown to be predictive for identifying carcinogenicity from 0.0172 for a random regression forest to 0.0166 for the CNN model (p < 0.05). These deep learning architecture also perform well when applied to predict time series of in vivo gene expression given measured time series of in vitro gene expression for rats.

AB - In clinical trials, animal and cell line models are often used to evaluate the potential toxic effects of a novel compound or candidate drug before progressing to human trials. However, relating the results of animal and in vitro model exposures to relevant clinical outcomes in the human in vivo system still proves challenging, relying on often putative orthologs. In recent years, multiple studies have demonstrated that the repeated dose rodent bioassay, the current gold standard in the field, lacks sufficient sensitivity and specificity in predicting toxic effects of pharmaceuticals in humans. In this study, we evaluate the potential of deep learning techniques to translate the pattern of gene expression measured following an exposure in rodents to humans, circumventing the current reliance on orthologs, and also from in vitro to in vivo experimental designs. Of the applied deep learning architectures applied in this study the convolutional neural network (CNN) and a deep artificial neural network with bottleneck architecture significantly outperform classical machine learning techniques in predicting the time series of gene expression in primary human hepatocytes given a measured time series of gene expression from primary rat hepatocytes following exposure in vitro to a previously unseen compound across multiple toxicologically relevant gene sets. With a reduction in average mean absolute error across 76 genes that have been shown to be predictive for identifying carcinogenicity from 0.0172 for a random regression forest to 0.0166 for the CNN model (p < 0.05). These deep learning architecture also perform well when applied to predict time series of in vivo gene expression given measured time series of in vitro gene expression for rats.

KW - REPRESENTATIONS

KW - TOXICOGENOMICS

KW - TOXICOLOGY

KW - TOXICITY

U2 - 10.1371/journal.pone.0236392

DO - 10.1371/journal.pone.0236392

M3 - Article

C2 - 32780735

SN - 1932-6203

VL - 15

JO - PLOS ONE

JF - PLOS ONE

IS - 8

M1 - e0236392

ER -