Application of transfer learning to predict drug-induced human in vivo gene expression changes using rat in vitro and in vivo data

Shauna D.O. Donovan; Rachel Cavill; Florian Wimmenauer; Alexander Lukas; Tobias Stumm; Evgueni Smirnov; Michael Lenz; Gokhan Ertaylan; Danyel G.J. Jennen; Natal A.W. van Riel; Kurt Driessens; Ralf L.M. Peeters; Theo M.C.M. de Kok

doi:10.1371/journal.pone.0292030

Application of transfer learning to predict drug-induced human in vivo gene expression changes using rat in vitro and in vivo data

Shauna D.O. Donovan^*, Rachel Cavill, Florian Wimmenauer, Alexander Lukas, Tobias Stumm, Evgueni Smirnov, Michael Lenz, Gokhan Ertaylan, Danyel G.J. Jennen, Natal A.W. van Riel, Kurt Driessens, Ralf L.M. Peeters, Theo M.C.M. de Kok

^*Corresponding author for this work

Research output: Contribution to journal › Article › Academic › peer-review

Abstract

The liver is the primary site for the metabolism and detoxification of many compounds, including pharmaceuticals. Consequently, it is also the primary location for many adverse reactions. As the liver is not readily accessible for sampling in humans; rodent or cell line models are often used to evaluate potential toxic effects of a novel compound or candidate drug. However, relating the results of animal and in vitro studies to relevant clinical outcomes for the human in vivo situation still proves challenging. In this study, we incorporate principles of transfer learning within a deep artificial neural network allowing us to leverage the relative abundance of rat in vitro and in vivo exposure data from the Open TG-GATEs data set to train a model to predict the expected pattern of human in vivo gene expression following an exposure given measured human in vitro gene expression. We show that domain adaptation has been successfully achieved, with the rat and human in vitro data no longer being separable in the common latent space generated by the network. The network produces physiologically plausible predictions of human in vivo gene expression pattern following an exposure to a previously unseen compound. Moreover, we show the integration of the human in vitro data in the training of the domain adaptation network significantly improves the temporal accuracy of the predicted rat in vivo gene expression pattern following an exposure to a previously unseen compound. In this way, we demonstrate the improvements in prediction accuracy that can be achieved by combining data from distinct domains.

Original language	English
Article number	e0292030
Number of pages	17
Journal	PLOS ONE
Volume	18
Issue number	11
DOIs	https://doi.org/10.1371/journal.pone.0292030
Publication status	Published - 30 Nov 2023

Access to Document

10.1371/journal.pone.0292030Licence: CC BY

Cite this

Donovan, S. D. O., Cavill, R., Wimmenauer, F., Lukas, A., Stumm, T., Smirnov, E., Lenz, M., Ertaylan, G., Jennen, D. G. J., van Riel, N. A. W., Driessens, K., Peeters, R. L. M., & de Kok, T. M. C. M. (2023). Application of transfer learning to predict drug-induced human in vivo gene expression changes using rat in vitro and in vivo data. PLOS ONE, 18(11), Article e0292030. https://doi.org/10.1371/journal.pone.0292030

@article{d02fa80c925545159e483fabcf87b374,

title = "Application of transfer learning to predict drug-induced human in vivo gene expression changes using rat in vitro and in vivo data",

abstract = "The liver is the primary site for the metabolism and detoxification of many compounds, including pharmaceuticals. Consequently, it is also the primary location for many adverse reactions. As the liver is not readily accessible for sampling in humans; rodent or cell line models are often used to evaluate potential toxic effects of a novel compound or candidate drug. However, relating the results of animal and in vitro studies to relevant clinical outcomes for the human in vivo situation still proves challenging. In this study, we incorporate principles of transfer learning within a deep artificial neural network allowing us to leverage the relative abundance of rat in vitro and in vivo exposure data from the Open TG-GATEs data set to train a model to predict the expected pattern of human in vivo gene expression following an exposure given measured human in vitro gene expression. We show that domain adaptation has been successfully achieved, with the rat and human in vitro data no longer being separable in the common latent space generated by the network. The network produces physiologically plausible predictions of human in vivo gene expression pattern following an exposure to a previously unseen compound. Moreover, we show the integration of the human in vitro data in the training of the domain adaptation network significantly improves the temporal accuracy of the predicted rat in vivo gene expression pattern following an exposure to a previously unseen compound. In this way, we demonstrate the improvements in prediction accuracy that can be achieved by combining data from distinct domains.",

author = "Donovan, {Shauna D.O.} and Rachel Cavill and Florian Wimmenauer and Alexander Lukas and Tobias Stumm and Evgueni Smirnov and Michael Lenz and Gokhan Ertaylan and Jennen, {Danyel G.J.} and {van Riel}, {Natal A.W.} and Kurt Driessens and Peeters, {Ralf L.M.} and {de Kok}, {Theo M.C.M.}",

note = "Funding Information: Funding: The research in this paper was supported by the Dutch Province of Limburg. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. Publisher Copyright: Copyright: {\textcopyright} 2023 O{\textquoteright}Donovan et al.",

year = "2023",

month = nov,

day = "30",

doi = "10.1371/journal.pone.0292030",

language = "English",

volume = "18",

journal = "PLOS ONE",

issn = "1932-6203",

publisher = "Public Library of Science",

number = "11",

}

Donovan, SDO, Cavill, R, Wimmenauer, F, Lukas, A, Stumm, T, Smirnov, E, Lenz, M, Ertaylan, G, Jennen, DGJ, van Riel, NAW, Driessens, K , Peeters, RLM & de Kok, TMCM 2023, 'Application of transfer learning to predict drug-induced human in vivo gene expression changes using rat in vitro and in vivo data', PLOS ONE, vol. 18, no. 11, e0292030. https://doi.org/10.1371/journal.pone.0292030

TY - JOUR

T1 - Application of transfer learning to predict drug-induced human in vivo gene expression changes using rat in vitro and in vivo data

AU - Donovan, Shauna D.O.

AU - Cavill, Rachel

AU - Wimmenauer, Florian

AU - Lukas, Alexander

AU - Stumm, Tobias

AU - Smirnov, Evgueni

AU - Lenz, Michael

AU - Ertaylan, Gokhan

AU - Jennen, Danyel G.J.

AU - van Riel, Natal A.W.

AU - Driessens, Kurt

AU - Peeters, Ralf L.M.

AU - de Kok, Theo M.C.M.

N1 - Funding Information: Funding: The research in this paper was supported by the Dutch Province of Limburg. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. Publisher Copyright: Copyright: © 2023 O’Donovan et al.

PY - 2023/11/30

Y1 - 2023/11/30

N2 - The liver is the primary site for the metabolism and detoxification of many compounds, including pharmaceuticals. Consequently, it is also the primary location for many adverse reactions. As the liver is not readily accessible for sampling in humans; rodent or cell line models are often used to evaluate potential toxic effects of a novel compound or candidate drug. However, relating the results of animal and in vitro studies to relevant clinical outcomes for the human in vivo situation still proves challenging. In this study, we incorporate principles of transfer learning within a deep artificial neural network allowing us to leverage the relative abundance of rat in vitro and in vivo exposure data from the Open TG-GATEs data set to train a model to predict the expected pattern of human in vivo gene expression following an exposure given measured human in vitro gene expression. We show that domain adaptation has been successfully achieved, with the rat and human in vitro data no longer being separable in the common latent space generated by the network. The network produces physiologically plausible predictions of human in vivo gene expression pattern following an exposure to a previously unseen compound. Moreover, we show the integration of the human in vitro data in the training of the domain adaptation network significantly improves the temporal accuracy of the predicted rat in vivo gene expression pattern following an exposure to a previously unseen compound. In this way, we demonstrate the improvements in prediction accuracy that can be achieved by combining data from distinct domains.

AB - The liver is the primary site for the metabolism and detoxification of many compounds, including pharmaceuticals. Consequently, it is also the primary location for many adverse reactions. As the liver is not readily accessible for sampling in humans; rodent or cell line models are often used to evaluate potential toxic effects of a novel compound or candidate drug. However, relating the results of animal and in vitro studies to relevant clinical outcomes for the human in vivo situation still proves challenging. In this study, we incorporate principles of transfer learning within a deep artificial neural network allowing us to leverage the relative abundance of rat in vitro and in vivo exposure data from the Open TG-GATEs data set to train a model to predict the expected pattern of human in vivo gene expression following an exposure given measured human in vitro gene expression. We show that domain adaptation has been successfully achieved, with the rat and human in vitro data no longer being separable in the common latent space generated by the network. The network produces physiologically plausible predictions of human in vivo gene expression pattern following an exposure to a previously unseen compound. Moreover, we show the integration of the human in vitro data in the training of the domain adaptation network significantly improves the temporal accuracy of the predicted rat in vivo gene expression pattern following an exposure to a previously unseen compound. In this way, we demonstrate the improvements in prediction accuracy that can be achieved by combining data from distinct domains.

U2 - 10.1371/journal.pone.0292030

DO - 10.1371/journal.pone.0292030

M3 - Article

SN - 1932-6203

VL - 18

JO - PLOS ONE

JF - PLOS ONE

IS - 11

M1 - e0292030

ER -