Blood-based DNA methylation and exposure risk scores predict PTSD with high accuracy in military and civilian cohorts

Agaz Wani; Seyma Katrinli; Xiang Zhao; Nikolaos Daskalakis; Anthony Zannas; Allison Aiello; Dewleen Baker; Marco Boks; Leslie Brick; Chia-Yen Chen; Shareefa Dalvie; Catherine Fortier; Elbert Geuze; Jasmeet Hayes; Ronald Kessler; Anthony King; Nastassja Koen; Israel Liberzon; Adriana Lori; Jurjen Luykx; Adam Maihofer; William Milberg; Mark Miller; Mary Mufford; Nicole Nugent; Sheila Rauch; Kerry Ressler; Victoria Risbrough; Bart Rutten; Dan Stein; Murrary Stein; Robert Ursano; Mieke Verfaellie; Erin Ware; Derek Wildman; Erika Wolf; Caroline Nievergelt; Mark Logue; Alicia Smith; Monica Uddin; Eric Vermetten; Christiaan Vinkers

doi:10.21203/rs.3.rs-3952163/v1

Blood-based DNA methylation and exposure risk scores predict PTSD with high accuracy in military and civilian cohorts

Agaz Wani, Seyma Katrinli, Xiang Zhao, Nikolaos Daskalakis, Anthony Zannas, Allison Aiello, Dewleen Baker, Marco Boks, Leslie Brick, Chia-Yen Chen, Shareefa Dalvie, Catherine Fortier, Elbert Geuze, Jasmeet Hayes, Ronald Kessler, Anthony King, Nastassja Koen, Israel Liberzon, Adriana Lori, Jurjen LuykxAdam Maihofer, William Milberg, Mark Miller, Mary Mufford, Nicole Nugent, Sheila Rauch, Kerry Ressler, Victoria Risbrough, Bart Rutten, Dan Stein, Murrary Stein, Robert Ursano, Mieke Verfaellie, Erin Ware, Derek Wildman, Erika Wolf, Caroline Nievergelt, Mark Logue, Alicia Smith, Monica Uddin, Eric Vermetten, Christiaan Vinkers

Research output: Working paper / Preprint › Preprint

Abstract

BACKGROUND: Incorporating genomic data into risk prediction has become an increasingly useful approach for rapid identification of individuals most at risk for complex disorders such as PTSD. Our goal was to develop and validate Methylation Risk Scores (MRS) using machine learning to distinguish individuals who have PTSD from those who do not. METHODS: Elastic Net was used to develop three risk score models using a discovery dataset (n = 1226; 314 cases, 912 controls) comprised of 5 diverse cohorts with available blood-derived DNA methylation (DNAm) measured on the Illumina Epic BeadChip. The first risk score, exposure and methylation risk score (eMRS) used cumulative and childhood trauma exposure and DNAm variables; the second, methylation-only risk score (MoRS) was based solely on DNAm data; the third, methylation-only risk scores with adjusted exposure variables (MoRSAE) utilized DNAm data adjusted for the two exposure variables. The potential of these risk scores to predict future PTSD based on pre-deployment data was also assessed. External validation of risk scores was conducted in four independent cohorts. RESULTS: The eMRS model showed the highest accuracy (92%), precision (91%), recall (87%), and f1-score (89%) in classifying PTSD using 3730 features. While still highly accurate, the MoRS (accuracy = 89%) using 3728 features and MoRSAE (accuracy = 84%) using 4150 features showed a decline in classification power. eMRS significantly predicted PTSD in one of the four independent cohorts, the BEAR cohort (beta = 0.6839, p-0.003), but not in the remaining three cohorts. Pre-deployment risk scores from all models (eMRS, beta = 1.92; MoRS, beta = 1.99 and MoRSAE, beta = 1.77) displayed a significant (p < 0.001) predictive power for post-deployment PTSD. CONCLUSION: Results, especially those from the eMRS, reinforce earlier findings that methylation and trauma are interconnected and can be leveraged to increase the correct classification of those with vs. without PTSD. Moreover, our models can potentially be a valuable tool in predicting the future risk of developing PTSD. As more data become available, including additional molecular, environmental, and psychosocial factors in these scores may enhance their accuracy in predicting the condition and, relatedly, improve their performance in independent cohorts.

Original language	English
Publisher	Research Square Company
DOIs	https://doi.org/10.21203/rs.3.rs-3952163/v1
Publication status	Published - 15 Feb 2024

Keywords

DNA methylation
Machine learning
PTSD
Risk scores

Access to Document

10.21203/rs.3.rs-3952163/v1

Cite this

Wani, A., Katrinli, S., Zhao, X., Daskalakis, N., Zannas, A., Aiello, A., Baker, D., Boks, M., Brick, L., Chen, C.-Y., Dalvie, S., Fortier, C., Geuze, E., Hayes, J., Kessler, R., King, A., Koen, N., Liberzon, I., Lori, A., ... Vinkers, C. (2024). Blood-based DNA methylation and exposure risk scores predict PTSD with high accuracy in military and civilian cohorts. Research Square Company. https://doi.org/10.21203/rs.3.rs-3952163/v1

@techreport{f020a12137f04cd99d9ea8d79347dcaf,

title = "Blood-based DNA methylation and exposure risk scores predict PTSD with high accuracy in military and civilian cohorts",

abstract = "BACKGROUND: Incorporating genomic data into risk prediction has become an increasingly useful approach for rapid identification of individuals most at risk for complex disorders such as PTSD. Our goal was to develop and validate Methylation Risk Scores (MRS) using machine learning to distinguish individuals who have PTSD from those who do not. METHODS: Elastic Net was used to develop three risk score models using a discovery dataset (n = 1226; 314 cases, 912 controls) comprised of 5 diverse cohorts with available blood-derived DNA methylation (DNAm) measured on the Illumina Epic BeadChip. The first risk score, exposure and methylation risk score (eMRS) used cumulative and childhood trauma exposure and DNAm variables; the second, methylation-only risk score (MoRS) was based solely on DNAm data; the third, methylation-only risk scores with adjusted exposure variables (MoRSAE) utilized DNAm data adjusted for the two exposure variables. The potential of these risk scores to predict future PTSD based on pre-deployment data was also assessed. External validation of risk scores was conducted in four independent cohorts. RESULTS: The eMRS model showed the highest accuracy (92%), precision (91%), recall (87%), and f1-score (89%) in classifying PTSD using 3730 features. While still highly accurate, the MoRS (accuracy = 89%) using 3728 features and MoRSAE (accuracy = 84%) using 4150 features showed a decline in classification power. eMRS significantly predicted PTSD in one of the four independent cohorts, the BEAR cohort (beta = 0.6839, p-0.003), but not in the remaining three cohorts. Pre-deployment risk scores from all models (eMRS, beta = 1.92; MoRS, beta = 1.99 and MoRSAE, beta = 1.77) displayed a significant (p < 0.001) predictive power for post-deployment PTSD. CONCLUSION: Results, especially those from the eMRS, reinforce earlier findings that methylation and trauma are interconnected and can be leveraged to increase the correct classification of those with vs. without PTSD. Moreover, our models can potentially be a valuable tool in predicting the future risk of developing PTSD. As more data become available, including additional molecular, environmental, and psychosocial factors in these scores may enhance their accuracy in predicting the condition and, relatedly, improve their performance in independent cohorts.",

keywords = "DNA methylation, Machine learning, PTSD, Risk scores",

author = "Agaz Wani and Seyma Katrinli and Xiang Zhao and Nikolaos Daskalakis and Anthony Zannas and Allison Aiello and Dewleen Baker and Marco Boks and Leslie Brick and Chia-Yen Chen and Shareefa Dalvie and Catherine Fortier and Elbert Geuze and Jasmeet Hayes and Ronald Kessler and Anthony King and Nastassja Koen and Israel Liberzon and Adriana Lori and Jurjen Luykx and Adam Maihofer and William Milberg and Mark Miller and Mary Mufford and Nicole Nugent and Sheila Rauch and Kerry Ressler and Victoria Risbrough and Bart Rutten and Dan Stein and Murrary Stein and Robert Ursano and Mieke Verfaellie and Erin Ware and Derek Wildman and Erika Wolf and Caroline Nievergelt and Mark Logue and Alicia Smith and Monica Uddin and Eric Vermetten and Christiaan Vinkers",

year = "2024",

month = feb,

day = "15",

doi = "10.21203/rs.3.rs-3952163/v1",

language = "English",

publisher = "Research Square Company",

address = "United States",

type = "WorkingPaper",

institution = "Research Square Company",

}

Wani, A, Katrinli, S, Zhao, X, Daskalakis, N, Zannas, A, Aiello, A, Baker, D, Boks, M, Brick, L, Chen, C-Y, Dalvie, S, Fortier, C, Geuze, E, Hayes, J, Kessler, R, King, A, Koen, N, Liberzon, I, Lori, A, Luykx, J, Maihofer, A, Milberg, W, Miller, M, Mufford, M, Nugent, N, Rauch, S, Ressler, K, Risbrough, V, Rutten, B, Stein, D, Stein, M, Ursano, R, Verfaellie, M, Ware, E, Wildman, D, Wolf, E, Nievergelt, C, Logue, M, Smith, A, Uddin, M, Vermetten, E & Vinkers, C 2024 'Blood-based DNA methylation and exposure risk scores predict PTSD with high accuracy in military and civilian cohorts' Research Square Company. https://doi.org/10.21203/rs.3.rs-3952163/v1

TY - UNPB

T1 - Blood-based DNA methylation and exposure risk scores predict PTSD with high accuracy in military and civilian cohorts

AU - Wani, Agaz

AU - Katrinli, Seyma

AU - Zhao, Xiang

AU - Daskalakis, Nikolaos

AU - Zannas, Anthony

AU - Aiello, Allison

AU - Baker, Dewleen

AU - Boks, Marco

AU - Brick, Leslie

AU - Chen, Chia-Yen

AU - Dalvie, Shareefa

AU - Fortier, Catherine

AU - Geuze, Elbert

AU - Hayes, Jasmeet

AU - Kessler, Ronald

AU - King, Anthony

AU - Koen, Nastassja

AU - Liberzon, Israel

AU - Lori, Adriana

AU - Luykx, Jurjen

AU - Maihofer, Adam

AU - Milberg, William

AU - Miller, Mark

AU - Mufford, Mary

AU - Nugent, Nicole

AU - Rauch, Sheila

AU - Ressler, Kerry

AU - Risbrough, Victoria

AU - Rutten, Bart

AU - Stein, Dan

AU - Stein, Murrary

AU - Ursano, Robert

AU - Verfaellie, Mieke

AU - Ware, Erin

AU - Wildman, Derek

AU - Wolf, Erika

AU - Nievergelt, Caroline

AU - Logue, Mark

AU - Smith, Alicia

AU - Uddin, Monica

AU - Vermetten, Eric

AU - Vinkers, Christiaan

PY - 2024/2/15

Y1 - 2024/2/15

N2 - BACKGROUND: Incorporating genomic data into risk prediction has become an increasingly useful approach for rapid identification of individuals most at risk for complex disorders such as PTSD. Our goal was to develop and validate Methylation Risk Scores (MRS) using machine learning to distinguish individuals who have PTSD from those who do not. METHODS: Elastic Net was used to develop three risk score models using a discovery dataset (n = 1226; 314 cases, 912 controls) comprised of 5 diverse cohorts with available blood-derived DNA methylation (DNAm) measured on the Illumina Epic BeadChip. The first risk score, exposure and methylation risk score (eMRS) used cumulative and childhood trauma exposure and DNAm variables; the second, methylation-only risk score (MoRS) was based solely on DNAm data; the third, methylation-only risk scores with adjusted exposure variables (MoRSAE) utilized DNAm data adjusted for the two exposure variables. The potential of these risk scores to predict future PTSD based on pre-deployment data was also assessed. External validation of risk scores was conducted in four independent cohorts. RESULTS: The eMRS model showed the highest accuracy (92%), precision (91%), recall (87%), and f1-score (89%) in classifying PTSD using 3730 features. While still highly accurate, the MoRS (accuracy = 89%) using 3728 features and MoRSAE (accuracy = 84%) using 4150 features showed a decline in classification power. eMRS significantly predicted PTSD in one of the four independent cohorts, the BEAR cohort (beta = 0.6839, p-0.003), but not in the remaining three cohorts. Pre-deployment risk scores from all models (eMRS, beta = 1.92; MoRS, beta = 1.99 and MoRSAE, beta = 1.77) displayed a significant (p < 0.001) predictive power for post-deployment PTSD. CONCLUSION: Results, especially those from the eMRS, reinforce earlier findings that methylation and trauma are interconnected and can be leveraged to increase the correct classification of those with vs. without PTSD. Moreover, our models can potentially be a valuable tool in predicting the future risk of developing PTSD. As more data become available, including additional molecular, environmental, and psychosocial factors in these scores may enhance their accuracy in predicting the condition and, relatedly, improve their performance in independent cohorts.

AB - BACKGROUND: Incorporating genomic data into risk prediction has become an increasingly useful approach for rapid identification of individuals most at risk for complex disorders such as PTSD. Our goal was to develop and validate Methylation Risk Scores (MRS) using machine learning to distinguish individuals who have PTSD from those who do not. METHODS: Elastic Net was used to develop three risk score models using a discovery dataset (n = 1226; 314 cases, 912 controls) comprised of 5 diverse cohorts with available blood-derived DNA methylation (DNAm) measured on the Illumina Epic BeadChip. The first risk score, exposure and methylation risk score (eMRS) used cumulative and childhood trauma exposure and DNAm variables; the second, methylation-only risk score (MoRS) was based solely on DNAm data; the third, methylation-only risk scores with adjusted exposure variables (MoRSAE) utilized DNAm data adjusted for the two exposure variables. The potential of these risk scores to predict future PTSD based on pre-deployment data was also assessed. External validation of risk scores was conducted in four independent cohorts. RESULTS: The eMRS model showed the highest accuracy (92%), precision (91%), recall (87%), and f1-score (89%) in classifying PTSD using 3730 features. While still highly accurate, the MoRS (accuracy = 89%) using 3728 features and MoRSAE (accuracy = 84%) using 4150 features showed a decline in classification power. eMRS significantly predicted PTSD in one of the four independent cohorts, the BEAR cohort (beta = 0.6839, p-0.003), but not in the remaining three cohorts. Pre-deployment risk scores from all models (eMRS, beta = 1.92; MoRS, beta = 1.99 and MoRSAE, beta = 1.77) displayed a significant (p < 0.001) predictive power for post-deployment PTSD. CONCLUSION: Results, especially those from the eMRS, reinforce earlier findings that methylation and trauma are interconnected and can be leveraged to increase the correct classification of those with vs. without PTSD. Moreover, our models can potentially be a valuable tool in predicting the future risk of developing PTSD. As more data become available, including additional molecular, environmental, and psychosocial factors in these scores may enhance their accuracy in predicting the condition and, relatedly, improve their performance in independent cohorts.

KW - DNA methylation

KW - Machine learning

KW - PTSD

KW - Risk scores

U2 - 10.21203/rs.3.rs-3952163/v1

DO - 10.21203/rs.3.rs-3952163/v1

M3 - Preprint

BT - Blood-based DNA methylation and exposure risk scores predict PTSD with high accuracy in military and civilian cohorts

PB - Research Square Company

ER -