Systematic analysis and prediction of genes associated with monogenic disorders on human chromosome X

E. Leitao, C. Schroder, I. Parenti, C. Dalle, A. Rastetter, T. Kuhnel, A. Kuechler, S. Kaya, B. Gerard, E. Schaefer, C. Nava, N. Drouot, C. Engel, J. Piard, B. Duban-Bedu, L. Villard, A.P.A. Stegmann, E.K. Vanhoutte, J.A.J. Verdonschot, F.J. KaiserF.T. Mau-Them, M. Scala, P. Striano, S.G.M. Frints, E. Argilli, E.H. Sherr, F. Elder, J. Buratti, B. Keren, C. Mignot, D. Heron, J.L. Mandel, J. Gecz, V.M. Kalscheuer, B. Horsthemke, A. Piton, C. Depienne*

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

Disease gene discovery on chromosome (chr) X is challenging owing to its unique modes of inheritance. We undertook a systematic analysis of human chrX genes. We observe a higher proportion of disorder-associated genes and an enrichment of genes involved in cognition, language, and seizures on chrX compared to autosomes. We analyze gene constraints, exon and promoter conservation, expression, and paralogues, and report 127 genes sharing one or more attributes with known chrX disorder genes. Using machine learning classifiers trained to distinguish disease-associated from dispensable genes, we classify 247 genes, including 115 of the 127, as having high probability of being disease-associated. We provide evidence of an excess of variants in predicted genes in existing databases. Finally, we report damaging variants in CDK16 and TRPC5 in patients with intellectual disability or autism spectrum disorders. This study predicts large-scale gene-disease associations that could be used for prioritization of X-linked pathogenic variants.Discovering disease genes on the X chromosome can be particularly challenging. Here, the authors use features of known disease genes and machine learning to predict genes that remain to be associated with disorders on this chromosome.
Original languageEnglish
Article number6570
Number of pages17
JournalNature Communications
Volume13
Issue number1
DOIs
Publication statusPublished - 2 Nov 2022

Keywords

  • CPG DENSITY
  • EXPRESSION
  • FRAMEWORK
  • INACTIVATION
  • INTELLECTUAL DISABILITY
  • LANDSCAPE
  • MUTATIONS
  • R/BIOCONDUCTOR PACKAGE
  • VARIANTS
  • VERTEBRATE

Cite this