A catalogue of 863 Rett-syndrome-causing MECP2 mutations and lessons learned from data integration

Friederike Ehrhart*, Annika Jacobsen, Maria Rigau, Mattia Bosio, Rajaram Kaliyaperumal, Jeroen F. J. Laros, Egon L. Willighagen, Alfonso Valencia, Marco Roos, Salvador Capella-gutierrez, Leopold M. G. Curfs, Chris T. Evelo

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review


Rett syndrome (RTT) is a rare neurological disorder mostly caused by a genetic variation in MECP2. Making new MECP2 variants and the related phenotypes available provides data for better understanding of disease mechanisms and faster identification of variants for diagnosis. This is, however, currently hampered by the lack of interoperability between genotype-phenotype databases. Here, we demonstrate on the example of MECP2 in RTT that by making the genotype-phenotype data more Findable, Accessible, Interoperable, and Reusable (FAIR), we can facilitate prioritization and analysis of variants. In total, 10,968 MECP2 variants were successfully integrated. Among these variants 863 unique confirmed RTT causing and 209 unique confirmed benign variants were found. This dataset was used for comparison of pathogenicity predicting tools, protein consequences, and identification of ambiguous variants. Prediction tools generally recognised the RTT causing and benign variants, however, there was a broad range of overlap Nineteen variants were identified that were annotated as both disease-causing and benign, suggesting that there are additional factors in these cases contributing to disease development.

Original languageEnglish
Article number10
Number of pages12
JournalScientific data
Issue number1
Publication statusPublished - 15 Jan 2021


  • GENE

Cite this