Automated extraction of potential migraine biomarkers using a semantic graph

Wytze J. Vlietstra*, Ronald Zielman, Robin M. van Dongen, Erik A. Schultes, Floris Wiesman, Rein Vos, Erik M. van Mulligen, Jan A. Kors

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

Problem: Biomedical literature and databases contain important clues for the identification of potential disease biomarkers. However, searching these enormous knowledge reservoirs and integrating findings across heterogeneous sources is costly and difficult. Here we demonstrate how semantically integrated knowledge, extracted from biomedical literature and structured databases, can be used to automatically identify potential migraine biomarkers.

Method: We used a knowledge graph containing more than 3.5 million biomedical concepts and 68.4 million relationships. Biochemical compound concepts were filtered and ranked by their potential as biomarkers based on their connections to a subgraph of migraine-related concepts. The ranked results were evaluated against the results of a systematic literature review that was performed manually by migraine researchers. Weight points were assigned to these reference compounds to indicate their relative importance.

Results: Ranked results automatically generated by the knowledge graph were highly consistent with results from the manual literature review. Out of 222 reference compounds, 163 (73%) ranked in the top 2000, with 547 out of the 644 (85%) weight points assigned to the reference compounds. For reference compounds that were not in the top of the list, an extensive error analysis has been performed. When evaluating the overall performance, we obtained a ROC-AUC of 0.974.

Discussion: Semantic knowledge graphs composed of information integrated from multiple and varying sources can assist researchers in identifying potential disease biomarkers. (C) 2017 The Author(s). Published by Elsevier Inc.

Original languageEnglish
Pages (from-to)178-189
Number of pages12
JournalJournal of Biomedical Informatics
Volume71
DOIs
Publication statusPublished - Jul 2017

Keywords

  • Knowledge graph
  • Graph semantics
  • Biomarker identification
  • Migraine biomarkers
  • Semantic subgraph
  • BIOLOGY DATABASE COLLECTION
  • NUCLEIC-ACIDS-RESEARCH
  • LITERATURE-BASED DISCOVERY
  • PLASMA VASOPRESSIN
  • ALZHEIMERS-DISEASE
  • ISSUE
  • PATHOPHYSIOLOGY
  • PREDICATIONS
  • KNOWLEDGE
  • UMLS

Cite this