The ChEMBL database as linked open data

E.L. Willighagen*, A.S. Waagmeester, O. Spjuth, P. Ansell, A.J. Williams, V. Tkachenko, J. Hastings, B. Chen, D.J. Wild

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

BACKGROUND: Making data available as Linked Data using Resource Description Framework (RDF) promotes integration with other web resources. RDF documents can natively link to related data, and others can link back using Uniform Resource Identifiers (URIs). RDF makes the data machine-readable and uses extensible vocabularies for additional information, making it easier to scale up inference and data analysis. RESULTS: This paper describes recent developments in an ongoing project converting data from the ChEMBL database into RDF triples. Relative to earlier versions, this updated version of ChEMBL-RDF uses recently introduced ontologies, including CHEMINF and CiTO; exposes more information from the database; and is now available as dereferencable, linked data. To demonstrate these new features, we present novel use cases showing further integration with other web resources, including Bio2RDF, Chem2Bio2RDF, and ChemSpider, and showing the use of standard ontologies for querying. CONCLUSIONS: We have illustrated the advantages of using open standards and ontologies to link the ChEMBL database to other databases. Using those links and the knowledge encoded in standards and ontologies, the ChEMBL-RDF resource creates a foundation for integrated semantic web cheminformatics applications, such as the presented decision support.
Original languageEnglish
Article number23
Number of pages12
JournalJournal of Cheminformatics
Volume5
Issue number1
DOIs
Publication statusPublished - 8 May 2013

Keywords

  • ChEMBL
  • Bioactivity
  • Semantic web
  • Resource Description Framework
  • Linked Data
  • CHEMICAL BIOLOGY DATA
  • ONTOLOGY
  • SYSTEMS
  • PROTEIN
  • FRAMEWORK
  • LINKING

Cite this