A model for capturing provenance of assertions about chemical substances

Kody Moodley, Amrapali Zaveri, Chunlei Wu, Michel Dumontier

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingAcademicpeer-review

Abstract

Chemical substance resources on the Web are often made accessible to researchers through public APIs (Application Programming Interfaces). A significant problem of missing provenance information arises when extracting and integrating data in such APIs. Even when provenance is stated, it is usually not done with any prescribed templates or terminology. This creates a burden on data producers and makes it challenging for API developers to automatically extract and analyse this information. Downstream, these consequences hinder efforts to automatically determine the veracity and quality of extracted data, critical for proving the integrity of associated research findings. In this paper, we propose a model for capturing provenance of assertions about chemical substances by systematically analyzing three sources: (i) Nanopublications, (ii) Wikidata and (iii) selected Minimal Information Standards (MISTS) for reporting biomedical studies. We analyse provenance terms used in these sources along with their frequency of use and synthesize our findings into a preliminary model for capturing provenance.
Original languageEnglish
Title of host publicationSemantic Web Applications and Tools for Health Care and Life Sciences
EditorsChristopher J. O. Baker, Andra Waagmeester, Andrea Splendiani, Oya Deniz Beyan, M. Scott Marshall
Place of PublicationAntwerp, Belgium
PublisherCEUR-WS.org
Volume2275
Publication statusPublished - 2018
EventSemantic Web Applications and Tools for Health Care and Life Sciences - Antwerp, Antwerp, Belgium
Duration: 3 Dec 20186 Dec 2018
Conference number: 11
http://www.swat4ls.org/workshops/antwerp2018/

Conference

ConferenceSemantic Web Applications and Tools for Health Care and Life Sciences
Abbreviated titleSWAT4LS
Country/TerritoryBelgium
CityAntwerp
Period3/12/186/12/18
Internet address

Keywords

  • provenance
  • metadata
  • ontologies

Cite this