Quality Assessment of Biomedical Metadata Using Topic Modeling

Amrapali Zaveri, Michel Dumontier, Stuti Nayak

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingAcademicpeer-review

Abstract

There is an abundance of biomedical data present on the Web. However, this data is not re-usable because it is insu ciently described using rich metadata. The recently published FAIR principles specify desirable criteria that metadata and their corresponding datasets need to be Findable, Accessible, Interoperable, and Reusable. However, currently the biomedical metadata quality is poor which makes data reuse extremely di cult. To tackle this problem, we propose the use of topic modeling, specifically non-negative matrix factorization (NMF), as a first step towards dimensionality reduction when dealing with large amounts of data. In this position paper, as a use case, we apply NMF to the BioSamples metadata and present preliminary results.

Original languageEnglish
Title of host publicationQuality Assessment of Biomedical Metadata Using Topic Modeling.
Publication statusPublished - 2018
EventSemantic Web solutions for large-scale biomedical data analytics - Crete, Greece
Duration: 3 Jun 2018 → …

Workshop

WorkshopSemantic Web solutions for large-scale biomedical data analytics
Abbreviated titleSeWeBMeDA 2018
Country/TerritoryGreece
CityCrete
Period3/06/18 → …

Fingerprint

Dive into the research topics of 'Quality Assessment of Biomedical Metadata Using Topic Modeling'. Together they form a unique fingerprint.

Cite this