A preliminary investigation into SPARQL query complexity and federation in Bio2RDF

C. Buil-Aranda, M. Ugarte, Meritxell Arenas, M. Dumontier

Research output: Contribution to journalConference article in journalAcademicpeer-review

Abstract

When users query a SPARQL endpoint, they normally face an empty text box in which they have to write the desired queries. This obstructs the process of obtaining the data they want, since users rarely have any assistance in querying a (possibly huge) RDF database. In this paper we report a deep analysis of the server log files that record the queries that users send to the SPARQL endpoints, focusing in the Bio2RDF cluster. This log analysis reveals the large number of repeated queries that users submit, and how they pursue a trial and error process by adding and removing operations from the submitted queries to obtain the desired results. We also show how users try to connect to other RDF datasets in the Linked Open Data cloud. Our results offer insight into the interaction between users and a schema-light RDF dataset, and secondly, suggest improvements to SPARQL server optimizations in terms of optimization and results caching.

Original languageEnglish
Pages (from-to)196-204
Number of pages9
JournalCEUR Workshop Proceedings
Volume1378
Publication statusPublished - 2015
Externally publishedYes

Fingerprint

Dive into the research topics of 'A preliminary investigation into SPARQL query complexity and federation in Bio2RDF'. Together they form a unique fingerprint.

Cite this