TY - GEN
T1 - Reliable Granular References to Changing Linked Data
AU - Kuhn, Tobias
AU - Willighagen, Egon
AU - Evelo, Chris
AU - Queralt-rosinach, Núria
AU - Centeno, Emilio
AU - Furlong, Laura I.
PY - 2017/10/4
Y1 - 2017/10/4
N2 - Nanopublications are a concept to represent Linked Data in a granular and provenance-aware manner, which has been successfully applied to a number of scientific datasets. We demonstrated in previous work how we can establish reliable and verifiable identifiers for nanopublications and sets thereof. Further adoption of these techniques, however, was probably hindered by the fact that nanopublications can lead to an explosion in the number of triples due to auxiliary information about the structure of each nanopublication and repetitive provenance and metadata. We demonstrate here that this significant overhead disappears once we take the version history of nanopublication datasets into account, calculate incremental updates, and allow users to deal with the specific subsets they need. We show that the total size and overhead of evolving scientific datasets is reduced, and typical subsets that researchers use for their analyses can be referenced and retrieved efficiently with optimized precision, persistence, and reliability.
AB - Nanopublications are a concept to represent Linked Data in a granular and provenance-aware manner, which has been successfully applied to a number of scientific datasets. We demonstrated in previous work how we can establish reliable and verifiable identifiers for nanopublications and sets thereof. Further adoption of these techniques, however, was probably hindered by the fact that nanopublications can lead to an explosion in the number of triples due to auxiliary information about the structure of each nanopublication and repetitive provenance and metadata. We demonstrate here that this significant overhead disappears once we take the version history of nanopublication datasets into account, calculate incremental updates, and allow users to deal with the specific subsets they need. We show that the total size and overhead of evolving scientific datasets is reduced, and typical subsets that researchers use for their analyses can be referenced and retrieved efficiently with optimized precision, persistence, and reliability.
UR - https://springernature.figshare.com/articles/dataset/Reliable_Granular_References_to_Changing_Linked_Data/5230639/1
U2 - 10.1007/978-3-319-68288-4_26
DO - 10.1007/978-3-319-68288-4_26
M3 - Conference article in proceeding
SN - 978-3-319-68287-7
VL - 10587
T3 - Lecture Notes in Computer Science
SP - 436
EP - 451
BT - The Semantic Web – ISWC 2017
A2 - d'Amato, Claudia
A2 - Fernandez, Miriam
A2 - Tamma, Valentina
ER -