[Profile picture of Ruben Verborgh]

Ruben Verborgh

Semantically Annotating CEUR-WS Workshop Proceedings with RML

by Pieter Heyvaert, Anastasia Dimou, Ruben Verborgh, Erik Mannens, and Rik Van de Walle

In this paper, we present our solution for the first task of the second edition of the Semantic Publishing Challenge. The task requires extracting and semantically annotating information regarding CEUR-WS workshops, their chairs and conference affiliations, as well as their papers and their authors, from a set of html-encoded workshop proceedings volumes. Our solution builds on last year’s submission, while we address a number of shortcomings, assess the generated dataset for its quality and publish the queries as SPARQL query templates. This is accomplished using the RDF Mapping Language (RML) to define the mappings, RMLProcessor to execute them, RDFUnit to both validate the mapping documents and assess the generated dataset’s quality, and The DataTank to publish the SPARQL query templates. This results in an overall improved quality of the generated dataset that is reflected in the query results.

Full text BibTeX Mendeley

Published in 2016 in Proceedings of the 12th Extended Semantic Web Conference: Semantic Publishing Challenge.

Keywords: RML, SPARQL

Read this paper online

Cite this paper in your publications

Discuss this paper