[Profile picture of Ruben Verborgh]

Ruben Verborgh

Merging and Enriching DCAT Feeds to Improve Discoverability of Datasets

by Pieter Heyvaert, Pieter Colpaert, Ruben Verborgh, Erik Mannens, and Rik Van de Walle

Data Catalog Vocabulary (DCAT) is a W3C specification to describe datasets published on the Web. However, these catalogs are not easily discoverable based on a user’s needs. In this paper, we introduce the Node.js module "dcat-merger" which allows a user agent to download and semantically merge different DCAT feeds from the Web into one DCAT feed, which can be republished. Merging the input feeds is followed by enriching them. Besides determining the subjects of the datasets, using DBpedia Spotlight, two extensions were built: one categorizes the datasets according to a taxonomy, and the other adds spatial properties to the datasets. These extensions require the use of information available in DBpedia’s SPARQL endpoint. However, public SPARQL endpoints often suffer from low availability, so a Triple Pattern Fragments alternative is used. However, the need for DCAT Merger sparks the discussion for more high level functionality to improve a catalog’s discoverability.

BibTeX other citation formats

Published in 2015 in Proceedings of the 12th Extended Semantic Web Conference: Posters and Demos.

Keywords:

Read this article online

Cite this article in your work

Cite this article easily using its BibTeX entry:

@inproceedings{heyvaert_eswc_poster_2015,
  author = {Heyvaert, Pieter and Colpaert, Pieter and Verborgh, Ruben and Mannens, Erik and Van de Walle, Rik},
  title = {Merging and Enriching {DCAT} Feeds to Improve Discoverability of Datasets},
  booktitle = {Proceedings of the 12th Extended Semantic Web Conference: Posters and Demos},
  series = {Lecture Notes in Computer Science},
  editor = {Gandon, Fabien and Gu\'eret, Christophe and Villata, Serena and Breslin, John and Faron-Zucker, Catherine and Zimmermann, Antoine},
  volume = 9341,
  pages = {67--71},
  year = 2015,
  month = jun,
  publisher = {Springer},
  isbn = {978-3-319-25639-9},
  doi = {10.1007/978-3-319-25639-9_13},
}

Alternatively, pick a reference of your choice below:

IEEE
P. Heyvaert, P. Colpaert, R. Verborgh, E. Mannens, and R. Van de Walle, “Merging and Enriching DCAT Feeds to Improve Discoverability of Datasets,” in Proceedings of the 12th Extended Semantic Web Conference: Posters and Demos, 2015, vol. 9341, pp. 67–71.
ACM
Pieter Heyvaert, Pieter Colpaert, Ruben Verborgh, Erik Mannens, and Rik Van de Walle. 2015. Merging and Enriching DCAT Feeds to Improve Discoverability of Datasets. In Fabien Gandon, Christophe Guéret, Serena Villata, John Breslin, Catherine Faron-Zucker, & Antoine Zimmermann, eds. Proceedings of the 12th Extended Semantic Web Conference: Posters and Demos. Lecture Notes in Computer Science. Springer, 67–71.
LNCS
Heyvaert, P., Colpaert, P., Verborgh, R., Mannens, E., Van de Walle, R.: Merging and Enriching DCAT Feeds to Improve Discoverability of Datasets. In: Gandon, F., Guéret, C., Villata, S., Breslin, J., Faron-Zucker, C., and Zimmermann, A. (eds.) Proceedings of the 12th Extended Semantic Web Conference: Posters and Demos. pp. 67–71. Springer (2015).
APA
Heyvaert, P., Colpaert, P., Verborgh, R., Mannens, E., & Van de Walle, R. (2015). Merging and Enriching DCAT Feeds to Improve Discoverability of Datasets. In F. Gandon, C. Guéret, S. Villata, J. Breslin, C. Faron-Zucker, & A. Zimmermann (Eds.), Proceedings of the 12th Extended Semantic Web Conference: Posters and Demos (Vol. 9341, pp. 67–71). Springer.
MLA
Heyvaert, Pieter et al. “Merging and Enriching DCAT Feeds to Improve Discoverability of Datasets.” Proceedings of the 12th Extended Semantic Web Conference: Posters and Demos. Ed. Fabien Gandon et al. Vol. 9341. Springer, 2015. 67–71. Print. Lecture Notes in Computer Science.

Discuss this article