Linked Research Articles
Below are some of the peer-reviewed scientific articles I wrote as a main author.
I am additionally publishing them on this website as Linked Research.
These articles, among many others, are part of my full publication list.
Linked Research articles on this website
Re-Decentralizing the Web, For Good This Time
Linking the World’s Information: Essays on Tim Berners-Lee’s Invention of the World Wide Web, 2023
Originally designed as a decentralized ecosystem, the Web has undergone a significant centralization in recent years. In order to regain control over our digital self, over the digital aspects of our lives, we need to understand how we arrived at this point and how we can get back on track. This chapter explains the history of decentralization in a Web context, and details Tim Berners-Lee’s role in the continued battle for a free and open Web. The challenges and solutions are not purely technical in nature, but rather fit into a larger socio-economic puzzle, to which all of us are invited to contribute. Let us take back the Web for good, and leverage its full potential as envisioned by its creator.
The Semantic Web identity crisis: in search of the trivialities that never were
Semantic Web Journal, 2020
For a domain with a strong focus on unambiguous identifiers and meaning, the Semantic Web research field itself has a surprisingly ill-defined sense of identity. Started at the end of the 1990s at the intersection of databases, logic, and Web, and influenced along the way by all major tech hypes such as Big Data and machine learning, our research community needs to look in the mirror to understand who we really are. The key question amid all possible directions is pinpointing the important challenges we are uniquely positioned to tackle. In this article, we highlight the community’s unconscious bias toward addressing the Paretonian 80% of problems through research—
Decentralizing the Semantic Web through incentivized collaboration
International Semantic Web Conference: Blue Sky Track, 2018
Personal data is being centralized at an unprecedented scale, and this comes with widely known and far-reaching consequences, considering the recent data scandals with companies such as Equifax and Facebook. Decentralizing personal data storage allows people to take back control of their data, and Semantic Web technologies can facilitate data integration at runtime. However, such data processing over decentralized data requires far more expensive algorithms, while at the same time, less processing power is available in individual stores compared to large-scale data centers. This article presents a vision in which nodes in decentralized networks are incentivized to collaborate on data processing using a distributed ledger. By leveraging the collective processing capacity of all nodes, we can provide a sustainable alternative to the current generation of centralized solutions, and thereby put people back in control without compromising on functionality.
A Web API ecosystem through feature-based reuse
Internet Computing, 2018
The fast-growing Web API landscape brings clients more options than ever before—
Piecing the puzzle – Self-publishing queryable research data on the Web
Workshop on Linked Data on the Web, 2017
Publishing research on the Web accompanied by machine-readable data is one of the aims of Linked Research. Merely embedding metadata as RDFa in HTML research articles, however, does not solve the problems of accessing and querying that data. Hence, I created a simple ETL pipeline to extract and enrich Linked Data from my personal website, publishing the result in a queryable way through Triple Pattern Fragments. The pipeline is open source, uses existing ontologies, and can be adapted to other websites. In this article, I discuss this pipeline, the resulting data, and its possibilities for query evaluation on the Web. More than 35,000 RDF triples of my data are queryable, even with federated SPARQL queries because of links to external datasets. This proves that researchers do not need to depend on centralized repositories for readily accessible (meta-)data, but instead can—
Your JSON is not my JSON – A case for more fine-grained content negotiation
Workshop on Smart Descriptions & Smarter Vocabularies, 2016
Information resources can be expressed in different representations along many dimensions such as format, language, and time. Through content negotiation, HTTP clients and servers can agree on which representation is most appropriate for a given piece of data. For instance, interactive clients typically indicate they prefer HTML, whereas automated clients would ask for JSON or RDF. However, labels such as “JSON” and “RDF” are insufficient to negotiate between the rich variety of possibilities offered by today’s languages and data models. This position paper argues that, despite widespread misuse, content negotiation remains the way forward. However, we need to extend it with more granular options in order to serve different current and future Web clients sustainably.
Linked Research articles on other websites
In addition to the articles above, I have contributed to these Linked Research articles published on other websites:
- DFDP: A Declarative Form Description Pipeline for Decentralizing Web Forms (2024)
- Demonstration of Link Traversal SPARQL Query Processing over the Decentralized Solid Environment (2024)
- Link Traversal Query Processing over Decentralized Environments with Structural Assumptions (2023)
- In-Memory Indexing of Quoted RDF Triples (2023)
- How Does the Link Queue Evolve during Traversal-Based Query Processing? (2023)
- A Prospective Analysis of Security Vulnerabilities within Link Traversal-Based Query Processing (2022)
- What’s in a Pod? – A knowledge graph interpretation for the Solid ecosystem (2022)
- Leveraging Semantic Web technologies for digital interoperability in the Railway domain (2021)
- Optimizing Approximate Membership Metadata in Triple Pattern Fragments for Clients and Servers (2020)
- Comparing a Polling and Push-Based Approach for Live Open Data Interfaces (2020)
- Efficient Live Public Transport Data Sharing for Route Planning on the Web (2020)
- Reflections On: Triple Storage for Random-Access Versioned Querying of RDF Archives (2019)
- Predicting phase durations of traffic lights using live Open Traffic Lights data (2019)
- Decentralized Publication and Consumption of Transfer Footpaths (2019)
- Using an existing website as a queryable low-cost LOD publishing interface (2019)
- Triple Storage for Random-Access Versioned Querying of RDF Archives (2019)
- Demonstration of Comunica, a Web framework for querying heterogeneous Linked Data interfaces (2018)
- GraphQL-LD: Linked Data Querying with GraphQL (2018)
- Comunica: a Modular SPARQL Query Engine for the Web (2018)
- Components.js: A Semantic Dependency Injection Framework (2018)
- OSTRICH: Versioned Random-Access Triple Store (2018)
- Describing configurations of software experiments as Linked Data (2017)
- Decentralised Authoring, Annotations and Notifications for a Read–Write Web with dokieli (2017)
- Interoperability and FAIRness through a novel combination of Web technologies (2017)
- Challenges as Enablers for High Quality Linked Data: Insights from the Semantic Publishing Challenge (2017)
- Decentralized provenance-aware publishing with nanopublications (2016)
- Drawing Conclusions from Linked Data on the Web (2015)