Web-scale Provenance Reconstruction of Implicit Information Diffusion on Social Media
Fast, massive, and viral data diffused on social media affects a large share of the online population, and thus, the (prospective) information diffusion mechanisms behind it are of great interest to researchers. The (retrospective) provenance of such data is equally important because it contributes to the understanding of the relevance and trustworthiness of the information. Furthermore, computing provenance in a timely way is crucial for particular use cases and practitioners, such as online journalists that promptly need to assess particular pieces of information. Social media currently provide insufficient mechanisms for provenance tracking, publication and generation, while state-of-the-art on social media research focuses mainly on explicit diffusion mechanisms (like retweets in Twitter or reshares in Facebook).The implicit diffusion mechanisms remain understudied due to the difficulties of being captured and properly understood. From a technical side, the state of the art for provenance reconstruction evaluates small datasets after the fact, sidestepping requirements for scale and speed of current social media data. In this paper, we investigate the mechanisms of implicit information diffusion by computing its fine-grained provenance. We prove that explicit mechanisms are insufficient to capture influence and our analysis unravels a significant part of implicit interactions and influence in social media. Our approach works incrementally and can be scaled up to cover a truly Web-scale scenario like major events. The results show that (on a single machine) we can process datasets consisting of up to several millions of messages at rates that cover bursty behaviour, without compromising result quality. By doing that, we provide to online journalists and social media users in general, fine grained provenance reconstruction which sheds lights on implicit interactions not captured by social media providers. These results are provided in an online fashion which also allows for fast relevance and trustworthiness assessment.
Published in 2018 in Distributed and Parallel Databases.
- provenance
- social media
- information diffusion
- Web
- trust
- publication
- research
Read this article online
- Request a digital copy of this article.
- Comment on this article.
Cite this article in your work
Cite this article easily using its BibTeX entry:
@article{taxidou_dapd_2018,
author = {Taxidou, Io and Lieber, Sven and Fischer, Peter and De Nies, Tom and Verborgh, Ruben},
title = {Web-scale Provenance Reconstruction of Implicit Information Diffusion on Social Media},
journal = {Distributed and Parallel Databases},
volume = 36,
number = 1,
pages = {47--79},
issn = {0926-8782},
publisher = {Springer},
year = 2018,
month = mar,
doi = {10.1007/s10619-017-7211-3},
}
Alternatively, pick a reference of your choice below:
- ACM
- Io Taxidou, Sven Lieber, Peter Fischer, Tom De Nies, and Ruben Verborgh. 2018. Web-scale Provenance Reconstruction of Implicit Information Diffusion on Social Media. Distributed and Parallel Databases 36, 1 (March 2018), 47–79.
- APA
- Taxidou, I., Lieber, S., Fischer, P., De Nies, T., & Verborgh, R. (2018). Web-scale Provenance Reconstruction of Implicit Information Diffusion on Social Media. Distributed and Parallel Databases, 36(1), 47–79.
- IEEE
- I. Taxidou, S. Lieber, P. Fischer, T. De Nies, and R. Verborgh, “Web-scale Provenance Reconstruction of Implicit Information Diffusion on Social Media,” Distributed and Parallel Databases, vol. 36, no. 1, pp. 47–79, Mar. 2018.
- LNCS
- Taxidou, I., Lieber, S., Fischer, P., De Nies, T., Verborgh, R.: Web-scale Provenance Reconstruction of Implicit Information Diffusion on Social Media. Distributed and Parallel Databases. 36, 47–79 (2018).
- MLA
- Taxidou, Io, et al. “Web-Scale Provenance Reconstruction of Implicit Information Diffusion on Social Media.” Distributed and Parallel Databases, vol. 36, no. 1, Springer, Mar. 2018, pp. 47–79.
Discuss this article
- Discover all publications by Ruben Verborgh.
- Find related articles on Google Scholar.
- Post your questions or comments below.