Crowdsourcing Linked Data Quality Assessment

Presented at: The 12th International Semantic Web Conference (ISWC2013)

by Maribel Acosta, Amrapali Zaveri, Elena Simperl, Dimitris Kontokostas, Sören Auer, Jens Lehmann

Webpage: http://www.springer.com/computer/ai/book/978-3-642-41337-7
Webpage: https://github.com/lidingpku/iswc-archive/raw/master/paper/iswc-2013/82190257-crowdsourcing-linked-data-quality-assessment.pdf

In this paper we look into the use of crowdsourcing as a means to handle Linked Data quality problems that are challenging to be solved automatically. We analyzed the most common errors encountered in Linked Data sources and classified them according to the extent to which they are likely to be amenable to a specific form of crowdsourcing. Based on this analysis, we implemented a quality assessment methodology for Linked Data that leverages the wisdom of the crowds in different ways: (i) a contest targeting an expert crowd of researchers and Linked Data enthusiasts; complemented by (ii) paid microtasks published on Amazon Mechanical Turk. We empirically evaluated how this methodology could efficiently spot quality issues in DBpedia. We also investigated how the contributions of the two types of crowds could be optimally integrated into Linked Data curation processes. The results show that the two styles of crowdsourcing are complementary and that crowdsourcing-enabled quality assessment is a promising and affordable way to enhance the quality of Linked Data.

Crowdsourcing Linked Data Quality Assessment was presented at this event.


Resource URI on the dog food server: http://data.semanticweb.org/conference/iswc/2013/proceedings-2/paper-17


Explore this resource elsewhere: