MultiCrawler: A Pipelined Architecture for Crawling and Indexing Semantic Web Data

Presented at: 5th International Semantic Web Conference (ISWC2006)

by Andreas Harth, J├╝rgen Umbrich, Stefan Decker

Webpage: http://dx.doi.org/10.1007/11926078_19
Webpage: http://iswc2006.semanticweb.org/items/Harth2006dq.pdf

The goal of the work presented in this paper is to obtain large amounts of semistructured data from the web. We contrast our approach to conventional web crawlers, and describe and evaluate a five-step pipelined architecture to crawl and index data from both the traditional and the Semantic Web.


Resource URI on the dog food server: http://data.semanticweb.org/conference/iswc/2006/paper-40


Explore this resource elsewhere: