OXPath: Little Language, Little Memory, Great Value

Presented at: 20th International World Wide Web Conference (WWW2011)

by Andrew Sellers, Tim Furche, Georg Gottlob, Giovanni Grasso, Christian Schallhart

Webpage: http://wwwconference.org/www2011/proceeding/companion/p261.pdf

Data about everything is readily available on the web-but often only accessible through elaborate user interactions. For automated decision support, extracting that data is essential, but infeasible with existing heavy-weight data extraction systems. In this demonstration, we present OXPath, a novel approach to web extraction, with a system that supports informed job selection and integrates information from several different web sites. By carefully extending XPath, OXPath exploits its familiarity and provides a light-weight interface, which is easy to use and embed. We highlight how OXPath guarantees optimal page buffering, storing only a constant number of pages for non-recursive queries.

OXPath: Little Language, Little Memory, Great Value was presented at this event.

Keywords: World Wide Web


Resource URI on the dog food server: http://data.semanticweb.org/conference/www/2011/demo/oxpath-little-language-little-memory-great-value


Explore this resource elsewhere: