High-performance Computing Applied to Semantic Web Databases

Presented at: 8th Extended Semantic Web Conference (ESWC2011)

by Eric Goodman, Edward Jimenez, David Mizell, Sinan al-Saffar, Bob Adolf, David Haglin

To-date, the application of high-performance computing resources to Semantic Web data has largely focused on commodity hardware and distributed memory platforms. In this paper we make the case that more specialized hardware can offer superior scaling and close to an order of magnitude improvement in performance. In particular we examine the Cray XMT. Its key characteristics, a large, global sharedmemory, and processors with a memory-latency tolerant design, offer an environment conducive to programming for the Semantic Web and have engendered results that far surpass current state of the art. We examine three fundamental pieces requisite for a fully functioning semantic database: dictionary encoding, RDFS inference, and query processing. We show scaling up to 512 processors (the largest configuration we had available), and the ability to process 20 billion triples completely inmemory.

Keywords: Cray XMT, Dictionary Encoding, RDFS Inference, SPARQL, Semantic Web

Resource URI on the dog food server: http://data.semanticweb.org/conference/eswc/2011/paper/semantic-data-management/8

Explore this resource elsewhere: