Presented at: 5th European Semantic Web Conference (ESWC2008)
by Angela Maduko, Kemafor Anyanwu, Amit Sheth, Paul Schliekelman
Webpage: http://dx.doi.org/10.1007/978-3-540-68234-9_38Graphs are increasingly used to model data in many disciplines. Structure search which matches a query graph against a data graph, is a common information retrieval paradigm for graph structured data. A crucial factor in optimizing such searches is the ability to estimate the frequency of substructures within a query graph. In this work, we present and evaluate two techniques for estimating the frequency of subgraphs from a summary of the data graph. In the first technique, we assume that edge occurrences on edge sequences are position independent and summarize only the most informative dependencies. In the second technique, we prune small subgraphs based on a valuation scheme that blends information about their importance and estimation power. In both techniques, we assume conditional independence to estimate the frequencies of larger subgraphs. We validate the effectiveness of our techniques using experiments on real and synthetic datasets
Keywords: graph summaries, result cardinality estimation, subgraph cardinality estimation, Database Management System, SPARQL, Semantic Web
Resource URI on the dog food server: http://data.semanticweb.org/conference/eswc/2008/paper/330
Same as: http://revyu.com/things/eswc-2008-paper-graph-summaries-estimation
Same as: http://semanticweb.org/id/Graph_Summaries_for_Subgraph_Frequency_Estimation
Explore this resource elsewhere: