Distinguishing between Instances and Classes in the Wikipedia Taxonomy

Presented at: 5th European Semantic Web Conference (ESWC2008)

by Caecilia Zirn, Vivi Nastase, Michael Strube

Webpage: http://dx.doi.org/10.1007/978-3-540-68234-9_29

This paper presents an automatic method for differentiating between instances and classes in a large scale taxonomy induced from the Wikipedia category network. The method exploits characteristics of the category names and the structure of the network. The approach we present is the first attempt to make this distinction automatically in a large scale resource. In contrast, WordNet and Cyc rely on manual annotations. The result of the process is evaluated against ResearchCyc. On the subnetwork shared by our taxonomy and ResearchCyc we report 84.52% accuracy.

Keywords: classes and instances, taxonomy, Natural language processing, Natural Language Processing, NLP, Ontology (computer science), Ontology (Computer Science), Semantic Web, Web Ontology Language


Resource URI on the dog food server: http://data.semanticweb.org/conference/eswc/2008/paper/36
Same as: http://revyu.com/things/eswc-2008-paper-distinguishing-between-taxonomy
Same as: http://semanticweb.org/id/Distinguishing_between_Instances_and_Classes_in_the_Wikipedia_Taxonomy


Explore this resource elsewhere: