Bringing the IPTC News Architecture into the Semantic Web

Presented at: 7th International Semantic Web Conference (ISWC2008)

by Raphael Troncy

Webpage: http://dx.doi.org/10.1007/978-3-540-88564-1_31
Webpage: https://github.com/lidingpku/iswc-archive/raw/master/paper/iswc-2008/53180478.pdf

For easing the exchange of news, the International Press Telecommunication Council (IPTC) has developed the NewsML Architecture (NAR), an XML-based model that is specialized into a number of languages such as NewsML G2 and EventsML G2. As part of this architecture, specific controlled vocabularies, such as the IPTC News Codes, are used to categorize news items together with other industry-standard thesauri. While news is still mainly in the form of text-based stories, these are often illustrated with graphics, images and videos. Media-specific metadata formats, such as EXIF, DIG35 and XMP, are used to describe the media. The use of different metadata formats in a single production process leads to interoperability problems within the news production chain itself. It also excludes linking to existing web knowledge resources and impedes the construction of uniform end-user interfaces for searching and browsing news content. In order to allow these different metadata standards to interoperate within a single information environment, we design an OWL ontology for the IPTC News Architecture, linked with other multimedia metadata standards. We convert the IPTC NewsCodes into a SKOS thesaurus and we demonstrate how the news metadata can then be enriched using natural language processing and multimedia analysis and integrated with existing knowledge already formalized on the Semantic Web. We discuss the method we used for developing the ontology and give rationale for our design decisions. We provide guidelines for re-engineering schemas into ontologies and formalize their implicit semantics. In order to demonstrate the appropriateness of our ontology infrastructure, we present an exploratory environment for searching and browsing news items.

Keywords: multimedia ontology, news ontology, ontology engineering best practices, semantic web browser, Semantic Web


Resource URI on the dog food server: http://data.semanticweb.org/conference/iswc/2008/paper/research/210


Explore this resource elsewhere: