Creating and Exploiting Multimodal Annotated Corpora

Presented at: The Sixth International Language Resources and Evaluation Conference (LREC2008)

by Philippe Blache, Roxane Bertrand, Gaëlle Ferré

Webpage: http://www.lrec-conf.org/proceedings/lrec2008/pdf/449_paper.pdf
Webpage: http://www.lrec-conf.org/proceedings/lrec2008/slides/449.pdf
Webpage: http://www.lrec-conf.org/proceedings/lrec2008/summaries/449.html

The paper presents a project of the Laboratoire Parole & Langage which aims at collecting, annotating and exploiting a corpus of spoken French in a multimodal perspective. The project directly meets the present needs in linguistics where a growing number of researchers become aware of the fact that a theory of communication which aims at describing real interactions should take into account the complexity of these interactions. However, in order to take into account such a complexity, linguists should have access to spoken corpora annotated in different fields. The paper presents the annotation schemes used in phonetics, morphology and syntax, prosody, gestuality at the LPL together with the type of linguistic description made from the annotations seen in two examples.

Keywords: Corpus (creation, annotation, etc.), Multimedia annotation and processing, Tools, systems, applications, Linguistics


Resource URI on the dog food server: http://data.semanticweb.org/conference/lrec/2008/papers/449


Explore this resource elsewhere: