Mind Your Metadata: Exploiting Semantics for Configuration, Adaptation, and Provenance in Scientific Workflows

Presented at: 10th International Semantic Web Conference (ISWC2011)

by Yolanda Gil, Pedro Szekely, Sandra Villamizar, Thomas Harmon, Varun Ratnakar, Shubham Gupta, Maria Muslea, Fabio Silva, Craig Knoblock

Scientific metadata containing semantic descriptions of scientific data is expensive to capture and is typically not used across entire data analytic processes. We present an approach where semantic metadata is generated as scientific data is being prepared, and then subsequently used to configure models and to customize them to the data. The metadata captured includes sensor descriptions, data characteristics, data types, and process documentation. This metadata is then used in a workflow system to select analytic models dynamically and to set up model parameters automatically. In addition, all aspects of data processing are documented, and the system is able to generate extensive provenance records for new data products based on the metadata. As a result, the system can dynamically select analytic models based on the metadata properties of the data it is processing, generating more accurate results. We show results in analyzing stream metabolism for watershed ecosystem management.

Mind Your Metadata: Exploiting Semantics for Configuration, Adaptation, and Provenance in Scientific Workflows was presented at this event.

Keywords: Semantic Web


Resource URI on the dog food server: http://data.semanticweb.org/conference/iswc/2011/paper/semantic-web-in-use/2


Explore this resource elsewhere: