Automatic Normalization and Annotation for Discovering Semantic Mappings
Contributo in Atti di convegno
Data di Pubblicazione:
2011
Citazione:
Automatic Normalization and Annotation for Discovering Semantic Mappings / Bergamaschi, Sonia; Beneventano, Domenico; Po, Laura; Sorrentino, Serena. - STAMPA. - 6585:(2011), pp. 85-100. ( Workshop on Search Computing - Trends and Developments, SeCo 2010 Como, Italy May 25-31, 2010) [10.1007/978-3-642-19668-3_8].
Abstract:
Schema matching is the problem of finding relationships among concepts across heterogeneous data sources (heterogeneous in format and in structure). Starting from the “hidden meaning” associated to schema labels (i.e. class/attribute names) it is possible to discover relationships among the elements of different schemata. Lexical annotation (i.e. annotation w.r.t. a thesaurus/lexical resource) helps in associating a “meaning” to schema labels. However, accuracy of semi-automatic lexical annotation methods on real-world schemata suffers from the abundance of non-dictionary words such as compound nouns and word abbreviations. In this work, we address this problem by proposing a method to perform schema labels normalization which increases the number of comparable labels. Unlike other solutions, the method semi-automatically expands abbreviations and annotates compound terms, without a minimal manual effort. We empirically prove that our normalization method helps in the identification of similarities among schema elements of different data sources, thus improving schema matching accuracy.
Tipologia CRIS:
Relazione in Atti di Convegno
Keywords:
lexical relationships; probabilistic annotation; word sense disambiguation; label normalization; semantic resource framework
Elenco autori:
Bergamaschi, Sonia; Beneventano, Domenico; Po, Laura; Sorrentino, Serena
Link alla scheda completa:
Link al Full Text:
Titolo del libro:
Search Computing: Trends and Developments
Pubblicato in: