Context Semantic Analysis: A Knowledge-Based Technique for Computing Inter-document Similarity
Conference Paper
Publication Date:
2016
Short description:
Context Semantic Analysis: A Knowledge-Based Technique for Computing Inter-document Similarity / Bergamaschi, Sonia; Beneventano, Domenico; Benedetti, Fabio. - 9939:(2016), pp. 164-178. ( 9th International Conference on Similarity Search and Applications, SISAP 2016 Tokyo, Japan October 24-26, 2016) [10.1007/978-3-319-46759-7_13].
abstract:
We propose a novel knowledge-based technique for inter-document similarity, called Context Semantic Analysis (CSA). Several specialized approaches built on top of specific knowledge base (e.g. Wikipedia) exist in literature but CSA differs from them because it is designed to be portable to any RDF knowledge base. Our technique relies on a generic RDF knowledge base (e.g. DBpedia and Wikidata) to extract from it a vector able to represent the context of a document. We show how such a Semantic Context Vector can be effectively exploited to compute inter-document similarity. Experimental results show that our general technique outperforms baselines built on top of traditional methods, and achieves a performance similar to the ones of specialized methods.
Iris type:
Relazione in Atti di Convegno
Keywords:
Knowledge graph Knowledge base Inter-document similarity Similarity measures
List of contributors:
Bergamaschi, Sonia; Beneventano, Domenico; Benedetti, Fabio
Book title:
Similarity Search and Applications
Published in: