Skip to Main Content (Press Enter)

Logo UNIMORE
  • ×
  • Home
  • Corsi
  • Insegnamenti
  • Professioni
  • Persone
  • Pubblicazioni
  • Strutture
  • Terza Missione
  • Attività
  • Competenze

UNI-FIND
Logo UNIMORE

|

UNI-FIND

unimore.it
  • ×
  • Home
  • Corsi
  • Insegnamenti
  • Professioni
  • Persone
  • Pubblicazioni
  • Strutture
  • Terza Missione
  • Attività
  • Competenze
  1. Pubblicazioni

Towards Progressive Search-driven Entity Resolution

Contributo in Atti di convegno
Data di Pubblicazione:
2018
Citazione:
Towards Progressive Search-driven Entity Resolution / Pietrangelo, A.; Simonini, G.; Bergamaschi, S.; Koumarelas, I.; Naumann, F.. - 2161:(2018). ( 26th Italian Symposium on Advanced Database Systems, SEBD 2018 Ethra Reserve, ita 2018).
Abstract:
Keyword-search systems for databases aim to answer a user query composed of a few terms with a ranked list of records. They are powerful and easy-to-use data exploration tools for a wide range of contexts. For instance, given a product database gathered scraping e-commerce websites, these systems enable even non-technical users to explore the item set (e.g., to check whether it contains certain products or not, or to discover the price of an item). However, if the database contains dirty records (i.e., incomplete and duplicated records), a pre-processing step to clean the data is required. One fundamental data cleaning step is Entity Resolution, i.e., the task of identifying and fusing together all the records that refer to the same real-word entity. This task is typically executed on the whole data, independently of: (i) the portion of the entities that a user may indicate through keywords, and (ii) the order priority that a user might express through an order by clause. This paper describes a first step to solve the problem of progressive search-driven Entity Resolution: resolving all the entities described by a user through a handful of keywords, progressively (according to an order by clause). We discuss the features of our method, named SearchER and showcase some examples of keyword queries on two real-world datasets obtained with a demonstrative prototype that we have built.
Tipologia CRIS:
Relazione in Atti di Convegno
Keywords:
Keyword search · Entity Resolution · Data Cleaning · Pay-as-you-go · Query-driven Data Integration
Elenco autori:
Pietrangelo, A.; Simonini, G.; Bergamaschi, S.; Koumarelas, I.; Naumann, F.
Autori di Ateneo:
BERGAMASCHI Sonia
SIMONINI GIOVANNI
Link alla scheda completa:
https://iris.unimore.it/handle/11380/1200661
Titolo del libro:
CEUR Workshop Proceedings
Pubblicato in:
CEUR WORKSHOP PROCEEDINGS
Journal
CEUR WORKSHOP PROCEEDINGS
Series
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.0.0