Skip to Main Content (Press Enter)

Logo UNIMORE
  • ×
  • Home
  • Corsi
  • Insegnamenti
  • Professioni
  • Persone
  • Pubblicazioni
  • Strutture
  • Terza Missione
  • Attività
  • Competenze

UNI-FIND
Logo UNIMORE

|

UNI-FIND

unimore.it
  • ×
  • Home
  • Corsi
  • Insegnamenti
  • Professioni
  • Persone
  • Pubblicazioni
  • Strutture
  • Terza Missione
  • Attività
  • Competenze
  1. Pubblicazioni

Analyzing How BERT Performs Entity Matching

Articolo
Data di Pubblicazione:
2022
Citazione:
Analyzing How BERT Performs Entity Matching / Paganelli, M., Del Buono, F., Baraldi, A., Guerra, F.. - In: PROCEEDINGS OF THE VLDB ENDOWMENT. - ISSN 2150-8097. - 15:8(2022), pp. 1726-1738. (48th International Conference on Very Large Data Bases, VLDB 2022 aus 2022) [10.14778/3529337.3529356].
Abstract:
State-of-the-art Entity Matching (EM) approaches rely on transformer architectures, such as BERT, for generating highly contextualized embeddings of terms. The embeddings are then used to predict whether pairs of entity descriptions refer to the same real-world entity. BERT-based EM models demonstrated to be effective, but act as black-boxes for the users, who have limited insight into the motivations behind their decisions. In this paper, we perform a multi-facet analysis of the components of pre-trained and fine-tuned BERT architectures applied to an EM task. The main findings resulting from our extensive experimental evaluation are (1) the fine-tuning process applied to the EM task mainly modifies the last layers of the BERT components, but in a different way on tokens belonging to descriptions of matching / non-matching entities; (2) the special structure of the EM datasets, where records are pairs of entity descriptions is recognized by BERT; (3) the pair-wise semantic similarity of tokens is not a key knowledge exploited by BERT-based EM models.
Tipologia CRIS:
Articolo su rivista
Elenco autori:
Paganelli, M.; Del Buono, F.; Baraldi, A.; Guerra, F.
Autori di Ateneo:
GUERRA Francesco
PAGANELLI MATTEO
Link alla scheda completa:
https://iris.unimore.it/handle/11380/1291984
Link al Full Text:
https://iris.unimore.it//retrieve/handle/11380/1291984/453876/p1726-paganelli.pdf
Pubblicato in:
PROCEEDINGS OF THE VLDB ENDOWMENT
Journal
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.6.1.0