Skip to Main Content (Press Enter)

Logo UNIMORE
  • ×
  • Home
  • Corsi
  • Insegnamenti
  • Professioni
  • Persone
  • Pubblicazioni
  • Strutture
  • Terza Missione
  • Attività
  • Competenze

UNI-FIND
Logo UNIMORE

|

UNI-FIND

unimore.it
  • ×
  • Home
  • Corsi
  • Insegnamenti
  • Professioni
  • Persone
  • Pubblicazioni
  • Strutture
  • Terza Missione
  • Attività
  • Competenze
  1. Pubblicazioni

Benchmarking BERT-based Models for Latin: A Case Study on Biblical References in Ancient Christian Literature

Contributo in Atti di convegno
Data di Pubblicazione:
2025
Citazione:
Benchmarking BERT-based Models for Latin: A Case Study on Biblical References in Ancient Christian Literature / Caffagni, Davide; Cocchi, Federico; Mambelli, Anna; Tutrone, Fabio; Zanella, Marco; Cornia, Marcella; Cucchiara, Rita. - 3937:(2025). ( 21st Conference on Information and Research Science Connecting to Digital and Library Science, IRCDL 2025 Udine, Italy February 20-21).
Abstract:
Transformer-based language models like BERT have revolutionized Natural Language Processing (NLP) research, but their application to historical languages remains underexplored. This paper investigates the adaptation of BERT-based embedding models for Latin, a language central to the study of the sacred texts of Christianity. Focusing on Jerome’s Vulgate, pre-Vulgate Latin translations of the Bible, and patristic commentaries such as Augustine’s De Genesi ad litteram, we address the challenges posed by Latin’s complex syntax, specialized vocabulary, and historical variations at the orthographic, morphological, and semantic levels. In particular, we propose fine-tuning existing BERT-based embedding models on annotated Latin corpora, using self-generated hard negatives to improve performance in detecting biblical references in early Christian literature in Latin. Experimental results demonstrate the ability of BERT-based models to identify citations of and allusions to the Bible(s) in ancient Christian commentaries while highlighting the complexities and challenges of this field. By integrating NLP techniques with humanistic expertise, this work provides a case study on intertextual analysis in Latin patristic works. It underscores the transformative potential of interdisciplinary approaches, advancing computational tools for sacred text studies and bridging the gap between philology and computational analysis.
Tipologia CRIS:
Relazione in Atti di Convegno
Keywords:
Ancient Languages; Bible; Ancient Christian Literature; Intertextuality; Sentence Embeddings; Sentence Similarity Search
Elenco autori:
Caffagni, Davide; Cocchi, Federico; Mambelli, Anna; Tutrone, Fabio; Zanella, Marco; Cornia, Marcella; Cucchiara, Rita
Autori di Ateneo:
CAFFAGNI DAVIDE
CORNIA MARCELLA
CUCCHIARA Rita
MAMBELLI Anna
Link alla scheda completa:
https://iris.unimore.it/handle/11380/1371269
Link al Full Text:
https://iris.unimore.it//retrieve/handle/11380/1371269/937377/Mambelli%20Anna_2025_IRCDL_Latin_Embeddings.pdf
Titolo del libro:
Proceedings of the 21st Conference on Information and Research Science Connecting to Digital and Library Science, IRCDL 2025
Pubblicato in:
CEUR WORKSHOP PROCEEDINGS
Journal
CEUR WORKSHOP PROCEEDINGS
Series
  • Dati Generali

Dati Generali

URL

https://ceur-ws.org/Vol-3937/short11.pdf
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.0.0