Digital Maktaba Project: Proposing a Metadata-Driven Framework for Arabic Library Digitization
Contributo in Atti di convegno
Data di Pubblicazione:
2025
Citazione:
Digital Maktaba Project: Proposing a Metadata-Driven Framework for Arabic Library Digitization / El Ganadi, Amina; Gagliardelli, Luca; Aftar, Sania; Ruozzi, Federico. - 3937:(2025). ( IRCDL 2025: 21st Conference on Information and Research Sciences Connecting to Digital and Library Science Udine, Italy February, 20-21 2025).
Abstract:
The rapid digitization of cultural heritage has underscored the critical need for robust digital libraries, particularly for underrepresented languages like Arabic and Persian. This paper describes the methodologies and challenges involved in developing a metadata-driven Arabic digital library, utilizing bibliographic metadata extracted from the Diamond catalogue. It explores advanced metadata schemas, such as Dublin Core, and integrates text recognition technologies and preservation strategies to address key concerns of accessibility, scholarly use, and the long-term preservation of Arabic-script texts.
The paper delves into specific challenges of processing Arabic script, including handling calligraphy, diacritics, and ligatures, and introduces innovative solutions like the use of frontispiece images to train OCR systems. Furthermore, it discusses how integrated metadata could not only enhance text recognition but also improve user engagement by enabling refined search functionalities and better resource discovery. Finally, the paper outlines future directions for expanding metadata frameworks to ensure interoperability and the long-term preservation of cultural heritage.
Tipologia CRIS:
Relazione in Atti di Convegno
Keywords:
Arabic Digital Library; Bibliographic Metadata; Cultural Heritage; Digital Maktaba Project; Digitization; Document Analysis; Natural Language Processing; OCR;
Elenco autori:
El Ganadi, Amina; Gagliardelli, Luca; Aftar, Sania; Ruozzi, Federico
Link alla scheda completa:
Link al Full Text:
Titolo del libro:
Proceedings of the 21st Conference on Information and Research science Connecting to Digital and Library science
Pubblicato in: