Skip to Main Content (Press Enter)

Logo UNIMORE
  • ×
  • Home
  • Corsi
  • Insegnamenti
  • Professioni
  • Persone
  • Pubblicazioni
  • Strutture
  • Terza Missione
  • Attività
  • Competenze

UNI-FIND
Logo UNIMORE

|

UNI-FIND

unimore.it
  • ×
  • Home
  • Corsi
  • Insegnamenti
  • Professioni
  • Persone
  • Pubblicazioni
  • Strutture
  • Terza Missione
  • Attività
  • Competenze
  1. Pubblicazioni

Handwritten Text Generation from Visual Archetypes

Contributo in Atti di convegno
Data di Pubblicazione:
2023
Citazione:
Handwritten Text Generation from Visual Archetypes / Pippi, V.; Cascianelli, S.; Cucchiara, R.. - 2023-:(2023), pp. 22458-22467. ( 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023 can 2023) [10.1109/CVPR52729.2023.02151].
Abstract:
Generating synthetic images of handwritten text in a writer-specific style is a challenging task, especially in the case of unseen styles and new words, and even more when these latter contain characters that are rarely encountered during training. While emulating a writer's style has been recently addressed by generative models, the generalization towards rare characters has been disregarded. In this work, we devise a Transformer-based model for Few-Shot styled handwritten text generation and focus on obtaining a robust and informative representation of both the text and the style. In particular, we propose a novel representation of the textual content as a sequence of dense vectors obtained from images of symbols written as standard GNU Unifont glyphs, which can be considered their visual archetypes. This strategy is more suitable for generating characters that, despite having been seen rarely during training, possibly share visual details with the frequently observed ones. As for the style, we obtain a robust representation of unseen writers' calligraphy by exploiting specific pre-training on a large synthetic dataset. Quantitative and qualitative results demonstrate the effectiveness of our proposal in generating words in unseen styles and with rare characters more faithfully than existing approaches relying on independent one-hot encodings of the characters.
Tipologia CRIS:
Relazione in Atti di Convegno
Keywords:
Document analysis and understanding
Elenco autori:
Pippi, V.; Cascianelli, S.; Cucchiara, R.
Autori di Ateneo:
CASCIANELLI Silvia
CUCCHIARA Rita
Link alla scheda completa:
https://iris.unimore.it/handle/11380/1363934
Titolo del libro:
Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
Pubblicato in:
PROCEEDINGS IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION
Series
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.4.5.0