Fast and lightweight support for nested parallelism on cluster-based embedded many-cores
Contributo in Atti di convegno
Data di Pubblicazione:
2012
Citazione:
Fast and lightweight support for nested parallelism on cluster-based embedded many-cores / Marongiu, A., Burgio, P., Benini, L.. - STAMPA. - (2012), pp. 105-110. (15th Design, Automation and Test in Europe Conference and Exhibition, DATE 2012 Dresden, deu 12-16 March 2012) [10.1109/DATE.2012.6176441].
Abstract:
Several recent many-core accelerators have been architected as fabrics of tightly-coupled shared memory clusters. A hierarchical interconnection system is used – with a crossbarlike medium inside each cluster and a network-on-chip (NoC) at the global level – which make memory operations nonuniform (NUMA). Nested parallelism represents a powerful programming abstraction for these architectures, where a first level of parallelism can be used to distribute coarse-grained tasks to clusters, and additional levels of fine-grained parallelism can be distributed to processors within a cluster. This paper presents a lightweight and highly optimized support for nested parallelism on cluster-based embedded many-cores. We assess the costs to enable multi-level parallelization and demonstrate that our techniques allow to extract high degrees of parallelism.
Tipologia CRIS:
Relazione in Atti di Convegno
Keywords:
MANY-CORE EMBEDDED SYSTEMS; SHARED MEMORY EMBEDDED SYSTEMS; OPENMP; PROGRAMMING MODELS; SYNCHRONIZATION; NESTED PARALLELISM; CLUSTERED ARCHITECTURES
Elenco autori:
Marongiu, Andrea; Burgio, Paolo; Benini, Luca
Link alla scheda completa:
Titolo del libro:
Proceedings of Design, Automation & Test in Europe Conference & Exhibition (DATE), 2012
Pubblicato in: