Skip to Main Content (Press Enter)

Logo UNIMORE
  • ×
  • Home
  • Corsi
  • Insegnamenti
  • Professioni
  • Persone
  • Pubblicazioni
  • Strutture
  • Terza Missione
  • Attività
  • Competenze

UNI-FIND
Logo UNIMORE

|

UNI-FIND

unimore.it
  • ×
  • Home
  • Corsi
  • Insegnamenti
  • Professioni
  • Persone
  • Pubblicazioni
  • Strutture
  • Terza Missione
  • Attività
  • Competenze
  1. Pubblicazioni

Efficient Implementation of Genetic Algorithms on GP-GPU with Scheduled Persistent CUDA Threads

Contributo in Atti di convegno
Data di Pubblicazione:
2015
Citazione:
Efficient Implementation of Genetic Algorithms on GP-GPU with Scheduled Persistent CUDA Threads / Capodieci, Nicola; Burgio, Paolo. - 2016-:(2015), pp. 6-12. ( 7th International Symposium on Parallel Architectures, Algorithms, and Programming, PAAP 2015 Nanjing (China) 12-14 Dicembre 2015) [10.1109/PAAP.2015.13].
Abstract:
In this paper we present a heavily exploration oriented implementation of genetic algorithms to be executed on graphic processor units (GPUs) that is optimized with our novel mechanism for scheduling GPU-side synchronized jobs that takes inspiration from the concept of persistent threads. Persistent Threads allow an efficient distribution of work loads throughout the GPU so to fully exploit the CUDA (NVIDIA's proprietary Compute Unified Device Architecture) architecture. Our approach (named Scheduled Light Kernel, SLK) uses a specifically designed data structure for issuing sequences of commands from the CPU to the GPU able to minimize CPUGPU communications, exploit streams of concurrent execution of different device side functions within different Streaming Multiprocessors and minimize kernels launch overhead. Results obtained on two completely different experimental settings show that our approach is able to dramatically increase the performance of the tested genetic algorithms compared to the baseline implementation that (while still running on a GPU) does not exploit our proposed approach. Our proposed SLK approach does not require substantial code rewriting and is also compared to newly introduced features in the last CUDA development toolkit, such as nested kernel invocations for dynamic parallelism.
Tipologia CRIS:
Relazione in Atti di Convegno
Keywords:
GP-GPUs, Genetic Algorithms, Persistent Threads, CUDA
Elenco autori:
Capodieci, Nicola; Burgio, Paolo
Autori di Ateneo:
BURGIO PAOLO
CAPODIECI NICOLA
Link alla scheda completa:
https://iris.unimore.it/handle/11380/1129949
Titolo del libro:
Seventh International Symposium on Parallel Architectures, Algorithms, and Programming, PAAP 2015. Proceedings
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.0.0