Skip to Main Content (Press Enter)

Logo UNIMORE
  • ×
  • Home
  • Corsi
  • Insegnamenti
  • Professioni
  • Persone
  • Pubblicazioni
  • Strutture
  • Terza Missione
  • Attività
  • Competenze

UNI-FIND
Logo UNIMORE

|

UNI-FIND

unimore.it
  • ×
  • Home
  • Corsi
  • Insegnamenti
  • Professioni
  • Persone
  • Pubblicazioni
  • Strutture
  • Terza Missione
  • Attività
  • Competenze
  1. Pubblicazioni

BigBench workload executed by using Apache Flink

Contributo in Atti di convegno
Data di Pubblicazione:
2017
Citazione:
BigBench workload executed by using Apache Flink / Bergamaschi, Sonia; Gagliardelli, Luca; Simonini, Giovanni; Zhu, Song. - In: PROCEDIA MANUFACTURING. - ISSN 2351-9789. - 11:(2017), pp. 695-702. ( 27th International Conference on Flexible Automation and Intelligent Manufacturing (FAIM) Modena, ITALY JUN 27-30, 2017) [10.1016/j.promfg.2017.07.169].
Abstract:
Many of the challenges that have to be faced in Industry 4.0 involve the management and analysis of huge amount of data (e.g. sensor data management and machine-fault prediction in industrial manufacturing, web-logs analysis in e-commerce). To handle the so-called Big Data management and analysis, a plethora of frameworks has been proposed in the last decade. Many of them are focusing on the parallel processing paradigm, such as MapReduce, Apache Hive, Apache Flink. However, in this jungle of frameworks, the performance evaluation of these technologies is not a trivial task, and strictly depends on the application requirements. The scope of this paper is to compare two of the most employed and promising frameworks to manage big data: Apache Flink and Apache Hive, which are general purpose distributed platforms under the umbrella of the Apache Software Foundation. To evaluate these two frameworks we use the benchmark BigBench, developed for Apache Hive. We re-implemented the most significant queries of Apache Hive BigBench to make them work on Apache Flink, in order to be able to compare the results of the same queries executed on both frameworks. Our results show that Apache Flink, if it is configured well, is able to outperform Apache Hive.
Tipologia CRIS:
Relazione in Atti di Convegno
Keywords:
Big Data Management; Apache Flink; Apache Hive; BigBench; Benchmarking
Elenco autori:
Bergamaschi, Sonia; Gagliardelli, Luca; Simonini, Giovanni; Zhu, Song
Autori di Ateneo:
BERGAMASCHI Sonia
GAGLIARDELLI LUCA
SIMONINI GIOVANNI
Link alla scheda completa:
https://iris.unimore.it/handle/11380/1145249
Link al Full Text:
https://iris.unimore.it//retrieve/handle/11380/1145249/624170/1-s2.0-S235197891730375X-main.pdf
Titolo del libro:
27TH INTERNATIONAL CONFERENCE ON FLEXIBLE AUTOMATION AND INTELLIGENT MANUFACTURING, FAIM2017
Pubblicato in:
PROCEDIA MANUFACTURING
Journal
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.0.0