Easy spark

Y. van den Wildenberg, W.W.L. Nuijten, O. Papapetrou

Onderzoeksoutput: Hoofdstuk in Boek/Rapport/CongresprocedureConferentiebijdrageAcademicpeer review

Samenvatting

Today's data deluge calls for novel, scalable data handling and processing solutions. Spark has emerged as a popular distributed in-memory computing engine for processing and analysing a large amount of data in parallel. However, the way parallel processing pipelines are designed is fundamentally different from traditional programming techniques, and hence most programmers are either unable to start using Spark, or are not utilising Spark to the maximum of its potential. This study describes an easier entry point into Spark. We design and implement a GUI that allows any programmer with knowledge of a standard programming language (e.g., Python or Java) to write Spark applications effortlessly and interactively, and to submit and execute them to large clusters.

Originele taal-2Engels
TitelProceedings of the Workshops of the EDBT/ICDT 2021 Joint Conference, Nicosia, Cyprus, March 23, 2021
RedacteurenConstantinos Costa, Evaggelika Pitoura
UitgeverijCEUR-WS.org
Aantal pagina's6
StatusGepubliceerd - 2021
Evenement2021 Workshops of the EDBT/ICDT Joint Conference, EDBT/ICDT-WS 2021 - Nicosia, Cyprus
Duur: 23 mrt 2021 → …

Publicatie series

NaamCEUR Workshop Proceedings
UitgeverijCEUR-WS.org
Volume2841
ISSN van geprinte versie1613-0073

Congres

Congres2021 Workshops of the EDBT/ICDT Joint Conference, EDBT/ICDT-WS 2021
LandCyprus
StadNicosia
Periode23/03/21 → …

Bibliografische nota

Publisher Copyright:
© 2021 Copyright for this paper by its author(s).

Copyright:
Copyright 2021 Elsevier B.V., All rights reserved.

Vingerafdruk Duik in de onderzoeksthema's van 'Easy spark'. Samen vormen ze een unieke vingerafdruk.

Citeer dit