Logo du site
  • English
  • Français
  • Se connecter
Logo du site
  • English
  • Français
  • Se connecter
  1. Accueil
  2. Université de Neuchâtel
  3. Publications
  4. Applying big data paradigms to a large scale scientific workflow: Lessons learned and future directions
 
  • Details
Options
Vignette d'image

Applying big data paradigms to a large scale scientific workflow: Lessons learned and future directions

Auteur(s)
Kropf, Peter 
Institut d'informatique 
Lapin, Andrei 
Institut d'informatique 
Carretero, Jesus
Caíno-Lores, Silvina
Date de parution
2020-6-1
In
Future Gener. Comput. Syst.
Vol.
110
De la page
440
A la page
452
Revu par les pairs
1
Mots-clés
  • Scientific workflows
  • Big data
  • Cloud computing
  • Apache spark
  • Hydrology
  • Scientific workflows

  • Big data

  • Cloud computing

  • Apache spark

  • Hydrology

Résumé
The increasing amounts of data related to the execution of scientific workflows has raised awareness
of their shift towards parallel data-intensive problems. In this paper, we deliver our experience combining
the traditional high-performance computing and grid-based approaches with Big Data analytics
paradigms, in the context of scientific ensemble workflows. Our goal was to assess and discuss the
suitability of such data-oriented mechanisms for production-ready workflows, especially in terms of
scalability. We focused on two key elements in the Big Data ecosystem: the data-centric programming
model, and the underlying infrastructure that integrates storage and computation in each node. We
experimented with a representative MPI-based iterative workflow from the hydrology domain, EnKFHGS,
which we re-implemented using the Spark data analysis framework. We conducted experiments on
a local cluster, a private cloud running OpenNebula, and the Amazon Elastic Compute Cloud (AmazonEC2).
The results we obtained were analysed to synthesize the lessons we learned from this experience, while
discussing promising directions for further research.
Identifiants
https://libra.unine.ch/handle/123456789/28360
_
10.1016/j.future.2018.04.014
Type de publication
journal article
Dossier(s) à télécharger
 main article: 2020-06-15_258_2814.pdf (23.56 KB)
google-scholar
Présentation du portailGuide d'utilisationStratégie Open AccessDirective Open Access La recherche à l'UniNE Open Access ORCIDNouveautés

Service information scientifique & bibliothèques
Rue Emile-Argand 11
2000 Neuchâtel
contact.libra@unine.ch

Propulsé par DSpace, DSpace-CRIS & 4Science | v2022.02.00