+ All Categories
Home > Documents > A Python-oriented environment for climate experiments at ...a High Performance Data Analytics ......

A Python-oriented environment for climate experiments at ...a High Performance Data Analytics ......

Date post: 02-Oct-2020
Category:
Upload: others
View: 2 times
Download: 0 times
Share this document with a friend
9
A Python-oriented environment for climate experiments at scale in the frame of the European Open Science Cloud D. Elia 1,3 , F. Antonio 1 , C. Palazzo 1 , P. Nassisi 1 , S. Bendoukha 2 , R. Kwee- Hinzmann 2 , S. Fiore 1 , T. Weigel 2 , H. Thiemann 2 , and G. Aloisio 1,3 1 Fondazione Centro Euro-Mediterraneo sui Cambiamenti Climatici (CMCC), Lecce, Italy 2 Deutsches Klimarechenzentrum (DKRZ), Hamburg, Germany 3 University of Salento, Lecce, Italy EGU General Assembly 2020 Online | 4-8 May 2020 D806 | EGU2020-17031
Transcript
Page 1: A Python-oriented environment for climate experiments at ...a High Performance Data Analytics ... (e.g. in Jupyter Notebooks) Programmatic access through the PyOphidia class Two modules

A Python-oriented environment for climate experiments at scale

in the frame of the European Open Science Cloud

D. Elia1,3, F. Antonio1, C. Palazzo1, P. Nassisi1, S. Bendoukha2, R. Kwee-Hinzmann2, S. Fiore1, T. Weigel2, H. Thiemann2, and G. Aloisio1,3 1 Fondazione Centro Euro-Mediterraneo sui Cambiamenti Climatici (CMCC), Lecce, Italy 2 Deutsches Klimarechenzentrum (DKRZ), Hamburg, Germany 3 University of Salento, Lecce, Italy

EGU General Assembly 2020 Online | 4-8 May 2020

D806 | EGU2020-17031

Page 2: A Python-oriented environment for climate experiments at ...a High Performance Data Analytics ... (e.g. in Jupyter Notebooks) Programmatic access through the PyOphidia class Two modules

ENES Climate Analytics Service (ECAS)

✔  The ENES Climate Analytics Service (ECAS), proposed by CMCC & DKRZ in the EU H2020 EOSC-Hub project, supports climate data analysis

✔  It is one of the EOSC-Hub Thematic Services

✔  ECAS builds on top of the Ophidia HPDA framework integrated with components from INDIGO-DataCloud, EUDAT and EGI

TheEuropeanCommissionlaunchedtheEuropeanOpenScienceCloud(EOSC)Ini9a9vetocapitaliseonthedatarevolu9on.EOSCwillprovideEuropeanscience,industryandpublicauthori9eswithworld-classdigitalinfrastructurethatbringstateoftheartcompu9nganddatastoragecapacitytothefinger9psofanyscien9stsandengineerintheEU.

https://www.eosc-hub.eu/services/ENES%20Climate%20Analytics%20Service

Page 3: A Python-oriented environment for climate experiments at ...a High Performance Data Analytics ... (e.g. in Jupyter Notebooks) Programmatic access through the PyOphidia class Two modules

ECASLab: a Python environment for data analysis

ECASLab provides a user-friendly environment for scientific analysis based on:

✔  The ECAS integrated service

✔  A JupyterHub instance providing a graphical environment for user’s experiments

✔  Bundled with a wide set of Python scientific modules for data manipulation, analysis and visualization, such as PyOphidia, NumPy, Pandas, Dask, Matplotlib, basemap, Cartopy

✔  A set of ECAS usage example notebooks (https://github.com/ECAS-Lab/ecas-notebooks) Two major instances are hosted by:

ü  CMCC https://ecaslab.cmcc.it

ü  DKRZ https://ecaslab.dkrz.de

Page 4: A Python-oriented environment for climate experiments at ...a High Performance Data Analytics ... (e.g. in Jupyter Notebooks) Programmatic access through the PyOphidia class Two modules

The Ophidia project Ophidia (http://ophidia.cmcc.it) is a  CMCC Foundation  research project addressing data challenges for eScience1 It provides: ✔  a High Performance Data Analytics (HPDA) framework for multi-dimensional scientific

data joining HPC paradigms with scientific data analytics approaches

✔  in-memory and server-side data analysis exploiting parallel computing techniques and database approaches

✔  a multi-dimensional, array-based, storage model and partitioning schema for scientific data leveraging the datacube abstraction

✔  end-to-end mechanisms to support complex experiments and large workflows on scientific datacubes, primarily in climate domain

1. S. Fiore, A. D’Anca, C. Palazzo, I. T. Foster, D. N. Williams, G. Aloisio, “Ophidia: toward big data analytics for escience”, ICCS 2013

Page 5: A Python-oriented environment for climate experiments at ...a High Performance Data Analytics ... (e.g. in Jupyter Notebooks) Programmatic access through the PyOphidia class Two modules

✔  PyOphidia provides a Python interface to submit commands to the Ophidia Server and to retrieve/deserialize the results (e.g. in Jupyter Notebooks)

Programmatic access through the PyOphidia class

✔  Two modules available:

✔  Client class: supports the submissions of Ophidia commands and workflows, as well as the management of session

✔  Cube class: provides the datacube type abstraction and the methods to manipulate, process and get information on cubes objects

https://github.com/OphidiaBigData/PyOphidia https://pypi.org/project/PyOphidia/ https://anaconda.org/conda-forge/pyophidia

Page 6: A Python-oriented environment for climate experiments at ...a High Performance Data Analytics ... (e.g. in Jupyter Notebooks) Programmatic access through the PyOphidia class Two modules

ECAS on the EGI Federated Cloud Infrastructure

✔  A ready-to-use ECAS single-node VMI is available from the EGI AppDB https://appdb.egi.eu/store/vappliance/ecas

✔  A multi-node ECAS cluster can be

dinamically provisioned on the EGI FedCloud through the Elastic Cloud Computing Cluster (EC3)

ECAS has been integrated into the EGI FedCloud, considering two scenarios1:

1. Elastic deployment of ECAS on EGI: https://www.egi.eu/about/newsletters/elastic-deployment-of-ecas-on-egi/

Page 7: A Python-oriented environment for climate experiments at ...a High Performance Data Analytics ... (e.g. in Jupyter Notebooks) Programmatic access through the PyOphidia class Two modules

A complete environment for climate experiments ECASLab provides a complete environment for supporting scientist in their daily research activities with a focus on those from the climate change domain

✔  It represents a single entrypoint to analysis tools, scientific datasets (e.g., from ESGF data archive) and computing resources

✔  It provides the capabilities for the implementation and execution of both interactive and complex experiments (workflows), such as multi-model CMIP-based data analysis1

ECAS is also one of the compute services made available to climate scientists by the EU H2020 IS-ENES3 project

1. S. Fiore, D. Elia, C. Palazzo, A. D’Anca, F. Antonio, D. N. Williams, I. Foster, G. Aloisio, “Towards an Open (Data) Science Analytics-Hub for Reproducible multi-model Climate Analysis at Scale”, 2018 IEEE International Conference on Big Data (Big Data)

Page 8: A Python-oriented environment for climate experiments at ...a High Performance Data Analytics ... (e.g. in Jupyter Notebooks) Programmatic access through the PyOphidia class Two modules

Free access to computing platforms for multi-model climate data analyses for CMIP6 and CORDEX !

The European Research Infrastructure for climate modelling IS-ENES offers a new charge-free server-side computing service:

✔  Reduce data transfer issues ✔  Direct access to petabytes of model data ✔  Run your own scripts and Jupyter notebooks ✔  Speed up the computational analysis Information on the call: https://portal.enes.org/data/data-metadata-service/analysis-platforms

Learn more at EGU sessions:

✔  CL5.7 Climate Services - Underpinning Science , 05 May, 10:45–12:30, EGU 2020-19121: https://meetingorganizer.copernicus.org/EGU2020/session/36737

✔  CL2.6 Detecting and attributing climate change: trends, extreme events, and impacts, 07 May, 08:30–10:15, EGU 2020-19340: https://meetingorganizer.copernicus.org/EGU2020/session/36768

IS-ENES3 is a project funded by the European Union’s Horizon 2020 research and innovation programme under grant agreement No 824084

Page 9: A Python-oriented environment for climate experiments at ...a High Performance Data Analytics ... (e.g. in Jupyter Notebooks) Programmatic access through the PyOphidia class Two modules

Acknowledgment

These activities are supported in part by EOC-Hub and IS-ENES3 projects:

✔  EOSC-hub receives funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No. 777536

✔  IS-ENES3 is a project funded by the European Union’s Horizon 2020 research and innovation programme under grant agreement No. 824084


Recommended