60 likes | 81 Views
Enter the epoch of Precision Cosmology with Euclid and Planck missions mapping the Universe with unprecedented accuracy. Large computer simulations are crucial, but face challenges in data storage and distribution. SIMCODE project tackles this with cutting-edge technologies for long-term data preservation and efficient scientific exploitation. With data formats like HDF5 and FITS, simulations will be used by a broad community for up to 10 years. The SIMCODE-DS pilot focuses on testing data preservation strategies for computational cosmology community's future needs. For detailed information, visit www.marcobaldi.it/SIMCODE.
E N D
PI: Marco Baldi (marco.baldi5@unibo.it) Presenter: Matteo Nori (matteo.nori3@unibo.it) Physics and Astronomy Department Bologna University SIMCODE-DS
Next decade: Euclid Last decade: Planck Simulations for Precision Cosmology ΛCDM cDE-CDM fNL(k) We are about to enter the epoch of “Precision Cosmology” (testing the cosmological model at ~1% precision) In the next decade, costly space missions such as the Euclid satellite will map the Universe with unprecedented accuracy mν ≠ 0 f(R) gravity Precision Cosmology NEEDS large computer simulations:
We need large storage facilities! In the next few years we will need to develop simulations with more than a trillion (i.e. 1012!!!) particles: Euclid Full Scale Runs PROBLEMS: - Where to preserve such big data for such a long time? - How to distribute/transfer/manage efficiently these data? - How to allow a full scientific exploitation of the simulations? NEED of a dedicated infrastructure with cutting-edge technologies! • One snapshot of a Euclid Full Scale simulation: ~180-200 TB • The complete simulation: a few PB • Data can have a variety of formats, including HDF5 and FITS Data should be used for scientific exploitation by a broad community and for an extended period of time (~5-10 yrs)
SIMCODE is a computational cosmology project funded by the Italian government through the High-Qualification programme SIR with a 1/2 Million Euro grant for a period of 3 years. The SIMCODE project SIMCODE will perform large cosmological simulations and investigate their observational footprints. SIMCODE therefore represents an intermediate step towards the development of pipelines and tools that will be employed in a few years for the next generation of simulations. The present SIMCODE-DS (Data Storage) pilot aims at testing possible strategies and tools for data preservation and distribution in view of these long term needs of the computational cosmology community.
Data production Data Sharing and distribution Long-term data storage The SIMCODE work flow recover data subset raw data: snapshots access raw data Level 0: raw data (within the collaboration) Large HPC facilities EUDAT facilities Additional data analysis processed data: maps and catalogs Level 1: processed data (external collaborators) Level 3: science-ready data (anyone) Data Analysis & Benchmarking