What is it about?
Data-intensive science requires scalable data management tools to distribute content from a curated data server, remotely share scientific simulation or experiment data with collaborators, enable analyses of hosted data by researchers among others. Those observations lead to the development of COSMO, which targets the sharing and accessibility challenges of large scale data management in order to enable scientific collaboration at scale. COSMO includes data sharing, data management, and data analysis tools that can be accessed through simple and flexible web portals and APIs. The current COSMO version includes tools such as Globus endpoints, a data-sharing portal, a REST API, and also a set of domain-agnostic recommendations for scientific data sharing.
Featured Image
Why is it important?
We have developed an excellent COSMO proof-of-concept in collaboration with the BlueTides simulation project, which produces state-of-the-art cosmological simulations containing close to one trillion simulated particles. The utilization of the COSMO platform for remote data access resulted in a successful James Webb Telescope proposal to observe the first quasars in the first observing cycle and several related peer-review journal Publications.
Perspectives
Applying the successful experiences with the BlueTides simulations to other scientific domains is also of interest, as well as adopting additional features to further improve the user experience. We open-source our implementation and welcome contributions. Please check our paper (https://dl.acm.org/doi/10.1145/3491418.3535166) and website (https://www.cmu.edu/psc/aibd/cosmo.html) for details.
Mei-Yu Wang
Carnegie Mellon University
Read the Original
This page is a summary of: COSMO: a Research Data Service Platform and Experiences from the BlueTides Project, July 2022, ACM (Association for Computing Machinery),
DOI: 10.1145/3491418.3535166.
You can read the full text:
Contributors
The following have contributed to this page







