학술논문

Dynamo -- Handling Scientific Data Across Sites and Storage Media
Document Type
Working Paper
Source
Computing and Software for Big Science 5, 11 (2021)
Subject
Computer Science - Distributed, Parallel, and Cluster Computing
High Energy Physics - Experiment
Language
Abstract
Dynamo is a full-stack software solution for scientific data management. Dynamo's architecture is modular, extensible, and customizable, making the software suitable for managing data in a wide range of installation scales, from a few terabytes stored at a single location to hundreds of petabytes distributed across a worldwide computing grid. This article documents the core system design of Dynamo and describes the applications that implement various data management tasks. A brief report is also given on the operational experiences of the system at the CMS experiment at the CERN Large Hadron Collider and at a small scale analysis facility.
Comment: 18 pages, 9 figures