학술논문

Support for HTCondor high-Throughput Computing Workflows in the REANA Reusable Analysis Platform
Document Type
Conference
Source
2019 15th International Conference on eScience (eScience) eScience (eScience), 2019 15th International Conference on. :630-631 Sep, 2019
Subject
Computing and Processing
Data analysis
Task analysis
Pipelines
Containers
Scalability
Engines
High energy physics
reproducible science
computational workflows
high-throughput computing
high-performance computing
data analysis
Language
Abstract
REANA is a reusable and reproducible data analysis platform allowing researchers to structure their analysis pipelines and run them on remote containerised compute clouds. REANA supports several different workflows systems (CWL, Serial, Yadage) and uses Kubernetes' job execution backend. We have designed an abstract job execution component that extends the REANA platform job execution capabilities to support multiple compute backends. We have tested the abstract job execution component with HTCondor and verified the scalability of the designed solution. The results show that the REANA platform would be able to support hybrid scientific workflows where different parts of the analysis pipelines can be executed on multiple computing backends.