학술논문

Towards a Service-based Adaptable Data Layer for Cloud Workflows
Document Type
Conference
Source
2023 IEEE 47th Annual Computers, Software, and Applications Conference (COMPSAC) COMPSAC Computers, Software, and Applications Conference (COMPSAC), 2023 IEEE 47th Annual. :904-911 Jun, 2023
Subject
Computing and Processing
Engineering Profession
General Topics for Engineers
Protocols
File systems
Soft sensors
Prototypes
Data transfer
Systems support
Container-native workflow data layer
IPFS
WebDAV
data transfers
performance analysis
Cloud computing
Language
Abstract
Many scientific workflows are data-driven and need to be continuously executed for the large volume of datasets transferred from distributed data sources. The overhead arising from data transfers must be considered when optimizing workflow performance. Many workflow systems support various data transfer protocols (DTPs) and file systems. However, challenges that hinder wide protocol adoption are mainly the need for more feasibility of adapting new solutions, such as decentralized ones. In this paper, we prototype a container-native data layer that supports multiple DTPs, e.g., FTP, WebDAV, and IPFS, for Cloud workflows. Based on this tool, we demonstrated the feasibility of using combinations of Docker, CWL, and Argo to deploy and execute several application scenarios adaptably. Besides, we analyzed the performance of data transfers and workflow execution time between IPFS and WebDAV, which can help users decide which one to handle data. Our results show that IPFS outperforms WebDAV in uploading large files, and the makespan via IPFS executed in Argo is comparable with WebDAV.