학술논문

Performance Evaluation of Data-Centric Workloads in Serverless Environments
Document Type
Conference
Source
2021 IEEE 14th International Conference on Cloud Computing (CLOUD) CLOUD Cloud Computing (CLOUD), 2021 IEEE 14th International Conference on. :491-496 Sep, 2021
Subject
Computing and Processing
Performance evaluation
Processor scheduling
Computational modeling
Scalability
Pipelines
Memory management
Parallel processing
Serverless
Distributed Computing
Kubernetes
Knative
Tekton
Performance Model
Storage
Language
ISSN
2159-6190
Abstract
Serverless computing is a cloud-based execution paradigm that allows provisioning resources on-demand, freeing developers from infrastructure management and operational concerns. It typically involves deploying workloads as stateless functions that take no resources when not in use, and is meant to scale transparently. To make serverless effective, providers impose limits on a per-function level, such as maximum duration, fixed amount of memory, and no persistent local storage. These constraints make it challenging for data-intensive workloads to take advantage of serverless because they lead to sharing significant amounts of data through remote storage. In this paper, we build a performance model for serverless workloads that considers how data is shared between functions, including the amount of data and the underlying technology that is being used. The model's accuracy is assessed by running a real workload in a cluster using Knative, a state-of-the-art serverless environment, showing a relative error of 5.52%. With the proposed model, we evaluate the performance of data-intensive workloads in serverless, analyzing parallelism, scalability, resource requirements, and scheduling policies. We also explore possible solutions for the data-sharing problem, like using local memory and storage. Our results show that the performance of data-intensive workloads in serverless can be up to 4.32= faster depending on how these are deployed.