학술논문

When Failure is (Not) an Option: Reliability Models for Microservices Architectures
Document Type
Conference
Source
2020 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW) ISSREW Software Reliability Engineering Workshops (ISSREW),2020 IEEE International Symposium on. :19-24 Oct, 2020
Subject
Computing and Processing
Circuit breakers
Reliability
Integrated circuit modeling
Integrated circuit reliability
Steady-state
Computational modeling
Power system protection
microservices
service mesh
sidecars
circuit breakers
reliability
availability
resilience
reliability models
probabilistic model checking
PRISM
Language
Abstract
Modern application development and deployment is rapidly evolving to microservices based architectures, in which thousands of microservices communicate with one another and can be independently scaled and updated. While these architectures enable flexibility of deployment and frequency of upgrades, the naive use of thousands of communicating and frequently updated microservices can significantly impact the reliability of applications. To address these challenges, service meshes are used to rapidly detect and respond to microservices failures without necessitating changes to the microservices themselves. However, there are inherent tradeoffs that service meshes must make with regards to how quickly they assume a microservice has failed and the subsequent impact on overall application reliability. We present in this paper a modeling framework for microservices and service mesh reliability that takes these tradeoffs into account.