학술논문

Automatic Database Troubleshooting of Azure SQL Databases
Document Type
Periodical
Source
IEEE Transactions on Cloud Computing IEEE Trans. Cloud Comput. Cloud Computing, IEEE Transactions on. 10(3):1604-1619 Sep, 2022
Subject
Computing and Processing
Communication, Networking and Broadcast Technologies
Databases
Cloud computing
Telemetry
Servers
Data science
Monitoring
Data mining
Azure SQL database
cloud platform
data science models
database troubleshooting
expert systems
Language
ISSN
2168-7161
2372-0018
Abstract
Exponentially growing number of workloads has been transferred from on-premises to the cloud environment during last decade. It incurs a constantly increasing load on the monitoring and troubleshooting capabilities of the platforms that host those applications nowadays. Data mining techniques and use of telemetry data have recently become an unavoidable means for tracking the behavior of cloud services. The research effort elaborated in this paper is focused on exploiting real-world data to build an automatic database troubleshooting system that exploits the combination of the more comprehensive statistical data science models and an expert system to perform the root cause analysis. An extensive evaluation study was conducted during the eight-month period with a plethora of Azure SQL production workloads for a varying number of databases. The obtained results confirmed the viability and cost-effectiveness of such an approach at the scale of the cloud.