학술논문

Modeling I/O Performance Variability Using Conditional Variational Autoencoders
Document Type
Conference
Source
2018 IEEE International Conference on Cluster Computing (CLUSTER) CLUSTER Cluster Computing (CLUSTER), 2018 IEEE International Conference on. :109-113 Sep, 2018
Subject
Communication, Networking and Broadcast Technologies
Computing and Processing
Predictive models
Data models
Measurement
Computational modeling
Training
Decoding
Testing
I/O performance variability, parallel filesystems, probabilistic machine learning, variational autoencoders
Language
ISSN
2168-9253
Abstract
Storage system performance modeling is crucial for efficient use of heterogeneous shared resources on leadership-class computers. Variability in application performance, particularly variability arising from concurrent applications sharing I/O resources, is a major hurdle in the development of accurate performance models. We adopt a deep learning approach based on conditional variational auto encoders (CVAE) for I/O performance modeling, and use it to quantify performance variability. We illustrate our approach using the data collected on Edison, a production supercomputing system at the National Energy Research Scientific Computing Center (NERSC). The CVAE approach is investigated by comparing it to a previously proposed sensitivity-based Gaussian process (GP) model. We find that the CVAE model performs slightly better than the GP model in cases where training and testing data come from different applications, since CVAE can inherently leverage the whole data from multiple applications whereas GP partitions the data and builds separate models for each partition. Hence, the CVAE offers an alternative modeling approach that does not need pre-processing; it has enough flexibility to handle data from a wide variety of applications without changing the inference approach.