학술논문

A Codesign Framework for Online Data Analysis and Reduction
Document Type
Conference
Source
2019 IEEE/ACM Workflows in Support of Large-Scale Science (WORKS) Workflows in Support of Large-Scale Science (WORKS), 2019 IEEE/ACM. :11-20 Nov, 2019
Subject
Computing and Processing
Analytical models
Data models
Computational modeling
Tools
Pipelines
Computer architecture
Data analysis
exascale; cheetah; savanna; codar; workflows; in situ; online; reduction; codesign; summit
Language
Abstract
In this paper we discuss our design of a toolset for automating performance studies of composed HPC applications that perform online data reduction and analysis. We describe Cheetah, a new framework for performing parametric studies on coupled applications. Cheetah facilitates understanding the impact of various factors such as process placement, synchronicity of algorithms, and storage vs. compute requirements for online analysis of large data. Ultimately, we aim to create a catalog of performance results that can help scientists understand tradeoffs when designing next-generation simulations that make use of online processing techniques. We illustrate the design choices of Cheetah by using a reaction-diffusion simulation (Gray-Scott) paired with an analysis application to demonstrate initial results of fine-grained process placement on Summit, a pre-exascale supercomputer at Oak Ridge National Laboratory.