학술논문

A high-performance and energy-efficient CT reconstruction algorithm for multi-terabyte datasets
Document Type
Conference
Source
2013 IEEE Nuclear Science Symposium and Medical Imaging Conference (2013 NSS/MIC) Nuclear Science Symposium and Medical Imaging Conference (NSS/MIC), 2013 IEEE. :1-7 Oct, 2013
Subject
Nuclear Engineering
Image reconstruction
Graphics processing units
Kernel
Reconstruction algorithms
Computed tomography
Performance evaluation
Scalability
Language
ISSN
1082-3654
Abstract
There has been much work done in implementing various GPU-based Computed Tomography reconstruction algorithms for medical applications showing tremendous improvement in computational performance. While many of these reconstruction algorithms could also be applied to industrial-scale datasets, the performance gains may be modest to non-existent due to a combination of algorithmic, hardware, or scalability limitations. Previous work presented showed an irregular dynamic approach to GPU-Reconstruction kernel execution for industrial-scale reconstructions that dramatically improved voxel processing throughput. However, the improved kernel execution magnified other system bottlenecks such as host memory bandwidth and storage read/write bandwidth, thus hindering performance gains. This paper presents a multi-GPU-based reconstruction algorithm capable of efficiently reconstructing large volumes (between 64 gigavoxels and 1 teravoxel volumes) not only faster than traditional CPU- and GPU-based reconstruction algorithms but also while consuming significantly less energy. The reconstruction algorithm exploits the irregular kernel approach from previous work as well as a modularized MIMD-like environment, heterogeneous parallelism, as well as macro- and micro-scale dynamic task allocation. The result is a portable and flexible reconstruction algorithm capable of executing on a wide range of architectures including mobile computers, workstations, supercomputers, and modestly-sized hetero or homogeneous clusters with any number of graphics processors.