학술논문

Performance Left on the Table: An Evaluation of Compiler Autovectorization for RISC-V
Document Type
Periodical
Source
IEEE Micro Micro, IEEE. 42(5):41-48 Jan, 2022
Subject
Computing and Processing
Codes
Benchmark testing
Programming
Registers
Program processors
Reduced instruction set computing
Instruction sets
Language
ISSN
0272-1732
1937-4143
Abstract
Next-generation length-agnostic vector instruction set architecture (ISA) designs, the RISC-V vector extension, and ARM’s scalable vector extension enable software portability across hardware implementations with different vector engines. While traditional, fixed-length single-instruction–multiple-data ISA instructions, such as Intel AVX and ARM Neon, enjoy mature compiler support for automatic vectorization, compiler support is still emerging for these length-agnostic ISAs. This work studies the compiler shortcomings that constitute the gap in autovectorization capabilities between length-agnostic and fixed-length architectures. We examine LLVM’s support for both the RISC-V vector extension and traditional vector ISAs. We study a set of synthetic scalar loops to compare the breadth of support in the two settings, and we examine a real benchmark suite to compare autovectorized to hand-vectorized RISC-V code. We use both studies to distill a set of recommendations for engineering improvements and future research in compilers and programming models for length-agnostic vector programming.