학술논문

Exploitation of operation-level parallelism in a processor of the CRAY X-MP
Document Type
Conference
Source
Proceedings., 1990 IEEE International Conference on Computer Design: VLSI in Computers and Processors Computer Design: VLSI in Computers and Processors, 1990. ICCD '90. Proceedings, 1990 IEEE International Conference on. :20-23 1990
Subject
Computing and Processing
Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Photonics and Electrooptics
Signal Processing and Analysis
Parallel processing
Clocks
Hardware
Processor scheduling
Program processors
Pipeline processing
Dynamic scheduling
Production
Application software
Vector processors
Language
Abstract
Available operation-level parallelism and its exploitation in the CRAY X-MP processor are studied. Considered are the sizes and contributions to execution time of basic blocks, instruction and operation issue rates and issue stalls, and operation execution overlap for entire executions of three large programs, FLO52, TRFD, and QCD1, taken from the Perfect Club benchmark set. The large basic blocks account for a significant portion of the overall execution time. It is also found that with the use of vector instructions, the X-MP is able to issue more than one operation per clock cycle, even though it can issue a maximum of one instruction per cycle.ETX