학술논문

VLIW across multiple superscalar processors on a single chip
Document Type
Conference
Source
Proceedings 1997 International Conference on Parallel Architectures and Compilation Techniques Parallel architectures and compilation techniques Parallel Architectures and Compilation Techniques., 1997. Proceedings., 1997 International Conference on. :166-175 1997
Subject
Computing and Processing
Communication, Networking and Broadcast Technologies
VLIW
Processor scheduling
Dynamic scheduling
Aggregates
Clocks
Microprocessors
Parallel processing
Program processors
Engines
Runtime
Language
Abstract
Advances in IC technology increase the integration density for higher clock rates and provide more opportunities for microprocessor design. The authors propose a new paradigm to exploit instruction-level parallelism (ILP) across multiple superscalar processors on a single chip by taking advantages of both VLIW-style static scheduling techniques and dynamic scheduling of superscalar architecture. In the proposed paradigm, ILP is exploited by a compiler from a sequential program and this VLIW-like-parallelized code is further parallelized by 2-way superscalar engines at run-time. Superscalar processors are connected by an aggregate function network, which can enforce the necessary static timing constraints and provide appropriate inter-processor data communication mechanisms that are needed for ILP. The aggregate function operations are statically scheduled and implement not only fine-grain communication and control, but also simple global computations resembling systolic array operations within the network.