학술논문

SCOC: High-radix switches made of bufferless clos networks
Document Type
Conference
Source
2015 IEEE 21st International Symposium on High Performance Computer Architecture (HPCA) High Performance Computer Architecture (HPCA), 2015 IEEE 21st International Symposium on. :402-414 Feb, 2015
Subject
Computing and Processing
Clocks
Ports (Computers)
Wires
Scheduling
System-on-chip
Topology
Computer architecture
Language
ISSN
1530-0897
2378-203X
Abstract
In today's datacenters handling big data and for exascale computers of tomorrow, there is a pressing need for high-radix switches to economically and efficiently unify the computing and storage resources that are dispersed across multiple racks. In this paper, we present SCOC, a switch architecture suitable for economical IC implementation that can efficiently replace crossbars for high-radix switch nodes. SCOC is a multi-stage bufferless network with O(N 2 /m) cost, where m is a design parameter, practically ranging between 4–16. We identify and resolve more than five fairness violations that are pertinent to hierarchical scheduling. Effectively, from a performance perspective, SCOC is indistinguishable from efficient flat crossbars. Computer simulations show that it competes well or even outperforms flat crossbars and hierarchical switches. We report data from our ASIC implementation at 32 nm of a SCOC 136×136 switch, with shallow buffers, connecting 25 Gb/s links. In this first incarnation, SCOC is used at the spines of a server-rack, fat-tree network. Internally, it runs at 9.9 Tb/s, thus offering a speedup of 1.45 ×, and provides a fall-through latency of just 61 ns.