Generalized Fault-Tolerance Topology Generation for Application-Specific Network-on-Chips

Song Chen,Mengke Ge,Zhigang Li,Jinglei Huang,Qi Xu,Feng Wu

Published 2019 in IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

ABSTRACT

The network-on-chips (NoCs)-based communication architecture is a promising candidate for addressing communication bottlenecks in many-core processors and neural network processors. In this article, we consider the generalized fault-tolerance topology generation problem, where the link (physical channel) or switch failures can happen, for application-specific NoCs (ASNoCs). With a user-defined maximum number of faults, <inline-formula> <tex-math notation="LaTeX">$K$ </tex-math></inline-formula>, we propose an integer linear programming (ILP)-based method to generate ASNoC topologies, which can tolerate at most <inline-formula> <tex-math notation="LaTeX">$K$ </tex-math></inline-formula> faults in switches or links. Given the communication requirements between cores and their floorplan, we first propose a convex-cost-flow-based method to solve a core mapping (CM) problem for building connections between the cores and switches. Second, an ILP-based method is proposed to solve the routing path allocation (PA) problem, where <inline-formula> <tex-math notation="LaTeX">$K+1$ </tex-math></inline-formula> switch-disjoint routing paths are allocated for every communication flow between the cores. Finally, to reduce switch sizes, we propose to share the switch ports for the connections between the cores and switches and formulate the port sharing problem as a clique-partitioning problem, which is solved by iteratively finding a set of the maximum cliques. Additionally, we propose an ILP-based method to simultaneously solve the CM and routing PA problems when only physical link failures are considered. The experimental results show that the power consumption of fault-tolerance topologies increases almost linearly with <inline-formula> <tex-math notation="LaTeX">$K$ </tex-math></inline-formula> because of the routing path redundancy for fault tolerance. When both switch faults and link faults are considered, port sharing can reduce the average power consumption of fault-tolerance topologies with <inline-formula> <tex-math notation="LaTeX">$K=1$ </tex-math></inline-formula>, <inline-formula> <tex-math notation="LaTeX">$K=2$ </tex-math></inline-formula>, and <inline-formula> <tex-math notation="LaTeX">$K=3$ </tex-math></inline-formula> by 18.08%, 28.88%, and 34.20%, respectively. When considering only the physical link faults, the experimental results show that compared to the fault-tolerant topology generation (FTTG) algorithm, the proposed method reduces power consumption and hop count by 10.58% and 6.25%, respectively; compared to the de Bruijn Digraph (DBG)-based method, the proposed method reduces power consumption and hop count by 21.72% and 9.35%, respectively.

PUBLICATION RECORD

CITATION MAP

EXTRACTION MAP

CLAIMS

  • No claims are published for this paper.

CONCEPTS

  • No concepts are published for this paper.

REFERENCES

Showing 1-39 of 39 references · Page 1 of 1