TY - GEN
T1 - Network Partitioning and Avoidable Contention
AU - Oltchik, Yishai
AU - Schwartz, Oded
N1 - Publisher Copyright: © 2020 Owner/Author.
PY - 2020/7/6
Y1 - 2020/7/6
N2 - Network contention frequently dominates the run time of parallel algorithms and limits scaling performance. Most previous studies mitigate or eliminate contention by utilizing one of several approaches: communication-minimizing algorithms; hotspot-avoiding routing schemes; topology-aware task mapping; or improving global network properties, such as bisection bandwidth, edge-expansion, partitioning, and network diameter. In practice, parallel jobs often use only a fraction of a host system. How do processor allocation policies affect contention within a partition? We utilize edge-isoperimetric analysis of network graphs to determine whether a network partition defined by a processor allocation has optimal internal bisection. Increasing the bisection allows a more efficient use of the network resources, decreasing or completely eliminating the link contention. We study torus networks and characterize partition geometries that maximize internal bisection bandwidth, and examine the allocation policies of Mira and JUQUEEN, the two largest publicly-accessible Blue∼Gene/Q torus-based supercomputers. Our analysis shows that the bisection bandwidth of their partitions can often be improved by changing the partitions' geometries, yielding up to a x2∼speedup for contention-bound workloads. Benchmark experiments validate the predictions. Our analysis applies to allocation policies of other networks.
AB - Network contention frequently dominates the run time of parallel algorithms and limits scaling performance. Most previous studies mitigate or eliminate contention by utilizing one of several approaches: communication-minimizing algorithms; hotspot-avoiding routing schemes; topology-aware task mapping; or improving global network properties, such as bisection bandwidth, edge-expansion, partitioning, and network diameter. In practice, parallel jobs often use only a fraction of a host system. How do processor allocation policies affect contention within a partition? We utilize edge-isoperimetric analysis of network graphs to determine whether a network partition defined by a processor allocation has optimal internal bisection. Increasing the bisection allows a more efficient use of the network resources, decreasing or completely eliminating the link contention. We study torus networks and characterize partition geometries that maximize internal bisection bandwidth, and examine the allocation policies of Mira and JUQUEEN, the two largest publicly-accessible Blue∼Gene/Q torus-based supercomputers. Our analysis shows that the bisection bandwidth of their partitions can often be improved by changing the partitions' geometries, yielding up to a x2∼speedup for contention-bound workloads. Benchmark experiments validate the predictions. Our analysis applies to allocation policies of other networks.
KW - contention-minimization
KW - high-performance computing
KW - network topologies
KW - parallel computing
KW - torus networks
UR - http://www.scopus.com/inward/record.url?scp=85088662588&partnerID=8YFLogxK
U2 - https://doi.org/10.1145/3350755.3400242
DO - https://doi.org/10.1145/3350755.3400242
M3 - منشور من مؤتمر
T3 - Annual ACM Symposium on Parallelism in Algorithms and Architectures
SP - 563
EP - 565
BT - SPAA 2020 - Proceedings of the 32nd ACM Symposium on Parallelism in Algorithms and Architectures
T2 - 32nd ACM Symposium on Parallelism in Algorithms and Architectures, SPAA 2020
Y2 - 15 July 2020 through 17 July 2020
ER -