International Journal of Networking and Computing
Online ISSN : 2185-2847
Print ISSN : 2185-2839
ISSN-L : 2185-2839
Special Issue on Workshop on Advances in Parallel and Distributed Computational Models 2021
Comparing Distributed Termination Detection Algorithms for Modern HPC Platforms
George BosilcaAurélien BouteillerThomas HeraultValentin Le FèvreYves RobertJack Dongarra
Author information
JOURNAL OPEN ACCESS

2022 Volume 12 Issue 1 Pages 26-46

Details
Abstract

This paper revisits distributed termination detection algorithms in the context of High-Performance Computing (HPC) applications. We introduce an efficient variant of the Credit Distribution Algorithm (CDA) and compare it to the original algorithm (HCDA) as well as to its two primary competitors: the Four Counters algorithm (4C) and the Efficient Delay-Optimal Distributed algorithm (EDOD). We analyze the behavior of each algorithm for some simplified task-based kernels and show the superiority of CDA in terms of the number of control messages. We then compare the implementation of these algorithms over a task-based runtime system, PaRSEC and show the advantages and limitations of each approach in a real implementation.

Related papers from these authors
© 2022 International Journal of Networking and Computing
Previous article Next article
feedback
Top