International Journal of Networking and Computing
Online ISSN : 2185-2847
Print ISSN : 2185-2839
ISSN-L : 2185-2839
Predicting Performance of Communications and Computations under Memory Contention in Distributed HPC Systems
Alexandre DenisEmmanuel JeannotPhilippe Swartvagher
著者情報
ジャーナル オープンアクセス

2023 年 13 巻 1 号 p. 62-91

詳細
抄録

To amortize the cost of MPI communications, distributed parallel HPC applications can overlap network communications with computations in the hope that it improves global application performance. When using this technique, both computations and communications are running at the same time. But computation usually also performs some data movements. Since data for computations and for communications use the same memory system, memory contention may occur when computations are memory-bound and large messages are transmitted through the network at the same time. In this paper we propose a model to predict memory bandwidth for computations and for communications when they are executed side by side, according to data locality and taking contention into account. Elaboration of the model allowed to better understand locations of bottleneck in the memory system and what are the strategies of the memory system in case of contention. The model was evaluated on many platforms with different characteristics, and showed a prediction error in average lower than 4%.

著者関連情報
© 2023 International Journal of Networking and Computing
前の記事
feedback
Top