We consider parallelism of an extended IDR (s) method based on the secant method by means of the proposed Cache-Cache Balance technique. In this paper, we describe concept and outline of the Cache-Cache Balance technique. Through many numerical experiments, we will make clear effectiveness of the proposed Cache-Cache Balance implementation for parallelism.
This paper compares the performance of sparse Matrix-vector multiplication paralleled by the conventional Block-Cyclic distribution and its improved variant on parallel computer with shared memory. The underlying idea is to exchange nonzero entries of matrix assigned to each thread with block unit. Numerical results demonstrate that the proposed distribution using exchange nonzero entries of matrix with block unit gives or improves parallelism.