IEICE Transactions on Information and Systems
Online ISSN : 1745-1361
Print ISSN : 0916-8532
Regular Section
Performance Optimization for Sparse AtAx in Parallel on Multicore CPU
Yuan TAOYangdong DENGShuai MUZhenzhong ZHANGMingfa ZHULimin XIAOLi RUAN
Author information
JOURNAL FREE ACCESS

2014 Volume E97.D Issue 2 Pages 315-318

Details
Abstract

The sparse matrix operation, yy+AtAx, where A is a sparse matrix and x and y are dense vectors, is a widely used computing pattern in High Performance Computing (HPC) applications. The pattern poses challenge to efficient solutions because both a matrix and its transposed version are involved. An efficient sparse matrix format, Compressed Sparse Blocks (CSB), has been proposed to provide nearly the same performance for both Ax and Atx. We develop a multithreaded implementation for the CSB format and apply it to solve yy+AtAx. Experiments show that our technique outperforms the Compressed Sparse Row (CSR) based solution in POSKI by up to 2.5 fold on over 70% of benchmarking matrices.

Content from these authors
© 2014 The Institute of Electronics, Information and Communication Engineers
Previous article Next article
feedback
Top