Loading...
Thumbnail Image
Item

Optimizing Sparse Matrix-Matrix Multiplication on a Heterogeneous CPU-GPU Platform

Wu, Xiaolong
Citations
Altmetric:
Abstract

Sparse Matrix-Matrix multiplication (SpMM) is a fundamental operation over irregular data, which is widely used in graph algorithms, such as finding minimum spanning trees and shortest paths. In this work, we present a hybrid CPU and GPU-based parallel SpMM algorithm to improve the performance of SpMM. First, we improve data locality by element-wise multiplication. Second, we utilize the ordered property of row indices for partial sorting instead of full sorting of all triples according to row and column indices. Finally, through a hybrid CPU-GPU approach using two level pipelining technique, our algorithm is able to better exploit a heterogeneous system. Compared with the state-of-the-art SpMM methods in cuSPARSE and CUSP libraries, our approach achieves an average of 1.6x and 2.9x speedup separately on the nine representative matrices from University of Florida sparse matrix collection.

Comments
Description
Date
2015-12-16
Journal Title
Journal ISSN
Volume Title
Publisher
Research Projects
Organizational Units
Journal Issue
Keywords
Sparse matrix-matrix multiplication, Data locality, Pipelining, GPU
Citation
Wu, Xiaolong (2015). "Optimizing Sparse Matrix-Matrix Multiplication on a Heterogeneous CPU-GPU Platform." Thesis, Georgia State University. https://doi.org/10.57709/7906679
Embargo Lift Date
2015-12-04
Embedded videos