Liu Li(刘力),Liu Li,Yang Guangwen.[J].高技术通讯(英文),2013,19(4):339~345 |
|
Cache performance optimization of irregular sparse matrix multiplication on modern multi-core CPU and GPU |
|
DOI: |
中文关键词: |
英文关键词: sparse matrix multiplication, cache miss, scalability, multi-core CPU, GPU |
基金项目: |
Author Name | Affiliation | Liu Li(刘力) | | Liu Li | | Yang Guangwen | |
|
Hits: 934 |
Download times: 0 |
中文摘要: |
|
英文摘要: |
This paper focuses on how to optimize the cache performance of sparse matrix-matrix multiplication (SpGEMM). It classifies the cache misses into two categories: one is caused by the irregular distribution pattern of the multiplier-matrix, and the other is caused by the multiplicand. For each of them, the paper puts forward an optimization method respectively. The first hash based method removes cache misses of the 1st category effectively, and improves the performance by a factor of 6 on an Intel 8-core CPU for the best cases. For cache misses of the 2nd category, it proposes a new cache replacement algorithm, which achieves a cache hit rate much higher than other historical knowledge based algorithms, and the algorithm is applicable on CELL and GPU. To further verify the effectiveness of our methods, we implement our algorithm on GPU, and the performance perfectly scales with the size of on-chip storage. |
View Full Text
View/Add Comment Download reader |
Close |
|
|
|