TY - GEN
T1 - Faster matrix multiplication via sparse decomposition
AU - Beniamini, Gal
AU - Schwartz, Oded
N1 - Publisher Copyright: © 2019 Copyright held by the owner/author(s). Publication rights licensed to ACM.
PY - 2019/6/17
Y1 - 2019/6/17
N2 - Fast matrix multiplication algorithms are of practical use only if the leading coefficient of their arithmetic complexity is sufficiently small. Many algorithms with low asymptotic cost have large leading coefficients, and are thus impractical. Karstadt and Schwartz have recently demonstrated a technique that reduces the leading coefficient by introducing fast O(n2 logn) basis transformations, applied to the input and output matrices. We generalize their technique, by allowing larger bases for the transformations while maintaining low overhead. Thus we accelerate several matrix multiplication algorithms, beyond what is known to be possible using the previous technique. Of particular interest are a few new sub-cubic algorithms with leading coefficient 2, matching that of classical matrix multiplication. For example, we obtain an algorithm with arithmetic complexity of 2nlog3 23+o(nlog3 23) compared to 2n3 − n2 of the classical algorithm. Such new algorithms can outperform previous ones (classical included) even on relatively small matrices. We obtain lower bounds matching the coefficient of several of our algorithms, proving them to be optimal.
AB - Fast matrix multiplication algorithms are of practical use only if the leading coefficient of their arithmetic complexity is sufficiently small. Many algorithms with low asymptotic cost have large leading coefficients, and are thus impractical. Karstadt and Schwartz have recently demonstrated a technique that reduces the leading coefficient by introducing fast O(n2 logn) basis transformations, applied to the input and output matrices. We generalize their technique, by allowing larger bases for the transformations while maintaining low overhead. Thus we accelerate several matrix multiplication algorithms, beyond what is known to be possible using the previous technique. Of particular interest are a few new sub-cubic algorithms with leading coefficient 2, matching that of classical matrix multiplication. For example, we obtain an algorithm with arithmetic complexity of 2nlog3 23+o(nlog3 23) compared to 2n3 − n2 of the classical algorithm. Such new algorithms can outperform previous ones (classical included) even on relatively small matrices. We obtain lower bounds matching the coefficient of several of our algorithms, proving them to be optimal.
KW - Bilinear Algorithms
KW - Fast Matrix Multiplication
UR - http://www.scopus.com/inward/record.url?scp=85068641732&partnerID=8YFLogxK
U2 - 10.1145/3323165.3323188
DO - 10.1145/3323165.3323188
M3 - منشور من مؤتمر
T3 - Annual ACM Symposium on Parallelism in Algorithms and Architectures
SP - 11
EP - 22
BT - SPAA 2019 - Proceedings of the 31st ACM Symposium on Parallelism in Algorithms and Architectures
T2 - 31st ACM Symposium on Parallelism in Algorithms and Architectures, SPAA 2019
Y2 - 22 June 2019 through 24 June 2019
ER -