TY - JOUR
T1 - Acceleration of iterative refinement for singular value decomposition
AU - Uchino, Yuki
AU - Terao, Takeshi
AU - Ozaki, Katsuhisa
N1 - Publisher Copyright:
© 2023, The Author(s).
PY - 2024/2
Y1 - 2024/2
N2 - We propose fast numerical algorithms to improve the accuracy of singular vectors for a real matrix. Recently, Ogita and Aishima proposed an iterative refinement algorithm for singular value decomposition that is constructed with highly accurate matrix multiplications carried out six times per iteration. The algorithm runs for the problem that has no multiple and clustered singular values. In this study, we show that the same algorithm can be run with highly accurate matrix multiplications carried out five times. Also, we proposed four algorithms constructed with highly accurate matrix multiplications, two algorithms with the multiplications carried out four times, and the other two with the multiplications carried out five times. These algorithms adopt the idea of a mixed-precision iterative refinement method for linear systems. Numerical experiments demonstrate speed-up and quadratic convergence of the proposed algorithms. As a result, the fastest algorithm is 1.7 and 1.4 times faster than the Ogita-Aishima algorithm per iteration on a CPU and GPU, respectively.
AB - We propose fast numerical algorithms to improve the accuracy of singular vectors for a real matrix. Recently, Ogita and Aishima proposed an iterative refinement algorithm for singular value decomposition that is constructed with highly accurate matrix multiplications carried out six times per iteration. The algorithm runs for the problem that has no multiple and clustered singular values. In this study, we show that the same algorithm can be run with highly accurate matrix multiplications carried out five times. Also, we proposed four algorithms constructed with highly accurate matrix multiplications, two algorithms with the multiplications carried out four times, and the other two with the multiplications carried out five times. These algorithms adopt the idea of a mixed-precision iterative refinement method for linear systems. Numerical experiments demonstrate speed-up and quadratic convergence of the proposed algorithms. As a result, the fastest algorithm is 1.7 and 1.4 times faster than the Ogita-Aishima algorithm per iteration on a CPU and GPU, respectively.
KW - Accurate numerical computation
KW - Iterative refinement
KW - Mixed-precision computation
KW - Singular value decomposition
UR - http://www.scopus.com/inward/record.url?scp=85165195102&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85165195102&partnerID=8YFLogxK
U2 - 10.1007/s11075-023-01596-9
DO - 10.1007/s11075-023-01596-9
M3 - Article
AN - SCOPUS:85165195102
SN - 1017-1398
VL - 95
SP - 979
EP - 1009
JO - Numerical Algorithms
JF - Numerical Algorithms
IS - 2
ER -