A block-recursive approach to unitary matrix decomposition

G.I. Malaschonok, S.S. Sukharskyi

Abstract


Singular value decomposition is one of the fundamental tools of numerical linear algebra and is widely used in data processing, optimization, computer vision, and machine learning. The classical approach to its computation is based on the Householder algorithm, which reduces an arbitrary matrix to a tridiagonal or bidiagonal form followed by the computation of singular values. However, as the matrix size increases, significant difficulties arise in efficiently parallelizing such algorithms, which limits their performance on modern high-performance computing systems. One promising approach to overcoming these limitations is the use of block-recursive algorithms. Such algo rithms allow the original problem to be decomposed into independent block subproblems that can be processed in parallel, providing a high degree of scalability on multiprocessor systems and distributed-memory clusters. This paper proposes a new block-recursive algorithm for the orthogonal (or unitary) decomposition of symmetric matrices based on QR and QP decompositions. The algorithm constructs orthogonal (unitary) factors and pro duces a band matrix, reducing the computational complexity of subsequent stages of spectral decomposition. A complexity analysis is presented showing that the asymptotic order of the algorithm is determined by the complexity of matrix multiplication.Experimental performance studies were carried out in a parallel computing environment using GPU acceleration for matrix operations. The obtained results demonstrate the efficiency of the proposed algorithm as the problem size increases and confirm the agreement between the experimental com plexity and the theoretical estimates. The reconstruction error of the matrix remains at the level of machine precision for double-precision floating-point numbers, which indicates the numerical stability of the proposed approach.

Problems in programming 2026; 1: 102-111

 


Keywords


unitary matrix decomposition; block-recursive algorithm; parallel computing; graphics processing unit

References


Golub, G.H. and Van Loan, C.F., 2013. Matrix Computations. Baltimore: Johns Hopkins University Press.

Trefethen, L.N. and Bau, D., 1997. Numerical Linear Algebra. Philadelphia: SIAM.

Strassen, V., 1969. Gaussian elimination is not optimal. Numerische Mathematik, 13(4), pp.354–356.

Coppersmith, D. and Winograd, S., 1990. Matrix multiplication via arithmetic progressions. Journal of Symbolic Computation, 9(3), pp.251–280.

Schönhage, A., 1973. Unitäre Transformationen großer Matrizen. Numerische Mathematik, 20, pp.409–417.

Malaschonok, G., 2019. Recursive matrix algorithms, distributed dynamic control, scaling, stability. In: Computer Science and Information Technologies (CSIT). Yerevan, pp.112–115.

Malaschonok, G. and Ivaskevich, A., 2020. Quick recursive QR decomposition. In: Proceedings of the Conference on Mathematical Foundations of Informatics (MFOI-2020). Kyiv, pp.1–8.

Pershuta, P.V., 2024. Algorithm for reducing a symmetric matrix to tridiagonal form using QR and QP decompositions (Master’s thesis). Kyiv (in Ukrainian).

Demmel, J., Dumitriu, I., Holtz, O. and Klein berg, R., 2007. Fast linear algebra is stable. Numerische Mathematik, 108, pp.59–91.

Sidko, A.A., 2024. Execution environment for block-recursive matrix algorithms on a distrib uted-memory supercomputer (PhD dissertation). Kyiv (in Ukrainian).

Malashonok, H.I. and Sidko, A.A., 2018. Distributed computing: DAP-technology for paral lelization of recursive algorithms. Naukovi zapysky NaUKMA, 1, pp.25–32 (in Ukrainian).


Refbacks

  • There are currently no refbacks.