Skip to main content

LUP Student Papers

LUND UNIVERSITY LIBRARIES

Randomised Methods for Computing the QR Decomposition of a Tall-Skinny Matrix

Vijups, Artis LU (2025) In Bachelor's Theses in Mathematical Sciences NUMK11 20251
Centre for Mathematical Sciences
Mathematics (Faculty of Sciences)
Abstract
The QR decomposition is used to solve least squares problems, systems of equations and other important linear algebra tasks. We consider the computation of the QR decomposition for tall-skinny matrices, using the Cholesky decomposition as an intermediate step. To improve efficiency, we introduce randomised numerical linear algebra and sketching as a tool for improving computational efficiency. Numerical experiments are performed comparing the accuracy and runtime of randomised algorithms to classical methods and built-in functions from NumPy and SciPy. They demonstrate that, when applicable, the randomised methods return accurate results in less time than all alternatives.
Popular Abstract
Companies and researchers collect more data than ever. This collected data is usually stored as a large table of numbers called a matrix.

In many cases, matrices possess billions of measurements for hundreds of different, possibly related things. Since we have a lot more rows of data than things to measure, this table of numbers looks very tall and thin, so we call it a tall-skinny matrix.

Just like 12 can be split into 3 × 4 or 2 × 6, matrices can be decomposed into parts. There are different ways to split up a number; similarly, there are many ways to decompose a matrix.

One useful option is called the QR decomposition, as it helps to uncover relationships between the different things being measured.

When decomposing a... (More)
Companies and researchers collect more data than ever. This collected data is usually stored as a large table of numbers called a matrix.

In many cases, matrices possess billions of measurements for hundreds of different, possibly related things. Since we have a lot more rows of data than things to measure, this table of numbers looks very tall and thin, so we call it a tall-skinny matrix.

Just like 12 can be split into 3 × 4 or 2 × 6, matrices can be decomposed into parts. There are different ways to split up a number; similarly, there are many ways to decompose a matrix.

One useful option is called the QR decomposition, as it helps to uncover relationships between the different things being measured.

When decomposing a huge matrix, though, efficiency is crucial, since a weak computer or a slow method would fail to produce results in any reasonable amount of time. We need large and powerful computers to tackle such a problem, and we need methods that can make full use of all the available power.

In the last couple of decades, there have been several discoveries of faster ways to decompose matrices, and one such discovery is that of randomised methods. They involve, for example, randomly selecting a few hundred rows of the matrix, and doing some of the steps using this smaller matrix instead of the huge original one.

Conveniently, the results remain very accurate, because the smaller matrix typically retains many of the characteristics of the original matrix.

We compare some randomised methods to traditional ways of computing the QR decomposition. The methods are written in the Python programming language, also using two additional tools called NumPy and SciPy. Through different tests, we measure the accuracy of the results and the time in which they are computed.

Our findings are that the randomised methods are the fastest way to get consistently accurate results in most cases, but to unlock their potential on powerful computers, these methods would benefit from being implemented with tools that target such systems, possibly outside of Python. (Less)
Please use this url to cite or link to this publication:
author
Vijups, Artis LU
supervisor
organization
course
NUMK11 20251
year
type
M2 - Bachelor Degree
subject
keywords
Numerical Analysis, Numerical Linear Algebra, Randomization, Matrix Factorization, QR Decomposition, Cholesky Decomposition, Scalability
publication/series
Bachelor's Theses in Mathematical Sciences
report number
LUNFNA-4066-2025
ISSN
1654-6229
other publication id
2025:K22
language
English
id
9200396
date added to LUP
2025-09-01 13:52:44
date last changed
2025-09-01 13:52:44
@misc{9200396,
  abstract     = {{The QR decomposition is used to solve least squares problems, systems of equations and other important linear algebra tasks. We consider the computation of the QR decomposition for tall-skinny matrices, using the Cholesky decomposition as an intermediate step. To improve efficiency, we introduce randomised numerical linear algebra and sketching as a tool for improving computational efficiency. Numerical experiments are performed comparing the accuracy and runtime of randomised algorithms to classical methods and built-in functions from NumPy and SciPy. They demonstrate that, when applicable, the randomised methods return accurate results in less time than all alternatives.}},
  author       = {{Vijups, Artis}},
  issn         = {{1654-6229}},
  language     = {{eng}},
  note         = {{Student Paper}},
  series       = {{Bachelor's Theses in Mathematical Sciences}},
  title        = {{Randomised Methods for Computing the QR Decomposition of a Tall-Skinny Matrix}},
  year         = {{2025}},
}