Linear Algebra

Low-Rank Approximation

Visualize how truncating singular values reduces data size while preserving the essential structure of a matrix.

Low-Rank Approximation

Concept Overview

A low-rank approximation aims to find a matrix of lower rank that closely approximates a given matrix. By decomposing a large, dense matrix into a combination of a few essential features or components, we can significantly reduce data size, filter out noise, and reveal the underlying structure of the data. This is typically achieved by selectively truncating the Singular Value Decomposition (SVD).

Mathematical Definition

According to the Eckart-Young-Mirsky theorem, the best rank-k approximation to a matrix A under both the Frobenius norm and the 2-norm is obtained by truncating the SVD of A. If the full SVD of A is given by A = UΣVT, the rank-k approximation Ak is:

Ak = Σi=1k σiuiviT

where:

  • k is the chosen target rank (k < rank(A)).
  • σi are the largest singular values in descending order.
  • ui and vi are the corresponding left and right singular vectors.

Key Concepts

Truncation

Since singular values measure the "importance" or variance captured by each respective singular vector, discarding the smallest singular values (setting them to zero) effectively removes the components of the matrix that contribute least to its overall structure, often corresponding to noise.

Error Bounds

The approximation error when using the rank-k truncation is exactly bounded by the first omitted singular value:

||A - Ak||2 = σk+1
||A - Ak||F = √(Σi=k+1r σi2)

Historical Context

The fundamental theoretical groundwork for low-rank approximation was established in 1936 by Carl Eckart and Gale Young, who proved the theorem for the Frobenius norm. It was later independently rediscovered and extended to other norms by Leon Mirsky in 1960. Their theorem remains a cornerstone in modern data science and numerical linear algebra, solidifying the SVD's position as a primary tool for dimensionality reduction.

Real-world Applications

  • Image Compression: Representing high-resolution images as low-rank matrices to save storage space without significant loss of visual quality.
  • Recommendation Systems: Collaborative filtering (like the famous Netflix Prize) uses low-rank models to predict user preferences from a sparse matrix of past ratings.
  • Principal Component Analysis (PCA): Using truncated components to map high-dimensional statistical data to lower dimensions for easier visualization and processing.
  • Latent Semantic Analysis: In Natural Language Processing, decomposing word-document matrices to discover hidden topics or semantics in large text corpora.

Related Concepts

  • SVD Decomposition — the foundational matrix factorization technique used to compute the approximation.
  • Principal Component Analysis (PCA) — the statistical equivalent to SVD truncation.
  • Least Squares Approximation — another fundamental method for minimizing error in linear algebra.

Experience it interactively

Adjust parameters, observe in real time, and build deep intuition with Riano’s interactive Low-Rank Approximation module.

Try Low-Rank Approximation on Riano →

More in Linear Algebra