This course, Applications in Linear Algebra, describes various ways we can use linear algebra in the real world.

We begin by describing a brief but important theorem for this course.

Theorem: Uniqueness of Invertible Linear Systems

Suppose we have a linear system
$A x = b$
with $n$ variables and $n$ equations. Then, if $A$ is an invertible $n \times n$ matrix, then the linear system has 1 and only 1 solution, and we can find it by taking
$x = A^{- 1} b$

Notes

Notes for this course are given below.

Matrix Exponentials and Rotations

Contextualization

Suppose we want to solve

x^{'} (t) = [01 - 1 0] x (t) x (0) = (2, - 5)^{T}

Recall that from Heat Diffusion, we can find a solution $x (t) = e^{t A} (2, - 5)^{T}$ , and to compute this, we want to diagonalize $A$ to find

e^{t A} = P e^{t D} P^{- 1}

But when we try to compute this, we find complex eigenvalues with eigenvectors!

λ_{1} = i λ_{2} = - i v_{1} = (i, 1) v_{2} = (- i, 1)

This will give us diagonalization

A = P D P^{- 1} = [i 1 - i 1] [i 0 0 - i] \frac{1}{2 i} [1 - 1 i i] = [i 1 - i 1] [i 0 0 - i] [- (1/2) i (1/2) i 1/2 1/2]

And subsequent solution

P e^{t D} P^{- 1} = P [e^{t i} 0 0 e^{- t i}] P^{- 1} = [i 1 - i 1] [cos t + i sin t 0 0 cos t - i sin t] [- (1/2) i (1/2) i 1/2 1/2] = [cos t sin t - sin t cos t]

By a theorem (Euler’s Formula), it is true that for any real number $x$ , $e^{i x} = cos x + i sin x$ . So,

This gives us final solution $x (t) = (2 cos t + 5 sin t, 2 sin t - 5 cos t)^{T}$ !

We make some interesting notes from this:

Even though we had to work with imaginary numbers, our final answer is a real answer! This should make sense. as $e^{t A}$ has to be a real matrix given that $A$ is real (a series of real matrices must also be real).
Our matrix $e^{t A}$ gives us a CCW rotation by $t$ radians!

We ask, why does this happen? What proeprties of $A$ cause this to happen?

It’s not because $A$ started as a rotation matrix! If we tried this with another rotation matrix, we may not get the same answer.

Matrix Exponentials

We first continue our discussion of matrix exponentials.

Analogous to $e^{0} = 1$ , for the $0$ matrix,

e^{0_{n \times n}} = I_{n}

What about $e^{a} e^{b} = e^{a + b}$ ? Is it true for matrices that $e^{A} e^{B} = e^{A + B}$ ?

No! And in fact, it occurs because matrix multiplication is not commutative. While in $e^{A} e^{B}$ , all terms are of the form $A^{i} B^{j}$ (all $A$ matrices first, then $B$ matrices after), powers like

(A + B)^{2} = A^{2} + A B + B A + B^{2}

flip the order of the matrix multiplication!

However, if we have matrices $A$ , $B$ whose product are commutative, then this property holds!

Proposition

If $A B = B A$ , then
$e^{A B} = e^{A + B}$

Corollary

Let $A$ be any $n \times n$ matrix.

For any scalars $s, t$ , $e^{t A} e^{s A} = e^{(t + s) A}$

$e^{A} e^{- A} = I$ So, $e^{A}$ has an inverse, given as $(e^{A})^{- 1} = e^{- A}$ for any square matrix $A$ .

Theorem

For an $n \times n$ matrix $A$ ,
$(e^{A})^{T} = e^{(A^{T})}$

This happens because the transpose operation is linear and continuous, so we can transpose the series term-by-term!

Theorem

Let $A$ be an $n \times n$ matrix.

If $λ$ is an eigenvalue for $A$ , then $e^{λ}$ is an eigenvalue for $e^{A}$ .

More precisely, if $v$ is an eigenvector for $A$ with eigenvalue $λ$ , then $v$ is an eigenvector for $e^{A}$ with eigenvalue $e^{λ}$ .

The trace of an $n \times n$ matrix is the sum of the diagonal entries of $A$ denoted $tr A$ .

tr A = i = 1 \sum n a_{ii}

Theorem

For an $n \times n$ matrix $A$ ,

$tr A$ equals the sum of the eigenvalues of $A$ .

$det A$ equals the product of the eigenvalues of $A$ .

Using these facts, we can explain the following.

Theorem

$det (e^{A}) = e^{tr A}$
In particular, if $A$ has real entries, then the determinant of $e^{A}$ will always strictly be positive.

We can also use this to know if a matrix isn’t an exponential of any matrix!

Proof

If the eigenvalues of $A$ are $λ_{1}, \dots λ_{n}$ , then the eigenvalues of $e^{A}$ are $e^{λ_{1}}, \dots e^{λ_{n}}$ , and as the determinant is the product of eigenvalues,
$det e^{A} = e^{λ_{1}} \dots e^{λ_{n}} = e^{λ_{1} + \dots + λ_{n}} = e^{tr A}$

Rotations

So, for what $A$ is $e^{A}$ a rotation? To answer this question, we must first define what exactly a “rotation” matrix is.

An $n \times n$ matrix $Q$ is orthogonal if $Q$ satisfies

Q^{T} Q = I_{n}

Theorem

Let $Q$ be $n \times n$ . The following are equivalent:

$Q$ is an orthogonal matrix.

$Q^{- 1} = Q^{T}$

The columns of $Q$ form an orthonormal basis for $R^{n}$ , meaning they are orthogonal to each other and are unit vectors.

$∣∣ Q v ∣∣ = ∣∣ v ∣∣$ for all $v \in R^{n}$ . In other words, $Q$ preserves the length of vectors.

Rotations and reflections are orthogonal matrices by condition (4) of the theorem! How do we know what an orthogonal matrix is classified as?

Note that every orthogonal $Q$ satisfies

det Q = \pm 1

Proof

$Q^{T} Q = I_{n} det Q^{T} Q = det I_{n} = 1 det Q^{T} det Q = 1 det Q^{2} = 1 det Q = \pm 1$

It turns out, rotations have determinant 1, and reflections have determinant -1.

So, we can define a rotation matrix as an orthogonal matrix with determinant 1. So, for $A$ such that $e^{A}$ is a rotation, we need $e^{A}$ to be orthogonal with $det e^{A} = 1$ .

We know that $det e^{A} > 0$ for any $A$ . Furthermore, for $e^{A}$ to be orthogonal, we need that

(e^{A})^{T} = (e^{A})^{- 1} ⟺ e^{A^{T}} = e^{- A}

So, this relation holds whenever $A^{T} = - A$ . $A$ is called skew-symmetric if $A^{T} = - A$ .

Theorem

If $A$ is skew-symmetric, then $e^{t A}$ will be a rotation matrix for any real $t$ !

Example: 2-Dimensional Rotations

$A = [010 - 1]$
Observe that this is a skew-symmetric matrix, and we saw that
$e^{t A} = [cos t sin t - sin t cos t]$

We ask, are there any other $2 \times 2$ skew-symmetric matrices? For $A$ to be skew-symmetric,

[a b c d] = [- a - c - b - d]

So, we need $a = - a$ , $c = - b$ , $b = - c$ , and $d = - d$ . Thus forces $a = d = 0$ , and $b, c$ must be opposites of each other. So, we have general form

A = [0 c - c 0] = c [01 - 1 0]

So, there are no other skew-symmetric matrices!

Similarly, for the 3-dimensional case, we can find general form

A = 0 d g - d 0 h - g - h 0

So, $e^{t A}$ will give us a family of rotations! But what rotation does it actually represent? In other words, what is the rotations’ axis and angle?

Note that the axis must always go through the origin, as the transformation is linear.

Consider a skew-symmetric matrix $A$ of the form

A = 0 z - y - z 0 x y - x 0

Observe that we can make a vector $v = (x, y, z)$ , which is an eigenvector for matrix $A$ with eigenvalue 0! This implies that $v$ is also an eigenvector for $e^{A}$ (similarly, $e^{t A}$ ), with eigenvalue $e^{0} = 1$ .

This means that the transformation $e^{t A}$ does nothing to the vector $v$ ! Meaning, if $e^{t A}$ is a rotation matrix, then $v$ must be the axis of rotation!

So, the family of rotations given by $e^{t A}$ has the axis of rotation given as the line through the origin given by vector $v = (x, y, z)$ . Furthermore, as $t$ changes, so does the angle of rotation.

$t$ may not be exactly the angle of rotation though, since $A$ may stretch the vectors!

Theorem: 3D Rotation Matrices

Let $u = (x, y, z)$ be a unit vector, and let
$A = 0 z - y - z 0 x y - x 0$
Then, the rotation about the line through the origin in the direction of $u$ by $θ$ radians is given by $e^{θ A}$ !

This theorem is 3D specific! Don’t try to generalize it to higher dimensions.

Example: Rotation Matrices

Use the above theorem to find $RZ (θ)$ .

Here, we want to rotate about unit vector $u = (0, 0, 1)^{T}$ . We can find our rotation by taking
$A = 010 - 1 00 000$
And taking the exponential
$e^{θ A} = cos θ sin θ 0 - sin θ cos θ 0 001$

Example: Rotation Matrices (2)

Find the matrix for rotation by 27 degrees around axis through origin in the direction of $v = (3, 2, - 6)^{T}$

First, we take a unit vector in the same direction
$u = \frac{v}{∣∣ v ∣∣} = (3/7, 2/7, - 6/7)^{T}$
Furthermore, we have angle $\frac{27 π}{180} = \frac{3 π}{20}$

We can use this to find
$A = 0 - 6/7 - 2/7 6/7 0 3/7 2/7 - 3/7 0$
And find our rotation matrix as
$e^{\frac{3 π}{20} A}$

Example: Rotation Matrices (3)

$A = 0 - 1 - 2 10 - 3 230$
We know $e^{A}$ is some 3D rotation. What is it’s axis / angle?

We can use the theorem to find this. First, we need to recognize $A$ as the matrix built from a unit vector, by finding scalar multiple
$A = 14 0 - 1/ 14 - 2/ 14 1/ 14 0 - 3/ 14 2/ 14 3/ 14 0$
So, we find that we have a rotation about
$u = (- 3/ 14, 2/ 14, - 1/ 14)$
With rotation $14$ radians.

Singular Value Decomposition

Review: Symmetric Matrices and the Spectral Theorem

An $n \times n$ matrix $A$ is symmetric if

A^{T} = A

Theorem: Spectral Theorem for Real Symmetric Matrices

Let $A$ be an $n \times n$ symmetric matrix with real entries. Then,

It has all real eigenvalues

Any pair of eigenvectors for $A$ with different eigenvalues are going to be orthogonal

$A$ is diagonalizable

There is an orthonormal basis for $R^{n}$ that consists of eigenvectors for this matrix.

$A$ can be orthogonally diagonalized, in other words, for orthogonal matrix $P$ , $A = P D P^{T}$

Recall that $P^{T} = P^{- 1}$ is equivalent to having orthonormal columns.

Example: Orthogonally Diagonalizing a Symmetric Matrix

$A = [13 - 6 - 6 - 3]$
Notice how $A$ is a symmetric matrix, so its orthogonally diagonalizable. Let’s orthogonally diagonalize it.

We find eigenvalues $λ_{1} = - 5, λ_{2} = 15$ .

We find eigenvectors $v_{1} = (1, 3)^{T}, v_{2} = (- 3, 1)^{T}$ .

Our eigenvectors are orthogonal, but not orthonormal! Thus, we need to rescale them to get unit eigenvectors.
$v_{1} = (\frac{1}{10}, \frac{3}{10})^{T} v_{2} = (\frac{- 3}{10}, \frac{1}{10})^{T}$
So, we orthogonally diagonalize $A$ as
$A = P D P^{T} = [\frac{1}{10} \frac{3}{10} \frac{- 3}{10} \frac{1}{10}] [- 5 0 015] [\frac{1}{10} \frac{- 3}{10} \frac{3}{10} \frac{1}{10}]$

Singular Value Decompositions (SVDs)

Let $A$ be an $m \times n$ matrix with real entries. A singular value decomposition (SVD) for $A$ is a factorization of the form

A = U Σ V^{T}

Where

$U$ is an $m \times m$ orthogonal matrix
$Σ$ is $m \times n$ diagonal matrix, with non-negative (real) diagonal entries $Σ = σ_{1} 00 ⋮ 00 ⋮ 0 0 σ_{2} 0 ⋮ 00 ⋮ 0 00 σ_{2} ⋮ 00 ⋮ 0 \dots \dots \dots ⋱ \dots \dots \dots 000 ⋮ σ_{n} 0 ⋮ 0 σ_{i} \geq 0$ known as the singular values of $A$ .
$V^{T}$ is an $n \times n$ orthogonal matrix, where the columns $v_{1}, \dots v_{n}$ are known as the right singular vectors of $A$ .

Singular value decompositions are very general!

Note that even for a square diagonalizable $A$ , its diagonalization need not be its singular value decomposition!

Example

$A = 101110 = 2/ 6 1/ 6 1/ 6 0 1/ 2 - 1/ 2 - 1/ 3 1/ 3 1/ 3 300010 [1/ 2 1/ 2 - 1/ 2 1/ 2]^{T}$

How to we find singular value decompositions? First, note that for any $m \times n$ matrix $A$ ,

$A^{T} A$ and $A A^{T}$ are symmetric matrices of size $n \times n$ and $m \times m$ , respectively. Hence, by the spectral theorem, they are orthogonally diagonalizable.
The eigenvalues of $A^{T} A$ and $A A^{T}$ are non-negative real numbers.
$A^{T} A$ and $A A^{T}$ have the same eigenvalues (with the same multiplicities), except for $λ = 0$ .

Now suppose that $A$ has a singular value decomposition.

A = U Σ V^{T}

Then,

A^{T} A = (U Σ V^{T})^{T} (U Σ V^{T}) = V Σ^{T} U^{T} U Σ V^{T} = V (Σ^{T} Σ) V^{T}

This is an orthogonal diagonalization of our matrix! Similarly, we can find

A A^{T} = U (Σ Σ^{T}) U^{T}

So, we can find

$V$ as an orthonormal basis of eigenvectors for $A^{T} A$ ( $P$ in the orthogonal diagonalization)
$Σ$ as the (positive) square roots of $A^{T} A$ ’s eigenvalues
$U$ as the orthonormal basis of eigenvectors for $A A^{T}$ ( $P$ in the orthogonal diagonalization)

Note that this equation is equivalent to

A V [A v_{1} \dots A v_{n}] = U Σ = [σ_{1} u_{1} \dots σ_{n} u_{n}]

So, an additional requirement for these systems is that for any $i$ , $A v_{i} = σ_{i} u_{i}$ , and furthermore, if $σ_{i} \neq = 0$ , then this implies

u_{i} = \frac{1}{σ _{i}} A v_{i}

Theorem: Singular Value Decompositions

Every $m \times n$ matrix $A$ has a singular value decomposition
$A = U Σ V^{T}$
Where

The diagonal entries of $Σ$ are the square roots of the eigenvalues for $A^{T} A$

The columns of $V$ are an orthonormal basis of eigenvectors for $A^{T} A$

The columns of $U$ are an orthonormal basis of eigenvectors for $A A^{T}$

Chosen such that
$A v_{i} = σ_{i} u_{i}$
For each $i$ .

So, one strategy to find an SVD for $A$ is as follows:

Find the eigenvectors and eigenvalues of $A^{T} A$ to get the $v_{i}$ ’s and $σ_{i}$ ‘s.
Use the fact that $u_{i} = \frac{1}{σ _{i}} A v_{i}$ to get the $u_{i}$ ’s for when $σ_{i}$ is non-zero.
If necessary, get the rest of the $u_{i}$ ’s (for $σ_{i} = 0$ ) by finding $λ = 0$ eigenvectors for $A A^{T}$ .

By convention, we order the singular values in decreasing order.

σ_{1} \geq σ_{2} \geq σ_{3} \geq \dots

This can become very unreasonable to do by hand for many matrices! We can use MATLAB to do a SVD for us, using command [U, S, V] = svd(A).

Example: Singular Value Decompositions

Find the SVD of
$A = 104014$
We start by finding
$A^{T} A = [17161617]$
To find eigenvalues $λ_{1} = 33, λ_{2} = 1$ , and eigenvectors
$v_{1} = (1/ 2, 1/ 2)^{T} v_{2} = (1/ 2, - 1/ 2)^{T}$
These are our right singular vectors, with singular values $σ_{1} = 33, σ_{2} = 1 = 1$ !

We now find our $u_{i}$ ‘s.
$u_{1} = \frac{1}{33} A 1 = (\frac{1}{66}, \frac{1}{66}, \frac{8}{66})^{T} u_{2} = A v_{2} = (\frac{1}{2}, - \frac{1}{2}, 0)^{T}$
Finally, we need a $u_{3}$ , a unit eigenvector for $A A^{T}$ with eigenvalue $λ = 0$ . So, we solve $A A^{T} x = 0$ .
$u_{3} = (\frac{4}{33}, \frac{4}{33}, - \frac{1}{33})$
This gives us final result
$A = U Σ V^{T} = \frac{1}{66} \frac{1}{66} \frac{8}{66} \frac{1}{2} - \frac{1}{2} 0 \frac{4}{33} \frac{4}{33} - \frac{1}{33} 3300010 [\frac{1}{2} \frac{1/}{2} \frac{1}{2} - \frac{1}{2}]^{T}$

Inverses and Pseudoinverses

Motivation

Using SVDs, we can define the concept of a “pseudoinverse” for non-invertible matrices!

Suppose $A$ is $n \times n$ and invertible with SVD

A = U Σ V^{T} = U σ_{1} σ_{2} ⋱ σ_{n} V^{T}

Where all $σ_{i} \neq = 0$ (otherwise, $A$ would be non-invertible). Then,

A^{- 1} = (U Σ V^{T})^{- 1} = V Σ^{- 1} U^{T} = V 1/ σ_{1} 1/ σ_{2} ⋱ 1/ σ_{n} U^{T}

This is the SVD of $A$ ’s inverse matrix!

Note how $U, V$ got swapped, and all $σ_{i}$ ’s get inverted!

This gives us a notion to find inverse matrices, even for matrices that don’t have a inverse! This defines a “pseudoinverse”.

Now, consider a general $m \times n$ matrix $A$ with SVD

A = U Σ V^{T}

We can define the Moore-Penrose Pseudoinverse of $A$ to be

A^{+} = V Σ^{+} U^{T}

Where $Σ^{+}$ is the transpose of the matrix $Σ$ , where all non-negative singular values are inverted ( $1/ σ_{i}, σ_{i} \neq = 0$ ).

Some properties of the pseudo-inverse are as follows:

If $A$ is $n \times n$ and invertible, then $A^{+} = A^{- 1}$ .
If $A$ is $m \times n$ with linearly independent columns, then $A^{+} = (A^{T} A)^{- 1} A^{T}$ .

Example: Pseudo-Inverses

$A = 104014$
A has SVD
$A = U Σ V^{T} = \frac{1}{66} \frac{1}{66} \frac{8}{66} \frac{1}{2} - \frac{1}{2} 0 \frac{4}{33} \frac{4}{33} - \frac{1}{33} 3300010 [\frac{1}{2} \frac{1/}{2} \frac{1}{2} - \frac{1}{2}]$
So, we can find its pseudo-inverse as
$A^{+} = V Σ U^{T} = [\frac{1}{2} \frac{1/}{2} \frac{1}{2} - \frac{1}{2}] [\frac{1}{33} 0 0100] \frac{1}{66} \frac{1}{66} \frac{8}{66} \frac{1}{2} - \frac{1}{2} 0 \frac{4}{33} \frac{4}{33} - \frac{1}{33}^{T}$

Now consider a linear system $A x = b$ .

If $A$ is $n \times n$ and invertible, then $x = A^{+} b A^{- 1} b$
If $A$ is $n \times m$ with linearly independent columns, then $x = A^{+} b = (A^{T} A)^{- 1} A^{T} b$ The unique least-squares solution to the system!

What if $A x = b$ has infinitely many solutions? Or what if $A x = b$ is inconsistent but has infinitely many least-squares solutions? What does $x = A^{+} b$ mean in these cases?

Theorem

The vector $x = A^{+} b$ is the least-squares solution of the system $A x = b$ , with the smallest possible norm $∣∣ x ∣∣$ .

Image Compression

Matrix Approximations

Consider the matrix $A$ , with SVD $A = U Σ V^{T}$ .

A = U σ_{1} σ_{2} ⋱ σ_{r} 00 V^{T} σ_{1} \geq σ_{2} \geq \dots \geq σ_{r} > 0

We ask, how could we approximate $A$ with a lower rank matrix?

$A$ has rank equal to the number of non-zero singular values!

Well, for a rank $1 \leq k \leq r$ , one way we could approximate $A$ is by dropping all singular values between $k + 1$ to $r$ !

A_{k} = U σ_{1} ⋱ σ_{k} 00 V^{T}

We can find that this is actually the best approximation to $A$ possible.

Theorem: Eckart-Young Theorem

$A_{k}$ is the best $k$ approximation to $A$ , with error given by the magnitude of the singular values we dropped.
$∣∣ A - A_{k} ∣ ∣_{F} = σ_{k + 1}^{2} + \dots + σ_{r}^{2}$

This is known as the Frobenius norm, and we can also find it by taking the sum of the squares of the entries in the matrix (then square rooting).

Example

Find the best rank 1 approximation to
$A = [12715]$
We can find SVD
$A = [- 0.4233 - 0.9060 - 0.9060 0.4233] [16.7032 0 0 0.0599] [- 0.1338 - 0.9910 - 0.9910 0.1338]^{T}$
With this, we can find rank 1 approximation by dropping the smallest singular value.
$A_{1} = [- 0.4233 - 0.9060 - 0.9060 0.4233] [16.7032 0 00] [- 0.1338 - 0.9910 - 0.9910 0.1338]^{T} = [0.9462 2.0251 7.0073 14.9966]$
And furthermore, according to the theorem above, we can find error
$∣∣ A - A_{1} ∣ ∣_{F} = 0.059 9^{2} = 0.0599$
Now say we do a rank 1 approximation on
$B = [3 - 5 43]$
We find $σ_{1} \approx 5.9, σ_{2} \approx 4.9$ ! Because $σ_{2}$ is much larger than our previous example, we find a higher error, so we should expect our approximation to be a lot worse.

Image Compression

But why do we want to be able to approximate matrices like this?

Well, if we write

U V A = [u_{1}, u_{2} \dots u_{m}] = [v_{1}, v_{2} \dots v_{n}] = U Σ V^{T} = [u_{1}, u_{2} \dots u_{m}] σ_{1} σ_{2} ⋱ σ_{r} 00 v_{1}^{T} v_{2}^{T} ⋮ v_{n}^{T} = σ_{1} u_{1} v_{1}^{T} + σ_{2} u_{2} v_{2}^{T} + \dots + σ_{r} u_{r} v_{r}^{T}

Each of these terms yields a $m \times n$ matrix!

Theorem

If $A$ has rank $r$ , then
$A = σ_{1} u_{1} v_{1}^{T} + σ_{2} u_{2} v_{2}^{T} + \dots + σ_{r} u_{r} v_{r}^{T}$

Consequently, the lower rank approximations to $A$ are:

A_{1} A_{2} A_{k} = σ_{1} u_{1} v_{1}^{T} = σ_{1} u_{1} v_{1}^{T} + σ_{2} u_{2} v_{2}^{T} ⋮ = σ_{1} u_{1} v_{1}^{T} + σ_{2} u_{2} v_{2}^{T} + \dots + σ_{k} u_{k} v_{k}^{T}

We can find the lower rank approximations by dropping the smallest $σ_{i}$ terms!

This gives us a way to store lower rank approximations! For example, instead of explicitly storing $A_{1}$ , we only need to store $σ_{1}, u_{1}, v_{1}$ , and the computer can reconstruct the original matrix for us!

This is a lot cheaper than storing $A_{1}$ . If $A_{1}$ is $1000 \times 1000$ , for example, then instead of storing the entire matrix (1 million entries), we only need to store $1 + 1000 + 1000 = 2001$ entries! If we store these entries in a file, then the computer can take these entries and regenerate the original image!

Generalizing, we can compute $n \times n$ matrix $A_{k}$ provided we know and store the collection

⎩ ⎨ ⎧ σ_{1} \dots σ_{k} u_{1} \dots u_{k} v_{1} \dots v_{k}

Which would take $k + kn + kn = (2 n + 1) k$ entries, opposed to the original $n^{2}$ entries of the matrix! In fact we can actually have our approximation take $2 nk$ entries, if we multiply the $σ_{i}$ ’s into one of the vectors!

This will be useful if

2 kn < n^{2} ⟹ k < \frac{n}{2}

So suppose we have a (grayscale) picture that is $n \times n$ pixels (could be $m \times n$ ). THe color of each pixel is a shade of gray, encoded as a number between 0 (black) and 1 (white).

This gives us a marix $A$ , which we can compress using rank $k$ approximations!

A_{k} = σ_{1} u_{1} v_{1}^{T} + σ_{2} u_{2} v_{2}^{T} + \dots + σ_{k} u_{k} v_{k}

Now, is there a quantitative way to gauge the quality of our compressed image?

The error in approximation is found as the Frobenius Norm, $∣∣ A - A_{k} ∣ ∣_{F} = σ_{k + 1}^{2} + \dots + σ_{r}^{2}$
But the above error can be great if you have many pixels! So, normalizing the above error, we can find $\frac{∣∣ A - A _{k} ∣ ∣ _{F}}{∣∣ A ∣ ∣ _{F}} = \frac{σ _{k + 1}^{2} + \dots + σ _{r}^{2}}{σ _{1}^{2} + \dots + σ _{r}^{2}}$ It is convenient to work with the square of this! This tells us how bad the compression is. $\frac{σ _{k + 1}^{2} + \dots + σ _{r}^{2}}{σ _{1}^{2} + \dots + σ _{r}^{2}}$ Now, if we want to know how good it is, we can subtract it from 1! This is known as the compression rate / image quality, tracking the percentage of variance preserved from the compression. $\frac{σ _{1}^{2} + \dots + σ _{k}^{2}}{σ _{1}^{2} + \dots + σ _{r}^{2}} = \frac{∣∣ A _{k} ∣ ∣ _{F}^{2}}{∣∣ A ∣ ∣ _{F}^{2}}$

Shu-Ye's Quartz Space 🪴

Table of Contents

MATH401

Notes

Matrix Exponentials and Rotations

Contextualization

Matrix Exponentials

Rotations

Singular Value Decomposition

Review: Symmetric Matrices and the Spectral Theorem

Singular Value Decompositions (SVDs)

Inverses and Pseudoinverses

Image Compression

Matrix Approximations

Image Compression

Graph View

Backlinks

Shu-Ye's Quartz Space 🪴

Table of Contents

MATH401

Notes §

Matrix Exponentials and Rotations §

Contextualization §

Matrix Exponentials §

Rotations §

Singular Value Decomposition §

Review: Symmetric Matrices and the Spectral Theorem §

Singular Value Decompositions (SVDs) §

Inverses and Pseudoinverses §

Image Compression §

Matrix Approximations §

Image Compression §

Graph View

Backlinks

Notes

Matrix Exponentials and Rotations

Contextualization

Matrix Exponentials

Rotations

Singular Value Decomposition

Review: Symmetric Matrices and the Spectral Theorem

Singular Value Decompositions (SVDs)

Inverses and Pseudoinverses

Image Compression

Matrix Approximations

Image Compression