(Linear) Discrete Dynamical Systems

We first review some concepts around eigenvalues and eigenvectors.

An eigenvector for $A$ is a non-zero vector $x$ with the property that

A x = λ x

Where $λ$ is some scalar value. In other words, $A x$ is a scalar multiple of $x$ .

We can solve for eigenvectors using the following system:

A x = λ x ⟹ (A - λ I_{n}) x = 0

$I_{n}$ is the $n \times n$ identity matrix.

If $λ$ is an eigenvalue for $A$ , then the following are also true.

There is a non-zero $x$ such that $A x = λ x$ .
There is $x \neq = 0$ such that $(A - λ I_{n}) x = 0$
$A - λ I_{n}$ is not invertible
$det (A - λ I_{n}) = 0$

Additionally, if $λ$ is an eigenvalue, then we know that $λ$ satisfies the characteristic equation

det (A - λ I_{n}) = 0

So, generally, the process is:

Find the eigenvalues for $A$ by solving the characteristic equation for $λ$
Find eigenvectors for $λ$ by solving the system $(A - λ I_{n}) = 0$ .

Example: Eigenvectors and Eigenvalues

$A = [1 - 2 14]$
We find the eigenvalues first by solving
$det (A - λ I_{2}) 1 - λ - 2 1 4 - λ (λ - 2) (λ - 3) λ = 0 = 0 = 0 = 2, 3$
We can now find our eigenvectors. Say we want the eigenvector for $λ = 2$ . Then, we solve system
$(A - 2 I_{2}) x = 0 [- 1 - 2 12] x = 0$

An $n \times n$ matrix $A$ is diagonalizable if there is an invertible matrix $P$ and a diagonal $n \times n$ matrix $D$ such that

A = P D P^{- 1}

Theorem: Eigenvectors of Diagonalizable Matrices

An $n \times n$ matrix $A$ is diagonalizable if and only if $A$ must have $n$ linearly independent eigenvectors $v_{1}, v_{2}, \dots v_{n}$ .

In this case, $A = P D P^{- 1}$ where $P$ ’s columns are the eigenvectors, and $D$ ’s diagonal values are their respective eigenvalues.
$P = [v_{1} v_{2} \dots v_{n}] D = λ_{1} 0 ⋮ 0 0 λ_{2} ⋮ 0 \dots \dots ⋱ \dots 00 ⋮ λ_{n}$

Let’s see how we can use these concepts.

Suppose we have a population of predators (ex. hawks), who live among a population of prey (ex. rats). Let $H_{k}$ be the number of hawks after $k$ months have passed, and $R_{k}$ be the number of rats (in thousands) after $k$ months. We’ll only consider integers of $k$ months.

We can represent and make inferences about this system as follows:

Example: Finding $x_{k}$

Assume that these populations evolve according to some sort of model as follows:
$H_{k + 1} = (0.4) H_{k} + (0.5) R_{k} R_{k + 1} = (- 0.2) H_{k} + (1.2) R_{k}$

Note that if we know this, then given the populations in some month $k$ , we should be able to compute the populations in the next month!

Now say we know initial populations $H_{0} = 500$ , $R_{0} = 250$ , and
$H_{k + 1} = 0.4 H_{k} + 0.5 R_{k} R_{k + 1} = - 0.2 H_{k} + 1.2 R_{k}$
To find $x_{k + 1}$ , we find that
$x_{k + 1} = [H_{k + 1} R_{k + 1}] = [0.4 H_{k} + 0.5 R_{k} - 0.2 H_{k} + 1.2 R_{k}] = [0.4 - 0.2 0.5 1.2] [H_{k} R_{k}] = A x_{k}$
This gives us a nice way to find any $x_{k}$ given our initial conditions $x_{0} = (500, 250)^{T}$ !
$x_{k} = A^{k} x_{0}$

We can plug in our values to find what the populations are for any month $k$ .

For this to be useful for large $k$ , we want to be able to find a formula for $A^{k}$ . How could we do this?

Diagonal Matrices

If $A$ was diagonal, this would be really easy to solve! The power of any diagonal matrix is just it’s entries raised to each power individually.

What if $A$ is diagonalizable?

A = P D P^{- 1}

Then, interestingly enough,

A^{k} A^{k} = (P D P^{- 1}) (P D P^{- 1}) \dots (P D P^{- 1}) = P D^{k} P^{- 1}

This can be used to conveniently find $A^{k}$ ! We can then multiply this with $x_{0}$ to find a closed formula for $x_{k}$ .

Example: Finding $A^{k}$

In our above system, we can diagonalize our matrix as
$A = [0.613 0.790 0.955 0.295] [1.04 5^{k} 0 0 0.55 5^{k}] [- 0.517 1.378 1.666 - 1.069]$
And find
$x_{k} = P D^{k} P^{- 1} x_{0} = [96.91 (1.045)^{k} + 403.1 (0.555)^{k} 125 (1.045)^{k} + 125 (0.555)^{k}]$
We can see that as $k \to \infty$ , both populations will go to $\infty$ , so they won’t die off! For large $k$ ,
$H_{k} \approx 96.91 (1.045)^{k} R_{k} \approx 125 (1.045)^{k}$
Note that
$\frac{H _{k}}{R _{k}} \approx \frac{96.91}{125}$
Which tells us what the stable proportion of hawks to rats is in the long term!

Another application of these systems is in Recurrence Relations. Consider the following example.

Example: Recurrence Relations

THe Fibonacci Numbers are given as
$F_{0} = 0 F_{1} = 1 F_{n} = F_{n - 1} + F_{n - 2}$
Can we find a closed formula for $F_{k}$ ?

To know what comes next, we need a pair of consecutive Fibonacci numbers. Let $x_{k} = (F_{k}, F_{k + 1})$ . Then,
$x_{k + 1} = [F_{k + 1} F_{k + 2}] = [F_{k + 1} F_{k} + F_{k + 1}] = [0111] [F_{k} F_{k + 1}]$
We’ve found a formula for $x_{k + 1}$ in terms of $x_{k}$ (with initial conditions $x_{0} = [0, 1]$ .

We can solve for $x^{k}$ as
$x_{k} = A^{k} x_{0} = P D^{k} P^{- 1} x_{0}$
To find a closed form solution.

Markov Chains

Suppose every year, we have changes

10% of those living in the city, decide to move to the suburbs.
5% of those living in the suburbs decide to move to the city.

Let $C_{k}$ denote the proportion of the total population living in the city after $k$ years, $S_{k}$ denote the proportion of the tital population living in the suburbs after $k$ years. Note that by this definition,

C_{k} + S_{k} = 1

Let $x_{k} = (C_{k}, S_{k})^{T}$ . This gives us system

C_{k + 1} = 0.9 C_{k} + 0.05 S_{k} S_{k + 1} = 0.1 C_{k} + 0.95 S_{k} x_{k + 1} = [0.9 0.10 0.05 0.95] x

Which is a special kind of linear discrete system called a Markov Chain! This matrix has the unique property that all column sums equal 1.

Now suppose we initially have 40% of the population in the city, 60% of the population in the suburbs. Then, using our matrix we can deduce unique things about our system! For example, what happens to $x_{k}$ as $k \to \infty$ ?

While we could diagonalize our matrix like in linear discrete systems, the unique properties of Markov Chains gives us a more convenient way to do this!

Theorem

Say we have a Markov Chain such that
$x = k \to \infty lim x_{k} = k \to \infty lim T^{k} x_{0}$
exists is non-zero. Then, $x$ is an eigenvector for $T$ with eigenvalue $λ = 1$ .

Given the above theorem, we can find what we converge to by solving for the eigenvalues and eigenvectors! In our previous example, we find eigenvalue and eigenvector

λ = 1 v = (1/3, 2/3)^{T}

Note that by the assumption that the proportions should sum to 1, there is only one eigenvector that satisfies our requirements.

This is what our system will converge to for our input, and in fact, for any valid input to the system!

A probability vector is a vector $v$ such for all $i$ , $0 \leq v_{i} \leq 1$ , and

v_{1} + v_{2} + \dots + v_{n} = 1

An $n \times n$ matrix $T$ is called a stochastic matrix if every column of $T$ is a probability vector.

Theorem: Stochastic Matrices and Probability Vectors

If $T$ is $n \times n$ stochastic and $v$ is an $n \times 1$ probability vector, then $T v$ is a probability vector.

Furthermore, the product of two stochastic matrices is stochastic (as multiplying one matrix by another is essentially a bunch of matrix-vector products)

So, if $T$ is stochastic, then $T^{k}$ is stochastic for any positive integer $k$ .

A Markov Chain is a dynamical system

x_{k + 1} = T x_{k}, x_{0} = v

Where $T$ is stochastic, and $x_{0}$ is a probability vector. By the above theorem, it follows that all $v_{k}$ are also probability vectors.

In a Markov Chain, sometimes the stochastic matrix $T$ is called a transition matrix.

Theorem

If $A$ is stochastic, then $λ = 1$ is an eigenvalue for $A$ .

Proof (Sketch)

It can be shown that $A$ and $A^{T}$ always have the same eigenvalues, and because of this, any vector times $A^{T}$ is itself (as the rows of $A^{T}$ are probability vectors).

A stochastic matrix $T$ is called regular if there is some integer $k \geq 1$ such that all entries of $T^{k}$ are strictly positive (nonzero).

If $T^{k}$ is regular, then it holds that $T^{k + i}$ is regular for all $i \geq 0$ . So, one way to verify if a matrix is regular is just to raise it to a high power and check if we get a regular matrix!

Example: Regular Stochastic Matrices

$T = [0.90 0.1 0.05 0.95]$
$T^{1}$ has all positive entries, so it is regular with $k = 1$ .
$T = 0.9 0 0.1 0 0.8 0.2 0.5 0.5 0 T^{2} = 0.86 0.05 0.09 0.10 0.74 0.16 0.45 0.4 0.15$
$T^{2}$ has all positive entries, so $T$ is regular with $k = 2$ .
$T = [10 0.05 0.05]$
$T$ is upper triangular, and a product of upper triangular matrices is upper triangular. Thus, $T$ is not stochastic.

A steady state vector for a stochastic $T$ is a probability vector $x$ such that

T (x) = x

In other words, a probability vector that is an eigenvector for $T$ for $λ = 1$ .

Theorem

Suppose $T$ is a regular stochastic matrix. Then, there is a unique steady state vector $v$ for $T$ .

Further, if $x$ is any probability vector, then
$k \to \infty lim T^{k} x_{0} = v$

This is an important result we use to solve Markov Chains!

Example

The weather in Columbus is either good, indifferent, or bad on any given day.

If good today, then for tomorrow we have 60% of good, 30% of indifferent, 10% of bad.

If indifferent today, then for tomorrow we have 40% good, 30% indifferent, 30% bad.

If bad today, then for tomorrow we have 40% good, 50% indifferent, 10% bad.

What is the probability that any given day has good weather?

We start by creating transition matrix
$T = [0.6 0.4 0.40.3 0.3 0.50.1 0.3 0.1]$
This is a Markov Chain with a regular $T$ ! So, by our theorem, we can find a unique steady state vector $v$ .

Recall the steady state vector has to be a probability vector! So, we need to normalize the eigenvector we find for $λ = 1$ .

$v = (1/2, 1/3, 1/6)^{T}$
So in the long term, $1/2$ of the days are good, $1/3$ are indifferent, and $1/6$ of the days are bad.

Note that if $x_{0}$ contains probabilities for the weather today, then $x_{k} = T^{k} x_{0}$ contains the probabilities $k$ days from now.

Interpreting $T$

The $(i, j)$ entry of $T$ , $t_{ij}$ , represents the probability of moving from state $j$ to state $i$ . What about the entries of $T^{k}$ ?

The $(i, j)$ entry of $T^{k}$ represents the probability of starting at state $j$ , and ending at state $i$ after $k$ steps.

Based on this, if $T^{k}$ is regular, then we know that for all $i, j$ , there is some integer $k$ such that it is possible to get from $j \to i$ !

Example: Random Walks

Consider the following maze with rooms.
graph LR
1 o--o 2 & 3;
2 o--o 3 & 4;
4 o--o 3;
5 o--o 3 & 4;
A mouse runs through this maze with 5 rooms. At each time step, (every second), the mouse will leave its current room and move to a new room, choosing the next room randomly.

This is a particular type of Markov Chain called a random walk. This gives us matrix

$T = 0 1/2 1/2 00 1/3 0 1/3 1/3 0 1/4 1/4 0 1/4 1/4 0 1/3 1/3 0 1/3 000 1/2 1/2$

We can analyze this matrix as we’ve done previously!

Google Pagerank Algorithm

The Google Pagerank algorithm is an application of Markov Chains.

Google needs to rank webpages based on which are “important”. This will be based entirely on how the webpages link to each other. The basic idea is that a webpage is important if many other pages link to it. However, being linked to by an important webpage should carry more weight than being linked to by a page no one cares about.

Google’s idea is to treat websurfing like a random walk in which the websurfer is rnadomly clicking links. A page is ranked based on the percentage of time the of the random walk is spent on the page (from the steady state).

Some issues with this:

Some pages don’t have outbound links.
Sometimes, websurfers load up a new page without following a link.

We assume the following about the random websurfer (RW):

RW starts at some page.
If there are outbound links, there is an 85% chance that RW chooses one of the links, considered equally likely. There is a 15% chance that RW chooses to visit a page at random from all possible pages (not following a link).
If the page has no outbound links, there is a 100% chance RW visits a random page, chosen from all possible pages.
RW continues this forever.

Example: Google Pagerank

Suppose there are 4 webpages, linked as follows:
graph LR
3 --> 1;
2 --> 3;
1 --> 2 & 3;
4 --> 2 & 3;
This gives us transition matrix

$T = 0.85 (Click a Link) + 0.15 (Go Somewhere Random) = 0.85 0 1/2 1/2 0 00101000 0 1/2 1/2 0 + 0.15 1/4 1/4 1/4 1/4 1/4 1/4 1/4 1/4 1/4 1/4 1/4 1/4 1/4 1/4 1/4 1/4$

Note that if there no outbound links, then the page would have a 100% chance to go somewhere random. We list this by just filling both columns in the 0.85 and 0.15 matrix with an equal chance of going to any website.

We can then use this matrix and create a steady state vector!

Absorbing Markov Chains

Here we consider a different type of Markov chain, where the underlying system has distinct ending points. In these systems, we ask about the ending state of the system, not its (as we’re used to) the equilibrium point as time goes to infinity.

In tennis, you have to win by at least 2 scores. If both players score 3, the game enters “deuce”.

During a deuce, the game flow is

graph LR
1[Deuce] -. Player A Scores .-> 2[Advantage A];
1 -. Player B Scores .-> 3[Advantage B];
2 -. Player B Scores .-> 1;
3 -. Player A Scores .-> 1;

2 -. Player A Scores .-> 4[A Wins];
3 -. Player B Scores .-> 5[B Wins];

Suppose at any given moment, Player A scores with probability $p$ , so Player B cores with probability $1 - p$ . We’ll assume $p = 0.6$ .

What is the probability that player $A$ wins?

We can model this as a Markov Chain

graph LR
1[1] -. "0.6" .-> 2[2];
1 -. "0.4".-> 3[3];
2 -. "0.4" .-> 1;
3 -. "0.6" .-> 1;

2 -. "0.6" .-> 4[4];
3 -. "0.4" .-> 5[5];
4 -. "1" .-> 4;
5 -. "1" .-> 5;

Note the transitions at the end points back to themselves. These are called absorbing states, and we define them so that we have a valid Markov Chain.

This gives us transition matrix

T = 0 0.6 0.4 00 0.4 00 0.6 0 0.6 000 0.4 0001000001

States 4,5 are called absorbing states, as they are states you can’t leave. States 1,2,3 are called transient states. This is an absorbing Markov Chain.

Note that $T$ is not regular, and we are not interested in simply finding the steady state vectors of the system.

If the game starts in deuce, then our starting point is $x = (1, 0, 0, 0, 0)$ , and

x_{k} = T^{k} x_{0}

Gives probabilities for being in each of the states after $k$ scores. If we did some of the computations,

x_{1} = (0, 0.6, 0.4, 0, 0) x_{2} = (0.48, 0, 0, 0.36, 0.16) x_{3} = (0, 0.288, 0.192, 0.36, 0.16) x_{4} = (0.2304, 0, 0, 0.5328, 0.2368) ⋮ x_{8} = (0.0531, 0, 0, 0.6923, 0.3077) ⋮ x_{30} = (0.0000, 0, 0, 0.6923, 0.3077)

After $k$ scores, we start to see our final probabilities for the game’s outcome, and we can say that $A$ has a 69.23% chance to win, and $B$ has a 30.77% chance to win!

In general, we find that our answers came from

k \to \infty lim x_{k} = k \to \infty lim T^{k} x_{0}

So, we want a better understanding of $T^{k}$ . Given our $T^{k}$ , let’s first separte our transient states from our absorbing states, both in columns and rows. This gives us a grid of blocks,

T = [Q R 0 I]

Where $Q, R$ are matrices, and $I$ is the identity matrix.

Q = 0 0.6 0.4 0.4 00 0.6 00 R = [00 0.7 0 0 0.4]

Consider powers $T^{k}$ .

T^{2} = TT = [Q R 0 I] [Q R 0 I] = [QQ + 0 R RQ + I R Q 0 + O I R 0 + II] = [Q^{2} RQ + R 0 I]

Continuing this for any $k$ , we find the general form

T^{k} = [Q^{k} R (Q^{k - 1} + Q^{k - 2} + \dots + Q + I) 0 I]

So,

k \to \infty lim T^{k} = [lim_{k \to \infty} Q^{k} R \sum_{i = 0}^{\infty} Q^{i} 0 I]

Interestingly, the sum is equivalent to $(I - Q)^{- 1}$ ! This is where we’ll find all of our interesting probabilities.

Theorem: Absorbing Markov Chains

For an absorbing Markov Chain,

The $(i, j)$ entry of $R (I - Q)^{- 1}$ contains the probability of ending in state $i$ given that we start in state $j$ .

The $(i, j)$ entry of $(I - Q)^{- 1}$ contains the expected number of visits to state $i$ given we start in state $j$ .

The $j^{t h}$ column sum is the expected number of time steps we have until stopping.

Shu-Ye's Quartz Space 🪴

Table of Contents

Markov Chains

(Linear) Discrete Dynamical Systems

Markov Chains

Google Pagerank Algorithm

Absorbing Markov Chains

Graph View

Backlinks

Shu-Ye's Quartz Space 🪴

Table of Contents

Markov Chains

(Linear) Discrete Dynamical Systems §

Markov Chains §

Google Pagerank Algorithm §

Absorbing Markov Chains §

Graph View

Backlinks

(Linear) Discrete Dynamical Systems

Markov Chains

Google Pagerank Algorithm

Absorbing Markov Chains