In this section, we discuss various methods we can use to analyze the efficiency of algorithms. Here, we discuss two main methods of analysis - Big O Notation, and Recurrence Relations.

Big-O Notation

Motivation

Suppose we have 2 different algorithms operating on a list of length $n$ . Now suppose we run these algorithms on various size lists, and obtain the following runtime data:

$n$	$A_{1} (n)$	$A_{2} (n)$
10	6	1
20	12	6
30	18	17
40	24	25
50	28	40
60	30	63
70	38	82

We could use this table to compare the two algorithms based on their time values. For example, we can see that $A_{1} (n)$ is approximately better than $A_{2} (n)$ for $n \leq 40$ .

Is there a better way to formalize this comparison?

Well, yes! Suppose we know these functions stay within the bounds

0.4 n \leq 0.01 n^{2} \leq A_{1} (n) \leq 0.6 n A_{2} (n) \leq 0.02 n^{2}

These bounds could be used to more formally compare the two algorithms!

The purpose of Big-O Notation is to provide a more formalized argument for performing these comparisons.

Big-O Notation

Suppose we have a function $f (x)$ .

Big-O

We say $f (x) = O (g (x))$ to mean:

\exists x_{0}, C such that x \geq x_{0} \to f (x) \leq C g (x)

In other words, $O (g (x))$ is a function such that for large values $x$ ( $x > x_{0}$ ), $f (x)$ is less than some multiple of $g (x)$ !

We call $O (g (x))$ the Big-O of $f (x)$ . We can intuitively think of $O (g (x))$ as an upper bound for $f (x)$ .

Example: Big-O Intuition

Here, $f (x) = O (x^{2})$ with $C = 2$ and $x_{0}$ as shown.

Note that $f (x)$ does not necessarily have to be below $C x^{2}$ for all $x$ - only past some cutoff $x_{0}$ .

Given this definition, to prove that a function $f (x) = O (g (x))$ , we need to find some $x_{0}$ and $C$ and explicitly show it satisfies this definition. See the below examples.

Example: Proving Big-O Functions (1)

Prove that $2 x lo g (x) - 1 = O (x lo g (x))$

Observe that for all $x > 0$ ,
$2 x lo g (x) - 1 \leq 2 x lo g (x)$
Now, let $C = 2$ and $x_{0} = 1$ . Because for $x > x_{0}$ ,
$2 x lo g (x) - 1 \leq C x lo g (x)$
By definition, we’ve shown that $2 x lo g (x) - 1 = O (x lo g (x))$ .

Example: Proving Big-O Functions (2)

Prove that $4 x^{2} + x lo g (x) - 1 = O (x^{2})$

Observe that for all $x > 0$ ,
$4 x^{2} + x lo g (x) - 1 \leq 4 x^{2} + x^{2} = 5 x^{2} \leq 4 x^{2} + x lo g (x)$
Thus, we’ve found a Big-O function $O (x^{2})$ with $C = 5$ and $x_{0} = 1$ .

Non-Uniqueness of Big-O Functions

Note that Big-O functions are not unique! There are a multitude of functions that can satisfy the Big-O of $f (x)$ .

For example, suppose $f (x) = O (x^{2})$ . If this is the case, then we can also say that
$f (x) f (x) = O (x^{3}) = O (x^{4}) ⋮$

Big-Omega

Additionally, we can say $Ω (g (x))$ to mean:

\exists x_{0}, B > 0 such that \forall x \geq x_{0} \to f (x) \geq B g (x)

In other words, $Ω (g (x))$ is a function such for large values $x$ ( $x > x_{0}$ ), $f (x)$ is greater than some multiple of $g (x)$ for all $x$ !

We call $Ω (g (x))$ the Big-Omega of $f (x)$ . We can intuitively think of $Ω (g (x))$ as a lower bound for $f (x)$ .

Example: Big-Omega Intuition

Here, $f (x) = Ω (x^{2})$ with $B = \frac{1}{2}$ and $x_{0}$ as shown.

Similar to Big-O, we can prove $f (x) = Ω (g (x))$ by finding values $x_{0}, B$ satisfying our definition. See the example below.

Example: Big-Omega Proof (1)

Prove that $5 x l g (x) - x = Ω (x l g (x))$ .

Observe that $x \leq x l g (x)$ when $l g (x) \geq 1$ which occurs at $x \geq 2$ . Then, for $x \geq 2$ ,
$5 x l g (x) - x \geq 5 x l g (x) - x l g (x) \geq 4 x l g (x)$
Thus, choosing $B = 4$ and $x_{0} = 2$ , we have shown that $5 x l g (x) - x = Ω (x l g (x))$ !

Big-Theta

Note that while Big-O provides an upper bound, and Big-Omega provides a lower bound, a variety of functions can serve as these upper / lower bounds! Thus, Big-O and Big-Theta may not necessarily provide a good benchmark for a function.

To address this, we define $Θ (g (x))$ to mean

\exists B > 0, C > 0, x_{0} such that x \geq x_{0} \to B g (x) \leq f (x) \leq C g (x)

In other words, $Θ (g (x))$ is a function that serves as both an upper and lower bound for $f (x)$ !

We call $Θ (g (x))$ the Big-Theta of $f (x)$ . To prove $Θ (g (x))$ , we simply show that $f (x) = O (g (x))$ and $f (x) = Ω (g (x))$ .

We should generally prefer to use Big-Theta when possible.

If we cannot easily find explicit values $x_{0}, B, C$ satisfying our definition, we can also employ limits to prove $Θ (g (x))$ .

Theorem: Limit Theorem, Big-O

Let $f (x)$ and $g (x)$ be functions such that
$x \to \infty lim f (x) x \to \infty lim g (x)$
exist.

Then, the following are true.
$x \to \infty lim \frac{f ( x )}{g ( x )} \neq = \infty \to f (x) = O (g (x)) x \to \infty lim \frac{f ( x )}{g ( x )} \neq = 0 \to f (x) = Ω (g (x))$
Thus, if both hold true, then $f (x) = Θ (g (x))$ !

We apply the limit theorem in an example below.

Example: Big-O, Proof by Limits

Observe that
$n \to \infty lim \frac{n ln ( n )}{n ^{2}} = n \to \infty lim \frac{ln ( n )}{n} = n \to \infty lim \frac{1/ n}{1} = 0$
Thus, $n ln (n) = O (n^{2})$ .

Code Analysis with Big-O

Nice Functions

Big-O notation is especially important in computer science, as it serves as a benchmark for comparing algorithms with one another.

Typically we’ll have some algorithm that depends on some varying $n$ , where $n$ can be the length of a list, loop iterations, etc. Based on this algorithm, we can form a function $T (n)$ representing its runtime complexity.

How can we compare these functions, when there are potentially limitless $T (n)$ functions to compare?

Well, computer scientists have settled on a collection of nice functions which easily be compared with one another. A non-comprehensive list of these functions is given below, in order of slowest to fastest growing:

1, l g (n), n, n l g (n), n^{2}, n^{2} l g (n), \dots

Use of $n$ over $x$ does not typically matter, but we tend to prefer $n$ when we’re restricted to integers.

Choosing the respective Big-Theta function for our time function, $T (n) = Θ (g (n))$ , will give us an idea on how fast (or slow) our algorithm runs!

Big-O Algorithm Analysis: Intuition

Apart from using definitions and limit theorems, we can often intuitively find a function’s Big-Theta! For a given function $T (n)$ , the $Θ (g (n))$ is often the fastest growing term.

For example,
$n^{2} l g (n) + n l g (n) - n + l g (n) - 7 = Θ (n^{2} l g (n))$

Note that a slow-growing $Θ (g (n))$ for an algorithm does not necessarily mean its good for all values $n$ !

Example: Big-O Misconceptions

Suppose we have two algorithms
$T_{1} (n) = Θ (n) T_{2} (n) = Θ (n^{2})$
We may intuitively be led to think that $T_{1}$ is automatically better. However, this is not true - this only tells us that $T_{1} (n)$ is better for sufficiently large $n$ , but does not tell us which is better for small values $n$ .

We’ll have algorithms later in the course where they may be efficient for large values $n$ , but terribly inefficient for small values!

Determining $T (n)$

Say we have a block of code with some input whose size is $n$ , and we want to know $T (n) = Θ (g (n))$ .

Note that every statement in a block of code has overhead, and takes time. However, some statements “matter more” than others, as they have more meaningful contributions to the time complexity than others.

We can generalize this in the following rules:

Precedence: We don’t need to include a time function ( $n, n^{2}, \dots$ ) provided there is a faster growing function in the analysis.
Loops: We don’t need to include the time it takes for a loop iteration to occur, provided the loop body takes time.
Conditionals: We don’t need ot include the time it takes for a conditional to evaluate, provided the conditional body takes time.

See the below example.

Example: Algorithm Analysis

Suppose we have the following, with each statement taking some amount of time:
sum = 0 # a time
for i to n: # b time for each iteration
    sum += i # c time
We would find total time $T (n) = n \cdot (b + c) + a$ . However, if all we want is the most relevant time complexity, then

We can ignore $a$ , as it has an insignificant effect on the time complexity compared to the loop.

We can ignore $n \cdot b$ , as the body of the loop takes time ( $n \cdot c$ ).

This gives us $T (n) = n \cdot c$ .

Auxilliary Space

Oftentimes, alongside time complexity analysis, it may be helpful to analyze the amount of extra space (auxilliary space) the algorithm uses, in terms of the input size $n$ .

When calculating auxilliary space, we do not count the memory used by the input.

Recurrence Relations

While Big-O Notation is powerful, we sometimes will have difficulty termining the $T (n)$ of an algorithm, which will prevent us from determining a Big-O. Many of these cases, in fact, turn out to be in recursive functions.

Here, we propose a method of determining the time complexity of reursive functions, using recurrence relations.

A recurrence relation is an equation defining the value of a function (often $T (n)$ ) in terms of earlier values.

Typically, we’ll have one (or more) base cases as well.

Example: Recurrence Relations

$T (n) = T (n /2) + c T (1) = d$
Where $c, d$ are constants.

This is in fact the recurrence relation for binary search, as the time required to process a list of length $n$ equals the check of the list’s center, plus the time required to process the subsequent list of length $n /2$ .

Given these recurrence relations, we can solve for specific values of $T (n)$ ! We can do this by recursively plugging in $n_{0}$ into our recurrence relation, until we reach a base case.

Example: Solving for Specific Values

Consider the previous recurrence relation.

We can find various times given specific input sizes $n_{0}$ .
$T (2) = T (2/2) + c = T (1) + c = c + d T (4) = T (2) + c = [T (1) + c] + c = d + 2 c$

Note that recurrence relations often will need ceilings or floors to ensure that our values remain as integers (otherwise things don’t make sense).

Technique 1: Digging Down

We can use these recurrence relations to determine $Θ$ of a function!

Using the relation to expand itself, we can continuously expand it until we find some general pattern.
Then, we solve for when this pattern ends with regard to the base case.
Finally, we sub in this base case to find our final time complexity.

This technique is known as digging down.

Note that this technique only works for simple relations which have an easily identifiable pattern.

Example: Digging Down

Consider the previous recurrence relation.

Notice that
$T (n) = T (n /2) + c = [T (n / 2^{2}) + c] + c = T (n / 2^{2}) + 2 c = T (n / 2^{3}) + 3 c ⋮ = T (n / 2^{k}) + k \cdot c$
Note that by our base case, we will stop this expansion when
$\frac{n}{2 ^{k}} = 1 \to k = l g (n)$
Giving us a final value of
$T (n) = T (n / 2^{k}) + k \cdot c = T (1) + c l g (n) = d + c l g (n)$
This gives us a time complexity of $Θ (l g (n))$ !

Example: Digging Down (2)

$T (n) = 2 T (n /2) + 5 n T (1) = 7$
We expand it to find the following pattern:
$T (n) = 2 T (n /2) + 5 n = 2 [2 T (n / 2^{2}) + 5 (n /2)] + 5 n = 2^{2} T (n / 2^{2}) + 2 (5 n) = 2^{3} T (n / 2^{3}) + 3 (5 n) ⋮ = 2^{k} T (n / 2^{k}) + k (5 n)$
We find this pattern stops when we have
$\frac{n}{2 ^{k}} = 1 \to k = l g (n)$
Giving us final time complexity $Θ (n l g (n))$ .
$T (n) = 2^{l g (n)} T (1) + l g (n) (5 n) = n \cdot 7 + 5 n l g (n)$

Technique 2: Recurrence Trees

Another method of solving recurrence relations involves the use of a recurrence tree. See the example below.

Consider the recurrence relation given by

T (n) = 2 T (n /2) + n T (1) = 3

Say we want to find some value $T (n)$ . We can do this by drawing a tree to represent our summation, where every level $i$ (and its leaves) represents the result of $T (n / 2^{i})$ . The combination of all the nodes in this tree will therefore sum to $T (n)$ .

graph TD
      r[n] -.-> t1[n/2] & t2[n/2];

      t1 -.-> t3[n/4] & t4[n/4];
      t2 -.-> t5[n/4] & t6[n/4];
      
      t3 -.-> t7[n/8] & t8[n/8];
      t4 -.-> t9[n/8] & t10[n/8];
      t5 -.-> t11[n/8] & t12[n/8];
      t6 -.-> t13[n/8] & t14[n/8];

Note that every level represents the result of $T (n / 2^{i})$ . Thus, we can find when our levels end by finding when

n / 2^{i} = 1 \to i = l g (n)

We can then sum up all our results to find our total time. We make a table to represent this for us.

Level	Num Nodes	Time / Node	Level Total
$0$	$1$	$n$	$n$
$1$	$2$	$n /2$	$n$
$2$	$4$	$n /4$	$n$
$⋮$
$l g (n) - 1$	$2^{l g (n) - 1}$	$n / 2^{l g (n) - 1}$	$n$
$l g (n)$	$2^{l g (n)}$	$7$	$7 n$

Using this table, we find total sum as the sum of the level totals (rightmost column).

T (n) = i = 0 \sum l g (n) - 1 n + 7 n = n l g (n) + 7 n

Example: Recurrence Tree

Consider the recurrence tree given by

$T (n) = 2 T (n /4) + n + 1 T (1) = 3$

We find its time complexity by drawing a recurrence tree.
graph TD
      r[sqrt n + 1];

      n1[sqrt n/4 + 1]; n2[sqrt n/4 + 1];
      r -.-> n1 & n2;

      n3[T n/8]; n4[T n/8];
      n5[T n/8]; n6[T n/8];
      n1 -.-> n3 & n4;
      n2 -.-> n5 & n6;
Looking at the leaves of this tree, we find that they are, progressively, $T (n / 4^{i})$ , where $i \geq 0$ indicates the level in the tree. Given our base case $T (1)$ , we find our leaf level at $i = lo g_{4} (n)$ .

Summing up all our results to find our total time, we have the following table.

Level Num Nodes Time / Node Level Total
$0$ $1$ $n + 1$ $n + 1$
$1$ $2$ $n /4 + 1$ $n + 2$
$2$ $4$ $n /16 + 1$ $n + 4$
$⋮$
$lo g_{4} (n) - 1$ $2^{l o g_{4} (n) - 1}$ $n + 2^{l o g_{4} (n) - 1}$
$lo g_{4} (n)$ $2^{l o g_{4} (n)}$ $3$ $3 \cdot 2^{l o g_{4} (n)}$

We find our total time as

$T (n) = 3 \cdot 2^{l o g_{4} (n)} + i = 0 \sum l o g_{4} (n) - 1 (n + 2^{i}) = 3 n + i = 0 \sum n (n + 2^{i})$

Level	Num Nodes	Time / Node	Level Total
$0$	$1$	$n + 1$	$n + 1$
$1$	$2$	$n /4 + 1$	$n + 2$
$2$	$4$	$n /16 + 1$	$n + 4$
$⋮$
$lo g_{4} (n) - 1$	$2^{l o g_{4} (n) - 1}$		$n + 2^{l o g_{4} (n) - 1}$
$lo g_{4} (n)$	$2^{l o g_{4} (n)}$	$3$	$3 \cdot 2^{l o g_{4} (n)}$

Master Theorem

Note that in many of the above recurrence relation examples, there’s a common pattern among them. It appears as though we can generalize most recurrence relations in the form:

T (n) = a T (n / b) + f (n)

Given this general form, can we build a general rule for solving such recurrence relations?

Well, yes! Such a rule is given by the master theorem, described below.

Note that the master theorem provides one of the lowest level generalizations for recurrence relations. There are numerous other higher-level theorems we do not cover!

Theorem: Master Theorem

Suppose we have recurrence relation given by $T (n) = a T (n / b) + f (n)$ , with $a, b \in Z^{+}$ and $b > 1$ .

Then, we have 3 possible cases:

If $f (n) = O (n^{c})$ and $lo g_{b} a > c$ then $T (n) = Θ (n^{l o g_{b} (a)})$ .

If $f (n) = Θ (n^{c} l g^{k} (n))$ and $lo g_{b} (a) = c$ then $T (n) = Θ (n^{l o g_{b} (a)} l g^{k + 1} (n))$

If $f (n) = Ω (n^{c})$ and $lo g_{b} (a) < c$ then $T (n) = Θ (f (n))$ .

Note: Case (3) requires a regularity condition on $f (n)$ , which generally will be satisfied by most functions (and thus will not be our primary focus).

Example: Master Theorem (1)

$T (n) = 4 T (n /2) + n^{2} + l g (n)$
First, observe that $a = 4, b = 2, lo g_{2} (4) = 2$ . Furthermore, observe that $f (n) = n^{2} + l g (n)$ .

We can see that $f (n) = Θ (n^{2})$ , and furthermore, $lo g_{b} a = l g (2) = 2 = c$ . Thus, we can use property (2) of the master theorem.

By property (2) of the master theorem, we have
$T (n) = Θ (n^{2} l g (n))$

Example: Master Theorem (2)

$T (n) = 3 T (n /4) + n l g (n) + 1$
First, observe that $a = 3, b = 4, an d lo g_{4} (3) > 0$ . Furthermore, observe that
$f (n) = n l g (n) + 1 = Θ (n l g (n))$
We see that $lo g_{4} (3) \neq = c = 1$ . Thus, we cannot apply property (2) of the master theorem.

However, if $f (n) = Θ (n l g (n))$ , then it is also $f (n) = Ω (n)$ ! Because $lo g_{4} (3) < 1$ , we find (by property 3)
$T (n) = Θ (n l g (n))$

Shu-Ye's Quartz Space 🪴

Table of Contents

Analysis of Algorithms

Big-O Notation

Motivation

Big-O Notation

Big-O

Big-Omega

Big-Theta

Code Analysis with Big-O

Nice Functions

Determining $T (n)$

Recurrence Relations

Technique 1: Digging Down

Technique 2: Recurrence Trees

Master Theorem

Graph View

Backlinks

Shu-Ye's Quartz Space 🪴

Table of Contents

Analysis of Algorithms

Big-O Notation §

Motivation §

Big-O Notation §

Big-O §

Big-Omega §

Big-Theta §

Code Analysis with Big-O §

Nice Functions §

Determining T(n) §

Recurrence Relations §

Technique 1: Digging Down §

Technique 2: Recurrence Trees §

Master Theorem §

Graph View

Backlinks

Big-O Notation

Motivation

Big-O Notation

Big-O

Big-Omega

Big-Theta

Code Analysis with Big-O

Nice Functions

Determining $T (n)$

Recurrence Relations

Technique 1: Digging Down

Technique 2: Recurrence Trees

Master Theorem