Here, we will explore the theory behind symmetric key encryption. This is encryption, where both parties can send secure messages to each other with some shared (and private) key.

Sometimes, we call symmetric key encryption schemes ciphers.

By the end, we will have built the theory behind the symmetric key schemes used in the internet today.

One of the driving principles behind symmetric key encryption is as follows:

Kerckhoffs' Principle

A good cipher scheme should be able to be public, without adversaries being able to use it to decrypt messages. To do this, both parties share a secret key for encryption / decryption which adversaries are assumed to not know.

Secure schemes do not get their security from being poorly defined.

We always have to assume that the crypto designs are public! This is more suitable for large-scale usage of cryptography, and exposes schemes to the public eye so that they can be revised and strengthened.

Historical Ciphers

Let’s first examine some historical schemes before going into the theory.

For each of the following historical schemes, we will discuss the encryption algorithm, the decryption algorithm, the key-space (set of all possible keys) and secret key, and how to break the scheme.

Atbash Cipher

The Atbash Cipher substitutes the first letter with the last, the 2nd with the 2nd last, and so on. To encrypt and decrypt, simply swap the letters.

Because there is nothing that is secret, there is no key for this algorithm.

Shift / Caesar Cipher

The Caeser Cipher replaces each letter by the letter which is $n$ positions away in the alphabet. On a message, this has the effect of “shifting” every letter $n$ positions in the alphabet.

As there are 26 letters in the alphabet, there are 26 possible shifts we could do. Thus, our key-space has a size of 26.

Because the key-space is so small, we can easily brute-force this cipher by trying all possible shifts! Thus, this cipher is suspectible to a brute-force search.

For a secure scheme, it’s necessary that the key space is large! However, a large key space does not imply security.

Scytale Cipher

The Scytale Cipher stacks letters of the message into $n$ equal-sized rows. Reading the message column-by-column will give you the ciphertext (encryption), and reading the message row-by-row will give you the plain-text (decryption).

As we could have a nearly infinite number of rows, the key-space is arbitrarily large!

That does not make this cipher secure however, as we may be able to recover the key! Starting from the first letter, we can take letters that are offset $i$ positions from the start, to try to form a word. Once we get something that makes sense, we’ll have recovered our key!

Information about our ciphertext can let us reverse engineer the key!

Monoalphabetic Substitution

The Monoalphabetic Substitution maps every plaintext letter to a different ciphertext character, and substitutes them as so. Using this map, we can encrypt and decrypt our message.

The size of our key-space is $26!$ , which is really large! So, brute force search is not possible, but because of the 1-1 mapping, we could do a frequency analysis to recover the key!

Frequency Analysis

Frequency analysis is based off the fact that letters in (gramatically correct) English generally have different usage frequencies. If we know the frequencies of the ciphertext characters, we can guess what the mappings are.

Symmetric Key Encryption

Formal Definition

A symmetric encryption scheme is defined by 3 algorithms: Gen, Enc, Dec. Let $M$ be our message space, with $∣ M ∣ > 1$ .

Gen: The key-generation algorithm (typically probabilistic), which creates a key $k$ according to some distribution (must be probabilistic).
- $K$ denotes the keyspace, the set of all possible keys.
Enc: The encryption algorithm, which takes an input key $k \in K$ and message $m \in M$ to create ciphertext $c \leftarrow E n c_{k} (m)$ (could be probabilitic).
- $C$ denotes the ciphertext space, the set of all possible ciphertexts.
Dec: The decryption algorithm, which takes an input key $k \in K$ and ciphertext $c \in C$ to create message $m := De c_{k} (c)$ (must be deterministic).

Correctness mandates that $De c_{k} (E n c_{k} (m)) = m$ . If this does not hold, our encryption scheme won’t be very practical.

Each of the spaces $K, M, C$ can be given as random variables with probability distributions. Let $P r [...]$ denote the probability function. Then:

$P r [K = k], k \in K$ denotes the probability that the key output by Gen yields $k$ .

Typically, we will assume that $K$ has the uniform probability distribution (each key is of equal probability)
$P r [M = m], m \in M$ denotes the probability that the message is equal to $m$ , modeling some prior knowledge the adversary may have about the message.
$P r [C = c], c \in C$ denotes the probability that the ciphertext is $c$ , which is fully determined by the distribution on $K$ , $M$ , and Enc.
- $C = E n c_{K} (M)$

We assume that the distributions over $K$ and $M$ are independent.

Perfect Secrecy

We say an encryption scheme over a message space $M$ is perfectly secret if $\forall$ probability distributions over $M$ , $\forall m \in M, \forall c \in C$ such that $P r [C = c] > 0$ ,

P r [M = m ∣ C = c] = P r [M = m]

In other words, the probability that our message is $m$ does not change even if we know what the ciphertext $c$ is. Sometimes, we denote $P r [M = m]$ as the a priori distribution, and $P r [M = m ∣ C = c]$ as the a posteriori distribution.

This means that seeing $c$ does not give us any information about the message space!

There are a few equivalent definitions to perfect secrecy.

Equivalence 1

An encryption scheme over a message space $M$ is perfectly secret if and only if $\forall$ probability distribution over $M$ , $\forall m \in M$ , $\forall c \in C$ ,
$P r [C = c ∣ M = m] = P r [C = c]$
In other words, this is saying that the ciphertext is independent of the message (as we can still vary the key $k$ ).

Proof (One-Way)

Suppose that we have a perfectly secret scheme. We wish to show that $P r [C = c ∣ M = m] = P r [C = c]$ is also true.

Fix message distribution $M$ , $m \in M$ , $c \in C$ . By definition, we know that the following must hold by perfect secrecy.
$P r [M = m ∣ C = c] = P r [M = m]$
and by Baye’s Rule,
$P r [M = m ∣ C = c] = P r [M = m] \frac{P r [ C = c ∣ M = m ] P [ M = m ]}{P r [ C = c ]} = P r [M = m] P r [C = c ∣ M = m] = P r [C = c]$

Equivalence 2: Perfect Indistinguishability

An encryption scheme over a message space $M$ is perfectly secret if and only if $\forall$ probability distribution $M$ , $\forall m_{0}, m_{1} \in M$ , and $\forall c \in C$ ,
$P r [C = c ∣ M = m_{0}] = P r [C = c ∣ M = m_{1}]$

Example: Perfect Secrecy Example

An encryption scheme with message space $M$ is perfectly secret if and only if $\forall$ probability distribution over $M$ , $\forall m, m^{'} \in M$ and $\forall c \in C$ , we have
$P r [M = m ∣ C = c] = P [M = m^{'} ∣ C = c]$
Prove or refute this.

This is false. For every perfectly secret encryption scheme, we can always choose a distribution on $M$ for which this is false.

By way of contradiction, suppose this is true. Now, choose a distribution such that
$P r [M = m] > P r [M = m^{'}]$
Then, by definition of perfect secrecy,
$P r [M = m ∣ C = c] = P r [M = m] > P r [M = m^{'}] = P r [M = m^{'} ∣ C = c]$
Which is a contradiction!

Example: Perfect Secrecy Example (2)

An encryption scheme with message space $M$ is perfectly secret if and only if $\forall$ probability distribution over $M$ , $\forall m, m^{'} \in M$ and $\forall c \in C$ , $P r [C = c] > 0$ ,
$P r [K = k ∣ C = c] = P r [K = k]$

Explain this definition in English.

Why is this a bad definition? Describe an encryption scheme that leaks information about the message but still satisfies the definition.

Seeing the ciphertext does not tell us any information about the key.

This is bad, because we could just choose an encryption scheme with only one key, so it’s completely deterministic! For example, we do a shift cipher with only possible key $k = 13$ . Then, for any ciphertext, $P r [K = k ∣ C = c] = 1 = P r [K = k]$ , but we can easily reverse the encryption scheme.

Another solution is, this definition says nothing about the message! So, we could have a scheme that doesn’t do anything to the message (leaves it unencrypted), and generates a random key!

Example: Perfect Secrecy Example (3)

$M = {0, 1, \dots n - 1}$ , $K = {0, 1, \dots n - 1}$ . Gen() chooses a key at random from $K$ , $E n c_{k} (m) = m + k$ , $De c_{k} (c) = c - k$ . Is this perfectly secret?

No. BWOC, suppose it is. Let $n = 20$ , and let $M$ have the uniform distribution, and let $m = 10, c = 5$ . Then,
$P r [M = 10∣ C = 5] = 0 \neq = P r [M = 10] = \frac{1}{20}$

The One-Time Pad

Here, we describe a perfectly secret symmetric encryption scheme — the One-Time Pad. It works as so:

Fix integer $ℓ > 0$ . $M, K, C$ are all equal to ${0, 1}^{ℓ}$ .
Gen: Choose a string from $K = {0, 1}^{ℓ}$ according to the uniform distribution
Enc: Given $k \in {0, 1}^{ℓ}$ , $m \in {0, 1}^{ℓ}$ , output $c := k \oplus m$ .
Dec: Given $k \in {0, 1}^{ℓ}$ , $c \in {0, 1}^{ℓ}$ , output $m := k \oplus c$ .

Note that the notation ${0, 1}^{ℓ}$ stands for binary numbers (digits 0 or 1) of length $ℓ$ .

Example: One-Time Pad Example

Let’s see an example of the one-time pad. Let $ℓ = 3$ , and suppose we have message $m = 011$ , key $k = 101$ .

One time pad encrypts the ciphertext by XORing the binary numbers.
011 XOR 101 = 110
$c = 110$ ! To decrypt, we XOR the ciphertext with our key again.
110 XOR 101 = 011

Theorem: OTP Secrecy

The one-time pad encryption scheme is perfectly secret.

Proof

Recall that a scheme is perfectly secret if for all distributions on $M$ , $\forall c \in C$ , $\forall m_{0}, m_{1} \in M$ ,
$P r [C = c ∣ M = m_{0}] = P r [C = c ∣ M = m_{1}]$
Fix distribution over $M$ , $c \in C$ , $m_{0}, m_{1}$ .

For $c, m$ ,
$P r [C = c ∣ M = m] = P r [M \oplus K = c ∣ M = m] = \frac{P r [ M \oplus K = c \land M = m ]}{P r [ M = m ]} = \frac{P r [ m \oplus K = c \land M = m ]}{P r [ M = m ]} = \frac{P r [ K = c \oplus m \land M = m ]}{P r [ M = m ]} = \frac{P r [ K = c \oplus m ] P r [ M = m ]}{P r [ M = m ]} = P r [K = m \oplus c] = \frac{1}{2 ^{ℓ}} OTP Scheme Conditionals K, M Independent$
Thus,
$P r [C = c ∣ M = m_{0}] = \frac{1}{2 ^{ℓ}} = P r [C = c ∣ M = m_{1}]$
By perfect indistinguishability, OTP is perfectly secret.

The one-time pad is one of the few perfectly secret algorithms, and in fact, many schemes are variants / equivalent to the one-time pad. Thus, for proofs it’s often a good idea to start from the OTP and modify the scheme from there.

Example: Perfectly Secret Example

Prove or refute: An encryption scheme with message space $M$ is perfectly secret if and only if for every probability distribution over $M$ and every $c_{0}, c_{1} \in C$ we have $P r [C = c_{0}] = P r [C = c_{1}]$ .

Not true. To show why, we will construct a perfectly secret scheme that violates this. To do this, we will start from the one-time-pad (this is a good technique).

For message of length $ℓ$ , let’s take a key of length $ℓ + 1$ , $k ∣∣ b$ where $k \in {0, 1}^{ℓ}$ , $b = 0, 1$ with varying probabilities. Then,
$c = (m \oplus k) ∣∣ b$

$∣∣$ stands for the concatenating of binary strings together.

Because of the biased bit, our ciphertexts do not have the same probability, but no information is released!

Formally, choose any distribution over $M$ , $c_{0}, c_{1}$ , where $c_{0} = c ∣∣0, c_{1} = c ∣∣1$ . Then,
$P r [C = c ∣∣0] = P r [C = c] * P r [B = 0] \neq = P r [C = c] * P r [B = 1] = P r [C = c ∣∣1]$

The one-time pad is a powerful scheme, but it doesn’t come without its flaws:

The key length is the same as the message length, so for every bit communicated over a public channel, a bit must be shared privately.
- This is an inherent problem in perfectly secret encryption schemes! We prove this in the following theorem.
Key can only be used once.

This makes it very difficult to use the one-time pad in practice.

Theorem: Limitations of Perfect Secrecy

Let us have a perfectly secret encryption scheme over message space $M$ , key space $K$ . Then, it must be true that $∣ K ∣ \geq ∣ M ∣$ .

In most cases, this means that the lengths of the keys must be the same or longer than the lengths of our messages!

Proof

By way of contradiction, assume we have a perfectly secret encryption scheme where $∣ K ∣ < ∣ M ∣$ .

To obtain our contradiction, we must show that there exists a probability distribution $M$ , message $m \in M$ , and ciphertext $c \in C$ such that
$P r [M = m ∣ C = c] \neq = P [M = m]$

Often, when contradicting perfect secrecy, we use the uniform distribution on $M$ .

Let $M$ have the uniform distribution. By perfect secrecy, for $m \in M$ , $c \in C$ ,
$P r [M = m ∣ C = c] = P [M = m]$
Let’s do a brute force search over the key-space. Let $M (c)$ be the set of all messages $De c_{K} (c)$ . Because the decryption is deterministic, we have that $∣ M (c) ∣ \leq ∣ K ∣ < ∣ M ∣$ (it can be smaller if multiple keys decrypt to the same message).

So, there exists some message $m^{*} \in M$ that is contained in $M$ but not $M (c)$ . Choose this message instead. Then,
$P r [M = m^{*} ∣ C = c] = 0 \neq = P [M = m^{*}] = \frac{1}{∣ M ∣}$
This is a contradiction! So, our scheme cannot be perfectly secret.

Shannon's Theorem

Let Gen, Enc, Dec be an encryption scheme with message space $M$ , for which $∣ M ∣ = ∣ K ∣ = ∣ C ∣$ . Then, the scheme is perfectly secret if and only if:

Every key $k \in K$ is chocsen with equal probability $1/∣ K ∣$ by Gen.

For every $m \in M, c \in C$ , there exists a unique key $k \in K$ such that $E n c_{k} (m) \to c$ .

Note that this only applies when $∣ M ∣ = ∣ K ∣ = ∣ C ∣$ ! If this condition is not true, then we cannot use Shannon’s Theorem.

Example: Shannon's Theorem Example

Let $M = {0, \dots n - 1}, K = {0, \dots n - 1}$ . Let $G e n ()$ choose a key at random.

$E n c_{k} (m) = m + k mod n$

$De c_{k} (c) = c - k mod n$

Shannon’s theorem applies here! First, we know by assumption that all keys are chosen uniformly.

We also show that for $m, c$ , we explicitly can solve for a single $k$ such that $E n c_{k} (m) \to c$ . We find
$m + k mod n = c ⟹ k = c - m mod n$

The Computational Approach

Motivation

Shannon’s Theorem asserts that for correct schemes that are perfectly secret, $∣ K ∣ = ∣ M ∣$ . Thus, achieving perfect secrecy is, in many cases, unpractical for many real world situations.

Here, we explore a more relaxed definition for security, known as the computational approach. The computational approach only requires the following:

Security is only guaranteed against efficient adversaries that run for some feasible amount of time.
- Efficient adversaries are adversaries that can run in polynomial time (we also call them PPT adversaries).
Adversaries can potentially succeed with some very small probability.
- Adversaries can succeed, but they would need to run in non-polynomial time— they would run out of time if they were running in non-polynomial time.

We formally define these notions below.

Formal Definitions

Under the computational approach, schemes now have an additional parameter called the security parameter ( $n$ ).

Prior to running our scheme, we can set our security parameter, which tells us the run time of the adversary and its success probability as functions of $n$ . This gives a sort of guarantee against adversaries running in polynomial time.

We can think of the security parameter as the length of the key.

Efficient Adversaries

An adversary is efficient if they are in polynomial time (PPT). To be in polynomial time means that there exists some polynomial $p$ , such that the adversary runs for time at most $p (n)$ when the security parameter is $n$ .

Example: Polynomial Time Functions

$2 n^{2} + 3 n + 5$ is a polynomial function in $n$

$lo g n^{l o g n}$ is not a polynomial function in $n$

$2^{l o g n}$ is a polynomial function in $n$

To know if a function is polynomial, it may help to take the log, and see if this is bounded below or above by $lo g (n)$ (this is the logarithm of a polynomial!)

Given a security parameter $n$ , to say an adversary is running in time $p (n) = n^{2}$ means that for $n$ , the adversary has that amount of time to run.

Negligible Probability

A small probability of success means we have a negligible probability. A function $f$ is negligible if for every polynomial $p$ , and sufficienly large $n$ , it holds that

f (n) < \frac{1}{p ( n )}

In other words, as $n \to \infty$ , the function is smaller than all polynomial functions.

f (n) < \frac{1}{n ^{c}} \forall c \in R

The product of any polynomial with a negligible function is a negligible function!

To know if a function is negligible, take its reciprocal and see if its super-polynomial or not! If its reciprocal is a super-polynomial, then $f$ is negligible.

Example: Negligible Functions

$1/ 2^{n}$ is negligible

$1/ lo g n$ is not negligible

$1/ 2^{n}$ is negligible

$1/ n^{2}$ is not negligible

Practical Implications of Computational Security

Why do we define computational security as so?

Let’s say for key size $n$ , any adversary running in time $2^{n /2}$ breaks the scheme with probability $1/ 2^{n /2}$ . Meanwhile, Gen, Dec, Enc take time $n^{2}$ .

If $n = 128$ , then

Gen,Enc,Dec take time $16, 384$

Adversary runs in time $2^{64} = 1 0^{18}$ !

If $n = 256$ , then

Gen,Enc,Dec take time $65, 536$

Adversary runtime is multiplied by $2^{64}$ ! Becomes $2^{128} \approx 1 0^{38}$ .

This makes it really easy to adjust schemes to the adversary time! As adversary capabilities increase, we can shift our security parameter so that they will continue to be unable to break our scheme!

Private Key Encryption

A private-key encryption scheme is a tuple of probabilistic polynomial-time algorithms Gen, Enc, Dec such that:

Gen takes security parameter $1^{n}$ (1 repeated $n$ times), and outputs a key $k$ denoted $k \leftarrow G e n (1^{n})$ . WLOG, assume $∣ k ∣ \geq n$ .
Enc takes a key $k$ , message $m \in {0, 1}^{*}$ , and outpus a ciphertext $c$ , $E n c_{k} (m) \to c$ .
Dec takes a key $k$ , ciphertext $c$ , and outputs a message $De c_{k} (c) \to m$ .

By correctness, we mandate that for every $n$ , every $k$ , and every $m$ , it holds that $De c_{k} (E n c_{k} (m)) = m$ (the scheme must be deterministic).

Security in the Presence of an Eavesropper (EAV)

To define security under the computational approach, we think of an experiment.

Consider a private-key encryption scheme $Π = (G e n, E n c, Dec)$ , any adversary $A$ , and any value $n$ for the security parameter. We define a random variable experiment

Experiment P r i v K_{A, Π}^{e a v} (n)

Where the adversary and our challenger play the following game:

The adversary chooses two messages $m_{0}, m_{1}$ from the message space.
The challenger will generate a key $G e n (1^{n}) \to k$ and choose $b = 0 or 1$ randomly. Based on $b$ , the challenger then encrypts one of the messages $E n c_{k} (m_{b}) \to c$ .
The adversary receives the ciphertext $c$ , and now has to guess $b^{'} \in {0, 1}$ , denoting which message they think was encrypted.
We check if the adversary was right, and set the random variable to 1 if the adversary was right ( $b^{'} = b$ ), 0 if wrong ( $b^{'} \neq = b$ ).

We say $Π$ has indistinguishable encryptions in the presence of an eavesdropper (EAV-secure) if $\forall$ PPT adversaries $A$ , $\exists$ a negligible function $n e g l$ such that

P r [P r i v K_{A, Π}^{e a v} (n) = 1] \leq \frac{1}{2} + n e g l (n)

In other words, the adversary has close to a 50/50 chance of correctly guessing the message (as there are only 2 messages to choose from), with some negligible offset that drops to 0 quickly as $n \to \infty$ ! This tells us that they don’t really get any information about the message!

This is a weak notion of security and is not very useful in practice!

Similar to the different definitions of perfect secrecy, computational security also has equivalent definitions— though these other definitions are outside of the scope of this class.

Pseudorandom Generator (PRG)

Here, we define a pseudorandom generator. These can be used to create schemes that are computationally secure, with $∣ K ∣ < ∣ M ∣$ (making the scheme more practical).

Recall that this wouldn’t be possible for perfect secrecy!

A pseudorandom generator (PRG) is a deterministic algorithm $G$ , that takes as input a short random seed $s$ , and outputs a longer string $G (s)$ . It has the property that no polynomial time algorithm can “distinguish” $G (s)$ from a truly random string $r$ , despite being a deterministic function.

What this generator essentially does is “stretch” a small amount of true randomness ( $s$ ) to a larger amount of pseudorandomness, without compromising on security.

To have a PRG, we also define a game.

Ideal World: We sample a truly random bit string $r$ of length $ℓ (n)$ , $r \in {0, 1}^{ℓ}$ . It must hold that $ℓ (n) > n$ , where $ℓ (n)$ is called the expansion factor ( $G$ ’s output must be longer than the input).
Real World: We sample a truly random bit string $s$ of length $n$ , and compute $G (s)$ .
We give either the ideal world or real world bit string to $D$ , the distinguisher. For a PRG, any efficient distinguisher cannot tell which “world” the string is from (if it got $r$ or $G (s)$ ).

Formally, the probability that the distinguisher guesses the ideal world is about the same in either case.

∣ P r [D (r) = 1] - P r [D (G (s)) = 1] ∣ \leq Negligible

This means that the distinguisher has no way of really knowing which is the ideal world!

Here, $G$ only gets $n$ bits of true randomness, but need to “stretch” that randomness to $ℓ (n)$ bits!

Given a PRG, we can only assume the following:

If our input random string is uniform, the PRG’s output will also be uniform.

Example: PRG Proof (1)

Let $G$ be a PRG where $∣ G (s) ∣ = ∣ s ∣ + 1$ .

Let $G^{'} (s) = G (s ∣∣ \overset{s}{ˉ})$ , where $\overset{s}{ˉ}$ is the negation of $s$ . Is $G^{'}$ a PRG?

We know that $s ∣∣ \overset{s}{ˉ}$ is not uniformly random, as $\overset{s}{ˉ}$ depends on $s$ . Because the input is not random, we cannot assume $G^{'}$ is a PRG.

Let’s construct a pathological example. Let’s construct $G$ from another PRG, $\tilde{G}$ , which is a function from ${0, 1}^{n} \to {0, 1}^{2 n + 1}$ as so:
$G (s_{1} ∣∣ s_{2}) := \tilde{G} (s_{1}) ∣∣ \tilde{G} (\overset{s}{ˉ}_{2})$
We must show that $G$ is a PRG, and $G^{'}$ is not a PRG. Let’s start with $G$ .

$G$ is a PRG, as if $s_{1} ∣∣ s_{2}$ forms a truly random string, then $s_{1}, s_{2}$ are random, and $\overset{s}{ˉ}_{2}$ is also random. So, $\tilde{G}$ , being a PRG on a uniformly random input, outputs a uniformly random output.

$G^{'} (s) = G (s ∣∣ \overset{s}{ˉ}) = \tilde{G} (s) ∣∣ \tilde{G} (s)$ , but by the deterministic nature of PRGs, we have the same string concatenated with itself! So, the output does not have a uniform distribution.

Formally, define a distinguisher $D$ , which get some input $w$ . We choose a polynomial time algorithm for $D$ which lets it distinguish randomness from the PRG with non-negligible probability.

One algorithm $D$ could do is split the string in half, return 1 (PRG output) if the halves are the same, 0 otherwise. With this algorithm, we have the following probabilities:

$P r [D (G^{'} (s)) = 1] = 1$ , as we will always be able to tell the PRG as its two halves are always the same (shown above)

$P r [D (r) = 1] = \frac{1}{2 ^{2 n + 1}}$ , as for any string in the first half we need to match it in the second half.

So,
$∣ P r [D (G^{'} (s)) = 1] - P r [D (r) = 1] ∣ = 1 - \frac{1}{2 ^{2 n + 1}} \geq \frac{1}{2}$
This is not a negligible function, as when $n$ increases this goes to 1! So, we can define a constant function $1/2$ which this is greater than as $n \to \infty$ .

So, we have a $D$ that breaks the security of $G^{'}$

Using PRGs, we can create a secure fixed-length encryption scheme that is secure, and breaks the Shannon bound. Let $G$ be a PRG taking ${0, 1}^{n} \to {0, 1}^{ℓ (n)}$ , $ℓ (n) > n$ , and let $K = {0, 1}^{n}, M = {0, 1}^{ℓ (n)}$ .

Gen outputs a random key $k \in K$ .
Enc takes the key, runs it through the pseudo-random generator to get a pad $G (k) \to p \in {0, 1}^{ℓ}$ . It then XORs the output with the message to get our ciphertext $c$ .
Dec reverses this by XORing the same pad with the ciphertext.

This is a computationally-secure scheme, and $∣ K ∣ < ∣ M ∣$ , breaking the Shannon bound! We can prove this below.

Proof

We wish to show that given $G$ is a PRG, our scheme is EAV-secure. So, $\forall$ PPT adversaries, $\exists n e g l$ such that
$P r [P r i v K_{A, II}^{e a v} (n) = 1] \leq \frac{1}{2} + n e g l (n)$
In many cryptographic proofs, we cannot prove the definition directly. So, we prove by contradiction or contrapositive.

So, we will show that if $\exists$ PPT adversaries such that $\forall n e g l$ ,
$P r [P r i v K_{A, II}^{e a v} (n) = 1] > \frac{1}{2} + n e g l (n)$
Then $G$ cannot be a PRG.

Let $A$ be this adversary. Then, $\exists$ PPT distinguisher such that there exists a non-negligible function $ℓ^{'} (n)$ , where
$∣ P r [D (r) = 1] - P r [D (G (s)) = 1] ∣ \geq ℓ^{'} (n)$
Given this distinguisher, we will contruct another distinguisher against the PRG, that uses $A$ as a subroutine.

This is based off proof reduction! The idea that if one algorithm solves the boolean satisfiability problem, then it can be used to solve all NP-complete problems in polynomial time!

$D$ is a distinguisher that needs to take a string $w \in {0, 1}^{ℓ (n)}$ , and needs to return 0, 1 guessing what world the string came from. It contains and can use $A$

$A$ is an adversary that will first send $m_{0}, m_{1}$ to the challenger ( $D$ ). It expects a ciphertext, and guesses if $c$ is an encryption of $m_{0}$ or $m_{1}$ .

This is almost an algorithm! We just need to fill it in with a few steps.

How does $D$ generate the challenger ciphertext for $A$ ?

$D$ randomly chooses a $b \in {0, 1}$ , and sends back $w \oplus m_{b}$ .

After getting the response from $A^{'}$ , $b^{'}$ , what response does $D$ output?

If $b = b^{'}$ , then return 1

If $b \neq = b^{'}$ , then return 0

We now analyze the probabilities to show that our PRG is not actually a PRG.

The probability $P r [D (G (s)) = 1]$ is the probability $A$ guesses correctly if $w = G (s)$ . This is $P r [P r i v K_{A, II}^{e a v} (n) = 1] \geq \frac{1}{2} + n e g l$

The probability $P r [D (r) = 1]$ is the probability that $A$ guesses the one-time pad, which (by perfect secrecy) is $P r [D (w) = 1] = \frac{1}{2}$ As no matter what $A$ guesses, it cannot do better or worse by perfect secrecy.

So,
$∣ P r [D (G (s)) = 1] - P r [D (r) = 1] ∣ \geq \frac{1}{2} + ρ (n) - \frac{1}{2} = ρ (n)$
As $ρ (n)$ is non-negligible, $D$ breaks the security of the PRG. Because $A$ is a PPT, $D$ is also a PPT as it uses $A$ , we are done.

However, this compuationally secure scheme is not very practical.

The length of the message is fixed
The scheme can only be used once, as if the same key is used twice, the scheme would no longer be secure!

Stream Ciphers

To make something that can be used in practice, recall that for two PRGs, inputting the output of one into another will still yield a pseudo-random string! So, if we chain these PRGs together, we’ll get a “stream” of unique keys we can encrypt with.

This is the idea behind the stream cipher.

Let $G$ be a PRG that takes as input ${0, 1}^{n}$ and outputs ${0, 1}^{n + 1}$ .
Let $s_{0}$ represent some initial state, which is a truly random key.

Both the sender and receiver will store a state, which will let them generate a pseudorandom stream of 0s and 1s to use in a one-time pad. For every application of the PRG, the 1st bit will be used for the one-time pad, and the remaining bits will be used as the next state.

s_{0} = k s_{i + 1} = G (s_{i})_{2}, \dots G (s_{i})_{n + 1} pad_{i + 1} = G (s_{i})_{1}

Then, for some message, we can generate the ciphertext for the $i + 1^{t h}$ bit as

c_{i + 1} = m_{i + 1} \oplus pad_{i + 1}

Both the sender and receiver are in sync (so long as they have the same initial key), so the receiver can decrypt the ciphertext with the same pad!

Because the stream cipher does not use the same key, we have a secure scheme that can be used to send multiple variable-length messages! However, this requires that the sender and receiver are in sync— if any bit is dropped, then they won’t be in sync and the decryption will fail.

Key Idea

It’s possible to use a PRG to create a practical encryption scheme, with the cavaet that the sender and receiver need to share a state!

Chosen Plain-Text Attack Security (CPA)

Computational security was a bit of a weak and limited notion. Here, we define a notion that can actually be used in practice.

This notion can be used for multiple messages, and is a very standard notion of security!

Consider a private-key encryption scheme Gen, Enc, Dec, any adversary $A$ , and any value $n$ for the security parameter.

We define the following experiment, $P r i v K_{A, II}^{c p a} (n)$ .

The challenger (sender / receiver) will generate some key $k = G e n (1^{n})$ .
The adversary gets oracle access to the encryption algorithm, denoted $A^{E n c_{k} (\cdot)}$ . This means that $A$ does not know $k$ , but gets to use the encryption scheme. They’re allowed to choose a plaintext message $m$ and get back a ciphertext $c$ , for as many messages as they want (in polynomial time).
When the adversary has sampled enough input-output pairs, it sends two messages $m_{0}, m_{1}$ to the challenger.
The challenger picks a bit $b = {0, 1}$ at random, and based on this chooses one of the messages to encrypt $E n c_{k} (m_{b}) = c$ .
After receiving the ciphertext, $A$ gets oracle-access to the encrpytion scheme to query again.
When ready the adversary outputs $b^{'}$ , guessing what message the challenger chose.
$P r i v K_{A, II}^{c p a} (n) = 1$ if $b^{'} = b$ , and 0 if $b^{'} \neq = b$ .

We say our scheme has indistinguishable encryptions under a chosen-plaintext attack (CPA-secure) if for all PPT adversaries $A$ , there exists a negligible function such that

P r [P r i v K_{A, II}^{c p a} (n) = 1] \leq \frac{1}{2} + n e g l (n)

While this is a one-shot game, we can prove that CPA-security gives us security for multiple encryptions!

Theorem

Any adversary $A$ that has indistinguishable encryptions under a CPA-attack also has indistinguishable multiple encryptions under a chosen plain-text attack!

This means that we can use CPA-secure schemes with the same key, as many times as we want while maintaining security!

Info

If the adversary $A$ is allowed to query the oracle, what is stopping the adversary from just querying $m_{0}, m_{1}$ ? They could just query $m_{0}, m_{1}$ and see what the ciphertext is!

Because of this, any scheme that satisfies CPA-security must necessarily be probabilistic, meaning when we call the oracle, the ciphertext for any given message will be different.

This motivates the following theorem.

Theorem

If $II$ is an encryption scheme where Enc is a deterministic function of the key and the message, then $II$ cannot be CPA-secure.

Proof

Let $A$ be an adversary.

$A$ chooses two messages $m_{0}, m_{1}, m_{0} \neq = m_{1}$ .

$A$ query its oracle $m_{0}$ to get $c_{0}$ .

$A$ sends $m_{0}, m_{1}$ to the challenger and gets back $c$ .

If $c_{0} = c$ , $A$ predicts $b^{'} = 0$ . Otherwise, $A$ predicts $b^{'} = 1$ .

Clearly, $A$ is efficient as it only queries the oracle once, which is polynomial.

Furthermore, $A$ will always win the game with probability 1, which is greater than $1/2 + n e g l (n)$ .

Pseudorandom Functions

We will use pseudorandom functions to create schemes that are CPA-secure! As it does not make sense for a fixed function to be pseudorandom, we will define pseudorandomness on our selection from a set of fixed functions.

A keyed function $F : {0, 1}^{*} \times {0, 1}^{*} \to {0, 1}^{*}$ ( $F_{k} (x)$ ) is a two-input function, where

The first input is the key, denoted $k$ .
The second input is $x$ .

We say $F$ is efficient if there is a polynomial-time algorithm that can compute $F (k, x)$ given $k$ and $x$ . We will only be interested in efficient pseudo-functions.

$F (\cdot, \cdot)$ is public and polynomial-time efficient.

Let $D$ be a PPT distinguisher. We define the following experiment:

$D$ gets access to an oracle $O$ which is either equal to $F_{k}$ ( $k$ chosen at random) or $f : {0, 1}^{n} \to {0, 1}^{n}$ , a uniformly chosen random function. ( $D$ does not know which is the case).
- $f$ is a function with input set ${0, 1}^{n}$ , and for each input one output in ${0, 1}^{n}$ . Then, to choose $f$ uniformly at random, uniformly assign one output from ${0, 1}^{n}$ to each input. After being chosen, it then behaves deterministically.
$D$ may query the oracle for any $x$ , at which point the oracle returns $O (x)$ .
- Because the oracle computes a deterministic function, it returns the same result if queried twice on the same input.
$D$ may interact freely with the oracle, denoted $D^{O (\cdot)} (1^{n})$ , choosing its queries based on the outputs (as long it runs in polynomial time).
$D$ returns 1 if it thinks the oracle is our pseudorandom function $F_{k} (x)$ , 0 otherwise.

Then, a keyed function $F : {0, 1}^{*} \times {0, 1}^{*} \to {0, 1}^{*}$ is pseudorandom if for all PPT distinguishers $D$ , there exists a negligible function such that

∣ P r [D^{F_{k} (\cdot)} (1^{n}) = 1] - P r [D^{f (\cdot)} (1^{n}) = 1] ∣ \leq n e g l (n)

In other words, for any polynomial-time distinguisher, the probability that the distinguisher guesses that our function is the pseudorandom function (1) is negligible.

$F_{k}$ , for uniform key $k$ , is indistinguishable from a function chosen uniformly at random from the set of all functions with the same domain and range ( $f (\cdot)$ ).

To disprove that a function is a PRF, the question is: can we find a set of inputs whose outputs are correlated?

Example: Pseudo-Random Functions (Disproof)

Let $F$ be a PRF. For $F^{'} : {0, 1}^{n - 1} \to {0, 1}^{2 n}$ , show if $F^{'}$ is a PRF.
$F_{k}^{'} (x) = F_{k} (0∣∣ x) ∣∣ F (x ∣∣1)$
Suppose we plug in the following values of $x$ .
$F_{k}^{'} (0^{n - 1}) = F_{k} (0^{n}) ∣∣ F (0^{n - 1} 1) F_{k}^{'} (0^{n - 2} 1) = F_{k} (O^{n - 1} 1) ∣∣ F_{k} (O^{n - 2} 1^{2})$
Notice how the second half of the first input, and first half of the second input are the same! Thus, these two inputs have correlated outputs, and the distinguisher could use this fact to determine if it has the PRF or not. It can query these two inputs, and return 1 if these halves are the same!

With this attack,
$∣ P r [D^{F_{k} (x)} (1^{n}) = 1] - P r [D^{f (x)} (1^{n}) = 1] ∣ = 1 - \frac{1}{2 ^{n}}$

The probability for the truly random function comes from the fact that we need all $n$ bits of half of the second output to match the first, and under a uniform distribution, this is probability $1/ 2^{n}$ .

Example: Pseudo-Random Functions (Proof)

Let $F$ be a PRF. For $F^{'} : {0, 1}^{n - 1} \to {0, 1}^{2 n}$ , show if $F^{'}$ is a PRF.
$F_{k}^{'} (x) = F_{k} (0∣∣ x) ∣∣ F (1∣∣ x)$
This is a PRF! To see why, think of the input/output table for $F_{k}^{'}$ .

Because of the bit in front, we’re partitioning the input space of $F_{k}$ ! So, there will never be any collision where the same query $x$ yields correlated outputs. Thus, this PRF is secure.

Show that the space of inputs to $F$ are partitioned, and as $F$ is a PRF there will be no collisions. An intuitive argument is acceptable for this course.

CPA-Security with PRFs

Like with PRGs, we will now use PRFs to construct a CPA-secure scheme.

Let Gen output a random key $k$ , chosen uniformly randomly. Also, let $F_{k} (x)$ be a pseudorandom function.

Choose a random string $r \in {0, 1}^{n}$ .
Use the random string $r$ with $F_{k} (x)$ to create a pad, $F_{k} (r) = w$ .
With this pad, XOR it with our message to get $c_{2}$ . Let $c_{1}$ be our random string. Then, our ciphertext is given as $c = c_{1} ∣∣ c_{2}$ .
- We need the random string in the ciphertext to be able to decrypt our message.

Intuitively, this is secure because despite knowing $r$ , we don’t know $k$ , and the security of PRFs guarantee that without knowing $k$ , we cannot differentiate $F_{k} (x)$ from a random function!

Theorem

If $F$ is a pseudorandom function, then the construction above is a CPA-secure private-key encryption scheme for messages of length $n$ .

Proof

We will prove this via contrapositive (as typical of these proofs). Suppose the construction above is not CPA-secure. Then, $\exists$ some PPT $A$ and non-negligible function $ρ$ such that
$P r [P r i v K_{A, II}^{c p a} (n) = 1] \geq \frac{1}{2} + ρ (n)$
We wish to show that $F$ is not a pseudo-random function. In other words, that $\exists PPT$ distinguisher and non-negligible function $f^{'}$ such that
$∣ P r [D^{F_{k}^{'} (\cdot)} (1^{n}) = 1] - P r [D^{f^{'} (\cdot)} (1^{n}) = 1] ∣ \geq f^{'} (n)$
Define a distinguisher $D$ , who has $A$ within it. $D$ is playing a PRF game; $A$ is playing a CPA-security game.

$D$ gets an oracle $O$ , talks to the oracle, and returns 0/1 indicating if it thinks the oracle is the PRF.

$A$ can make queries to the oracle, sends $m_{0}, m_{1}$ to get $c$ , makes queries again, and then guesses 0/1 indicating what message it thinks $c$ is from.

So, $D$ would do the following:

First, $D$ gets an oracle $O$ which is either $F_{k}$ or $f$ .

$D$ will now choose a random string $r \in {0, 1}^{n}$ .

$A$ will be allowed to query the encryption scheme as needed. To query, $D$ will take $A$ ’s message, query $O (r)$ , and return $(r ∣∣ m \oplus O (r))$ to $A$ .

$A$ now take $m_{0}, m_{1}$ , and send them to $D$ .

$D$ will randomly choose $b \in {0, 1}$ , choose a random message based on this, and encrypt this as shown above. It sends this back to $A$ .

$A$ will return an answer $b^{'}$ . If $b^{'} = b$ , then output 1. Otherwise, output 0.

In the case that $O = F_{k}$ ,
$P r [D^{F_{k}} (1^{n}) = 1] = P r [P r i v K_{A, II}^{c p a} (n) = 1] \geq ρ (n)$
As it is the same probability as when $A$ guesses correctly.

In the case that $O = f$ ,
$P r [D^{f} (1^{n}) = 1] = P r [B a d] + (1 - P r [B a d]) \frac{1}{2} \leq P r [B a d] + \frac{1}{2} \leq \frac{q ( n )}{2 ^{n}} + \frac{1}{2}$
As this is the one-time pad game with probability 1/2, but there is exactly 1 case where $A$ can guess correctly, which is if the same $r$ is chosen when generating the challenger ciphertext and in a CPA-query. In this event, $A$ guesses correctly 100% of the time.

$q (n)$ stands for the number of CPA queries, and gives us a bound on this event happening.

$∣ P r [D^{F_{k}} (1^{n}) = 1] - P r [D^{f} (1^{n}) = 1] ∣ \leq \frac{1}{2} + ρ (n) - \frac{1}{2} - n e g l$
And as a non-negligible minus a negligible is non-negligible, this is $\leq$ some non-negligible function. Thus, our function is not a PRF.

Pseudo-Random Permutations

A pseudorandom permutation is exactly the same as a pseudorandom function, except that for every key $k$ , $F_{k}$ must be a permutation and it must be indistinguishable from a random permutation.

Permutation means, every function $F_{k}$ is a bijection! So, on the truth table, every input $x$ has a 1-1 mapping to a unique output, where every output has an input!

So, for the input-output table, each output $y \in {0, 1}^{n}$ appears exactly once!

This means that pseudo-random permutations are invertible! Furthermore, if you know the key, it should be possible to efficiently invert it!

Using pseudo-random permutations, we can define the following scheme.

The challenger chooses a random key $k$ from the key-space. This chooses a pseudo-random permutation.
The adversary gets oracle $O$ , which is either the pseudo-ranodm permutation $F_{k}$ , or a truly random permutation $f (x)$ .
- A truly random permutation is one where for each output $y$ , it randomly gets assigned to one input $x$ !
The adversary can query the oracle in the forward and backward direction, which is either $f (x), f^{- 1} (y)$ , or $F_{k} (x), f_{k}^{- 1} (y)$ .
The adversary guesses what function the oracle is.

A strong pseudo-random permutation (PRP) is one in which any efficient adversary cannot tell which world the oracle belongs to.

∣ P r [A^{f} () = 1] - P r [A^{F_{k} ()} () = 1] ∣ \leq n e g l

There are $(2^{n})!$ possible permutations from ${0, 1}^{n} \to {0, 1}^{n}$ .

PRPs are also known as block ciphers.

Previously, we created a CPA-secure encryption scheme for 1 block messages. We can now use PRPs to create a scheme that can encrypt multiple blocks!

Let message $m$ have the following blocks $m_{1}, m_{2}, m_{ℓ}$ . How can we encrypt this message?

One way we could do this is by running the 1-block scheme $ℓ$ times! This is okay in terms of security, but wastes a lot of resources!
- We need to generate a random key every time, which could be costly
- Every block needs the random ciphertext concatenated with it, doubling the size of each ciphertext block! This uses a lot of data.

The various ways we can use PRPs to encrypt blocks of messages are known as modes of operation. They are described below.

Electronic Code Block (ECB): Encrypt each message block with the PRP, and concatenate them together.
$F_{k} (m_{1}) ∣∣ F_{k} (m_{2}) ∣∣ F_{k} (m_{3}) ∣∣ \dots$
To decrypt, simply run the inverse of the function on each ciphertext block.
$F_{m}^{- 1} (c_{1}) ∣∣ F_{m}^{- 1} (c_{2}) ∣∣ F_{m}^{- 1} (c_{3}) ∣∣ \dots$

This is not a secure scheme, as it will give us the same output for the same input. However, it is intuitive and makes a lot of sense!
Cipher Block Chaining (CBC): Start with some initialization vector $I V$ , a truly random bit-string. Then, for every block, we can compute its ciphertext as
$F_{k} (m_{i} \oplus c_{i - 1}) = c_{i} c_{0} = I V$
In other words, to get our next ciphertext, we XOR the block with the previous ciphertext and run it through the PRP.

To decrypt, run the inverse on the ciphertext, and XOR it with the previous ciphertext.
$F^{- 1} (c_{i}) \oplus c_{i - 1} = m_{i}$
Output Feedback (OFB): Start with some initialization vector $I V$ . Then, to generate the ciphertext, we will first generate keys by repeatedly running the $I V$ through the PRP.
$F_{k} (z_{i - 1}) = z_{i} z_{0} = I V$
Then, we XOR the key with our message block to get our ciphertext.
$z_{i} \oplus m_{i} = c_{i}$
To decrypt, we repeat the key-stream and XOR each key with the ciphertext.
$z_{i} \oplus c_{i} = m_{i}$
Counter (CTR): Start with some random number which will serve as a counter, $c t r$ . Then, to generate the ciphertext, we will run $c t r + i$ through the PRP and XOR the output with the message. Increment counter after each message block.
$F_{k} (c t r + i) \oplus m_{i} = c_{i}$
To decrypt, we repeat the same process.
$F_{k}^{- 1} (c_{i}) \oplus (c t r + i) = m_{i}$

2,3,4 all achieve similar levels of security!

In looking at each mode, we should also think about if the mode is parallelizable!

Practical Constructions for Block Ciphers

How do we construct strong PRPs (block ciphers) in practice?

Below, we’ll discuss some various ideas that were uesd to define PRPs. Suppose we’re working with a 128-bit input and 128-bit output.

By the end, we’ll end on how AES-128 works.

For each idea, we’ll ask the questions:

Does the construction give us a permutation (does each input have a unique output)?
Is the construction indistinguishable from a random permutation (secure)?

Substitution-Permutation Networks (SPNs)

Here’s an idea:

Shannon Confusion Paradigm

Over small enough domains, random permutations are efficient. For example, if we only need to generate a random permutation table for an 8 bit output, we only need to generate $2^{8} = 256$ entries!

So, what if we could extend this? For a 128-bit input, let our key $k$ specify 16 permutations $f_{1}, \dots f_{16}$ on 8-bit blocks. Then, for an input $x$ , break it into 8-bit blocks $x_{1} x_{2} \dots x_{16}$ , and generate ciphertext
$c = f_{1} (x_{1}) ∣∣ f_{2} (x_{2}) ∣∣ \dots ∣∣ f_{16} (x_{16})$

This is a permutation, as we can invert ciphertext $c = y_{1} ∣∣ y_{2} ∣∣ \dots ∣∣ y_{16}$ by taking $f_{1}^{- 1} (y_{1}) ∣∣ \dots ∣∣ f_{16} (y_{16})$ .

This is good, but is not indistinguishable from a truly random permutation. To see why, query two messages, where only one block $x_{i}$ is different. Then, every output block between the two messages should be the same except block $i$ .

This fails because the order of the blocks is fixed which exposes information. We add an additional step to address this, where we permute the ordering of the blocks.

This is known as the Shannon Confusion-Diffusion Paradigm! The first step is known as confusion, and the block re-ordering step is known as diffusion.

A practical implementation of this paradigm is known as the Substitution-Permutation Network (SPN). An SPN applies the following operations, in what’s known as a round:

Key Mixing: The message is XORed with key $k_{i}$ , which is derived from the master key.
- A different key is used each round, and is derived from the master key using a key schedule (which often just takes different subsets of the master key’s bits).
Substitution: The message is split into blocks, and each block is ran through an S-Box, fixed and public permutations on some $n$ -bit input and $n$ -bit output.
Permutation: The bits are permuted through a public mixing function.

The SPN runs multiple of these rounds to obtain an output that looks random. After running each round, the network will always finish by running one more final key-mixing step.

Note that the substitution and permutation functions are completely public! The security comes from the keys that are XORed with the message at the key mixing steps.

How many rounds are needed for this to be secure?

Theorem: The Avalanche Effect

For a random permutation, when a single input bit is changed, we should expect each bit of $f (x)$ should be changed with probability 1/2. So,

We design S-boxes so that changing a single bit of the input to an S-box changes at least 2 bits in the output of the S-box

We design the mixing function so that the output bits of any given S-box are used as input to multiple S-boxes in the next round

Because of this, on round $i$ , if 1 bit of the input is changed, we can expect it to influence $2^{i}$ bits of the output. So, for AES-128, we need at least 7 rounds ( $2^{7} = 128$ ) for this to hold / give us the security we need!

In the case that we don’t have enough rounds, it’s possible to break SPN with a key-recovery attack. Suppose we have oracle access to our SPN.

Example: Key Recovery Attack vs. 1-Round SPN

Suppose we have a 1-round SPN on 16-bit messages, using 4-bit S-boxes, where our key is length 32-bit $k = k_{1} ∣∣ k_{2}$ ( $k_{1}$ used for the first mix, $k_{2}$ used for the second).

Using a brute-force approach, there are $2^{32}$ possible key values! However, we can actually significantly reduce this.

To understand why, suppose we queried the oracle to get input output pair $(x, y)$ . For any $k_{2}$ , there exists 1 unique $k_{1}$ that gives us $(x, y)$ . So, an attacker could do the following:

Query the oracle for $(x, y)$ .

For a 16-bit block of $k_{2}$ , guess one of the $2^{16}$ combinations.

Invert the SPN for our message (XOR y with $k_{2}$ , inverse mix, inverse S-box, then XOR with x) to find the corresponding $k_{1}$ for this $k_{2}$ .

Now, query the oracle for another message, and see if our $(k_{1} ∣∣ k_{2})$ pair gives us the correct message.

This means we only need to evaluate the SPN $2 * 2^{16}$ times! This is a lot more efficient than our original brute-force approach.

However, we could actually do even better than this! The key to this is realizing that we can actually compute our $(k_{1}, k_{2})$ combinations for each S-box block at a time.

Take some S-box block, say the block for $x_{1}$ to $x_{4}$ .

Figure out the bits of the output (and $k_{2}$ ) which are the output of this S-box block (XOR these bits of $y$ with the guess, inverse S-box, then XOR with the block for $x$ ).

Now, for any given block, we need to guess $2^{4}$ possible bit combinations of $k_{2}$ to find the corresponding $k_{1}$ for that block!

Repeating this for each of the 4 blocks, we now only evaluate our SPN $4 * 2^{4}$ times to get 4 lists, which represent all possible outputs of $(k_{1}, k_{2})$ ! We can run this on extra messages to determine which key combination is correct.

AES

Advanced Encryption Standard (AES) essentially uses the SPN paradigm for security.

Feistel Network

Feistel Networks are another way we can generate block ciphers, though we will not cover it in the sake of time.

Feistel networks essentially take a PRF, and convert it into a secure PRP.

DES

Data Encryption Standard (DES) uses the Feistel Network.

Message Authentication Codes (MAC)

In the previous section, we discussed how to create a secure encryption scheme. However, security is not everything! Even without knowing the original message in the previous schemes, attackers can still modify the message with no way for the sender / receiver to tell.

In other words, there’s secrecy, but not integrity! In this section, we will discuss how we can create schemes that maintain integrity.

Secure MACs

Suppose we have a sender, receiver, and eavesdropper, where only the sender and receiver have knowledge of the $k$ . Say the sender doesn’t care about security, and just wants the receiver to be able to verify that the message came from them, and was not modified in transit. How can they do this?

They do this using a message authentication code (MAC) scheme $Π = G e n, M a c, V r f y$ .

Both the sender and receiver share a key randomly generated with Gen.
The sender runs a MAC Algorithm $M a c_{k} (m)$ , which generates a tag $t$ that can be used for authentication.
The sender sends the message and tag together, $(m, t)$ , to the receiver.
The receiver runs a Verification Algorithm $V r f y_{k} (m, t)$ , which returns 1 (valid) or 0 (invalid) indicating if the message was sent by the sender or not (based on if the tag matches the message).

Corectness guarantees that $V r f y_{k} (m, M a c_{k} (m)) = 1$ . In other words, the verify algorithm should always work if the message truly wasn’t modified.

Like before, we will formalize this scheme with an experiment

Experiment M A C f or g e_{A, II} (n)

Let $Π$ be a MAC scheme, $A$ be an adversary, and $n$ be the security parameter. Let the challenger play the part of the sender / receiver.

The challenger first generates a random key $G e n (1^{n}) \to k$ .
$A$ gets oracle access to the MAC algorithm, $A^{M a c_{k} (\cdot)}$ . In other words, they can query any message they want, and get back a tag for that message. Let $Q$ be the set of all messages $m^{'}$ queried by $A$ .
$A$ tries to forge the message authenticity, by sending message and tag pair $(m, t)$ to the challenger.
The adversary wins, $M a c f or g e_{A, II} (n) = 1$ , if:
- $m \neq \in Q$ . In other words, the adversary did not query and get the tag for $m$ (if they queried $m$ , this is known as a replay attack, which has to dealt with separately in practice).
- $V r f y (m, t) = 1$ . In other words, the challenger was tricked into finding the tag matches the message

We say $Π$ is existentially unforgeable under an adaptive chosen message attack if $\forall$ PPT $A$ , $\exists$ negligible function $n e g l$ such that

P r [M a c f or g e_{A, II} (n) = 1] \leq n e g l (n)

Note that unlike security, this probability must be less than negligible probability, not $\frac{1}{2} + n e g l$ .

For shorthand, we will just say the scheme is secure in this case.

Strong Security

There’s also a stronger notion of security for MAC schemes, which we’ll call strong security. In this case, $Q$ is the set of all message, tag pairs $(m^{'}, t^{'})$ , and the adversary wins if

$(m, t) \neq \in Q$

$V r f y_{k} (m, t) = 1$

In other words, the adversary is allowed to replay a message, as long as the tag is different.

In practice, we don’t worry about this too much as the schemes we use are deterministic, so there’s only one valid tag per message (so strong security and normal security are the same).

Example: MAC Disproof

Suppose we have a MAC, where for mesage $m = m_{1} ∣∣ \dots ∣∣ m_{ℓ}$ , $m_{i} \in {0, 1}^{n}$ , choose $r \in {0, 1}^{n}$ at random and compute $t = r ∣∣ F_{k} (m_{1} \oplus r) ∣∣ \dots ∣∣ F_{k} (m_{ℓ} \oplus r)$ .

This is, in fact, not a secure MAC. We can show this below.

Let $A$ be an adversary which does the following:

Suppose we have some message $m$ we want to authenticate. Query the oracle to get the tag $t$ .

From the tag $t$ , take random bit-string $r$ in the beginning.

Now create forgery, where $m^{'}$ is the message where each block of the message is XORed with $r$ from above. Let $t^{'}$ be the tag where $0$ is our random bit-string, with $t$ after.

This is a valid forgery, which gives us a non-negligible probability of breaking the MAC!

Constructing Secure MACs (Fixed-Length)

We can construct secure MACs using pseudorandom functions!

Let $F$ be a pseudorandom function. We define a fixed-length MAC for messages of length $n$ as follows:

Gen: Outputs a random key $k$ .
Mac: For key $k$ and message $m$ , output $F_{k} (m) = t$ .
Vrfy: For key $k$ , message $m$ , tag $t$ , output 1 if and only if $t = F_{k} (m)$ .

This only works for fixed-length messages. Later, we’ll use this to create schemes that work for variable length messages.

Theorem: MAC Security

If $F$ is a PRF, then the construction above is a secure fixed-length MAC for messages of length $n$ .

Proof

By contrapositive, suppose the construction is not a secure MAC. So, there exists a PPT adversary $A$ , non-negligible function $ρ$ , such that
$P r [M A C f or g e_{A, Π} (n) = 1] \geq ρ$
We must show that there exists a distinguisher $D$ such that
$∣ P r [D^{f (\cdot)} (1^{n}) = 1] - P r [D^{F_{k} (\cdot)} (1^{n}) = 1] ∣ \geq ρ^{'} (n)$
For non-negligible function $ρ^{'} (n)$ .

Let $D$ do the following, playing the PRF experiment.

Let $D$ get the oracle, which is either random function $f$ , or $F_{k}$ .

$D$ gives $A$ oracle access. For any MAC query from $A$ , $D$ forwards messgae $m$ to the oracle to get $t = O (m)$ and sends it back to $A$ .

When ready, $A$ sends $(m^{*}, t^{*})$ to $D$ as a forgery.

$D$ checks if $m^{*} \in Q$ , output 0. If $m^{*} \neq \in Q$ , forward $m^{*}$ to the oracle and check if $O (m^{*}) = t^{*}$ . If so, output 1, 0 otherwise.

Note that $D$ is running $A$ , so $D$ is running in PPT. We have the following probabilities.
$P r [D^{F_{k} (\cdot)} (1^{n}) = 1] = P r [M A C f or g e_{A, Π} (n) = 1] \geq ρ (n) P r [D^{f (\cdot)} (1^{n}) = 1] \leq \frac{1}{2 ^{n}} ∣ P r [D^{f (\cdot)} (1^{n}) = 1] - P r [D^{F_{k} (\cdot)} (1^{n}) = 1] ∣ \geq ∣ ρ (n) - f r a c 1 2^{n} ∣$
Which is non-negligible. Thus, $F$ is not a PRF.

Constructing Secure MACs (Variable-Length)

Let’s see how we can extend MACs for variable-length MACs, known as domain extension for MACs.

Suppose we have a MAC $Π$ , which works for fixed-length messages of length $n$ . For a message of any length,

m = m_{1} m_{2} \dots m_{ℓ}

How can we generate a tag for this message?

Naive Extension

One way we could do this is by running each block of the message through the MAC to get a tag for each block.

Unfortunately, this is not secure. This is because there’s nothing “tying” the blocks of the messages together— so, an adversary can easily reorder the blocks to break the security of the scheme.

Let’s see some ways we can extend the MAC.

CBC MAC

CBC-MAC: We generate one tag by running each message through $F_{k}$ , XORing the result with the next message, and repeating this process. Formally,

c_{1} = F_{k} (m_{1}) c_{2} = F_{k} (m_{2} \oplus c_{1}) c_{3} = F_{k} (m_{3} \oplus c_{2}) ⋮ t = c_{ℓ} = F_{k} (m_{ℓ} \oplus c_{ℓ - 1})

This is secure only if the message and forgery are both required to have a fixed length $ℓ$ ! If these conditions are relaxed, we can perform a length extension attack to break the security of the MAC— let’s see how below.

Example: Length Extension Attack

Suppose the message and forgery are not required to have block-length $ℓ = 6$ . Then, let $A$ be the adversary who queries two messages $m_{1}, m_{2}$ of length 3 blocks each. $A$ will use these messages to then generate a forgery on a 6 block message $m^{'} = m_{1}^{'} m_{2}^{'} m_{3}^{'} m_{4}^{'} m_{5}^{'} m_{6}^{'}$ .

$A$ can query $m_{1} = m_{1}^{'} ∣∣ m_{2}^{'} ∣∣ m_{3}^{'}$ , to get tag $t_{1}$ .

Then, $A$ can query $m_{2} = (m_{4}^{'} \oplus t_{1}) ∣∣ m_{5}^{'} ∣∣ m_{6}^{'}$ to get tag $t_{2}$ .

This tag is our forgery! Return forgery $(m^{'}, t^{'})$ where $t^{'} = t_{2}$ .

This creates a forgery that breaks the security of our MAC!

This happens because we’re allowed to query the oracle on a prefix of the previous message. This relation breaks the security.

To fix CBC-MAC, we will first instead start the MAC not on $m_{1}$ , but on some prefix-free encoding. This is a scheme that maps an input message into a space of valid code-words, where for any two valid code-words, one does not prefix the other.

One easy way to do this is by encoding the length of the message! So, CBC-MAC would do the following:

c_{0} = F_{k} (Length of m) c_{1} = F_{k} (m_{1} \oplus c_{0}) c_{2} = F_{k} (m_{2} \oplus c_{1}) ⋮ t = c_{ℓ} = F_{k} (m_{ℓ} \oplus c_{ℓ - 1})

The prefix-free encoding ensures that messages cannot prefix each other in the MAC scheme!

Essentially, what we are doing is prepending an extra message block to the message, which indicates its length.

Hash and MAC

Hash-And-Mac: Using a hash function to compress messages of arbitrary lengths to the MAC input size. We use the hsah function to compress the message, then create a mac from the fixed-size output.

Formally, a hash function (with output length $ℓ$ ) is a pair of PPT algorithms Gen, H satisfying the following:

Gen takes input security parameter $1^{n}$ and outputs a key $s$ .
H takes input key $s$ , string $x \in {0, 1}^{*}$ and outputs a string $H^{s} (x) \in {0, 1}^{ℓ (n)}$

If $H^{s}$ is only defined for inputs $x \in {0, 1}^{ℓ^{'} (n)}$ and $ℓ^{'} (n) > ℓ (n)$ , then we say that Gen, H is a fixed-length hash function for inputs of length $ℓ^{'}$ . In this case, $H$ is also called a compression function (shrinks the size of the input).

We define the following experiment, $H a s h co l l_{A, II} (n)$ .

Generate key $s$ by running $G e n (1^{n})$ .
The adversary $A$ is given $s$ and outputs $x, x^{'}$ .
The output of the experiment is 1 if and only if $x \neq = x^{'}$ , and $H^{s} (x) = H^{s} (x^{'})$ .

We say a hash function is collision resistant if for all PPT adversaries, there exists a negligible function such that

P r [H a s h co l l_{A, II} (n) = 1] \leq n e g l

With hash functions, we can easily create MACs for variable length messages! We first hash our message into the MAC input domain, and then run the MAC algorithm

M a c_{k} (m) = F_{k} (H^{s} (m))

Theorem

If $Π$ is a secure MAC for messages of length $ℓ$ , and $Π_{H}$ is collision resistant, then the construction above is a secure MAC for arbitrary length messages.

Proof (Sketch)

Suppose by way of contradiction there is an adversary breaking Hash-And-Mac. We show that under this assumption, this adversary can break either the securei MAC or the hash function.

If $A$ can break the scheme, then it can produce $m, t = M a c_{k} (H^{s} (m))$ , $m \neq \in Q$ .

Case 1: We can check the list of queries to find $m^{'}$ , which will yield $H^{s} = H^{s} (m^{'})$ for some $m^{'} \in Q$ and $m \neq = m^{'}$ . This will break the collision resistant hahs function.

Case 2: In all the queries in $Q$ , they all correspond to different inputs $\tilde{m}$ to $M a c_{k}$ . This will break the MAC.

But how do we create a compression function in practice?

Creating a Compression Function in Practice

One way we can create a compression function is by using the Davies-Meyer Construction.

Let $F$ be a block cipher with $n$ -bit key and $ℓ$ -bit block length. Then, for $n + ℓ$ bit message $k ∣∣ m$ , we can compress the message as
$h (k ∣∣ x) = F_{k} (m) \oplus m$

Note that the XOR with $x$ is necessary, as without it, we could easily generate a collision ( $F$ ’s key is known, so we can decrypt on a different key to find a different message, key pair).

Intuitively, this works because the output of $F_{k} (m)$ looks “random”, so it serves as a bit-mask on $m$ . This will modify $m$ arbitrarily, giving us a “random” looking output.

Security is guaranteed only if we assume $F$ is not a PRP, but an ideal cipher, meaning that queries are allowed only on different keys that are not $k$ .

Given a compression function with fixed input-output lengths, we can then extend it to accept variable-length inputs using the Merkle-Damgard Transform (used by SHA-1 and SHA-256):

Split the message into blocks of equal length: $X_{1}, X_{2}, \dots, X_{B}$ .
Take an initialization vector, $Z_{0} = I V$ .
Hash $Z_{i} = H^{s} (Z_{i - 1} ∣∣ X_{i})$ for every $1 \leq i \leq B$ to get $Z_{B}$ . $Z_{1} = H^{s} (Z_{0} ∣∣ X_{1}) Z_{2} = H^{s} (Z_{1} ∣∣ X_{0}) \dots$
Finally, hash $H^{s} (Z_{B} ∣∣ L)$ , where $L$ is the length of the message.

In practice, the initialization vector is something hard-set, like $0^{n}$ .

This works as long as our function is a compression function (output is smaller than the input)!

We need to pad with the length, to distinguish messages that aren’t perfectly aligned with the blocks. If we were to pad these messages with 0s instead, we could easily create a collision.

Theorem: Security of Merkle-Damgard

If the underlying compression function $h$ is collision resistant, then so is $H$ , the Merkle-Damgard Transform of this function.

Proof (Sketch)

We wish to show (by contrapositive) that if we can find a collision in $H$ , say $x, x^{'}$ , we can generate a collision for $h$ .

Case 1: If the length of the two messages are not equal, we can easily create a collision on $h$ by taking the last inputs to $H^{s}$ in the transform. So, our collision is given by $Z_{B} ∣∣ L$ and $Z_{B}^{'} ∣∣ L^{'}$ .

Case 2: If the lengths are equal, check if $Z_{B} = Z_{B}^{'}$ . If they’re equal, repeat this case for $Z_{B - 1}$ and $Z_{B - 1}^{'}$ . If they are not, we have two messages generating a collision.

There must be some $Z_{i} \neq = Z_{i}^{'}$ , as otherwise, the messages are the same (which is a contradiction).

Merkle-Damgard is not a secure MAC.

Constructing a $h$ for Merkle Damgard

For a hash function outputting $ℓ$ bits, to get a collision with 100% probability, we need to try $2^{ℓ} + 1$ inputs (by pigeon hole principle).

However, we don’t need 100% probability of success to make the function insecure! By the birthday bound, we really only need $2^{ℓ /2}$ to find a collision with a reasonable probability. So, regardless of how good our collision function is, an attacker only needs about $2^{ℓ /2}$ samples to break the security!

The birthday bound states that for $q$ uniform, independent, random samples of $N$ , we have collision probability bounded by
$\frac{q ( q - 1 )}{4 N} \leq Coll (q, N) \leq \frac{q ^{2}}{2 N}$
Because of this, $ℓ$ has to be pretty large.

For example, as SHA-1 outputs length $ℓ = 160$ , someone could break it in time $2^{160/2}$ queries.

Sponge Construction

Sponge: Recently, a new paradigm has emerged for domain extension. Let $f$ be a completely random permutation on ${0, 1}^{r + c} \to {0, 1}^{r + c}$ .

This is the paradigm used in SHA-3.

Using $f$ , we will first absorb the input in one stage, then squeeze out the output in another (this is why it’s known as a sponge).

Absorb:
1. Start with an internal state of all 0’s, split into an $r$ section and a $c$ section. In other words, our state starts as $0^{r} ∣∣ 0^{c}$ .
2. For every block of the message $p_{i}$ , XOR it with $r$ , and run $f (r \oplus p_{i} ∣∣ c)$ . The first $r$ bits is our new $r$ section, with the rest being the $c$ section.
3. Repeat this for all blocks of the message, until the entire input is absorbed.
Squeeze:
1. Take the first $r$ bits of the state, and output them.
2. If more bits are needed, run the state through $f$ and output the next $r$ bits.

$r$ is our bitrate, how fast we can absorb / squeeze data; $c$ is our capacity, how secure our method is.

It can be shown that if $p_{0}$ is our key $k$ , and the remaining are our message blocks, this sponge method can be used directly as a MAC; it is indifferentiable from a random oracle, if $f$ is a truly random permutation.

Authenticated Encryption

CCA Security

We’ve seen a secure way to encrypt our message for security, and a secure way to authenticate our message for integrity. Now, let’s tie the two together!

Chosen Ciphertext Attack (CCA) Security is a standard security notion, which is even stronger than CPA security. We define the following game.

Consider a private-key encryption scheme $Π$ , some adversary $A$ , and security parameter $n$ . Define the following experiment

P r i v K_{A, Π}^{cc a} (n)

The challenger generates a key $G e n (1^{n}) = k$ .
The adversary gets oracle access to both the encryption and decryption algorithm, $A^{E n c_{k}, De c_{k}}$ . They can query messages to get ciphertexts, and ciphertexts to get messages.
The adversary chooses 2 messages, $m_{0}, m_{1}$ and sends them to the challenger.
The challenger chooses one of the messages at random, $b \in {0, 1}$ , encrypts it, and sends the ciphertext $c$ of $m_{b}$ back.
The adversary can again query the encryption and decryption oracle. When ready, it returns $b^{'}$ guessing what message was encrypted.
- To make sure this game is possible, in this step, the adversary is NOT allowed to query challenge ciphertext $c$ to the decryption oracle. It can query anything else though, as long as it is not $c$ . We’ll denote this slight limitation as $A^{De c_{k}^{*}}$
The experiment is 1 if $b^{'} = b$ , 0 otherwise.

We say $Π$ has indistinguishable encryptions under a chosen-ciphertext attack (CCA secure) if $\forall$ PPT adversaries $A$ , $\exists$ a negligible function $n e g l$ such that

P r [P r i v K_{A, Π}^{cc a} (n) = 1] \leq n e g l

This is a really strong notion of security!

Authenticated encryption schemes satisfy this type of security, by using MACs to tag the ciphertext for authentication. Then, $Dec$ throws an error (denoted $⊥$ ) if it detects an invalid ciphertext. By doing this, the adversary can only decrypt the ciphertexts its seen, and can’t make its own!

Unforgeability for Encryption

To define an authenticated encryption scheme, we also need to define an additional experiment $E n c F or g e_{A, II} (n)$ :

Run Gen to obtain key $k$
The adversary $A$ is given input $1^{n}$ , and access to the encryption oraacle $E n c_{k} (\cdot)$ . The adversary outputs a ciphertext $c$ .
Let $m = De c_{k} (c)$ , and let $Q$ denote the set of all queries that $A$ queried from the oracle. The output of the experiment is 1 if and only if $m \neq = ⊥$ and $m \neq \in Q$ .

We say a private-key encryption scheme is unforgeable if $\forall$ PPT adversaries $A$ , there is a negligible function $n e g l$ such that

P r [E n c F or g e_{A, II} (n) = 1] \leq n e g l (n)

Authenticated Encryption Schemes

We say a private-key encryption scheme is an authenticated encryption scheme if it is CCA-secure and unforgeable.

This is essentially a secure way of combining CPA-security and MACs!

Here are some generic constructions.

Encrypt and Authenticate

Encrypt-and-Authenticate: We run encryption and message authentication independently, in parallel.

E n c_{k_{E}} (m) = c M a c_{k_{M}} (m) = t ⟨ c, t ⟩

Do NOT use the same key for both schemes.

$c$ preserves security, and $t$ preserves privacy, but combining them does not guarantee both! This is because the tag $t$ can leak info on $m$ , which will break the security of the message.

In fact, if the MAC is deterministic (which it often is in practice), then CPA-security does not hold for the combined $(c, t)$ .

Authenticate then Encrypt

Authenticate-then-Encrypt: We first generate a tag, then compute our ciphertext by combining our message with the tag.

M a c_{k_{M}} (m) = t E n c_{k_{E}} (m ∣∣ t) = c c

Now, because we’re putting our tag into the ciphertext, this can provide privacy. However, this will not provide CCA-security!

Encrypt then Authenticate

Encrypt-then-Authenticate: We first encrypt the message, and then compute a tag on the result.

E n c_{k_{E}} (m) = c M a c_{k_{M}} (c) = t ⟨ c, t ⟩

This is secure as long as the MAC is strongly secure! Because only the encryption scheme sees the message, it provides CPA-security, and the use of the MAC preserves privacy, without compromising on security!

This also gives us CCA-security, as it renders the decryption oracle useless, as it will only return $⊥$ unless a known ciphertext from CCA-security is given (and we’re not allowed to query the challenge ciphertext $⟨ c, ⟩ t$ ).

This is exactly what goes on in the internet after key establishment!

Shu-Ye's Quartz Space 🪴

Table of Contents

Symmetric Key Encryption

Historical Ciphers §

Atbash Cipher §

Shift / Caesar Cipher §

Scytale Cipher §

Monoalphabetic Substitution §

Symmetric Key Encryption §

Formal Definition §

Perfect Secrecy §

The One-Time Pad §

The Computational Approach §

Motivation §

Formal Definitions §

Efficient Adversaries §

Negligible Probability §

Private Key Encryption §

Security in the Presence of an Eavesropper (EAV) §

Pseudorandom Generator (PRG) §

Stream Ciphers §

Chosen Plain-Text Attack Security (CPA) §

Pseudorandom Functions §

CPA-Security with PRFs §

Pseudo-Random Permutations §

Practical Constructions for Block Ciphers §

Substitution-Permutation Networks (SPNs) §

Feistel Network §

Message Authentication Codes (MAC) §

Secure MACs §

Constructing Secure MACs (Fixed-Length) §

Constructing Secure MACs (Variable-Length) §

CBC MAC §

Hash and MAC §

Sponge Construction §

Authenticated Encryption §

CCA Security §

Unforgeability for Encryption §

Authenticated Encryption Schemes §

Encrypt and Authenticate §

Authenticate then Encrypt §

Encrypt then Authenticate §

Graph View

Backlinks

Historical Ciphers

Atbash Cipher

Shift / Caesar Cipher

Scytale Cipher

Monoalphabetic Substitution

Symmetric Key Encryption

Formal Definition

Perfect Secrecy

The One-Time Pad

The Computational Approach

Motivation

Formal Definitions

Efficient Adversaries

Negligible Probability

Private Key Encryption

Security in the Presence of an Eavesropper (EAV)

Pseudorandom Generator (PRG)

Stream Ciphers

Chosen Plain-Text Attack Security (CPA)

Pseudorandom Functions

CPA-Security with PRFs

Pseudo-Random Permutations

Practical Constructions for Block Ciphers

Substitution-Permutation Networks (SPNs)

Feistel Network

Message Authentication Codes (MAC)

Secure MACs

Constructing Secure MACs (Fixed-Length)

Constructing Secure MACs (Variable-Length)

CBC MAC

Hash and MAC

Sponge Construction

Authenticated Encryption

CCA Security

Unforgeability for Encryption

Authenticated Encryption Schemes

Encrypt and Authenticate

Authenticate then Encrypt

Encrypt then Authenticate