Number Theory

Groups

A group is a set $G$ along with a binary operation $\circ$ for which the following conditions hold:

Closure: For all $g, h \in G$ , $g \circ h \in G$ .
Identity: There exists an identity $e \in G$ such that $\forall g \in G$ , $e \circ g = g = g \circ e$ .
Inverse: For all $g \in G$ , there exists an element $h \in G$ called the inverse such that $g \circ h = e = h \circ g$ .
Associativity: For all $g_{1}, g_{2}, g_{3} \in G$ , $g_{1} \circ (g_{2} \circ g_{3}) = (g_{1} \circ g_{2}) \circ g_{3}$

When $G$ has a finite number of elements, we say $G$ is finite and let $∣ G ∣$ denote the order (size) of the group.

We say a group $G$ with operation $\circ$ is abelian if it has the property of commutativity: $\forall g, h \in G, g \circ h = h \circ g$ .

For the purposes of this class, we will only deal with finite, abelian groups.

Example Group: Modular Arithmetic (Addition)

We say that two numbers $a, b$ are congruent modulo $p$ , denoted
$a \equiv b mod p$
If $p$ divides $(a - b)$ , or in other words, $p ∣ (a - b)$ .

For example, all of the following are true.
$2 \equiv 15 mod 1328 \equiv 15 mod 13 - 11 \equiv 15 mod 13$

Addition works in modular space as normal. You perform regular addition, and then take modulo $p$ . For example,
$8 + 10 mod 13 \equiv 18 mod 13 \equiv 5 mod 13$
Addition has the following properties. Consider the set of numbers $Z_{p} = {0, 1, \dots p - 1}$ :

Identity: $\forall a \in Z_{p}$ , adding 0 to it yields the same number

Additive Inverse: $\forall a \in Z_{p}$ , $\exists b$ such that $a + b mod p = 0$ , given as $b = p - a$ .

Closure: Any addition operations will yield a result that is still inside of $Z_{p}$ .

Associativity: You can take modulo at any point in the operation. For example, $((a + b) mod p + c) mod p = (a + (b + c) mod p) mod p$

The set $Z_{p}$ with our addition modulo operator defines a group!

Modular Arithmetic under Multiplication, Prime

In the context of cryptography, we are interested in multiplicative groups over the integers, as this introduces computational problems believed to be hard to solve. One example of this is the multiplication modulo $p$ group.

For now, we will only consider prime groups, but we will later generalize to composite groups.

Let $Z_{p}^{*} = {1, \dots p - 1}$ , with the multiplication mod operation. For $Z_{p}^{*}$ to be a group it must be true that p is prime. Without a prime $p$ , we won’t have a multiplicative inverse!

Multiplicative Inverses

We argue below that $Z_{p}^{*}$ satisfies the inverse property (the rest are trivial to prove). In other words, $\forall a \in Z_{p}^{*}$ , there exists a $b$ such that $a * b mod p = 1 mod p$ .

Example: Brute Force Inverse

Suppose we want to find the multiplicative inverse of $9 mod 11$ .

One way to do this is to brute-force iteratively try all 10 numbers in $Z_{11}^{*}$ to find our inverse. We will consider this brute-force to be exponential time! This is because when we’re using this with respect to binary numbers, the length of our input is on the magnitude of $2^{n}$ .

However, there’s a faster way to find the multiplicative inverse, through the Euclidean Algorithm! This algorithm is based off the following assertion:

Theorem: Euclidean Algorithm

Let $a, p$ be positive integers. Then, there exists integers $X, Y$ such that $X a + Y p = g c d (a, p)$ .

The Euclidean algorithm can be used to compute $g c d (a, p)$ in polynomial time. We can then extend this to compute $X, Y$ in polynomial time.

This algorithm has time complexity $2 lo g (b)$ , for $g c d (a, b)$ .

If we can use the Euclidean Algorithm to find $X, Y$ , then we can find a multiplicative inverse. This is because for $p$ prime, $g c d (a, p) = 1$ , so for

X a + Y p = g c d (a, p) = 1

We can rearrange our terms to find that $Y p = (1 - X a)$ , telling us that $p$ divides $(1 - X a)$ . Because of this, we know that

X a mod p \equiv 1

In other words, $X$ is our multiplicative inverse for $a$ !

Example: Inverses with the Euclidean Algorithm

Suppose we want to find the multiplicative inverse of $a = 9$ for $p = 23$ . In other words, we want to find
$9 X + 23 Y = g c d (9, 23) = 1$
We can do this by iteratively dividing as follows. Let $b = 23, a = 9$ . Every iteration, modular divide $b$ by $a$ , to get
$b = k * a + c$
Then, let $b = a, a = c$ and repeat.
$23 = 2 * 9 + 5 9 = 1 * 5 + 4 5 = 1 * 4 + 1 4 = 4 * 1 + 0$
Once we find a 0 for $c$ , we can work our way back up to find our inverse. If we rearrange every expression (ignoring the last) to be in terms of $c$ , we’ll find that
$23 = 2 * 9 + 5 ⟹ 5 = 23 - 2 * 9 9 = 1 * 5 + 4 ⟹ 4 = 9 - 1 * 5 5 = 1 * 4 + 1 ⟹ 1 = 5 - 1 * 4$
So, starting from the bottom, we can plug each equation back into the previous to get an expression in terms of 5, 23.
$1 = 5 - 1 * 4 1 = 5 - 1 * (9 - 1 * 5) 1 = (23 - 2 * 9) - 1 * (9 - 1 * (23 - 2 * 9)) 1 = 2 * 23 - 5 * 9$
Thus, we find our multiplicative inverse as $- 5$ .

Polynomial Time of Multiplicative Inverses

Note that when we use the Euclidean Algorithm, our “b” value is being halved every two rounds. So, our time complexity is $2 lo g (b)$ . This means our time complexity is polynomial given the input!

Thus, we can not only find multiplicative inverses, but find them efficiently.

Modular Exponentiation

What about Modular Exponentiation? Given $a, m, N$ , can we efficiently compute $a^{m} mod N$ ?

This is the result of multiplying $a$ by itself $m$ times, and taking the modulus of the result.

One way we could compute this is as follows:

def ModExp(a, m, N):
    temp = 1
    for i in range(1, m + 1)
        temp = temp * a % N
    return temp

But this has runtime $O (m)$ , where $m$ could be exponential! For an efficient algorithm, we need our runtime to be on the logarithmic order.

We can, in fact, achieve an efficient algorithm with repreated squaring. Let $m = m_{n - 1} m_{n_{2}} \dots m_{1} m_{0}$ be the bits of $m$ .

def ModExp(a, m, N):
    s = a
    temp = 1
    for i in range(0, n):
        if (mi == 1)
            temp = temp * s % N
        s = s^2 % N
    return temp

This has runtime $O (lo g_{2} (m))$ ! So, we can also perform Modular Exponentiation efficiently.

In the context of a prime $p$ , we can also use Fermat’s Little Theorem to speed up our computation.

Theorem: Fermat's Little Theorem

For prime $p$ , integer $a$ , $a^{p} \equiv a mod p$ .

Corollary

For prime $p$ and $a$ such that $g c d (a, p) = 1$ , $a^{p - 1} \equiv 1 mod p$ .

This theorem can be generalized to any finite group!

Theorem: Generalized Fermat's

Let $G$ be a finite group with $m = ∣ G ∣$ . Then, for any element $g \in G$ , appling the group operation to it $m$ times will yield 1.
$g^{m} = 1$

Recall that for our group $Z_{p}^{*}$ , we have $m = p - 1$ elements. So, this is making the same assertion as our previous corollary.

Modular Arithmetic under Multiplication, N Composite

Using primes to construct groups is very limiting, as there are only so many primes we could use. What about multiplicative groups modulo $N$ , where $N$ is composite? Can we create such groups?

For numbers ${1, \dots, N - 1}$ , only numbers $a$ such that $g c d (a, N) = 1$ have a multiplicative inverse by the Extended Euclidean Algorithm. Because all of the others do not have a multiplicative inverse, to obtain a valid group, we must disclude these values from our group.

So, we will define our group $Z_{N}^{*}$ as follows:

Z_{N}^{*} = {a \in {1, \dots N - 1} : g c d (a, N) = 1}

Then, $Z_{N}^{*}$ is an abelian, multiplicative group.

In practice, we will often create composite groups where $N = p \cdot q$ , for distinct primes $p, q$ . We can create this group as follows:

For $p$ , remove numbers $1 p, 2 p, 3 p, 4 p \dots q * p$
For $q$ , remove numbers $1 q, 2 q, 3 q, 4 q, \dots q * p$ .

This gives us order, denoted $ϕ (N)$ (the Euler totient function)

ϕ (N) = N - p - q + 1 = (p - 1) (q - 1)

We add 1 as we’re double counting the $q * p$ element in our removal.

Generalizing this, for $N = \prod_{i} p_{i}^{e_{i}}$ , where ${p_{i}}$ are distinct primes and $e_{i} \geq 1$ , then

ϕ (N) = i \prod p_{i}^{e_{i} - 1} (p_{i} - 1)

This gives us a very easy way to create large groups quickly! We take prime factors, and multiply them together to get a large group!

However, finding this prime factorization of $N$ is a very difficult problem!

By this theorem, and using the previous theorems, we know that for any $a$ such that $g c d (a, N) = 1$ ,

a^{ϕ (N)} \equiv 1 mod N

This can be used as a “quick / easy” vertification that someone has found $ϕ (N)$ !

A corollary of this theorem is that $g^{x} = g^{x mod ϕ (N)}$ (since every $ϕ (N)$ , we wrap around). This makes things easy if we can find the prime factorization of $N$ .

Cyclic Groups

For a finite group $G$ of order $m$ and $g \in G$ , consider

⟨ g ⟩ = {g^{0}, g^{1}, \dots g^{m - 1}}

Here, $⟨ g ⟩$ always forms a cyclic subgroup of $G$ . However, as there may be repeats, $⟨ g ⟩$ may be a subgroup with a smaller order than $m$ .

If the order of $⟨ g ⟩$ is equal to $G$ , we say that $G$ is a cyclic group and $g$ is a generator of $G$ . In other words, by apply modular exponentiation on $g$ , we can cycle between all values in $G$ .

Example: Cyclic Group + Generator

Define $Z_{13}^{*}$ . Then, 2 is a generator of $Z_{13}^{*}$ .

Input Output Input Output
$2^{0}$ 1 $2^{6}$ 12
$2^{1}$ 2 $2^{7}$ 11
$2^{2}$ 4 $2^{8}$ 9
$2^{3}$ 8 $2^{9}$ 5
$2^{4}$ 3 $2^{10}$ 10
$2^{5}$ 6 $2^{11}$ 7
$2^{12}$ 1

Input	Output	Input	Output
$2^{0}$	1	$2^{6}$	12
$2^{1}$	2	$2^{7}$	11
$2^{2}$	4	$2^{8}$	9
$2^{3}$	8	$2^{9}$	5
$2^{4}$	3	$2^{10}$	10
$2^{5}$	6	$2^{11}$	7
$2^{12}$	1

Let $G$ be a finite group and $g \in G$ . The order of $g$ is the smallest positive integer $i$ such that $g^{i} = 1$ .

Proposition: Generators

Let $G$ be a finite group, $g \in G$ with order $i$ . Then, for any integer $x$ , we have $g^{x} = g^{x mod i}$ .

Let $G$ be a finite group and $g \in G$ with order $i$ . Then, $g^{x} = g^{y}$ if and only if $x \equiv y mod i$ .

Let $G$ be a finite group of order $m$ , and $g \in G$ with order $i$ . Then, $i ∣ m$ .

Proposition (3) is particularly important! This is because if $m$ is prime, then the only generator orders we can get are 1 and $p$ ! Furthermore, because $i = 1$ is only possible with the identity element, this means all other elements are generators of $G$ .

We want to find these prime order groups, as they give us a basis for cryptographic problems.

Theorem

If $p$ is prime, then $Z_{p}^{*}$ is a cyclic group of order $p - 1$ .

Using the above theorem, we can construct a subgroup of $Z_{p}^{*}$ that is of prime order!

Prime Order Cyclic Groups

Let $Z_{p}^{*}$ , where $p$ is a strong prime: $p = 2 q + 1$ , where $q$ is also prime. By the above theorem, $Z_{p}^{*}$ is a cyclic group of order $p - 1 = 2 q$ .

We will cleverly take 1/2 of the elements of $Z_{p}^{*}$ , to get a group of order $q$ , which is prime!

Because $Z_{p}^{*}$ is cyclic, it has a generator $g$ . Choose this generator.

By definition of a generator, we can get every element in this group using it! So, let’s take every even power of $g$ , giving us a subgroup of order $q$ . This gives us a prime order group!

We take even powers, as even if you raise even powers to an exponent, you will still have an even power!

Cryptographic Problems on Cyclic Groups

Cyclic groups form the basis of many cryptographic problems. In particular, there are 3 main problems on cyclic groups, each building on the last.

Discrete Logarithm

We define the Discrete-Log Experiment $D L o g_{A, G} (n)$ as follows:

Run $G (1^{n})$ to get $(G, q, g)$ , where $G$ is a cyclic group of order $q$ , and generator $g$ .
Choose a $h \in G$ uniformly.
Adversary $A$ is given $G, q, g, h$ , and needs to guess the $x \in Z_{q}$ such that $g^{x} = h$ .
The output of the experient is 1 if $g^{x} = h$ and 0 otherwise.

As $q$ is typically on the magnitude of $2^{2048}$ or $2^{1024}$ , it would be extremely inefficient to do a brute force attack.

We say the Discrete-Logarithm Problem is hard relative to $G$ if for all PPT algorithms $A$ , there exists a negligible function such that

P r [D l o g_{A, G} (n) = 1] \leq n e g l (n)

This is the hardest problem on cyclic groups! All of the following problems are based off of this.

Computational Diffie-Hellman

We define the Computational Diffie-Hellman (CDH) problem as follows.

Given $(G, q, g)$ and uniform $h_{1} = g^{x_{1}}$ , $h_{2} = g^{x_{2}}$ , compute $g^{x_{1} \cdot x_{2}}$ .

Note that $h_{1} \cdot h_{2} = g^{x_{1} + x_{2}}$ , which won’t solve our problem.

This problem is based on the Discrete Logarithm problem, as if we could solve Discrete Log, we could solve for $x_{1}, x_{2}$ in PPT time and compute our result.

However, because Discrete Log is a hard problem, this is also hard.

Decisional Diffie-Hellman

We define the Decisional Diffie-Hellman (DDH) problem as follows.

Define a distinguisher $D$ , who gets one the group $G$ , order $q$ , generator $g$ , and one of the following:
- Ideal World: $g^{x}, g^{y}, g^{z}$ , 3 independent group elements with no correlation to each other.
- Real World: $g^{x}, g^{y}, g^{x y}$ , 3 group elements where the 3rd is related to the first two through the CDH problem.
The distinguisher gets one of the worlds, and has to guess the world that they’re in.

We say that the DDH problem is hard if for all PPT adversaries $A$ , they can only guess what world they’re in with a negligible probability.

∣ P r [D (G, q, g, g^{x}, g^{y}, g^{z}) = 1] - P r [D (G, q, g, g^{x}, g^{y}, g^{x y}) = 1] ∣ \leq n e g l

Note that DDH is not hard over $Z_{p}^{*}$ for for prime $p$ . This is because for $a \in Z_{p}^{*}$ , we can compute the Legendre symbol

\frac{a}{p}

Which is 1 if $a$ is a perfect square in the group (if $a = b^{2} mod p$ , then $(b^{2})^{(p - 1) /2} \equiv b^{p - 1} \equiv 1 mod p$ ), and -1 if $a$ is not. There exists an algorithm to do this efficiently to distinguish the ideal and real world.

Attack

Note that if we compute the Zegendre symbol on the 3 group elements we’re given, then:

For $g^{x}, g^{y}, g^{z}$ , we can get any of the 8 patterns by computing the Zegendre symbol on them.

For $g^{x}, g^{y}, g^{x y}$ , there are some patterns we cannot get. If $g^{x y}$ ’s symbol is 1, then at least $g^{x}$ or $g^{y}$ must have a symbol of 1. If $g^{x y}$ ’s symbol is -1, then $g^{x}$ and $g^{y}$ must have a symbol of -1.

If we compute these patterns and match one of the patterns that is possible in the $g^{x} g^{y} g^{x y}$ case, we return that we’re in the real world. This gives us a distinguishing algorithm with constant probability.

Elliptic Curves

Here, we will define Elliptic Curve groups. This is another group that can be used for Diffie-Hellman, and is the go-to method in cryptography right now.

Points on the Elliptic Curve

A finite field is a set of elements that can be viewed as a group with respect to two operations: addition and multiplication.

In fields, the identity element for addition (0)is not required to have a multiplicative inverse.

With fields, we now define whole polynomials over the elements in the group!

Let $Z_{p}$ be a finite field for prime $p \geq 5$ . Now consider equation $E$ in variables $x, y$ of the form:

y^{2} = x^{3} + A x + B mod p

Where $A, B$ are constants such that $4 A^{3} + 27 B^{2} \neq = 0$ (ensuring the cubic polynomial has no repeated roots).

Define $E (Z_{p})$ as the set of pairs $(x, y)$ satisfying the above equation as well as a special value of $O$ .

E (Z_{p}) = {(x, y) : x, y \in Z_{p} \land y^{2} = x^{3} + A x + B mod p} \cup {O}

These elements are called the points on the Elliptic Curve $E$ , where the special value $O$ is called the point at infinity.

Example: Finding Elliptic Curve Points

To find the points on an Elliptic Curve:

First, find the quadratic residues (squares) over $Z_{p}$ . These will be our possible values $y$ , and their $y^{2}$ values.

Now, take $y^{2} = f (x) = x^{3} + A x + B$ . Plug in all values for $x$ .

Every value of $x$ such that $f (x)$ is a non-zero quadratic residue yields 2 points on our curve.

Every value of $x$ such that $f (x)$ is a non-quadratic residue are not on the curve

Every value of $x$ such that $f (x) \equiv 0 mod p$ give 1 point on the curve.

Given an $x$ yielding a quadratic residue, we find $y$ by matching the result with matching $y^{2}$ values in (1).

Consider $y^{2} = x^{3} + 3 x + 3 mod 7$ . First, we find our quadratic residues as ${0, 1, 2, 4}$ .

Take $f (0) = 3 mod 7$ . This is a not a quadratic residue.

Take $f (1) = 0 mod 7$ . This gives us 1 point on the curve $(1, 0)$ .

Take $f (2) = 3 mod 7$ . This is not a quadratic residue.

Take $f (3) = 4 mod 7$ . This is a quadratic residue with roots 2,5, giving us points $(3, 2), (3, 5)$ .

Elliptic Curve Groups

For any elliptic curve, we will guarantee the property that every line intersecting $E (Z_{p})$ in 2 points, intersects it in exactly 3 points:

A point $P$ is counted 2 times if the line is tangent to the curve at $P$ .
The point at infinity is counted when the line is vertical.

With this property, we will define a group on the Elliptic Curve elements. We define the binary operation addition ( $+$ ) as follows:

For any two points $P_{1} + P_{2}$ , the result is the 3rd point intersecting the curve from the line between $P_{1}, P_{2}$ .
We say $O$ is the additive identity, $P + O = O + P = P$ .

Under this operation, we can find that for two points $P_{1}, P_{2} \neq = 0$ , we can calculate their addition as:

If $x_{1} \neq = x_{2}$ , then $P_{1} + P_{2} = (x_{3}, y_{3})$ with $x_{3} = [m^{2} - x_{1} - x_{2} mod p], y_{3} = [m - (x_{1} - x_{3}) - y_{1} mod p]$ for $m = \frac{y _{2} - y _{1}}{x _{2} - x _{1}} mod p$
If $x_{1} = x_{2}$ but $y_{1} \neq = y_{2}$ , then $P_{1} = - P_{2}$ and so $P_{1} + P_{2} = O$ .
If $P_{1} = P_{2}$ and $y_{1} = 0$ , then $P_{1} + P_{2} = 2 P_{1} = O$
If $P_{1} = P_{2}$ and $y_{1} \neq = 0$ , then $P_{1} + P_{2} = 2 P_{1} = (x_{3}, y_{3})$ with $x_{3} = [m^{2} - 2 x_{1} mod p], y_{3} = [m - (x_{1} - x_{3}) - y_{1} mod p]$ Where $m = \frac{3 x _{1}^{2} + A}{2 y _{1}} mod P$ .

DDH Over Elliptic Curves

Under this, we can perform Decisional Diffie Hellman over Elliptic Curves. In other words, we want to distinguish $(a P, b P, ab P)$ from $(a P, b P, c P)$ .

Here, $ab P$ is the third point from the line drawn by $a, b$ , and $c$ is another randomly chosen third point.

Theorem: Hasse Bound

For prime $p$ , and Elliptic Curve $E (Z_{p})$ , we know that the size of the group is
$p + 1 - 2 p \leq ∣ E (Z_{p}) ∣ p + 1 + 2 p$

Diffie Hellman Key Exchange

Using the Diffie-Hellman problems, we can define a key-exchange protocol.

EAV-Security for Key Exchange

First, to define security for this protocol, let’s define the key-exchange experiment $K E_{A, Π}^{e a v} (n)$ .

Two parties holding $1^{n}$ execute the protocol, $Π$ . This gives a transcript $t r an s$ containing all messages sent by the parties, and a key $k$ output by each of the parties.
A uniform $b = {0, 1}$ is chosen. If $b = 0$ , set $\hat{k} = k$ , and if $b = 1$ then choose $\hat{k}$ uniformly at random.
Adversary $A$ is given $t r an s$ and $\hat{k}$ , and outputs a bit $b^{'}$ distinguishing what $\hat{k}$ is.
The output of the experiment is 1 if $b^{'} = b$ , and 0 otherwise.

We say a key-exchange protocol $Π$ is secure in the presence of an eavesdropped if for all PPT adversaries $A$ , there exists a negligible function $n e g l$ such that

P r [K E_{A, Π}^{e a v} (n) = 1] \leq \frac{1}{2} + n e g l (n)

Diffie-Hellman Key Exchange

One protocol satisfying this is the Diffie Hellman Key Exchange. It works as follows.

Both parties agree ahead of time on some group $G$ , order $q$ , and generator $g$ .
Alice will randomly choose $x \in Z_{q}$ , and take $h_{1} = g^{x}$ . Alice sends $h_{1}$ to Bob.
Bob will randomly choose $y \in Z_{q}$ , and similarly compute $h_{2} = g^{y}$ . Bob sends $h_{2}$ to Alice.
Alice computes $k_{A} = h_{2}^{x}$ , and Bob computes $k_{B} = h_{1}^{y}$ . This gives them the same key.
- This is secure, as the adversary cannot see $x, y$ ! So, even if the adversary sees $h_{1}, h_{2}$ , it cannot easily compute $g^{x y}$ as they need to reverse engineer $x, y$ (which is a hard problem).

This is a protocol used everywhere! Typically, we use ECDH, Elliptic Curve Diffie Hellman.

Theorem: Security of Diffie Hellman Key Exchange

If the DDH problem is hard relative to $G$ , then the Diffie-Hellman key exchange protocol $Π$ is secure in the presence of an eavesdropper.

Proof (Sketch)

Intuitively, this is because if the DDH is hard, then a distinguisher has no PPT way of finding $g^{x y}$ given $g^{x}, g^{y}$ .

MiTM Attack Against Diffie-Hellman Key Exchange

Diffie-Hellman Key Exchange works well, but it assumes that the adversary is only eavesdropping on the communications. If the adversary had the ability to modify the messages in transit, they can perform a man in the middle attack.

Given $A, B$ , an adversary can complete a key-exchange protocol with both $A$ , $B$ .

Later, when $A$ and $B$ try to communicate, the adversary can decrypt the messages, and re-encrypt them before sending them to the other party.

The reason this happens is because there’s no way for a client to know who they’re communicating with!

Public Key Encryption

One way we can prevent the aformentioned man-in-the-middle attack is by using public key encryption. A public key encryption scheme is a triplet of PPT algorithms such that:

Gen takes a security parameter and outputs a pair of keys $p k, s k$ called the public key, and secret key, respectively.
Enc encrypts the message $m$ under the public key $p k$
Dec decrypts the ciphertext $c$ under the secret key $s k$ .

CPA-Security for Public Key Encryption

For a public key encryption scheme, we define the CPA experiment $P u b K_{A, Π}^{c p a} (n)$ :

Gen is ran to obtain the public key $p k$ and secret key $s k$ .
The adversary is given $p k$ , and returns a pair of equal length messages $m_{0}, m_{1}$ in the message space.
A uniform bit $b \in {0, 1}$ is chosen, and $m_{b}$ is encrypted and sent back to $A$ .
$A$ guesses which ciphertext they received.

We say the scheme is CPA-Secure if for all PPT adversaries $A$ , there is a negligible function such that

P r [P u b K_{A, Π}^{c p a} (n) = 1] \leq \frac{1}{2} + n e g l

Under a public key encryption scheme, we can send an encrypted message to the other party using $p k$ . They can then verify their identity by decrypting with the secret key, which only they have.

But how do we create a public key encryption scheme?

El Gamal Encryption

With any key exchange, we can convert it to a public key encryption scheme!

Below, we show how we can convert Diffie-Hellman Key Exchange into a public key scheme called El Gamal Encryption. To see why, consider the following:

In Diffie-Hellman, $R$ sends $h_{1}$ to $S$ , who sends $h_{2}$ to $R$ . Then, both parties to generate a shared key.
However, after receiving $h_{1}$ , $S$ can already generate the shared key! So, $S$ can generate the shared key, and encrypt the message with this shared key. It then sends $h_{2}$ and this ciphertext to $R$ .
$R$ can then generate the shared key, decrypting the ciphertext to get the message.

Formally, El Gamal Encryption works as follows:

Gen: Obtain $G, q, g$ . Choose a uniform $x \in Z_{q}$ , and compute $h = g^{x}$ . The public key and secret key are defined as follows: $p k = (G, q, g, h = g^{x}) s k = (G, q, g, x)$
Enc: Given $p k = (G, q, g, h = g^{x})$ and message $m \in G$ , choose a uniform $y \in Z_{q}$ and create ciphertext $c = (g^{y}, h^{y} \cdot m)$
Dec: Given $s k = (G, q, g, x)$ and ciphertext $c = (c_{1}, c_{2})$ , we can find message $m = c_{2} * (c_{1}^{x})^{- 1}$

Note that in decryption, $(c_{1}^{x})^{- 1}$ stands for us applying the group operation $x$ times (to find h^{xy}), then finding its multiplicative inverse.

Theorem: Security of El Gamal

If the DDH problem is hard relative to $G$ , then the El Gamal encryption scheme is CPA-secure.

Digital Signatures

Using public key encryption, we can also define a signature scheme.

We define a digital signature scheme as follows:

Gen: Takes a security parameter $1^{n}$ , and outputs a public key $p k$ and secret key $s k$ .
Sign: A signing algorithm that takes a private key $s k$ , and a message $m$ from some message space. It outputs a signature $S i g n_{s k} (m) = σ$ .
Vrfy: Takes the public key $p k$ , a message $m$ , and a signature $σ$ , and output a $b$ that is 1 if we have validity, 0 otherwise.

Signature Security

For security, we define the $S i g F or g e_{A, Π} (n)$ experiment:

Gen is ran to obtain $p k, s k$ .
The adversary is given $p k$ and access to an oracle $S i g n_{s k} (\cdot)$ .
The adversary outputs $(m, σ)$ . Let $Q$ denote the set of all queries that $A$ asked the oracle.
$A$ succeeds if and only if $V r f y (m, σ) = 1$ and $m \neq \in Q$ (not queried before).
If $A$ succeeds, the experiment is 1.

We say the scheme is secure if

P r [S i g F or g e_{A, Π} (n) = 1] \leq n e g l

Schnorr Identification Scheme

To construct a signature from the discrete logarithm, we will first construct an identification scheme. This is a scheme which can be used to prove knowledge of a secret key without revealing the secret key.

After this, we will perform a Fiat-Shamir Transform to covert an identification scheme into a signature scheme.

The Schnorr Identification Scheme works as follows. Consider two parties, the prover $P$ and the verifier $V$ :

Prover $P$ has secret key $x$ , and verifier $V$ has public key $y = g^{x}$ .
$P$ chooses uniform $k \in Z_{q}$ , and computes $I = g^{k}$ . $P$ sends this to $V$ .
$V$ now chooses a challenge, a uniform $r \in Z_{q}$ . $V$ sends this to $P$ .
$P$ computes $s = [r x + k mod q]$ , and sends this to $V$ .
$V$ can now check whether $g^{s} \cdot y^{- r} = g^{k}$ .
- If the prover is legitimate, then the verifier will find that $g^{s} \cdot y^{- r} = g^{r x + k} \cdot g^{- r x} = g^{k}$ .

This scheme is secure, and does not actually leak what $x$ is. Intuitively, this is because $s$ functions as a one-time pad, because $k$ is chosen uniformly. This obscures $x$ .

To formally show this, we will want to prove the following.

Proof: Prover Knows $x$

We first show that under this scheme, the prover knows $x$ . To do this, suppose we have a “knowledge extractor”. This extractor will take a prover who won the scheme. Given this prover, we can use it to compute the discrete log of $y = g^{x}$ in polynomial time.

This shows that the prover knows $x$ , as a polynomial computation of the discrete log would be impossible otherwise.

What our extractor will do is find two “paths”, starting with $I$ , that are accepted in our scheme. After we find these paths, we have for some initial $I$ , one path $r_{1}, s_{1}$ , and another $r_{2}, s_{2}$ . Using these, we can compute $x$ .
$g^{s_{1}} * y^{- r_{1}} = I = g^{s_{2}} * y^{- r_{2}} g^{s_{1} - s_{2}} = y^{r_{1} - r_{2}} g^{\frac{s _{1} - s _{2}}{r _{1} - r _{2}}} = y x = \frac{s _{1} - s _{2}}{r _{1} - r _{2}}$

This is called the Forking Lemma. We can get 2 accepting transcripts in polynomial time using rewinding.

Proof: No Information is Leaked about $x$

We also show that under this scheme, no information is leaked about $x$ . To do this, we will show that we can take a polynomial-time simulation, which, given $y$ , can simulate transcripts of identification protocols.

We construct a simulator that outputs correctly distributed transcripts $(I, r, s)$ .

Sample from the marginal distribution over $(r, s)$ , both selected randomly.

Sample consistent $I$ from the space of $g$ , dependent on $r, s$ , which we can compute as $g^{s} * y^{- r} = I$ .

By definition of the scheme, we have exactly 1 possible $I$ value that works!

These are all successful transcripts of identification protocols.

This is called Honest Verifier Zero Knowledge.

Fiat-Shamir Transform

After constructing an identification scheme, we then apply the Fiat-Shamir Transform to it to obtain a signature scheme.

This transform works as follows, giving us the Schnorr Signature Scheme.

Gen: Obtain keys $p k = g^{x}, s k = x$ .
Sign: Given a private key $s k$ and message $m$ ,
1. Compute $I = g^{k}$ for uniform $k$ .
2. Instead of a random $r$ , compute $r = H (I ∣∣ m)$ .
3. Use this $r$ to compute $s = r x + k$ .
4. Return signature $(r, s)$ .
Vrfy: Given public key $p k$ , message $m$ , and signature $(r, s)$ , compute $I = g^{s} y^{- r}$ . Rehash $H (I ∣∣ m)$ , and return 1 if this hash equals our original $r$ .

Shu-Ye's Quartz Space 🪴

Table of Contents

Public Key Encryption

Number Theory

Groups

Modular Arithmetic under Multiplication, Prime

Multiplicative Inverses

Modular Exponentiation

Modular Arithmetic under Multiplication, N Composite

Cyclic Groups

Cryptographic Problems on Cyclic Groups

Discrete Logarithm

Computational Diffie-Hellman

Decisional Diffie-Hellman

Elliptic Curves

Points on the Elliptic Curve

Elliptic Curve Groups

Diffie Hellman Key Exchange

EAV-Security for Key Exchange

Diffie-Hellman Key Exchange

Public Key Encryption

CPA-Security for Public Key Encryption

El Gamal Encryption

Digital Signatures

Signature Security

Schnorr Identification Scheme

Fiat-Shamir Transform

Graph View

Backlinks

Shu-Ye's Quartz Space 🪴

Table of Contents

Public Key Encryption

Number Theory §

Groups §

Modular Arithmetic under Multiplication, Prime §

Multiplicative Inverses §

Modular Exponentiation §

Modular Arithmetic under Multiplication, N Composite §

Cyclic Groups §

Cryptographic Problems on Cyclic Groups §

Discrete Logarithm §

Computational Diffie-Hellman §

Decisional Diffie-Hellman §

Elliptic Curves §

Points on the Elliptic Curve §

Elliptic Curve Groups §

Diffie Hellman Key Exchange §

EAV-Security for Key Exchange §

Diffie-Hellman Key Exchange §

Public Key Encryption §

CPA-Security for Public Key Encryption §

El Gamal Encryption §

Digital Signatures §

Signature Security §

Schnorr Identification Scheme §

Fiat-Shamir Transform §

Graph View

Backlinks

Number Theory

Groups

Modular Arithmetic under Multiplication, Prime

Multiplicative Inverses

Modular Exponentiation

Modular Arithmetic under Multiplication, N Composite

Cyclic Groups

Cryptographic Problems on Cyclic Groups

Discrete Logarithm

Computational Diffie-Hellman

Decisional Diffie-Hellman

Elliptic Curves

Points on the Elliptic Curve

Elliptic Curve Groups

Diffie Hellman Key Exchange

EAV-Security for Key Exchange

Diffie-Hellman Key Exchange

Public Key Encryption

CPA-Security for Public Key Encryption

El Gamal Encryption

Digital Signatures

Signature Security

Schnorr Identification Scheme

Fiat-Shamir Transform