Permutation argument

Given that gates in halo2 circuits operate "locally" (on cells in the current row or defined relative rows), it is common to need to copy a value from some arbitrary cell into the current row for use in a gate. This is performed with an equality constraint, which enforces that the source and destination cells contain the same value.

We implement these equality constraints by constructing a permutation that represents the constraints, and then using a permutation argument within the proof to enforce them.

Notation

A permutation is a one-to-one and onto mapping of a set onto itself. A permutation can be factored uniquely into a composition of cycles (up to ordering of cycles, and rotation of each cycle).

We sometimes use cycle notation to write permutations. Let $(a b c)$ denote a cycle where $a$ maps to $b,$ $b$ maps to $c,$ and $c$ maps to $a$ (with the obvious generalization to arbitrary-sized cycles). Writing two or more cycles next to each other denotes a composition of the corresponding permutations. For example, $(a b) (c d)$ denotes the permutation that maps $a$ to $b,$ $b$ to $a,$ $c$ to $d,$ and $d$ to $c .$

Constructing the permutation

Goal

We want to construct a permutation in which each subset of variables that are in a equality-constraint set form a cycle. For example, suppose that we have a circuit that defines the following equality constraints:

$a \equiv b$
$a \equiv c$
$d \equiv e$

From this we have the equality-constraint sets ${a, b, c}$ and ${d, e} .$ We want to construct the permutation:

$(a b c) (d e)$

which defines the mapping of $[a, b, c, d, e]$ to $[b, c, a, e, d] .$

Algorithm

We need to keep track of the set of cycles, which is a set of disjoint sets. Efficient data structures for this problem are known; for the sake of simplicity we choose one that is not asymptotically optimal but is easy to implement.

We represent the current state as:

an array $mapping$ for the permutation itself;
an auxiliary array $aux$ that keeps track of a distinguished element of each cycle;
another array $sizes$ that keeps track of the size of each cycle.

We have the invariant that for each element $x$ in a given cycle $C,$ $aux (x)$ points to the same element $c \in C .$ This allows us to quickly decide whether two given elements $x$ and $y$ are in the same cycle, by checking whether $aux (x) = aux (y) .$ Also, $sizes (aux (x))$ gives the size of the cycle containing $x .$ (This is guaranteed only for $sizes (aux (x)),$ not for $sizes (x) .$ )

The algorithm starts with a representation of the identity permutation: for all $x,$ we set $mapping (x) = x,$ $aux (x) = x,$ and $sizes (x) = 1.$

To add an equality constraint $left \equiv right$ :

Check whether $left$ and $right$ are already in the same cycle, i.e. whether $aux (left) = aux (right) .$ If so, there is nothing to do.
Otherwise, $left$ and $right$ belong to different cycles. Make $left$ the larger cycle and $right$ the smaller one, by swapping them iff $sizes (aux (left)) < sizes (aux (right)) .$
Set $sizes (aux (left)) := sizes (aux (left)) + sizes (aux (right)) .$
Following the mapping around the right (smaller) cycle, for each element $x$ set $aux (x) := aux (left) .$
Splice the smaller cycle into the larger one by swapping $mapping (left)$ with $mapping (right) .$

For example, given two disjoint cycles $(A B C D)$ and $(E F G H)$ :

A +---> B
^       +
|       |
+       v
D <---+ C       E +---> F
                ^       +
                |       |
                +       v
                H <---+ G

After adding constraint $B \equiv E$ the above algorithm produces the cycle:

A +---> B +-------------+
^                       |
|                       |
+                       v
D <---+ C <---+ E       F
                ^       +
                |       |
                +       v
                H <---+ G

Broken alternatives

If we did not check whether $left$ and $right$ were already in the same cycle, then we could end up undoing an equality constraint. For example, if we have the following constraints:

$a \equiv b$
$b \equiv c$
$c \equiv d$
$b \equiv d$

and we tried to implement adding an equality constraint just using step 5 of the above algorithm, then we would end up constructing the cycle $(a b) (c d),$ rather than the correct $(a b c d) .$

Argument specification

We need to check a permutation of cells in $m$ columns, represented in Lagrange basis by polynomials $v_{0}, \dots, v_{m - 1} .$

We will label each cell in those $m$ columns with a unique element of $F^{\times} .$

Suppose that we have a permutation on these labels, $σ (column : i, row : j) = (column : i^{'}, row : j^{'}) .$ in which the cycles correspond to equality-constraint sets.

If we consider the set of pairs ${(label, value)}$ , then the values within each cycle are equal if and only if permuting the label in each pair by $σ$ yields the same set:

Since the labels are distinct, set equality is the same as multiset equality, which we can check using a product argument.

Let $ω$ be a $2^{k}$ root of unity and let $δ$ be a $T$ root of unity, where $T \cdot 2^{S} + 1 = p$ with $T$ odd and $k \leq S .$ We will use $δ^{i} \cdot ω^{j} \in F^{\times}$ as the label for the cell in the $j$ th row of the $i$ th column of the permutation argument.

We represent $σ$ by a vector of $m$ polynomials $s_{i} (X)$ such that $s_{i} (ω^{j}) = δ^{i^{'}} \cdot ω^{j^{'}} .$

Notice that the identity permutation can be represented by the vector of $m$ polynomials $ID_{i} (ω^{j})$ such that $ID_{i} (ω^{j}) = δ^{i} \cdot ω^{j} .$

We will use a challenge $β$ to compress each $(label, value)$ pair to $value + β \cdot label .$ Just as in the product argument we used for lookups, we also use a challenge $γ$ to randomize each term of the product.

Now given our permutation represented by $s_{0}, \dots, s_{m - 1}$ over columns represented by $v_{0}, \dots, v_{m - 1},$ we want to ensure that: $i = 0 \prod m - 1 j = 0 \prod n - 1 (\frac{v _{i} ( ω ^{j} ) + β \cdot δ ^{i} \cdot ω ^{j} + γ}{v _{i} ( ω ^{j} ) + β \cdot s _{i} ( ω ^{j} ) + γ}) = 1$

Here $v_{i} (ω^{j}) + β \cdot δ^{i} \cdot ω^{j}$ represents the unpermuted $(label, v a l u e)$ pair, and $v_{i} (ω^{j}) + β \cdot s_{i} (ω^{j})$ represents the permuted $(σ (label), v a l u e)$ pair.

Let $Z_{P}$ be such that $Z_{P} (ω^{0}) = Z_{P} (ω^{n}) = 1$ and for $0 \leq j < n$ : $Z_{P} (ω^{j + 1}) = h = 0 \prod j i = 0 \prod m - 1 \frac{v _{i} ( ω ^{h} ) + β \cdot δ ^{i} \cdot ω ^{h} + γ}{v _{i} ( ω ^{h} ) + β \cdot s _{i} ( ω ^{h} ) + γ} = Z_{P} (ω^{j}) i = 0 \prod m - 1 \frac{v _{i} ( ω ^{j} ) + β \cdot δ ^{i} \cdot ω ^{j} + γ}{v _{i} ( ω ^{j} ) + β \cdot s _{i} ( ω ^{j} ) + γ}$

Then it is sufficient to enforce the rules: $Z_{P} (ω X) \cdot i = 0 \prod m - 1 (v_{i} (X) + β \cdot s_{i} (X) + γ) - Z_{P} (X) \cdot i = 0 \prod m - 1 (v_{i} (X) + β \cdot δ^{i} \cdot X + γ) = 0 ℓ_{0} \cdot (1 - Z_{P} (X)) = 0$

This assumes that the number of columns $m$ is such that the polynomial in the first rule above fits within the degree bound of the PLONK configuration. We will see below how to handle a larger number of columns.

The optimization used to obtain the simple representation of the identity permutation was suggested by Vitalik Buterin for PLONK, and is described at the end of section 8 of the PLONK paper. Note that the $δ^{i}$ are all distinct quadratic non-residues, provided that the number of columns that are enabled for equality is no more than $T$ , which always holds in practice for the curves used in Halo 2.

Zero-knowledge adjustment

Similarly to the lookup argument, we need an adjustment to the above argument to account for the last $t$ rows of each column being filled with random values.

We limit the number of usable rows to $u = 2^{k} - t - 1.$ We add two selectors, defined in the same way as for the lookup argument:

$q_{blind}$ is set to $1$ on the last $t$ rows, and $0$ elsewhere;
$q_{last}$ is set to $1$ only on row $u,$ and $0$ elsewhere (i.e. it is set on the row in between the usable rows and the blinding rows).

We enable the product rule from above only for the usable rows:

$(1 - (q_{last} (X) + q_{blind} (X))) \cdot$ $(Z_{P} (ω X) \cdot i = 0 \prod m - 1 (v_{i} (X) + β \cdot s_{i} (X) + γ) - Z_{P} (X) \cdot i = 0 \prod m - 1 (v_{i} (X) + β \cdot δ^{i} \cdot X + γ)) = 0$

The rule that is enabled on row $0$ remains the same:

$ℓ_{0} (X) \cdot (1 - Z_{P} (X)) = 0$

Since we can no longer rely on the wraparound to ensure that each product $Z_{P}$ becomes $1$ again at $ω^{2^{k}},$ we would instead need to constrain $Z (ω^{u}) = 1.$ This raises the same problem that was described for the lookup argument. So we allow $Z (ω^{u})$ to be either zero or one:

$q_{last} (X) \cdot (Z_{P} (X)^{2} - Z_{P} (X)) = 0$

which gives perfect completeness and zero knowledge.

Spanning a large number of columns

The halo2 implementation does not in practice limit the number of columns for which equality constraints can be enabled. Therefore, it must solve the problem that the above approach might yield a product rule with a polynomial that exceeds the PLONK configuration's degree bound. The degree bound could be raised, but this would be inefficient if no other rules require a larger degree.

Instead, we split the product across $b$ sets of $m$ columns, using product columns $Z_{P, 0}, \dots Z_{P, b - 1},$ and we use another rule to copy the product from the end of one column set to the beginning of the next.

That is, for $0 \leq a < b$ we have:

$(1 - (q_{last} (X) + q_{blind} (X))) \cdot$ $(Z_{P, a} (ω X) \cdot i = am \prod (a + 1) m - 1 (v_{i} (X) + β \cdot s_{i} (X) + γ) - Z_{P} (X) \cdot i = am \prod (a + 1) m - 1 (v_{i} (X) + β \cdot δ^{i} \cdot X + γ))$ $= 0$

For simplicity this is written assuming that the number of columns enabled for equality constraints is a multiple of $m$ ; if not then the products for the last column set will have fewer than $m$ terms.

For the first column set we have:

$ℓ_{0} \cdot (1 - Z_{P, 0} (X)) = 0$

For each subsequent column set, $0 < a < b,$ we use the following rule to copy $Z_{P, a - 1} (ω^{u})$ to the start of the next column set, $Z_{P, a} (ω^{0})$ :

$ℓ_{0} \cdot (Z_{P, a} (X) - Z_{P, a - 1} (ω^{u} X)) = 0$

For the last column set, we allow $Z_{P, b - 1} (ω^{u})$ to be either zero or one:

$q_{last} (X) \cdot (Z_{P, b - 1} (X)^{2} - Z_{P, b - 1} (X)) = 0$

which gives perfect completeness and zero knowledge as before.

The halo2 Book