SAFE: Sponge API for Field Elements (2023/522)

Abstract

From hashing and commitment schemes to Fiat–Shamir and encryption, hash functions are everywhere in zero-knowledge proof systems (ZKPs), and minor performance changes in “vanilla” implementations can translate in major discrepancies when the hash is processed as a circuit within the proof system. Protocol designers have resorted to a number of techniques and custom modes to optimize hash functions for ZKP settings, but so far without a single established, well-studied construction. To address this need, we define the Sponge API for Field Elements (SAFE), a unified framework for permutation-based schemes (including AEAD, Sigma, PRNGs, and so on). SAFE eliminates the performance overhead, is pluggable in any field-oriented protocol, and is suitable for any permutation algorithm. SAFE is implemented in Filecoin’s Neptune hash framework, which is our reference implementation (in Rust). SAFE is also being integrated in other prominent ZKP projects. This report specifies SAFE and describes use cases.

Among other improvements, our construction is among the first to store the protocol metadata in the sponge inner part in a provably secure way, which may be of independent interest to the sponge use cases outside of ZKP.

1. Introduction

Sponge functions [BDPV07] are the basis of permutation-based symmetric primitives’ design, as studied by Daemen et al.: hash functions, MACs, authenticated encryption schemes, PRNGs, and others. When operating in Duplex mode [BDPV11], a sponge can be seen as a stateful object that can ingest input (“absorb”) and produce output (“squeeze”) at any time and in arbitrary order.

The duplex specification sees the input and output as raw bits, and leaves application-specific encoding to the users. However, in zero-knowledge proof systems (ZKPs) specifications, hash functions often process field elements (with respect to some canonical encoding), rather than raw bits. Performance being critical to reduce the proof generation and verification cost, dedicated “field-friendly”, algebraic hash functions were designed, most of which are sponges. These include for example Poseidon [GKR^+21], Rescue [AAB^+20], MiMC [AGR^+16], and Reinforced Concrete [LRG^+22].

A typical sponge function makes one call to the inner permutation P per r input bits (or other base units) to be hashed (absorbed) or outputted (squeezed), with r called rate. Despite its relative simplicity, sponge-like constructions can and have been “misused”, or misimplemented, particularly when working with prime field arithmetic, for which constructions are less established and specifications not as rigorous as the generic one. Here we outline the most common misuse patterns we have observed, based on our experience contributing to and auditing ZKP projects (excluding trivial failures such as overwriting the full state):

Domain separation: When the input data fills the full rate, no padding or any other length-dependent separator is applied. A classical example is hashing two field elements inside a Merkle tree with r = 2 field elements, where only 1 call to P is made to reduce costs.
Cross-protocol collisions: Two protocols with one being an extension of the other start with the same state. As a result, they output the same prefix on the same inputs.
Superfluous permutations: Making an extra call between squeezing out data and absorbing it.
Custom constructions: Using nested sponge calls (instead of chaining squeeze-absorb calls) in multi-round non-interactive protocols obtained via Fiat–Shamir heuristic. This unnecessarily increases the complexity, and may jeopardize the provable security guarantees.

Whereas the last two issues are just extra work for a caller, the former two may breach not only security proofs but also the concrete security of real protocols. The API presented below solves all those and offers secure and efficient usage patterns for many use cases. It also reduces the workload of developers and makes a step towards a unified cross-platform interface.

Contribution. We propose SAFE, a generic API for sponge functions, as well as a production-ready reference implementation. SAFE comes with a security proof, which is available as a separate report. As its main features, SAFE:

Does not use any padding, thus not wasting an extra call to the sponge permutation in any circumstances.
Is independent of an underlying permutation and thus can be used with almost every design on the market (including Poseidon’s).
Eliminates a number of misuse patterns by limiting the set of operations callable at sponge and by binding a protocol designer to a specific order of these operations.
Is provably secure in the random permutation model in a number of settings, including the overlooked but frequently required cross-protocol security.
Is among the first constructions to store the protocol’s metadata in the sponge inner part, provably losing no security.

SAFE is suitable for a variety of use cases encountered in real-world ZK proof systems, including:

Hashing with fixed-length input, as found in commitment schemes, Merkle trees, and signatures.
Fiat–Shamir transforms and other stateful oracle simulations, where the sequence and size of input values is fixed and known in advance.
Authenticated encryption (in the ZK setting, also verifiable encryption) of predetermined-length messages. A classical example is an encryption of a coin secret on recipient’s public key in privacy-preserving cryptocurrencies [HBHW22].
Pseudo-random generation of field elements from a seed, with a reseeding mechanism.

SAFE however does not support variable-length hashing when the length of data hashed is unknown in advance. This sacrifice is inevitable for a drastic performance and simplicity improvement, and did not prove problematic when we surveyed ZK projects’ engineers.

2. Sponges

A (duplexed) sponge is a stateful object parameterized by a capacity c and a rate r where c + r = n is the inner state’s width. State changes are driven by three operations:

Permutation of the width-n state by P.
Injection of input data M^i (“absorption”), by chunks of up to r elements.
Extraction of output data Z^i (“squeezing”), by chunks of up to r elements.

The original sponge was defined for bitstring states, but all the security results carry over to field-element states, as long these use a sound, fixed-size encoding. When defined over field elements, sponge parameters (c, r, n) are usually expressed in terms of field elements, which lead to the notion of arithmetic capacity.

A sponge is traditionally initialized to the full-zero state. Input data then overwrites up to r given state elements at a time, whereas the c-wide inner part is never touched nor outputted. This construction provides security up to c/2 bits in the random permutation model in the indifferentiability framework [MRH04, CDMP05]. This result means that it behaves like a random oracle and, among others, achieves this level of preimage and collision resistance provided the output size is large enough. It is widely believed that concrete instances such as SHA-3 keep the same security level.

3. The SAFE API

3.1 API Overview

We assume a sponge width of n = r + c field elements, where r is the rate and c the capacity. Here \mathbb{F} is the finite field type, thus elements of \mathbb{F}^L are vectors of L field elements. Length is the length type, an unsigned integer properly bounded counting the number of field elements. State is the type of the internal state, consisting of field elements and other variables.

A SAFE sponge object should expose the following operations to protocol designers:

START(IOPattern, DomainSeparator): This initializes the inner state of the sponge, modifying up to c/2 field elements of the state. It is done once in the lifetime of a sponge.
ABSORB(Length : L, \mathbb{F}^L : X[L]): This injects L field elements to the state from the array X, interleaving calls to the permutation. It also checks if the current call matches the IO pattern.
SQUEEZE(Length : L) \to \mathbb{F}^L: This extracts L field elements from the state, interleaving calls to the permutation. It also checks if the current call matches the IO pattern.
FINISH(Length) \to Result: This marks the end of the sponge life, preventing any further operation. In particular, the state is erased from memory. The result is ‘OK’, or an error.

The general workflow of a sponge prescribed by SAFE is then as follows:

The consumer protocol initializes the sponge: \mathsf{START}(\mathit{IO}[\,], D) where \mathit{IO} is a sequence of future calls and their respective lengths (what we call “IO pattern”) and D is a domain separator. One can also start from a precomputed state, but said state must come from a properly initialized sponge.
The protocol makes a chain of calls C_1, C_2, \ldots, C_\ell, whose input lengths and types correspond to S. Each C^i is either an ABSORB or a SQUEEZE call.
The protocol closes the sponge with a FINISH() call.

The most important element of the design is IOPattern, which is a compact encoding of the pattern of ABSORB and SQUEEZE calls during the sponge lifetime. An implementation must forbid to finish the sponge usage if this pattern is violated. In particular, the output from SQUEEZE calls must not be used if the IO pattern is not followed.

Remark 1

Several IO patterns can belong to the same equivalence class, and thus leading to identical instances. This is because consecutive calls of a same type (ABSORB or SQUEEZE) are aggregated to define the initial state. An application that needs to absorb L > 1 elements in a row can thus do it one by one (with L calls to ABSORB), or with a single call including the L elements.

Important notes:

Dealing with non-field elements: The API assumes that the input is (represented as) field elements, however applications may need to process other data types. It is the responsibility of users to properly encode such inputs as field elements. If objects of different types are processed by multiple runs of a same instance, at the same position, then some signalling of the input type is required to avoid collisions between different elements of distinct types encoded identically (incurring a performance overhead).
Precomputed state: Multiple “forks” of a sponge can be created, by storing the state after a given number of operations, and restarting from it with distinct ABSORB calls in distinct branches. Note that all forks must do the same calls sequence, as specified to the START call.

3.2 Security

The security of SAFE API is captured by the following theorem, which essentially says that the SAFE outputs are indistinguishable from that of a random oracle.

Theorem 1

Let P be a cryptographic protocol that employs random oracles R_1, R_2, \ldots, R^k and is secure in the random oracle model against adversaries that make up to 2^\lambda queries to the oracles. Then, the implementation of this protocol with oracle R^i instantiated with the SAFE API using a field of size at least 2^{2\lambda} and a domain separator D^i (pairwise distinct) is secure against adversaries that make up to 2^\lambda queries to the underlying hash H and permutation P.

It implies that whenever SAFE is used in one or multiple (with different IO patterns or domain separators) protocols, at least c \log_2 |\mathbb{F}| bits of security is guaranteed against collision, preimage, or distinguishing attacks. Details are provided in a separate report.

3.3 From IO Patterns to Tags to Instances

Let c < n be the number of capacity elements. The SAFE sponge state consists of the following elements:

Permutation state V \in \mathbb{F}^n.
Absorb position \text{absorb\_pos} \leq n - c.
Squeeze position \text{squeeze\_pos} \leq n - c.
IO pattern expected (as defined by START).

The sponge updates itself by calling inner permutation P. It is also able to compute parameter tag T using H, a cryptographic hash function producing 256-bit digests, by default SHA3-256.

An instance is characterized by a tag derived from an IO pattern, which is a sequence of absorb phases and squeeze phases and their respective number of field elements. The tag is used as an initial value, to ensure that distinct instances behave differently. Using distinct tags for different, non-equivalent usage patterns avoids trivial collisions between input sequences of different length, where a “non-input” element is replaced by a zero element in the colliding message (this would lead to a collision because of the lack of padding). Furthermore, for applications that need to distinguish equivalent IO patterns, a domain separator can be set.

A tag is calculated from an IO pattern and a domain separator as follows:

Encode the IO pattern as a list of 32-bit words, whose MSB set to 1 for ABSORB calls and to 0 for SQUEEZE calls. For example, an instance that does 2 ABSORB calls with 3 elements each and then does one SQUEEZE call with 3 elements is described by the three words [0x80000003, 0x80000003, 0x00000003].
Aggregate any contiguous ABSORB or SQUEEZE calls within a single call: in our example, we would replace [0x80000003, 0x80000003] with a single 0x80000006.
Serialize the list of words into a byte string and append to it the domain separator D: for example, if D is the two-byte sequence 0x4142, then the example above would yield the string (if big-endian convention is used): 0x80000006000000034142.
Hash the string obtained with the hasher H to a 256-bit tag T (truncating the hash if needed).

Given its tag string, an instance admits an arbitrary number of executions, which are in addition characterized by an input Y \in (\mathbb{F}^r)^\star. In other words, a tag is not like an execution-specific nonce; it is a characterization of the expected usage in terms of IO pattern, and of a domain separator (which can be used to create different instances/tags for a same IO pattern).

Remark 2

The 32-bit encoding restricts the number of elements absorbed or squeezed to 2^{31} - 1 per call. For applications that need to absorb or squeeze such a large number of elements, the operation must therefore be done via multiple calls, rather than a single one.

Remark 3

If the hash function used to create the tag received field elements rather than byte strings, and can directly process 32-bit integers as field elements, then the serialization mechanism (including endianness aspects) is not needed.

3.4 Detailed API

Everything begins with START, which computes a tag from the IO pattern and domain separator. This operation is unique as it writes the tag into the inner part of the state, which makes us to use a new security proof (Section 3.2). Each call to ABSORB or SQUEEZE both:

Writes to or reads the outer part of the permutation state and calls the permutation P.
Verifies its own parameters against the initially supplied IO pattern (“early abort” misuse detection).

When all calls are done, the FINISH operation verifies that no call is left undone.

Algorithm 1 — START and FINISH Operations

If field elements are 248 bits or more, T is converted to a field element. Otherwise T is parsed as two or more field elements (but at most c/2 elements, if c is the arithmetic capacity).

START:

Given an IO pattern \mathit{IO} (as a list of calls with the respective number of elements) and a byte string D used as domain separator, compute the tag T as described in Section 3.3.
Set the permutation state to all zeros and add T to the first \min(256, \log_2 |\mathbb{F}|) bits of the inner part of the state (with respect to the field’s addition).
Set both absorb and squeeze positions to zero: \text{absorb\_pos} = \text{squeeze\_pos} = 0.
Set the IO count to zero: \text{io\_count} = 0.
Set the IO pattern expected to \mathit{IO}[L].

FINISH:

Check that \text{io\_count} equals the length of the IO pattern expected. Return an error otherwise.
Erase the state and its variables.

Algorithm 2 — ABSORB and SQUEEZE Operations

ABSORB:

If L = 0, return.
For i = 0, 1, \ldots, L - 1:
- If \text{absorb\_pos} = (n - c), then set V = P(V) to permute the state, and set \text{absorb\_pos} = 0 to restart writing at the zero offset.
- Add X[i] to the state element at \text{absorb\_pos}.
- Do \text{absorb\_pos}\mathord{++}.
Compute the 32-bit encoding of L to the IO pattern.
Verify that the word obtained is equal to the \text{io\_count}-th word of the IO pattern expected; abort upon mismatch (and erase the state).
Do \text{io\_count}\mathord{++}.
Set \text{squeeze\_pos} = (n - c), to force a permute at the start of the next SQUEEZE.

SQUEEZE:

If L = 0, return.
For i = 0, 1, \ldots, L - 1:
- If \text{squeeze\_pos} = (n - c), then set V = P(V) to permute the state, set \text{squeeze\_pos} = 0 to restart reading output at the zero offset, and set \text{absorb\_pos} = 0 to start writing at the zero offset in the next ABSORB.
- Set Y[i] = V[\text{squeeze\_pos}].
- Do \text{squeeze\_pos}\mathord{++}.
Compute the 32-bit encoding of L + 2^{31} with the IO pattern.
Verify that the word obtained is equal to the \text{io\_count}-th word of the IO pattern expected; abort upon mismatch (and erase the state).
Do \text{io\_count}\mathord{++}.

Remark 4

We do not set \text{absorb\_pos} to (n - c) as in ABSORB, as we may want the state to absorb at the same positions that have been squeezed, for example in authenticated encryption.

4. SAFE Applications

SAFE among others supports the following use cases.

Algorithm 3 — SAFE for Fixed-Length Hashing

Protocol	Sponge calls via SAFE
1: Select X_1, X_2, \ldots, X_L for hashing; 2: Get hash T.	1: START(IO[2], D) with IO the encoding of the following calls, and D an arbitrary domain separator; 2: ABSORB(L, X[]); 3: T \leftarrow SQUEEZE(1); 4: FINISH().

If the L elements are absorbed using more than one call – for example, via ABSORB(1, X_1) followed by ABSORB(L - 1, (X_2, \ldots, X_L)) – then the resulting hash will not change. No padding is required.

Algorithm 4 — SAFE for Merkle Tree of Arity 2

Protocol	Sponge calls via SAFE
1: Select child node hashes X_1, X_2; 2: Get parent hash T.	1: START(IO[2], D); 2: ABSORB(2, X[]); 3: T \leftarrow SQUEEZE(1); 4: FINISH().

Algorithm 5 — SAFE for Commitment Schemes

Protocol	Sponge calls via SAFE
1: Select values for commitment X_1, X_2, X_3 \in \mathbb{F}^2; 2: Get commitment C.	1: START(IO[4], D); 2: ABSORB(2, X_1[]); 3: ABSORB(2, X_2[]); 4: ABSORB(2, X_3[]); 5: C \leftarrow SQUEEZE(1); 6: FINISH().

Note that the tag will be the same for committing six 1-field elements. If this difference matters for an application, a domain separator should be used.

Algorithm 6 — SAFE for Sigma Protocols

Interactive form:

Parties agree on the common input Z \in \mathbb{F}^z.
Prover prepares and sends proof elements \pi_1 \in \mathbb{F}^{L_1} and \pi_2 \in \mathbb{F}^{L_2}.
Verifier responds with challenge c_1 \in \mathbb{F}.
Prover prepares and sends proof element \pi_3 \in \mathbb{F}^{L_3}.
Verifier responds with challenges c_2, c_3 \in \mathbb{F}.
Prover sends final proof \pi_4.

Sponge calls via SAFE:

START(IO[6], D) with IO the encoding of the following calls, and D an arbitrary domain separator.
ABSORB(z, Z).
ABSORB(L_1, \pi_1).
ABSORB(L_2, \pi_2).
c_1 \leftarrow SQUEEZE(1).
ABSORB(L_3, \pi_3).
c_2 \leftarrow SQUEEZE(1).
c_3 \leftarrow SQUEEZE(1).
FINISH().

Note that the sponge absorbs exactly those elements that the Prover sends to the Verifier. Thus a protocol designer can use a simple rule of thumb: just absorb everything that is sent out.

Algorithm 7 — SAFE for Authenticated Encryption

Protocol:

Get key K \in \mathbb{F}^k.
Get nonce N \in \mathbb{F}^m.
Encrypt data blocks D_1, D_2, \ldots, D_b, where D_i \in \mathbb{F}^{L_i}.
Get ciphertext C.

Sponge calls via SAFE:

START(IO[2b + 2], D).
ABSORB(k, K).
ABSORB(m, N).
C_1 \leftarrow SQUEEZE(L_1).
ABSORB(L_1, D_1).
C_2 \leftarrow SQUEEZE(L_2).
ABSORB(L_2, D_2).
…
C_b \leftarrow SQUEEZE(L_b).
ABSORB(L_b, D_b).
S \leftarrow SQUEEZE(1).
FINISH(). Upon success of FINISH() and of previous calls, the string (C_1 + D_1) \| (C_2 + D_2) \| \cdots \| (C_b + D_b) \| S will be the ciphertext, where “+” denotes addition in \mathbb{F}.

This construction is a simplification of the SpongeWrap mode [BDPV11]. It is most efficient when L_i \equiv 0 \pmod{r}, that is, all blocks fit the rate parameter of the sponge. This mode can be adapted to support associated data (authenticated but not encrypted), in the same vein as the SpongeWrap mode. Note that there is no padding overhead, nor are unneeded calls to P spent.

Algorithm 8 — SAFE for Stream Cipher and PRNG

Protocol:

If PRNG then: Get seed S \in \mathbb{F}^s.
Else (stream cipher): Get key K \in \mathbb{F}^k and nonce N \in \mathbb{F}^m.
Generate L stream elements C[\,].
If stream cipher then: Encrypt data D[\,] with C[\,].

Sponge calls via SAFE:

If PRNG then: ABSORB(s, S).
Else (stream cipher): ABSORB(k, K); then ABSORB(m, N).
C_1 \leftarrow SQUEEZE(L_1).
C_2 \leftarrow SQUEEZE(L_2).
…
C_b \leftarrow SQUEEZE(L_b).
FINISH(). For the stream cipher case, the plaintext D_1, \ldots, D_b with D_i consisting of L_i field elements is encrypted to (C_1 + D_1) \| (C_2 + D_2) \| \cdots \| (C_b + D_b).

References

[AAB^+20] Abdelrahaman Aly, Tomer Ashur, Eli Ben-Sasson, Siemen Dhooghe, and Alan Szepieniec. Design of Symmetric-Key Primitives for Advanced Cryptographic Protocols. IACR ToSC, (3):1–45, 2020.
[AGR^+16] Martin R. Albrecht, Lorenzo Grassi, Christian Rechberger, Arnab Roy, and Tyge Tiessen. MiMC: Efficient Encryption and Cryptographic Hashing with Minimal Multiplicative Complexity. In ASIACRYPT, 2016.
[BDPV07] Guido Bertoni, Joan Daemen, Michael Peeters, and Gilles Van Assche. Sponge functions. Ecrypt Hash Workshop, May 2007.
[BDPV11] Guido Bertoni, Joan Daemen, Michael Peeters, and Gilles Van Assche. Duplexing the sponge: Single-pass authenticated encryption and other applications. In SAC, 2011.
[CDMP05] Jean-Sebastien Coron, Yevgeniy Dodis, Cecile Malinaud, and Prashant Puniya. Merkle–Damgard revisited: How to construct a hash function. In CRYPTO, 2005.
[GKR^+21] Lorenzo Grassi, Dmitry Khovratovich, Arnab Roy, Christian Rechberger, and Markus Schofnegger. Poseidon: A new hash function for zero-knowledge proof systems. USENIX Security, 2021. [page on this site]
[HBHW22] Daira Hopwood, Sean Bowe, Taylor Hornby, and Nathan Wilcox. ZCash protocol specification, 2022.
[LRG^+22] Reinhard Luftenegger, Christian Rechberger, Lorenzo Grassi, Markus Schofnegger, Roman Walch, and Dmitry Khovratovich. Reinforced Concrete: A fast hash function for verifiable computation. In ACM CCS, 2022. [page on this site]
[MRH04] Ueli M. Maurer, Renato Renner, and Clemens Holenstein. Indifferentiability, impossibility results on reductions, and applications to the random oracle methodology. In TCC, 2004.

Abstract

1. Introduction

2. Sponges

3. The SAFE API

3.1 API Overview

3.2 Security

3.3 From IO Patterns to Tags to Instances

3.4 Detailed API

4. SAFE Applications

References

History