cryptography Cryptography, or cryptology (from grc, , translit=kryptós "hidden, secret"; and ''graphein'', "to write", or ''-logia'', "study", respectively), is the practice and study of techniques for secure communication in the presence of adve ...

, an accumulator is a one way membership

hash function A hash function is any function that can be used to map data of arbitrary size to fixed-size values. The values returned by a hash function are called ''hash values'', ''hash codes'', ''digests'', or simply ''hashes''. The values are usually ...

. It allows users to certify that potential candidates are a member of a certain set without revealing the individual members of the set. This concept was formally introduced by Josh Benaloh and Michael de Mare in 1993.

Formal definitions

There are several formal definitions which have been proposed in the literature. This section lists them by proposer, in roughly chronological order.

Benaloh and de Mare (1993)

Benaloh and de Mare define a one-way hash function as a family of functions

h_: X_\times Y_\to Z_

which satisfy the following three properties: # For all

\ell\in\mathbb, x\in X_, y\in Y_

, one can compute

h_(x,y)

in time

\text(\ell, , x, , , y, )

. (Here the "poly" symbol refers to an unspecified, but fixed, polynomial.) # No probabilistic polynomial-time algorithm will, for sufficiently large

\ell

, map the inputs

\ell\in\mathbb, (x, y)\in X_\times Y_, y'\in Y_

, find a value

x'\in X_

such that

h_(x, y) = h_(x', y')

with more than negligible probability. # For all

\ell\in\mathbb, x\in X_, y_1, y_2\in Y_

, one has

h(h(x, y_1), y_2) = h(h(x, y_2), y_1)

. (A function that satisfies this property is called ''quasi-commutative''.) (With the first two properties, one recovers the normal definition of a cryptographic hash function.) From such a function, one defines the "accumulated hash" of a set

\

and starting value

x

w.r.t. a value

z

to be

h(h(\cdots h(h(x, y_1), y_2),\dots, y_), y_m)

. The result, does not depend on the order of elements

y_1,y_2,...,y_n

because

h

is quasi-commutative. If

y_1,y_2,...,y_n

belong to some users of a cryptosystem, then everyone can compute the accumulated value

z.

Also, the user of

y_i

can compute the partial accumulated value

z_i

( y_1,...,y_,y_,...,y_n )

. Then,

h(z_i,y_i)=z.

So the

i -

user can provide the pair

(z_i,y_i)

to any other part, in order to authenticate

y_i

Barić and Pfitzmann (1997)

The basic functionality of a quasi-commutative hash function is not immediate from the definition. To fix this, Barić and Pfitzmann defined a slightly more general definition, which is the notion of an ''accumulator scheme'' as consisting of the following components: # Gen: a probabilistic algorithm that takes in two parameters

\lambda, N

(the security parameter and the number of values that can be securely accumulated, respectively), and returns an appropriate key

k

. # Eval: a probabilistic algorithm that takes in a key

k

and accumulation set

Y:=\

, where

N'\leq N

, and returning an accumulated value

z

and auxiliary information

aux

. We insist that Eval ''must'' be deterministic for

z

. # Wit: a probabilistic algorithm that takes in a key

k

, a value

y

, an accumulated value

z

of some set

Y

, and some auxiliary information

aux

, and returns either a witness

w

or the special symbol

\bot

. We insist that, if

y\in L

, that Wit returns a witness, and that Wit otherwise returns

\bot

. # Ver: a ''deterministic'' algorithm that takes in a key

k

, a value

y

, a witness

w

, and an accumulated value

z

, and returns a Yes/No value. We insist that if

w

was generated from running Wit on a tuple

(k, y, z, aux)

, where

z, aux

were generated from running Eval on some

k, L

, and where

L

was chosen arbitrarily and

k

was chosen from running Gen, that Ver always return Yes. It is relatively easy to see that one can define an accumulator scheme from any quasi-commutative hash function, using the technique shown above.

Camenisch and Lysyanskaya (2002)

One observes that, for many applications, the set of accumulated values will change many times. Naïvely, one could completely redo the accumulator calculation every time; however, this may be inefficient, especially if our set is very large and the change is very small. To formalize this intuition, Camenish and Lysyanskaya defined a ''dynamic accumulator scheme'' to consist of the 4 components of an ordinary accumulator scheme, plus three more: # Add: a (possibly probabilistic) algorithm that takes in a key

k

, an accumulated value

z

, and another value to accumulate

y

, and returns a new accumulated value

z'

and auxiliary information

aux

. We insist that if

z

was generated by accumulating some set

L

, then

z'

must be as if it were generated by accumulating the set

L\cup\

. # Del: a (possibly probabilistic) algorithm that takes in a key

k

, an accumulated value

z

, and another value to accumulate

y

, and returns a new accumulated value

z'

and auxiliary information

aux

. We insist that if

z

was generated by accumulating some set

L

, then

z'

must be as if it were generated by accumulating the set

L\backslash\

. # Upd: a deterministic algorithm that takes in the key

k

, a value

y

, a witness

w

, the accumulated value

z

, and auxiliary information

aux

, and returns a new witness

w'

. We insist that if

k

was generated by Gen,

y

is part of a set

L

w

is a witness for

y

being a member of

L

, and

z

is an accumulated value for

L

, and

aux

was generated by running Add or Del, then

w'

will be a witness for

y

being a member of the new set. Fazio and Nicolosi note that since Add, Del, and Upd can be simulated by rerunning Eval and Wit, this definition does not add any fundamentally new functionality.

Examples

One example is

multiplication Multiplication (often denoted by the cross symbol , by the mid-line dot operator , by juxtaposition, or, on computers, by an asterisk ) is one of the four elementary mathematical operations of arithmetic, with the other ones being ad ...

over large

prime number A prime number (or a prime) is a natural number greater than 1 that is not a product of two smaller natural numbers. A natural number greater than 1 that is not prime is called a composite number. For example, 5 is prime because the only way ...

s. This is a cryptographic accumulator, since it takes superpolynomial time to

factor Factor, a Latin word meaning "who/which acts", may refer to: Commerce * Factor (agent), a person who acts for, notably a mercantile and colonial agent * Factor (Scotland), a person or firm managing a Scottish estate * Factors of production, ...

a composite number (at least according to conjecture), but it takes only a small amount of time (polynomial in size) to divide a prime into an integer to check if it is one of the factors and/or to factor it out. New members may be added or subtracted to the set of factors by multiplying or factoring out the number respectively. In this system, two accumulators that have accumulated a single shared prime can have it trivially discovered by calculating their GCD, even without prior knowledge of the prime (which would otherwise require prime factorization of the accumulator to discover). More practical accumulators use a

quasi-commutative In mathematics, the quasi-commutative property is an extension or generalization of the general commutative property. This property is used in specific applications with various definitions. Applied to matrices Two matrices p and q are said to h ...

hash function, so that the size of the accumulator does not grow with the number of members. For example, Benaloh and de Mare propose a cryptographic accumulator inspired by

RSA RSA may refer to: Organizations Academia and education * Rabbinical Seminary of America, a yeshiva in New York City *Regional Science Association International (formerly the Regional Science Association), a US-based learned society *Renaissance S ...

: the quasi-commutative function

h(x, y) := x^y\pmod

for some composite number

n

. They recommend to choose

n

to be a ''rigid'' integer (i.e. the product of two

safe prime In number theory, a prime number ''p'' is a if 2''p'' + 1 is also prime. The number 2''p'' + 1 associated with a Sophie Germain prime is called a . For example, 11 is a Sophie Germain prime and 2 × 11 + ...

s). Barić and Pfitzmann proposed a variant where

y

was restricted to be prime and at most

n/4

(this constant is very close to

\phi(n)

, but does not leak information about the prime factorization of

n

David Naccache David Naccache is a cryptographer, currently a professor at the École normale supérieure and a member of its Computer Laboratory. He was previously a professor at Panthéon-Assas University. Biography He received his Ph.D. in 1995 from the ...

observed in 1993 that

e_(x, y) := x^y c^ \pmod

is quasi-commutative for all constants

c, n

, generalizing the previous RSA-inspired cryptographic accumulator. Naccache also noted that the Dickson polynomials are quasi-commutative in the degree, but it is unknown whether this family of functions is one-way. In 1996, Nyberg constructed an accumulator which is provably information-theoretically secure in the

random oracle model In cryptography, a random oracle is an oracle (a theoretical black box) that responds to every ''unique query'' with a (truly) random response chosen uniformly from its output domain. If a query is repeated, it responds the same way every time tha ...

. Choosing some upper limit

N = 2^d

for the number of items that can be securely accumulated and

\lambda

the security parameter, define the constant

\ell:\approx\frac\lambda N\log_2(N)

to be an integer multiple of

d

(so that one can write

\ell = rd

) and let

H:\^* \to \^

be some cryptographically secure hash function. Choose a key

k

as a random

r

-bit bitstring. Then, to accumulate using Nyberg's scheme, use the quasi-commutative hash function

h(x, y):= x\odot\alpha_r(H(y))

, where

\odot

is the

bitwise and In computer programming, a bitwise operation operates on a bit string, a bit array or a binary numeral (considered as a bit string) at the level of its individual bits. It is a fast and simple action, basic to the higher-level arithmetic operat ...

operation and

\alpha_r:\^\to \^r

is the function that interprets its input as a sequence of

d

-bit bitstrings of length

r

, replaces every all-zero bitstring with a single 0 and every other bitstring with a 1, and outputs the result.

Applications

Haber and Stornetta showed in 1990 that accumulators can be used to

timestamp A timestamp is a sequence of characters or encoded information identifying when a certain event occurred, usually giving date and time of day, sometimes accurate to a small fraction of a second. Timestamps do not have to be based on some absolut ...

documents through cryptographic chaining. (This concept anticipates the modern notion of a cryptographic

blockchain A blockchain is a type of distributed ledger technology (DLT) that consists of growing lists of records, called ''blocks'', that are securely linked together using cryptography. Each block contains a cryptographic hash of the previous block, ...

.) Benaloh and de Mare proposed an alternative scheme in 1991 based on discretizing time into rounds. Benaloh and de Mare showed that accumulators can be used so that a large group of people can recognize each other at a later time (which Fazio and Nicolosi call an "ID Escrow" situation). Each person selects a

y

representing their identity, and the group collectively selects a public accumulator

h

and a secret

x

. Then, the group publishes or saves the hash function and the accumulated hash of all the group's identities w.r.t the secret

x

and public accumulator; simultaneously, each member of the group keeps both its identity value

y

and the accumulated hash of all the group's identities ''except that of the member''. (If the large group of people do not trust each other, or if the accumulator has a cryptographic trapdoor as in the case of the RSA-inspired accumulator, then they can compute the accumulated hashes by

secure multiparty computation Secure multi-party computation (also known as secure computation, multi-party computation (MPC) or privacy-preserving computation) is a subfield of cryptography with the goal of creating methods for parties to jointly compute a function over their ...

.) To verify that a claimed member did indeed belong to the group later, they present their identity and personal accumulated hash (or a

zero-knowledge proof In cryptography, a zero-knowledge proof or zero-knowledge protocol is a method by which one party (the prover) can prove to another party (the verifier) that a given statement is true while the prover avoids conveying any additional information a ...

thereof); by accumulating the identity of the claimed member and checking it against the accumulated hash of the entire group, anyone can verify a member of the group. With a dynamic accumulator scheme, it is additionally easy to add or remove members afterward. Cryptographic accumulators can also be used to construct other cryptographically secure

data structure In computer science, a data structure is a data organization, management, and storage format that is usually chosen for Efficiency, efficient Data access, access to data. More precisely, a data structure is a collection of data values, the rel ...

s: * Barić and Pfitzmann show that one can construct fail-stop signatures with only constant space by exploiting the compression property. * Goodrich et al. constructed a size-oblivious, efficient, dynamic authenticated dictionary (which allows untrusted directories to give cryptographically verifiable answers to membership queries). *Papamanthou et al. constructed a cryptographically secure

hash table In computing, a hash table, also known as hash map, is a data structure that implements an associative array or dictionary. It is an abstract data type that maps keys to values. A hash table uses a hash function to compute an ''index'', ...

, whose functionality can be authenticated when stored remotely. The concept has received renewed interest due to the Zerocoin add on to

bitcoin Bitcoin ( abbrev