In
cryptography
Cryptography, or cryptology (from grc, , translit=kryptós "hidden, secret"; and ''graphein'', "to write", or ''-logia'', "study", respectively), is the practice and study of techniques for secure communication in the presence of adve ...
, an accumulator is a
one way membership
hash function
A hash function is any function that can be used to map data of arbitrary size to fixed-size values. The values returned by a hash function are called ''hash values'', ''hash codes'', ''digests'', or simply ''hashes''. The values are usually ...
. It allows users to certify that potential candidates are a member of a certain
set without revealing the individual members of the set. This concept was formally introduced by Josh Benaloh and Michael de Mare in 1993.
Formal definitions
There are several formal definitions which have been proposed in the literature. This section lists them by proposer, in roughly chronological order.
Benaloh and de Mare (1993)
Benaloh and de Mare define a one-way hash function as a family of functions
which satisfy the following three properties:
# For all
, one can compute
in time
. (Here the "poly" symbol refers to an unspecified, but fixed, polynomial.)
# No
probabilistic polynomial-time algorithm will, for sufficiently large
, map the inputs
, find a value
such that
with more than negligible probability.
# For all
, one has
. (A function that satisfies this property is called ''quasi-commutative''.)
(With the first two properties, one recovers the normal definition of a cryptographic hash function.)
From such a function, one defines the "accumulated hash" of a set
and starting value
w.r.t. a value
to be
. The result, does not depend on the order of elements
because
is quasi-commutative.
If
belong to some users of a cryptosystem, then everyone can compute the accumulated value
Also, the user of
can compute the partial accumulated value
of
. Then,
So the
user can provide the pair
to any other part, in order to authenticate
.
Barić and Pfitzmann (1997)
The basic functionality of a quasi-commutative hash function is not immediate from the definition. To fix this, Barić and Pfitzmann defined a slightly more general definition, which is the notion of an ''accumulator scheme'' as consisting of the following components:
# Gen: a probabilistic algorithm that takes in two parameters
(the security parameter and the number of values that can be securely accumulated, respectively), and returns an appropriate key
.
# Eval: a probabilistic algorithm that takes in a key
and accumulation set
, where
, and returning an accumulated value
and auxiliary information
. We insist that Eval ''must'' be deterministic for
.
# Wit: a probabilistic algorithm that takes in a key
, a value
, an accumulated value
of some set
, and some auxiliary information
, and returns either a witness
or the special symbol
. We insist that, if
, that Wit returns a witness, and that Wit otherwise returns
.
# Ver: a ''deterministic'' algorithm that takes in a key
, a value
, a witness
, and an accumulated value
, and returns a Yes/No value. We insist that if
was generated from running Wit on a tuple
, where
were generated from running Eval on some
, and where
was chosen arbitrarily and
was chosen from running Gen, that Ver always return Yes.
It is relatively easy to see that one can define an accumulator scheme from any quasi-commutative hash function, using the technique shown above.
Camenisch and Lysyanskaya (2002)
One observes that, for many applications, the set of accumulated values will change many times. Naïvely, one could completely redo the accumulator calculation every time; however, this may be inefficient, especially if our set is very large and the change is very small. To formalize this intuition, Camenish and Lysyanskaya defined a ''dynamic accumulator scheme'' to consist of the 4 components of an ordinary accumulator scheme, plus three more:
# Add: a (possibly probabilistic) algorithm that takes in a key
, an accumulated value
, and another value to accumulate
, and returns a new accumulated value
and auxiliary information
. We insist that if
was generated by accumulating some set
, then
must be as if it were generated by accumulating the set
.
# Del: a (possibly probabilistic) algorithm that takes in a key
, an accumulated value
, and another value to accumulate
, and returns a new accumulated value
and auxiliary information
. We insist that if
was generated by accumulating some set
, then
must be as if it were generated by accumulating the set
.
# Upd: a deterministic algorithm that takes in the key
, a value
, a witness
, the accumulated value
, and auxiliary information
, and returns a new witness
. We insist that if
was generated by Gen,
is part of a set
,
is a witness for
being a member of
, and
is an accumulated value for
, and
was generated by running Add or Del, then
will be a witness for
being a member of the new set.
Fazio and Nicolosi note that since Add, Del, and Upd can be simulated by rerunning Eval and Wit, this definition does not add any fundamentally new functionality.
Examples
One example is
multiplication
Multiplication (often denoted by the cross symbol , by the mid-line dot operator , by juxtaposition, or, on computers, by an asterisk ) is one of the four elementary mathematical operations of arithmetic, with the other ones being ad ...
over large
prime number
A prime number (or a prime) is a natural number greater than 1 that is not a product of two smaller natural numbers. A natural number greater than 1 that is not prime is called a composite number. For example, 5 is prime because the only way ...
s. This is a cryptographic accumulator, since it takes superpolynomial time to
factor
Factor, a Latin word meaning "who/which acts", may refer to:
Commerce
* Factor (agent), a person who acts for, notably a mercantile and colonial agent
* Factor (Scotland), a person or firm managing a Scottish estate
* Factors of production, ...
a composite number (at least according to conjecture), but it takes only a small amount of time (polynomial in size) to divide a prime into an integer to check if it is one of the factors and/or to factor it out. New members may be added or subtracted to the set of factors by multiplying or factoring out the number respectively. In this system, two accumulators that have accumulated a single shared prime can have it trivially discovered by calculating their GCD, even without prior knowledge of the prime (which would otherwise require prime factorization of the accumulator to discover).
More practical accumulators use a
quasi-commutative In mathematics, the quasi-commutative property is an extension or generalization of the general commutative property. This property is used in specific applications with various definitions.
Applied to matrices
Two matrices p and q are said to h ...
hash function, so that the size of the accumulator does not grow with the number of members. For example, Benaloh and de Mare propose a cryptographic accumulator inspired by
RSA
RSA may refer to:
Organizations Academia and education
* Rabbinical Seminary of America, a yeshiva in New York City
*Regional Science Association International (formerly the Regional Science Association), a US-based learned society
*Renaissance S ...
: the quasi-commutative function
for some composite number
. They recommend to choose
to be a ''rigid'' integer (i.e. the product of two
safe prime
In number theory, a prime number ''p'' is a if 2''p'' + 1 is also prime. The number 2''p'' + 1 associated with a Sophie Germain prime is called a . For example, 11 is a Sophie Germain prime and 2 × 11 +  ...
s).
Barić and Pfitzmann proposed a variant where
was restricted to be prime and at most
(this constant is very close to
, but does not leak information about the prime factorization of
).
David Naccache
David Naccache is a cryptographer, currently a professor at the École normale supérieure and a member of its Computer Laboratory. He was previously a professor at Panthéon-Assas University.
Biography
He received his Ph.D. in 1995 from the ...
observed in 1993 that
is quasi-commutative for all constants
, generalizing the previous RSA-inspired cryptographic accumulator. Naccache also noted that the
Dickson polynomials are quasi-commutative in the degree, but it is unknown whether this family of functions is one-way.
In 1996, Nyberg constructed an accumulator which is provably information-theoretically secure in the
random oracle model
In cryptography, a random oracle is an oracle (a theoretical black box) that responds to every ''unique query'' with a (truly) random response chosen uniformly from its output domain. If a query is repeated, it responds the same way every time tha ...
. Choosing some upper limit
for the number of items that can be securely accumulated and
the security parameter, define the constant
to be an integer multiple of
(so that one can write
) and let
be some
cryptographically secure hash function. Choose a key
as a random
-bit bitstring. Then, to accumulate using Nyberg's scheme, use the quasi-commutative hash function
, where
is the
bitwise and
In computer programming, a bitwise operation operates on a bit string, a bit array or a binary numeral (considered as a bit string) at the level of its individual bits. It is a fast and simple action, basic to the higher-level arithmetic operat ...
operation and
is the function that interprets its input as a sequence of
-bit bitstrings of length
, replaces every all-zero bitstring with a single 0 and every other bitstring with a 1, and outputs the result.
Applications
Haber and Stornetta showed in 1990 that accumulators can be used to
timestamp
A timestamp is a sequence of characters or encoded information identifying when a certain event occurred, usually giving date and time of day, sometimes accurate to a small fraction of a second. Timestamps do not have to be based on some absolut ...
documents through cryptographic chaining. (This concept anticipates the modern notion of a cryptographic
blockchain
A blockchain is a type of distributed ledger technology (DLT) that consists of growing lists of records, called ''blocks'', that are securely linked together using cryptography. Each block contains a cryptographic hash of the previous block, ...
.)
Benaloh and de Mare proposed an alternative scheme in 1991 based on discretizing time into rounds.
Benaloh and de Mare showed that accumulators can be used so that a large group of people can recognize each other at a later time (which Fazio and Nicolosi call an "ID Escrow" situation). Each person selects a
representing their identity, and the group collectively selects a public accumulator
and a secret
. Then, the group publishes or saves the hash function and the accumulated hash of all the group's identities w.r.t the secret
and public accumulator; simultaneously, each member of the group keeps both its identity value
and the accumulated hash of all the group's identities ''except that of the member''. (If the large group of people do not trust each other, or if the accumulator has a cryptographic trapdoor as in the case of the RSA-inspired accumulator, then they can compute the accumulated hashes by
secure multiparty computation
Secure multi-party computation (also known as secure computation, multi-party computation (MPC) or privacy-preserving computation) is a subfield of cryptography with the goal of creating methods for parties to jointly compute a function over their ...
.) To verify that a claimed member did indeed belong to the group later, they present their identity and personal accumulated hash (or a
zero-knowledge proof
In cryptography, a zero-knowledge proof or zero-knowledge protocol is a method by which one party (the prover) can prove to another party (the verifier) that a given statement is true while the prover avoids conveying any additional information a ...
thereof); by accumulating the identity of the claimed member and checking it against the accumulated hash of the entire group, anyone can verify a member of the group.
With a dynamic accumulator scheme, it is additionally easy to add or remove members afterward.
Cryptographic accumulators can also be used to construct other cryptographically secure
data structure
In computer science, a data structure is a data organization, management, and storage format that is usually chosen for Efficiency, efficient Data access, access to data. More precisely, a data structure is a collection of data values, the rel ...
s:
* Barić and Pfitzmann show that one can construct fail-stop signatures with only constant space by exploiting the compression property.
* Goodrich et al. constructed a size-oblivious, efficient, dynamic authenticated dictionary (which allows untrusted directories to give cryptographically verifiable answers to membership queries).
*Papamanthou et al. constructed a cryptographically secure
hash table
In computing, a hash table, also known as hash map, is a data structure that implements an associative array or dictionary. It is an abstract data type that maps keys to values. A hash table uses a hash function to compute an ''index'', ...
, whose functionality can be authenticated when stored remotely.
The concept has received renewed interest due to the
Zerocoin add on to