In
cryptography
Cryptography, or cryptology (from "hidden, secret"; and ''graphein'', "to write", or ''-logy, -logia'', "study", respectively), is the practice and study of techniques for secure communication in the presence of Adversary (cryptography), ...
, black-box obfuscation was a proposed
cryptographic primitive Cryptographic primitives are well-established, low-level cryptography, cryptographic algorithms that are frequently used to build cryptographic protocols for computer security systems. These routines include, but are not limited to, one-way hash fun ...
which would allow a
computer program
A computer program is a sequence or set of instructions in a programming language for a computer to Execution (computing), execute. It is one component of software, which also includes software documentation, documentation and other intangibl ...
to be
obfuscated in a way such that it was impossible to determine anything about it except its input and output behavior. Black-box obfuscation has been proven to be impossible, even in principle.
Impossibility
The unobfuscatable programs
Barak et al. constructed a family of unobfuscatable programs, for which an efficient attacker can always learn more from ''any'' obfuscated code than from black-box access.
Broadly, they start by engineering a special pair of programs that cannot be obfuscated together. For some randomly selected strings
of a fixed, pre-determined length
, define one program to be one that computes
and the other program to one that computes
(Here,
interprets its input as the code for a
Turing machine
A Turing machine is a mathematical model of computation describing an abstract machine that manipulates symbols on a strip of tape according to a table of rules. Despite the model's simplicity, it is capable of implementing any computer algori ...
. The second condition in the definition of
is to prevent the function from being
uncomputable.)
If an efficient attacker only has black-box access, Barak et al. argued, then the attacker only has an exponentially small chance of guessing the password
, and so cannot distinguish the pair of programs from a pair where
is replaced by some program
that always outputs "0". However, if the attacker has access to any obfuscated implementations
of
, then the attacker will find
with probability 1, whereas the attacker will always find
unless
(which should happen with negligible probability). This means that the attacker can always distinguish the pair
from the pair
with obfuscated code access, but not black-box access. Since ''no'' obfuscator can prevent this attack, Barak et al. conclude that no black-box obfuscator for pairs of programs exists.
To conclude the argument, Barak et al. define a third program to implement the functionality of the two previous:
Since equivalently efficient implementations of
can be recovered from one of
by hardwiring the value of
, Barak et al. conclude that
cannot be obfuscated either, which concludes their argument.
Impossible variants of black-box obfuscation and other types of unobfuscatable programs
In their paper, Barak et al. also prove the following (conditional to appropriate
cryptographic assumptions):
* There are unobfuscatable
circuits.
* There is no black-box ''approximate'' obfuscator.
* There are unobfuscatable, secure, probabilistic
private-key cryptosystems.
* There are unobfuscatable, secure, deterministic
digital signature schemes.
* There are unobfuscatable, secure, deterministic
message authentication schemes.
* There are unobfuscatable, secure
pseudorandom functions.
* For many protocols that are secure in the
random oracle model
In cryptography, a random oracle is an oracle (a theoretical black box) that responds to every ''unique query'' with a (truly) random response chosen uniformly from its output domain. If a query is repeated, it responds the same way every tim ...
, the protocol becomes insecure if the random oracle is replaced with an artificial
cryptographic hash function
A cryptographic hash function (CHF) is a hash algorithm (a map (mathematics), map of an arbitrary binary string to a binary string with a fixed size of n bits) that has special properties desirable for a cryptography, cryptographic application: ...
; in particular,
Fiat-Shamir schemes can be attacked.
* There are unobfuscatable circuits in
TC0 (that is, constant-depth threshold circuits).
* There are unobfuscatable sampling
algorithm
In mathematics and computer science, an algorithm () is a finite sequence of Rigour#Mathematics, mathematically rigorous instructions, typically used to solve a class of specific Computational problem, problems or to perform a computation. Algo ...
s (in fact, these cannot be obfuscated approximately).
* There is no secure software
watermarking scheme.
Weaker variants
In their original paper exploring black-box obfuscation, Barak et al. defined two weaker notions of cryptographic obfuscation which they did not rule out:
indistinguishability obfuscation and extractability obfuscation (which they called "differing-inputs obfuscation".) Informally, an indistinguishability obfuscator should convert input programs with the same functionality into output programs such that the outputs cannot be efficiently related to the inputs by a bounded attacker, and an extractability obfuscator should be an obfuscator such that if the efficient attacker ''could'' relate the outputs to the inputs for any two programs, then the attacker could also produce an input such that the two programs being obfuscated produce different outputs. (Note that an extractability obfuscator is necessarily an indistinguishability obfuscator.)
, a candidate implementation of
indistinguishability obfuscation is under investigation.
In 2013, Boyle et al. explored several candidate implementations of extractability obfuscation.
References
{{reflist
Software obfuscation
Cryptographic primitives
Unsolvable puzzles