Gap-Hamming Problem
   HOME

TheInfoList



OR:

In
communication complexity In theoretical computer science, communication complexity studies the amount of communication required to solve a problem when the input to the problem is distributed among two or more parties. The study of communication complexity was first intro ...
, the gap-Hamming problem asks, if
Alice and Bob Alice and Bob are fictional characters commonly used as placeholders in discussions about cryptography, cryptographic systems and Cryptographic protocol, protocols, and in other science and engineering literature where there are several partici ...
are each given a (potentially different) string, what is the minimal number of bits that they need to exchange in order for Alice to approximately compute the
Hamming distance In information theory, the Hamming distance between two String (computer science), strings or vectors of equal length is the number of positions at which the corresponding symbols are different. In other words, it measures the minimum number ...
between their strings. The solution to the problem roughly states that, if Alice and Bob are each given a string, then any
communication protocol A communication protocol is a system of rules that allows two or more entities of a communications system to transmit information via any variation of a physical quantity. The protocol defines the rules, syntax, semantics (computer science), sem ...
used to compute the Hamming distance between their strings does (asymptotically) no better than Bob sending his whole string to Alice. More specifically, if Alice and Bob are each given n-bit strings, there exists no communication protocol that lets Alice compute the hamming distance between their strings to within \pm\sqrt using less than \Omega(n) bits. The gap-Hamming problem has applications to proving lower bounds for many streaming algorithms, including moment frequency estimation and entropy estimation.


Formal statement

In this problem, Alice and Bob each receive a string, x \in \^n and y \in \^n, respectively, while Alice is required to compute the (partial) function, : \operatorname(x, y) = \begin +1 & D_H(x, y) \ge \frac + \sqrt\\ -1 & D_H(x, y) \le \frac - \sqrt\\ * & \text, \end using the least amount of communication possible. Here, * indicates that Alice can return either of \pm 1 and D_H(x, y) is the
Hamming distance In information theory, the Hamming distance between two String (computer science), strings or vectors of equal length is the number of positions at which the corresponding symbols are different. In other words, it measures the minimum number ...
between x and y. In other words, Alice needs to return whether Bob's string is significantly similar or significantly different from hers while minimizing the number of bits she exchanges with Bob. The problem's solution states that computing \operatorname requires at least \Omega(n) communication. In particular, it requires \Omega(n) communication even when x and y are chosen uniformly at random from \^n.


History

The gap-Hamming problem was originally proposed by Indyk and Woodruff in the early 2000's, who initially proved a linear lower bound on the ''one-way'' communication complexity of the problem (where Alice is only allowed to receive data from Bob) and conjectured a linear lower bound in the general case. The question of the infinite-round case (in which Alice and Bob are allowed to exchange as many messages as desired) remained open until Chakrabarti and Regev proved, via an anti-concentration argument, that the general problem also has linear lower bound complexity, thus settling the original question completely. This result was followed by a series of other papers that sought to simplify or find new approaches to proving the desired lower bound, notably first by Vidick and later by Sherstov, and, recently, with an information-theoretic approach by Hadar, Liu, Polyanskiy, and Shayevitz.


References

{{Reflist Computational complexity theory Quantum complexity theory Communication