classical cryptography In cryptography, a classical cipher is a type of cipher that was used historically but for the most part, has fallen into disuse. In contrast to modern cryptographic algorithms, most classical ciphers can be practically computed and solved by hand ...

, the Hill cipher is a polygraphic substitution cipher based on

linear algebra Linear algebra is the branch of mathematics concerning linear equations such as :a_1x_1+\cdots +a_nx_n=b, linear maps such as :(x_1, \ldots, x_n) \mapsto a_1x_1+\cdots +a_nx_n, and their representations in vector spaces and through matrix (mathemat ...

. Invented by Lester S. Hill in 1929, it was the first polygraphic cipher in which it was practical (though barely) to operate on more than three symbols at once. The following discussion assumes an elementary knowledge of matrices.

Encryption

Each letter is represented by a number

modulo In computing and mathematics, the modulo operation returns the remainder or signed remainder of a division, after one number is divided by another, the latter being called the '' modulus'' of the operation. Given two positive numbers and , mo ...

26. Though this is not an essential feature of the cipher, this simple scheme is often used: To encrypt a message, each block of ''n'' letters (considered as an ''n''-component vector) is multiplied by an

invertible In mathematics, the concept of an inverse element generalises the concepts of opposite () and reciprocal () of numbers. Given an operation denoted here , and an identity element denoted , if , one says that is a left inverse of , and that ...

''n'' × ''n'' matrix, against modulus 26. To decrypt the message, each block is multiplied by the inverse of the matrix used for encryption. The matrix used for encryption is the cipher key, and it should be chosen randomly from the set of invertible ''n'' × ''n'' matrices (

26). The cipher can, of course, be adapted to an alphabet with any number of letters; all arithmetic just needs to be done modulo the number of letters instead of modulo 26. Consider the message 'ACT', and the key below (or GYBNQKURP in letters): :

\begin 6 & 24 & 1 \\ 13 & 16 & 10 \\ 20 & 17 & 15 \end

Since 'A' is 0, 'C' is 2 and 'T' is 19, the message is the vector: :

\begin 0 \\ 2 \\ 19 \end

Thus the enciphered vector is given by: :

\begin 6 & 24 & 1 \\ 13 & 16 & 10 \\ 20 & 17 & 15 \end \begin 0 \\ 2 \\ 19 \end = \begin 67 \\ 222 \\ 319 \end \equiv \begin 15 \\ 14 \\ 7 \end \pmod

which corresponds to a

ciphertext In cryptography, ciphertext or cyphertext is the result of encryption performed on plaintext using an algorithm, called a cipher. Ciphertext is also known as encrypted or encoded information because it contains a form of the original plaintext ...

of 'POH'. Now, suppose that our message is instead 'CAT', or: :

\begin 2 \\ 0 \\ 19 \end

This time, the enciphered vector is given by: :

\begin 6 & 24 & 1 \\ 13 & 16 & 10 \\ 20 & 17 & 15 \end \begin 2 \\ 0 \\ 19 \end = \begin 31 \\ 216 \\ 325 \end \equiv \begin 5 \\ 8 \\ 13 \end \pmod

which corresponds to a ciphertext of 'FIN'. Every letter has changed. The Hill cipher has achieved Shannon's

diffusion Diffusion is the net movement of anything (for example, atoms, ions, molecules, energy) generally from a region of higher concentration to a region of lower concentration. Diffusion is driven by a gradient in Gibbs free energy or chemical p ...

, and an ''n''-dimensional Hill cipher can diffuse fully across ''n'' symbols at once.

Decryption

In order to decrypt, we turn the ciphertext back into a vector, then simply multiply by the inverse matrix of the key matrix (IFKVIVVMI in letters). We find that,

26, the inverse of the matrix used in the previous example is: :

\begin 6 & 24 & 1 \\ 13 & 16 & 10 \\ 20 & 17 & 15 \end^ \pmod\equiv \begin 8 & 5 & 10 \\ 21 & 8 & 21 \\ 21 & 12 & 8 \end

Taking the previous example ciphertext of 'POH', we get: :

\begin 8 & 5 & 10 \\ 21 & 8 & 21 \\ 21 & 12 & 8 \end \begin 15 \\ 14 \\ 7 \end = \begin 260 \\ 574 \\ 539 \end \equiv \begin 0 \\ 2 \\ 19 \end \pmod

which gets us back to 'ACT', as expected. One complication exists in picking the encrypting matrix: # Not all matrices have an inverse. The matrix will have an inverse if and only if its

determinant In mathematics, the determinant is a Scalar (mathematics), scalar-valued function (mathematics), function of the entries of a square matrix. The determinant of a matrix is commonly denoted , , or . Its value characterizes some properties of the ...

is inversible modulo n, where n is the modular base. Thus, if we work modulo 26 as above, the determinant must be nonzero, and must not be divisible by 2 or 13. If the determinant is 0, or has common factors with the modular base, then the matrix cannot be used in the Hill cipher, and another matrix must be chosen (otherwise it will not be possible to decrypt). Fortunately, matrices which satisfy the conditions to be used in the Hill cipher are fairly common. For our example key matrix: :

\begin 6 & 24& 1 \\ 13 & 16 & 10 \\ 20 & 17 & 15 \end = 6(16\cdot15-10\cdot17)-24(13\cdot15-10\cdot20)+1(13\cdot17-16\cdot20) = 441 \equiv 25 \pmod

So, modulo 26, the determinant is 25. Since

25=5^2

and

26=2 \times 13

, 25 has no common factors with 26, and this matrix can be used for the Hill cipher. The risk of the determinant having common factors with the modulus can be eliminated by making the modulus prime. Consequently, a useful variant of the Hill cipher adds 3 extra symbols (such as a space, a period and a question mark) to increase the modulus to 29.

Example

Let :

K= \begin 3 & 3 \\ 2 & 5 \end

be the key and suppose the plaintext message is 'HELP'. Then this plaintext is represented by two pairs :

HELP \to \begin H \\ E \end , \begin L \\ P \end \to \begin 7 \\ 4 \end , \begin 11 \\ 15 \end

Then we compute :

\begin 3 & 3 \\ 2 & 5 \end \begin 7 \\ 4 \end \equiv \begin 7 \\ 8 \end \pmod,

and :

\begin 3 & 3 \\ 2 & 5 \end \begin 11 \\ 15 \end \equiv \begin 0 \\ 19 \end\pmod

and continue encryption as follows: :

\begin 7 \\ 8 \end, \begin 0 \\ 19 \end \to \begin H \\ I \end, \begin A \\ T \end

The matrix ''K'' is invertible, hence

K^

exists such that

KK^=K^K=I_2

. The inverse of ''K'' can be computed by using the

formula In science, a formula is a concise way of expressing information symbolically, as in a mathematical formula or a ''chemical formula''. The informal use of the term ''formula'' in science refers to the general construct of a relationship betwe ...

\begin a & b \\ c & d \end^=(ad-bc)^\begin d & -b \\ -c & a \end

This formula still holds after a modular reduction if a

modular multiplicative inverse In mathematics, particularly in the area of arithmetic, a modular multiplicative inverse of an integer is an integer such that the product is congruent to 1 with respect to the modulus .. In the standard notation of modular arithmetic this cong ...

is used to compute Hence in this case, we compute :

K^ \equiv 9^ \begin 5 & 23 \\ 24 & 3 \end \equiv 3 \begin 5 & 23 \\ 24 & 3 \end \equiv \begin 15 & 17 \\ 20 & 9 \end\pmod

HIAT \to \begin H \\ I \end, \begin A \\ T \end \to \begin 7 \\ 8 \end, \begin 0 \\ 19 \end

Then we compute :

\begin 15 & 17 \\ 20 & 9 \end\begin 7 \\ 8 \end = \begin 241 \\ 212 \end \equiv \begin 7 \\ 4 \end\pmod,

and :

\begin 15 & 17 \\ 20 & 9 \end\begin 0 \\ 19 \end = \begin 323 \\ 171 \end \equiv \begin 11 \\ 15 \end\pmod

Therefore, :

\begin 7 \\ 4 \end, \begin 11 \\ 15 \end \to \begin H \\ E \end, \begin L \\ P \end \to HELP

Security

The basic Hill cipher is vulnerable to a known-plaintext attack because it is completely

linear In mathematics, the term ''linear'' is used in two distinct senses for two different properties: * linearity of a '' function'' (or '' mapping''); * linearity of a '' polynomial''. An example of a linear function is the function defined by f(x) ...

. An opponent who intercepts

n^2

plaintext/ciphertext character pairs can set up a linear system which can (usually) be easily solved; if it happens that this system is indeterminate, it is only necessary to add a few more plaintext/ciphertext pairs. Calculating this solution by standard linear algebra algorithms then takes very little time. While matrix multiplication alone does not result in a secure cipher it is still a useful step when combined with other non-linear operations, because matrix multiplication can provide

. For example, an appropriately chosen matrix can guarantee that small differences before the matrix multiplication will result in large differences after the matrix multiplication. Indeed, some modern ciphers use a matrix multiplication step to provide diffusion. For example, the MixColumns step in AES is a matrix multiplication. The function ''g'' in Twofish is a combination of non-linear S-boxes with a carefully chosen matrix multiplication (MDS).

Key space size

The key space is the set of all possible keys. The key space size is the number of possible keys. The effective key size, in number of bits, is the binary logarithm of the key space size. There are

26^

matrices of dimension ''n'' × ''n''. Thus

\log_2(26^)

or about

4.7n^2

is an upper bound on the key size of the Hill cipher using ''n'' × ''n'' matrices. This is only an upper bound because not every matrix is invertible and thus usable as a key. The number of invertible matrices can be computed via the

Chinese Remainder Theorem In mathematics, the Chinese remainder theorem states that if one knows the remainders of the Euclidean division of an integer ''n'' by several integers, then one can determine uniquely the remainder of the division of ''n'' by the product of thes ...

. I.e., a matrix is invertible modulo 26 if and only if it is invertible both modulo 2 and modulo 13. The number of invertible ''n'' × ''n'' matrices modulo 2 is equal to the order of the

general linear group In mathematics, the general linear group of degree n is the set of n\times n invertible matrices, together with the operation of ordinary matrix multiplication. This forms a group, because the product of two invertible matrices is again inve ...

GL(n,Z₂). It is :

2^(1-1/2)(1-1/2^2)\cdots(1-1/2^n).

Equally, the number of invertible matrices modulo 13 (i.e. the order of GL(n,Z₁₃)) is :

13^(1-1/13)(1-1/13^2)\cdots(1-1/13^n).

The number of invertible matrices modulo 26 is the product of those two numbers. Hence it is :

26^(1-1/2)(1-1/2^2)\cdots(1-1/2^n)(1-1/13)(1-1/13^2)\cdots(1-1/13^n).

Additionally it seems to be prudent to avoid too many zeroes in the key matrix, since they reduce diffusion. The net effect is that the effective keyspace of a basic Hill cipher is about

4.64n^2 - 1.7

. For a 5 × 5 Hill cipher, that is about 114 bits. Of course, key search is not the most efficient known attack.

Mechanical implementation

When operating on 2 symbols at once, a Hill cipher offers no particular advantage over Playfair or the bifid cipher, and in fact is weaker than either, and slightly more laborious to operate by pencil-and-paper. As the dimension increases, the cipher rapidly becomes infeasible for a human to operate by hand. A Hill cipher of dimension 6 was implemented mechanically. Hill and a partner were awarded a

patent A patent is a type of intellectual property that gives its owner the legal right to exclude others from making, using, or selling an invention for a limited period of time in exchange for publishing an sufficiency of disclosure, enabling discl ...

() for this device, which performed a 6 × 6 matrix multiplication modulo 26 using a system of gears and chains. Unfortunately the gearing arrangements (and thus the key) were fixed for any given machine, so triple encryption was recommended for security: a secret nonlinear step, followed by the wide diffusive step from the machine, followed by a third secret nonlinear step. (The much later Even–Mansour cipher also uses an unkeyed diffusive middle step). Such a combination was actually very powerful for 1929, and indicates that Hill apparently understood the concepts of a meet-in-the-middle attack as well as confusion and diffusion. Unfortunately, his machine did not sell.

References

* Lester S. Hill, Cryptography in an Algebraic Alphabet, ''The American Mathematical Monthly'' Vol.36, June–July 1929, pp. 306–312.
PDF
* Lester S. Hill, Concerning Certain Linear Transformation Apparatus of Cryptography, ''The American Mathematical Monthly'' Vol.38, 1931, pp. 135–154. * Jeffrey Overbey, William Traves, and Jerzy Wojdylo, On the Keyspace of the Hill Cipher, '' Cryptologia'', Vol.29, No.1, January 2005, pp59–72.
CiteSeerX

PDF

External links

*
Hill Cipher Web App
implements the Hill cipher and shows the matrices involved *
Hill Cipher Explained
illustrates the linear algebra behind the Hill Cipher *
Hill's Cipher Calculator
outlines the Hill Cipher with a Web page {{DEFAULTSORT:Hill Cipher Classical ciphers