Proof that π is irrational
   HOME

TheInfoList



OR:

In the 1760s,
Johann Heinrich Lambert Johann Heinrich Lambert (; ; 26 or 28 August 1728 – 25 September 1777) was a polymath from the Republic of Mulhouse, at that time allied to the Switzerland, Swiss Confederacy, who made important contributions to the subjects of mathematics, phys ...
was the first to prove that the number is
irrational Irrationality is cognition, thinking, talking, or acting without rationality. Irrationality often has a negative connotation, as thinking and actions that are less useful or more illogical than other more rational alternatives. The concept of ...
, meaning it cannot be expressed as a fraction a/b, where a and b are both
integer An integer is the number zero (0), a positive natural number (1, 2, 3, ...), or the negation of a positive natural number (−1, −2, −3, ...). The negations or additive inverses of the positive natural numbers are referred to as negative in ...
s. In the 19th century,
Charles Hermite Charles Hermite () FRS FRSE MIAS (24 December 1822 – 14 January 1901) was a French mathematician who did research concerning number theory, quadratic forms, invariant theory, orthogonal polynomials, elliptic functions, and algebra. Hermite p ...
found a proof that requires no prerequisite knowledge beyond basic
calculus Calculus is the mathematics, mathematical study of continuous change, in the same way that geometry is the study of shape, and algebra is the study of generalizations of arithmetic operations. Originally called infinitesimal calculus or "the ...
. Three simplifications of Hermite's proof are due to Mary Cartwright, Ivan Niven, and
Nicolas Bourbaki Nicolas Bourbaki () is the collective pseudonym of a group of mathematicians, predominantly French alumni of the École normale supérieure (Paris), École normale supérieure (ENS). Founded in 1934–1935, the Bourbaki group originally intende ...
. Another proof, which is a simplification of Lambert's proof, is due to
Miklós Laczkovich Miklós Laczkovich (born 21 February 1948) is a Hungarian mathematician mainly noted for his work on real analysis and geometric measure theory. His most famous result is the solution of Tarski's circle-squaring problem in 1989.Ruthen, R. (1989 ...
. Many of these are proofs by contradiction. In 1882,
Ferdinand von Lindemann Carl Louis Ferdinand von Lindemann (12 April 1852 – 6 March 1939) was a German mathematician, noted for his proof, published in 1882, that (pi) is a transcendental number, meaning it is not a root of any polynomial with rational coefficien ...
proved that \pi is not just irrational, but transcendental as well.


Lambert's proof

In 1761,
Johann Heinrich Lambert Johann Heinrich Lambert (; ; 26 or 28 August 1728 – 25 September 1777) was a polymath from the Republic of Mulhouse, at that time allied to the Switzerland, Swiss Confederacy, who made important contributions to the subjects of mathematics, phys ...
proved that \pi is irrational by first showing that this
continued fraction A continued fraction is a mathematical expression that can be written as a fraction with a denominator that is a sum that contains another simple or continued fraction. Depending on whether this iteration terminates with a simple fraction or not, ...
expansion holds: :\tan(x) = \cfrac. Then Lambert proved that if x is non-zero and rational, then this expression must be irrational. Since \tan\tfrac\pi4 =1, it follows that \tfrac\pi4 is irrational, and thus \pi is also irrational. A simplification of Lambert's proof is given
below Below may refer to: *Earth *Ground (disambiguation) *Soil *Floor * Bottom (disambiguation) *Less than *Temperatures below freezing *Hell or underworld People with the surname * Ernst von Below (1863–1955), German World War I general * Fred Belo ...
.


Hermite's proof

Written in 1873, this proof uses the characterization of \pi as the smallest positive number whose half is a
zero 0 (zero) is a number representing an empty quantity. Adding (or subtracting) 0 to any number leaves that number unchanged; in mathematical terminology, 0 is the additive identity of the integers, rational numbers, real numbers, and compl ...
of the
cosine In mathematics, sine and cosine are trigonometric functions of an angle. The sine and cosine of an acute angle are defined in the context of a right triangle: for the specified angle, its sine is the ratio of the length of the side opposite that ...
function and it actually proves that \pi^2 is irrational. As in many proofs of irrationality, it is a
proof by contradiction In logic, proof by contradiction is a form of proof that establishes the truth or the validity of a proposition by showing that assuming the proposition to be false leads to a contradiction. Although it is quite freely used in mathematical pr ...
. Consider the sequences of
real function In mathematical analysis, and applications in geometry, applied mathematics, engineering, and natural sciences, a function of a real variable is a function whose domain is the real numbers \mathbb, or a subset of \mathbb that contains an inter ...
s A_n and U_n for n \in \N_0 defined by: : \begin A_0(x) &= \sin(x), && A_(x) =\int_0^xyA_n(y)\,dy \\ ptU_0(x) &= \fracx, && U_(x) =-\fracx \end Using induction we can prove that :\begin A_n(x) &=\frac-\frac+\frac\mp\cdots \\ ptU_n(x) &=\frac1-\frac+\frac\mp\cdots \end and therefore we have: :U_n(x)=\frac.\, So : \begin \frac & =U_(x)=-\fracx=-\frac1x\frac \left(\frac\right) \\ pt& = -\frac \left( \frac \right ) \\ pt& = \frac \end which is equivalent to :A_(x)=(2n+1)A_n(x)-x^2A_(x).\, Using the definition of the sequence and employing induction we can show that :A_n(x) = P_n(x^2) \sin(x) + x Q_n(x^2) \cos(x),\, where P_n and Q_n are polynomial functions with integer coefficients and the degree of P_n is smaller than or equal to \bigl\lfloor \tfrac12n\bigr\rfloor. In particular, A_n\bigl(\tfrac12\pi\bigr) = P_n\bigl(\tfrac14\pi^2\bigr). Hermite also gave a closed expression for the function A_n, namely :A_n(x)=\frac\int_0^1(1-z^2)^n\cos(xz)\,\mathrmz.\, He did not justify this assertion, but it can be proved easily. First of all, this assertion is equivalent to :\frac\int_0^1(1-z^2)^n\cos(x z)\,\mathrmz=\frac=U_n(x). Proceeding by induction, take n = 0. :\int_0^1\cos(xz)\,\mathrmz=\fracx=U_0(x) and, for the inductive step, consider any
natural number In mathematics, the natural numbers are the numbers 0, 1, 2, 3, and so on, possibly excluding 0. Some start counting with 0, defining the natural numbers as the non-negative integers , while others start with 1, defining them as the positive in ...
n. If :\frac\int_0^1(1-z^2)^n\cos(xz)\,\mathrmz=U_n(x), then, using
integration by parts In calculus, and more generally in mathematical analysis, integration by parts or partial integration is a process that finds the integral of a product of functions in terms of the integral of the product of their derivative and antiderivati ...
and Leibniz's rule, one gets :\begin &\frac \int_0^1\left(1-z^2\right)^\cos(xz)\,\mathrmz \\ &\qquad=\frac\Biggl(\,\overbrace^ \ +\, \int_0^12(n+1)\left(1-z^2\right)^nz \fracx\,\mathrmz\Biggr)\\ pt&\qquad= \frac1x\cdot\frac1\int_0^1\left(1-z^2\right)^nz\sin(xz)\,\mathrmz\\ pt&\qquad= -\frac1x\cdot\frac\left(\frac1\int_0^1(1-z^2)^n\cos(xz)\,\mathrmz\right) \\ pt&\qquad= -\fracx \\ pt&\qquad= U_(x). \end If \tfrac14\pi^2 = p/q, with p and q in \N, then, since the coefficients of P_n are integers and its degree is smaller than or equal to \bigl\lfloor \tfrac12n\bigr\rfloor, q^P_n\bigl(\tfrac14\pi^2\bigr) is some integer N. In other words, :N=q^\bigl(\tfrac12\pi\bigr) =q^\frac\left(\dfrac pq \right)^\int_0^1(1-z^2)^n \cos \left(\tfrac12\pi z \right)\,\mathrmz. But this number is clearly greater than 0. On the other hand, the limit of this quantity as n goes to infinity is zero, and so, if n is large enough, N < 1. Thereby, a contradiction is reached. Hermite did not present his proof as an end in itself but as an afterthought within his search for a proof of the transcendence of \pi. He discussed the recurrence relations to motivate and to obtain a convenient integral representation. Once this integral representation is obtained, there are various ways to present a succinct and self-contained proof starting from the integral (as in Cartwright's, Bourbaki's or Niven's presentations), which Hermite could easily see (as he did in his proof of the transcendence of e). Moreover, Hermite's proof is closer to Lambert's proof than it seems. In fact, A_n(x) is the "residue" (or "remainder") of Lambert's continued fraction for \tan x.


Cartwright's proof

Harold Jeffreys Sir Harold Jeffreys, FRS (22 April 1891 – 18 March 1989) was a British geophysicist who made significant contributions to mathematics and statistics. His book, ''Theory of Probability'', which was first published in 1939, played an importan ...
wrote that this proof was set as an example in an exam at
Cambridge University The University of Cambridge is a Public university, public collegiate university, collegiate research university in Cambridge, England. Founded in 1209, the University of Cambridge is the List of oldest universities in continuous operation, wo ...
in 1945 by Mary Cartwright, but that she had not traced its origin. It still remains on the 4th problem sheet today for the Analysis IA course at Cambridge University. Consider the integrals :I_n(x)=\int_^1(1 - z^2)^n\cos(xz)\,dz, where n is a non-negative integer. Two integrations by parts give the
recurrence relation In mathematics, a recurrence relation is an equation according to which the nth term of a sequence of numbers is equal to some combination of the previous terms. Often, only k previous terms of the sequence appear in the equation, for a parameter ...
:x^2I_n(x)=2n(2n-1)I_(x)-4n(n-1)I_(x). \qquad (n \geq 2) If :J_n(x)=x^I_n(x), then this becomes :J_n(x)=2n(2n-1)J_(x)-4n(n-1)x^2J_(x). Furthermore, J_0(x) = 2 \sin x and J_1(x) = -4x\cos x + 4\sin x. Hence for all n \in \Z_+, :J_n(x)=x^I_n(x)=n!\bigl(P_n(x)\sin(x)+Q_n(x)\cos(x)\bigr), where P_n(x) and Q_n(x) are
polynomial In mathematics, a polynomial is a Expression (mathematics), mathematical expression consisting of indeterminate (variable), indeterminates (also called variable (mathematics), variables) and coefficients, that involves only the operations of addit ...
s of degree \leq n, and with
integer An integer is the number zero (0), a positive natural number (1, 2, 3, ...), or the negation of a positive natural number (−1, −2, −3, ...). The negations or additive inverses of the positive natural numbers are referred to as negative in ...
coefficients (depending on Take x = \tfrac12\pi, and suppose if possible that \tfrac12\pi = a/b where a and b are natural numbers (i.e., assume that \pi is rational). Then : \fracI_n\bigl(\tfrac12\pi\bigr) = P_n\bigl(\tfrac12\pi\bigr)b^. The right side is an integer. But 0 < I_n \bigl(\tfrac12\pi\bigr) < 2 since the interval 1,1/math> has length 2 and the function being integrated takes only values between 0 and 1. On the other hand, : \frac \to 0 \quad \textn \to \infty. Hence, for sufficiently large n : 0 < \frac < 1, that is, we could find an integer between 0 and 1. That is the contradiction that follows from the assumption that \pi is rational. This proof is similar to Hermite's proof. Indeed, :\begin J_n(x)&=x^\int_^1 (1 - z^2)^n \cos(xz)\,dz\\ pt &=2x^\int_0^1 (1 - z^2)^n \cos(xz)\,dz\\ pt &=2^n!A_n(x). \end However, it is clearly simpler. This is achieved by omitting the inductive definition of the functions A_n and taking as a starting point their expression as an integral.


Niven's proof

This proof uses the characterization of \pi as the smallest positive
zero 0 (zero) is a number representing an empty quantity. Adding (or subtracting) 0 to any number leaves that number unchanged; in mathematical terminology, 0 is the additive identity of the integers, rational numbers, real numbers, and compl ...
of the
sine In mathematics, sine and cosine are trigonometric functions of an angle. The sine and cosine of an acute angle are defined in the context of a right triangle: for the specified angle, its sine is the ratio of the length of the side opposite th ...
function. Suppose that \pi is rational, i.e. \pi = a/b for some integers a and b which may be taken
without loss of generality ''Without loss of generality'' (often abbreviated to WOLOG, WLOG or w.l.o.g.; less commonly stated as ''without any loss of generality'' or ''with no loss of generality'') is a frequently used expression in mathematics. The term is used to indicat ...
to both be positive. Given any positive integer n, we define the polynomial function: : f(x) = \frac and, for each x \in \R let :F(x) = f(x)-f''(x)+f^(x)+\cdots+(-1)^n f^(x). Claim 1: F(0) + F(\pi) is an integer. Proof: Expanding f as a sum of monomials, the coefficient of x^k is a number of the form c_k /n! where c_k is an integer, which is 0 if k < n. Therefore, f^(0) is 0 when k < n and it is equal to (k! / n!) c_k if in each case, f^(0) is an integer and therefore F(0) is an integer. On the other hand, f(\pi-x) = f(x) and so (-1)^kf^(\pi-x) = f^(x) for each non-negative integer k. In particular, (-1)^kf^(\pi) = f^(0). Therefore, f^(\pi) is also an integer and so F(\pi) is an integer (in fact, it is easy to see that Since F(0) and F(\pi) are integers, so is their sum. Claim 2: : \int_0^\pi f(x)\sin(x)\,dx=F(0)+F(\pi) Proof: Since f^ is the zero polynomial, we have : F'' + F = f. The derivatives of the
sine In mathematics, sine and cosine are trigonometric functions of an angle. The sine and cosine of an acute angle are defined in the context of a right triangle: for the specified angle, its sine is the ratio of the length of the side opposite th ...
and
cosine In mathematics, sine and cosine are trigonometric functions of an angle. The sine and cosine of an acute angle are defined in the context of a right triangle: for the specified angle, its sine is the ratio of the length of the side opposite that ...
function are given by sin' = cos and cos' = −sin. Hence the
product rule In calculus, the product rule (or Leibniz rule or Leibniz product rule) is a formula used to find the derivatives of products of two or more functions. For two functions, it may be stated in Lagrange's notation as (u \cdot v)' = u ' \cdot v ...
implies : (F'\cdot\sin - F\cdot\cos)' = f\cdot\sin By the
fundamental theorem of calculus The fundamental theorem of calculus is a theorem that links the concept of derivative, differentiating a function (mathematics), function (calculating its slopes, or rate of change at every point on its domain) with the concept of integral, inte ...
: \left. \int_0^\pi f(x)\sin(x)\,dx= \bigl(F'(x)\sin x - F(x)\cos x\bigr) \_0^\pi. Since \sin 0 = \sin \pi = 0 and \cos 0 = - \cos \pi = 1 (here we use the above-mentioned characterization of \pi as a zero of the sine function), Claim 2 follows. Conclusion: Since f(x) > 0 and \sin x > 0 for 0 < x < \pi (because \pi is the ''smallest'' positive zero of the sine function), Claims 1 and 2 show that F(0) + F(\pi) is a ''positive'' integer. Since 0 \leq x(a - bx) \leq \pi a and 0 \leq \sin x \leq 1 for 0 \leq x \leq \pi, we have, by the original definition of f, :\int_0^\pi f(x)\sin(x)\,dx\le\pi\frac which is smaller than 1 for large n, hence F(0) + F(\pi) < 1 for these n, by Claim 2. This is impossible for the positive integer F(0) + F(\pi). This shows that the original assumption that \pi is rational leads to a contradiction, which concludes the proof. The above proof is a polished version, which is kept as simple as possible concerning the prerequisites, of an analysis of the formula :\int_0^\pi f(x)\sin(x)\,dx = \sum_^n (-1)^j \left (f^(\pi)+f^(0)\right )+(-1)^\int_0^\pi f^(x)\sin(x)\,dx, which is obtained by 2n + 2 integrations by parts. Claim 2 essentially establishes this formula, where the use of F hides the iterated integration by parts. The last integral vanishes because f^ is the zero polynomial. Claim 1 shows that the remaining sum is an integer. Niven's proof is closer to Cartwright's (and therefore Hermite's) proof than it appears at first sight. In fact, :\begin J_n(x)&=x^\int_^1(1-z^2)^n\cos(xz)\,dz\\ &=\int_^1\left (x^2-(xz)^2\right )^nx\cos(xz)\,dz. \end Therefore, the substitution xz = y turns this integral into :\int_^x(x^2-y^2)^n\cos(y)\,dy. In particular, :\begin J_n\left(\frac\pi2\right)&=\int_^\left(\frac4-y^2\right)^n\cos(y)\,dy\\ pt&=\int_0^\pi\left(\frac4-\left(y-\frac\pi2\right)^2\right)^n\cos\left(y-\frac\pi2\right)\,dy\\ pt&=\int_0^\pi y^n(\pi-y)^n\sin(y)\,dy\\ pt&=\frac\int_0^\pi f(x)\sin(x)\,dx. \end Another connection between the proofs lies in the fact that Hermite already mentions that if f is a polynomial function and :F=f-f^+f^\mp\cdots, then :\int f(x)\sin(x)\,dx=F'(x)\sin(x)-F(x)\cos(x)+C, from which it follows that :\int_0^\pi f(x)\sin(x)\,dx=F(\pi)+F(0).


Bourbaki's proof

Bourbaki Bourbaki(s) may refer to : Persons and science * Charles-Denis Bourbaki (1816–1897), French general, son of Constantin Denis Bourbaki * Colonel Constantin Denis Bourbaki (1787–1827), officer in the Greek War of Independence and serving in the ...
's proof is outlined as an exercise in his
calculus Calculus is the mathematics, mathematical study of continuous change, in the same way that geometry is the study of shape, and algebra is the study of generalizations of arithmetic operations. Originally called infinitesimal calculus or "the ...
treatise. For each natural number ''b'' and each non-negative integer n, define :A_n(b)=b^n\int_0^\pi\frac\sin(x)\,dx. Since A_n(b) is the integral of a function defined on ,\pi/math> that takes the value 0 at 0 and \pi and which is greater than 0 otherwise, A_n(b) > 0. Besides, for each natural number b, A_n(b) < 1 if n is large enough, because : x(\pi-x) \le \left(\frac\pi2\right)^2 and therefore :A_n(b)\le\pi b^n \frac \left(\frac\pi2\right)^ = \pi \frac. On the other hand, repeated integration by parts allows us to deduce that, if a and b are natural numbers such that \pi = a/b and f is the polynomial function from , \pi/math> into \R defined by : f(x)=\frac, then: :\begin A_n(b) &= \int_0^\pi f(x)\sin(x)\,dx \\ pt &= \Big
Big Big or BIG may refer to: * Big, of great size or degree Film and television * Big (film), ''Big'' (film), a 1988 fantasy-comedy film starring Tom Hanks * ''Big'', a 2023 Taiwanese children's film starring Van Fan and Chie Tanaka * ''Big!'', a ...
^ \,- \Big \Big^ + \cdots \\ pt &\ \qquad \pm \Big f^(x) \cos(x) \Big^ \,\pm \int_0^\pi f^(x)\cos(x)\,dx. \end This last integral is 0, since f^ is the null function (because f is a polynomial function of degree Since each function f^ (with takes integer values at 0 and \pi and since the same thing happens with the sine and the cosine functions, this proves that A_n(b) is an integer. Since it is also greater than 0, it must be a natural number. But it was also proved that A_n(b) < 1 if n is large enough, thereby reaching a
contradiction In traditional logic, a contradiction involves a proposition conflicting either with itself or established fact. It is often used as a tool to detect disingenuous beliefs and bias. Illustrating a general tendency in applied logic, Aristotle's ...
. This proof is quite close to Niven's proof, the main difference between them being the way of proving that the numbers A_n(b) are integers.


Laczkovich's proof

Miklós Laczkovich Miklós Laczkovich (born 21 February 1948) is a Hungarian mathematician mainly noted for his work on real analysis and geometric measure theory. His most famous result is the solution of Tarski's circle-squaring problem in 1989.Ruthen, R. (1989 ...
's proof is a simplification of Lambert's original proof. He considers the functions :f_k(x) = 1 - \frack+\frac-\frac + \cdots \quad (k\notin\). These functions are clearly defined for any real number x. Besides :f_(x) = \cos(2x), :f_(x) = \frac. Claim 1: The following
recurrence relation In mathematics, a recurrence relation is an equation according to which the nth term of a sequence of numbers is equal to some combination of the previous terms. Often, only k previous terms of the sequence appear in the equation, for a parameter ...
holds for any real number :\fracf_(x)=f_(x)-f_k(x). Proof: This can be proved by comparing the coefficients of the powers of x. Claim 2: For each real number x, :\lim_f_k(x)=1. Proof: In fact, the sequence x^/n! is bounded (since it converges to and if C is an upper bound and if k > 1, then :\left, f_k(x)-1\\leqslant\sum_^\infty\frac C=C\frac=\frac C. Claim 3: If x \neq 0, x^2 is rational, and k\in\Q\smallsetminus\ then :f_k(x)\neq0 \quad \text \quad \frac\notin\Q. Proof: Otherwise, there would be a number y \neq 0 and integers a and b such that f_k(x) = ay and f_(x) = by. To see why, take y = f_(x), a = 0, and b = 1 if otherwise, choose integers a and b such that f_(x) / f_k(x) = b/a and define y = f_k(x)/a = f_(x)/b. In each case, y cannot be 0, because otherwise it would follow from claim 1 that each f_(x) (n \in \N) would be 0, which would contradict claim 2. Now, take a natural number c such that all three numbers bc/k, ck/x^2, and c/x^2 are integers and consider the sequence :g_n=\beginf_k(x) & n=0\\ \dfracf_(x) & n \neq 0 \end Then :g_0=f_k(x)=ay\in\Z y \quad \text \quad g_1=\frac ckf_(x)=\fracky\in\Z y. On the other hand, it follows from claim 1 that :\begin g_&=\frac\cdot\fracf_(x)\\ pt& =\fracf_(x)-\fracf_(x)\\ pt&=\fracg_-\fracg_n\\ pt&=\left(\frac+\frac cn\right)g_-\fracg_n, \end which is a linear combination of g_ and g_n with integer coefficients. Therefore, each g_n is an integer multiple of y. Besides, it follows from claim 2 that each g_n is greater than 0 (and therefore that if n is large enough and that the sequence of all g_n converges to 0. But a sequence of numbers greater than or equal to , y, cannot converge to 0. Since f_(\tfrac14\pi) = \cos \tfrac12\pi = 0, it follows from claim 3 that \tfrac1\pi^2 is irrational and therefore that \pi is irrational. On the other hand, since :\tan x=\frac=x\frac, another consequence of Claim 3 is that, if x \in \Q \smallsetminus \, then \tan x is irrational. Laczkovich's proof is really about the
hypergeometric function In mathematics, the Gaussian or ordinary hypergeometric function 2''F''1(''a'',''b'';''c'';''z'') is a special function represented by the hypergeometric series, that includes many other special functions as specific or limiting cases. It is ...
. In fact, f_k(x) = _0F_1 (k - x^2) and
Gauss Johann Carl Friedrich Gauss (; ; ; 30 April 177723 February 1855) was a German mathematician, astronomer, Geodesy, geodesist, and physicist, who contributed to many fields in mathematics and science. He was director of the Göttingen Observat ...
found a continued fraction expansion of the hypergeometric function using its
functional equation In mathematics, a functional equation is, in the broadest meaning, an equation in which one or several functions appear as unknowns. So, differential equations and integral equations are functional equations. However, a more restricted meaning ...
. This allowed Laczkovich to find a new and simpler proof of the fact that the tangent function has the continued fraction expansion that Lambert had discovered. Laczkovich's result can also be expressed in Bessel functions of the first kind J_\nu(x). In fact, \Gamma(k)J_(2x) = x^f_k(x) (where \Gamma is the
gamma function In mathematics, the gamma function (represented by Γ, capital Greek alphabet, Greek letter gamma) is the most common extension of the factorial function to complex numbers. Derived by Daniel Bernoulli, the gamma function \Gamma(z) is defined ...
). So Laczkovich's result is equivalent to: If x \neq 0, x^2 is rational, and k\in\Q\smallsetminus\ then :\frac\notin\Q.


See also

* Proof that e is irrational * Proof that is transcendental


References

{{DEFAULTSORT:Proof That Pi Is Irrational Pi Article proofs Irrational numbers