mathematics Mathematics is a field of study that discovers and organizes methods, Mathematical theory, theories and theorems that are developed and Mathematical proof, proved for the needs of empirical sciences and mathematics itself. There are many ar ...

, auxiliary functions are an important construction in

transcendental number theory Transcendental number theory is a branch of number theory that investigates transcendental numbers (numbers that are not solutions of any polynomial equation with rational coefficients), in both qualitative and quantitative ways. Transcendenc ...

. They are functions that appear in most proofs in this area of mathematics and that have specific, desirable properties, such as taking the value zero for many arguments, or having a zero of high

order Order, ORDER or Orders may refer to: * A socio-political or established or existing order, e.g. World order, Ancien Regime, Pax Britannica * Categorization, the process in which ideas and objects are recognized, differentiated, and understood ...

at some point.

Definition

Auxiliary functions are not a rigorously defined kind of function, rather they are functions which are either explicitly constructed or at least shown to exist and which provide a contradiction to some assumed hypothesis, or otherwise prove the result in question. Creating a function during the course of a proof in order to prove the result is not a technique exclusive to transcendence theory, but the term "auxiliary function" usually refers to the functions created in this area.

Explicit functions

Liouville's transcendence criterion

Because of the naming convention mentioned above, auxiliary functions can be dated back to their source simply by looking at the earliest results in transcendence theory. One of these first results was Liouville's proof that

transcendental numbers In mathematics, a transcendental number is a real or complex number that is not algebraic: that is, not the root of a non-zero polynomial with integer (or, equivalently, rational) coefficients. The best-known transcendental numbers are and . T ...

exist when he showed that the so called

Liouville number In number theory, a Liouville number is a real number x with the property that, for every positive integer n, there exists a pair of integers (p,q) with q>1 such that :0<\left, x-\frac\<\frac. The inequality implies that Liouville numbers po ...

s were transcendental. He did this by discovering a transcendence criterion which these numbers satisfied. To derive this criterion he started with a general

algebraic number In mathematics, an algebraic number is a number that is a root of a function, root of a non-zero polynomial in one variable with integer (or, equivalently, Rational number, rational) coefficients. For example, the golden ratio (1 + \sqrt)/2 is ...

α and found some property that this number would necessarily satisfy. The auxiliary function he used in the course of proving this criterion was simply the minimal polynomial of α, which is the irreducible polynomial ''f'' with integer coefficients such that ''f''(α) = 0. This function can be used to estimate how well the algebraic number α can be

estimated Estimation (or estimating) is the process of finding an estimate or approximation, which is a value that is usable for some purpose even if input data may be incomplete, uncertain, or unstable. The value is nonetheless usable because it is de ...

rational number In mathematics, a rational number is a number that can be expressed as the quotient or fraction of two integers, a numerator and a non-zero denominator . For example, is a rational number, as is every integer (for example, The set of all ...

s ''p''/''q''. Specifically if α has degree ''d'' at least two then he showed that :

\left, f\left(\frac\right)\\geq\frac,

and also, using the

mean value theorem In mathematics, the mean value theorem (or Lagrange's mean value theorem) states, roughly, that for a given planar arc (geometry), arc between two endpoints, there is at least one point at which the tangent to the arc is parallel to the secant lin ...

, that there is some constant depending on α, say ''c''(α), such that :

\left, f\left(\frac\right)\ \leq c(\alpha)\left, \alpha-\frac\.

Combining these results gives a property that the algebraic number must satisfy; therefore any number not satisfying this criterion must be transcendental. The auxiliary function in Liouville's work is very simple, merely a polynomial that vanishes at a given algebraic number. This kind of property is usually the one that auxiliary functions satisfy. They either vanish or become very small at particular points, which is usually combined with the assumption that they do not vanish or can't be too small to derive a result.

Fourier's proof of the irrationality of ''e''

Another simple, early occurrence is in Fourier's proof of the irrationality of ''e'', though the notation used usually disguises this fact. Fourier's proof used the

power series In mathematics, a power series (in one variable) is an infinite series of the form \sum_^\infty a_n \left(x - c\right)^n = a_0 + a_1 (x - c) + a_2 (x - c)^2 + \dots where ''a_n'' represents the coefficient of the ''n''th term and ''c'' is a co ...

of the exponential function: :

e^x=\sum_^ \frac.

By truncating this power series after, say, ''N'' + 1 terms we get a polynomial with rational coefficients of degree ''N'' which is in some sense "close" to the function ''e''^''x''. Specifically if we look at the auxiliary function defined by the remainder: :

R(x)=e^x-\sum_^ \frac

then this function—an

exponential polynomial In mathematics, exponential polynomials are functions on fields, rings, or abelian groups that take the form of polynomials in a variable and an exponential function. Definition In fields An exponential polynomial generally has both a variabl ...

—should take small values for ''x'' close to zero. If ''e'' is a rational number then by letting ''x'' = 1 in the above formula we see that ''R''(1) is also a rational number. However, Fourier proved that ''R''(1) could not be rational by eliminating every possible denominator. Thus ''e'' cannot be rational.

Hermite's proof of the irrationality of ''e''^''r''

Hermite extended the work of Fourier by approximating the function ''e''^''x'' not with a polynomial but with a

rational function In mathematics, a rational function is any function that can be defined by a rational fraction, which is an algebraic fraction such that both the numerator and the denominator are polynomials. The coefficients of the polynomials need not be ...

, that is a quotient of two polynomials. In particular he chose polynomials ''A''(''x'') and ''B''(''x'') such that the auxiliary function ''R'' defined by :

R(x)=B(x)e^x-A(x)

could be made as small as he wanted around ''x'' = 0. But if ''e''^''r'' were rational then ''R''(''r'') would have to be rational with a particular denominator, yet Hermite could make ''R''(''r'') too small to have such a denominator, hence a contradiction.

Hermite's proof of the transcendence of ''e''

To prove that ''e'' was in fact transcendental, Hermite took his work one step further by approximating not just the function ''e''^''x'', but also the functions ''e''^''kx'' for integers ''k'' = 1,...,''m'', where he assumed ''e'' was algebraic with degree ''m''. By approximating ''e''^''kx'' by rational functions with integer coefficients and with the same denominator, say ''A''_''k''(''x'') / ''B''(''x''), he could define auxiliary functions ''R''_''k''(''x'') by :

R_k(x)=B(x)e^-A_k(x).

For his contradiction Hermite supposed that ''e'' satisfied the polynomial equation with integer coefficients ''a''₀ + ''a''₁''e'' + ... + ''a''_''m''''e''^''m'' = 0. Multiplying this expression through by ''B''(1) he noticed that it implied :

R=a_0+a_1 R_1(1) + \cdots +a_m R_m(1)=a_1 A_1(1)+ \cdots +a_m A_m(1).

The right hand side is an integer and so, by estimating the auxiliary functions and proving that 0 < , ''R'', < 1 he derived the necessary contradiction.

Auxiliary functions from the pigeonhole principle

The auxiliary functions sketched above can all be explicitly calculated and worked with. A breakthrough by

Axel Thue Axel Thue (; 19 February 1863 – 7 March 1922) was a Norwegian mathematician, known for his original work in diophantine approximation and combinatorics. Work Thue published his first important paper in 1909. He stated in 1914 the so-called w ...

and

Carl Ludwig Siegel Carl Ludwig Siegel (31 December 1896 – 4 April 1981) was a German mathematician specialising in analytic number theory. He is known for, amongst other things, his contributions to the Thue–Siegel–Roth theorem in Diophantine approximation, ...

in the twentieth century was the realisation that these functions don't necessarily need to be explicitly known – it can be enough to know they exist and have certain properties. Using the

Pigeonhole Principle In mathematics, the pigeonhole principle states that if items are put into containers, with , then at least one container must contain more than one item. For example, of three gloves, at least two must be right-handed or at least two must be l ...

Thue, and later Siegel, managed to prove the existence of auxiliary functions which, for example, took the value zero at many different points, or took high order zeros at a smaller collection of points. Moreover they proved it was possible to construct such functions without making the functions too large. Their auxiliary functions were not explicit functions, then, but by knowing that a certain function with certain properties existed, they used its properties to simplify the transcendence proofs of the nineteenth century and give several new results. This method was picked up on and used by several other mathematicians, including Alexander Gelfond and Theodor Schneider who used it independently to prove the Gelfond–Schneider theorem. Alan Baker also used the method in the 1960s for his work on linear forms in logarithms and ultimately

Baker's theorem In transcendental number theory, a mathematical discipline, Baker's theorem gives a lower bound for the absolute value of linear combinations of logarithms of algebraic numbers. Nearly fifteen years earlier, Alexander Gelfond had considered the pr ...

. Another example of the use of this method from the 1960s is outlined below.

Auxiliary polynomial theorem

Let β equal the cube root of ''b/a'' in the equation ''ax''³ + ''bx''³ = ''c'' and assume ''m'' is an integer that satisfies ''m'' + 1 > 2''n''/3 ≥ ''m'' ≥ 3 where ''n'' is a positive integer. Then there exists :

F(X,Y) = P(X) + Y*Q(X)

such that :

\sum_^ u_i X^i = P(X),

\sum_^ v_i X^i = Q(X).

The auxiliary polynomial theorem states :

\max_ \le 2b^.

A theorem of Lang

In the 1960s

Serge Lang Serge Lang (; May 19, 1927 – September 12, 2005) was a French-American mathematician and activist who taught at Yale University for most of his career. He is known for his work in number theory and for his mathematics textbooks, including the i ...

proved a result using this non-explicit form of auxiliary functions. The theorem implies both the Hermite–Lindemann and Gelfond–Schneider theorems. The theorem deals with a

number field In mathematics, an algebraic number field (or simply number field) is an extension field K of the field of rational numbers such that the field extension K / \mathbb has finite degree (and hence is an algebraic field extension). Thus K is a ...

''K'' and meromorphic functions ''f''₁,...,''f''_''N'' of

at most ''ρ'', at least two of which are algebraically independent, and such that if we differentiate any of these functions then the result is a polynomial in all of the functions. Under these hypotheses the theorem states that if there are ''m'' distinct

complex number In mathematics, a complex number is an element of a number system that extends the real numbers with a specific element denoted , called the imaginary unit and satisfying the equation i^= -1; every complex number can be expressed in the for ...

s ω₁,...,ω_''m'' such that ''f''_''i'' (ω_''j'') is in ''K'' for all combinations of ''i'' and ''j'', then ''m'' is bounded by :

m\leq 20\rho :\mathbb

To prove the result Lang took two algebraically independent functions from ''f''₁,...,''f''_''N'', say ''f'' and ''g'', and then created an auxiliary function which was simply a polynomial ''F'' in ''f'' and ''g''. This auxiliary function could not be explicitly stated since ''f'' and ''g'' are not explicitly known. But using Siegel's lemma Lang showed how to make ''F'' in such a way that it vanished to a high order at the ''m'' complex numbers ω₁,...,ω_''m''. Because of this high order vanishing it can be shown that a high-order derivative of ''F'' takes a value of small size one of the ω_''i''s, "size" here referring to an algebraic property of a number. Using the maximum modulus principle Lang also found a separate way to estimate the absolute values of derivatives of ''F'', and using standard results comparing the size of a number and its absolute value he showed that these estimates were contradicted unless the claimed bound on ''m'' holds.

Interpolation determinants

After the myriad of successes gleaned from using existent but not explicit auxiliary functions, in the 1990s Michel Laurent introduced the idea of interpolation determinants. These are alternants – determinants of matrices of the form :

\mathcal=\left(\varphi_i(\zeta_j)\right)_

where φ_''i'' are a set of functions interpolated at a set of points ζ_''j''. Since a determinant is just a polynomial in the entries of a matrix, these auxiliary functions succumb to study by analytic means. A problem with the method was the need to choose a basis before the matrix could be worked with. A development by Jean-Benoît Bost removed this problem with the use of

Arakelov theory In mathematics, Arakelov theory (or Arakelov geometry) is an approach to Diophantine geometry, named for Suren Arakelov. It is used to study Diophantine equations in higher dimensions. Background The main motivation behind Arakelov geometry is tha ...

, and research in this area is ongoing. The example below gives an idea of the flavour of this approach.

A proof of the Hermite–Lindemann theorem

One of the simpler applications of this method is a proof of the real version of the Hermite–Lindemann theorem. That is, if α is a non-zero, real algebraic number, then ''e''^α is transcendental. First we let ''k'' be some natural number and ''n'' be a large multiple of ''k''. The interpolation determinant considered is the determinant Δ of the ''n''⁴×''n''⁴ matrix :

\left(\^\Big, _\right).

The rows of this matrix are indexed by 1 ≤ ''i''₁ ≤ ''n''⁴/''k'' and 1 ≤ ''i''₂ ≤ ''k'', while the columns are indexed by 1 ≤ ''j''₁ ≤ ''n''³ and 1 ≤ ''j''₂ ≤ ''n''. So the functions in our matrix are monomials in ''x'' and ''e''^''x'' and their derivatives, and we are interpolating at the ''k'' points 0,α,2α,...,(''k'' − 1)α. Assuming that ''e''^α is algebraic we can form the number field Q(α,''e''^α) of degree ''m'' over Q, and then multiply Δ by a suitable

denominator A fraction (from , "broken") represents a part of a whole or, more generally, any number of equal parts. When spoken in everyday English, a fraction describes how many parts of a certain size there are, for example, one-half, eight-fifths, thre ...

as well as all its images under the embeddings of the field Q(α,''e''^α) into C. For algebraic reasons this product is necessarily an integer, and using arguments relating to

Wronskian In mathematics, the Wronskian of ''n'' differentiable functions is the determinant formed with the functions and their derivatives up to order . It was introduced in 1812 by the Polish mathematician Józef Wroński, and is used in the study of ...

s it can be shown that it is non-zero, so its absolute value is an integer Ω ≥ 1. Using a version of the

for matrices it is possible to get an analytic bound on Ω as well, and in fact using big-O notation we have :

\Omega=O\left(\exp\left(\left(\frac-\frac\right)n^8\log n\right)\right).

The number ''m'' is fixed by the degree of the field Q(α,''e''^α), but ''k'' is the number of points we are interpolating at, and so we can increase it at will. And once ''k'' > 2(''m'' + 1)/3 we will have Ω → 0, eventually contradicting the established condition Ω ≥ 1. Thus ''e''^α cannot be algebraic after all.Adapted from Pila (1993).

Notes

References

* * * * * * * * * * * * * {{DEFAULTSORT:Auxiliary Function Number theory Diophantine approximation