
In
calculus
Calculus, originally called infinitesimal calculus or "the calculus of infinitesimals", is the mathematics, mathematical study of continuous change, in the same way that geometry is the study of shape, and algebra is the study of generalizati ...
, the inverse function rule is a
formula that expresses the
derivative
In mathematics, the derivative of a function of a real variable measures the sensitivity to change of the function value (output value) with respect to a change in its argument (input value). Derivatives are a fundamental tool of calculus. ...
of the
inverse
Inverse or invert may refer to:
Science and mathematics
* Inverse (logic), a type of conditional sentence which is an immediate inference made from another conditional sentence
* Additive inverse (negation), the inverse of a number that, when ad ...
of a
bijective
In mathematics, a bijection, also known as a bijective function, one-to-one correspondence, or invertible function, is a function between the elements of two sets, where each element of one set is paired with exactly one element of the other ...
and
differentiable function
In mathematics, a differentiable function of one real variable is a function whose derivative exists at each point in its domain. In other words, the graph of a differentiable function has a non- vertical tangent line at each interior point in ...
in terms of the derivative of . More precisely, if the inverse of
is denoted as
, where
if and only if
, then the inverse function rule is, in
Lagrange's notation
In differential calculus, there is no single uniform notation for differentiation. Instead, various notations for the derivative of a function or variable have been proposed by various mathematicians. The usefulness of each notation varies wit ...
,
:
.
This formula holds in general whenever
is
continuous and
injective
In mathematics, an injective function (also known as injection, or one-to-one function) is a function that maps distinct elements of its domain to distinct elements; that is, implies . (Equivalently, implies in the equivalent contraposi ...
on an interval , with
being differentiable at
(
) and where
. The same formula is also equivalent to the expression
:
where
denotes the unary derivative operator (on the space of functions) and
denotes
function composition
In mathematics, function composition is an operation that takes two functions and , and produces a function such that . In this operation, the function is applied to the result of applying the function to . That is, the functions and ...
.
Geometrically, a function and inverse function have
graphs that are
reflections, in the line
. This reflection operation turns the
gradient
In vector calculus, the gradient of a scalar-valued differentiable function of several variables is the vector field (or vector-valued function) \nabla f whose value at a point p is the "direction and rate of fastest increase". If the gr ...
of any line into its
reciprocal.
Assuming that
has an inverse in a
neighbourhood
A neighbourhood (British English, Irish English, Australian English and Canadian English) or neighborhood (American English; American and British English spelling differences, see spelling differences) is a geographically localised community ...
of
and that its derivative at that point is non-zero, its inverse is guaranteed to be differentiable at
and have a derivative given by the above formula.
The inverse function rule may also be expressed in
Leibniz's notation
In calculus, Leibniz's notation, named in honor of the 17th-century German philosopher and mathematician Gottfried Wilhelm Leibniz, uses the symbols and to represent infinitely small (or infinitesimal) increments of and , respectively, just ...
. As that notation suggests,
:
This relation is obtained by differentiating the equation
in terms of and applying the
chain rule
In calculus, the chain rule is a formula that expresses the derivative of the Function composition, composition of two differentiable functions and in terms of the derivatives of and . More precisely, if h=f\circ g is the function such that h(x) ...
, yielding that:
:
considering that the derivative of with respect to ' is 1.
Derivation
Let
be an invertible (bijective) function, let
be in the domain of
, and let
be in the codomain of
. Since f is a bijective function,
is in the range of
. This also means that
is in the domain of
, and that
is in the codomain of
. Since
is an invertible function, we know that
. The inverse function rule can be obtained by taking the derivative of this equation.
:
The right side is equal to 1 and the chain rule can be applied to the left side:
:
Rearranging then gives
:
Rather than using
as the variable, we can rewrite this equation using
as the input for
, and we get the following:
:
Examples
*
(for positive ) has inverse
.
:
:
At
, however, there is a problem: the graph of the square root function becomes vertical, corresponding to a horizontal tangent for the square function.
*
(for real ) has inverse
(for positive
)
:
:
Additional properties
*
Integrating this relationship gives
::
:This is only useful if the integral exists. In particular we need
to be non-zero across the range of integration.
:It follows that a function that has a
continuous derivative has an inverse in a
neighbourhood
A neighbourhood (British English, Irish English, Australian English and Canadian English) or neighborhood (American English; American and British English spelling differences, see spelling differences) is a geographically localised community ...
of every point where the derivative is non-zero. This need not be true if the derivative is not continuous.
* Another very interesting and useful property is the following:
::
:Where
denotes the antiderivative of
.
* The inverse of the derivative of f(x) is also of interest, as it is used in showing the convexity of the
Legendre transform.
Let
then we have, assuming
:
This can be shown using the previous notation
. Then we have:
:
Therefore:
:
By induction, we can generalize this result for any integer
, with
, the nth derivative of f(x), and
, assuming
:
:
Higher derivatives
The
chain rule
In calculus, the chain rule is a formula that expresses the derivative of the Function composition, composition of two differentiable functions and in terms of the derivatives of and . More precisely, if h=f\circ g is the function such that h(x) ...
given above is obtained by differentiating the identity
with respect to . One can continue the same process for higher derivatives. Differentiating the identity twice with respect to ', one obtains
:
that is simplified further by the chain rule as
:
Replacing the first derivative, using the identity obtained earlier, we get
:
Similarly for the third derivative:
:
or using the formula for the second derivative,
:
These formulas are generalized by the
Faà di Bruno's formula
Faà di Bruno's formula is an identity in mathematics generalizing the chain rule to higher derivatives. It is named after , although he was not the first to state or prove the formula. In 1800, more than 50 years before Faà di Bruno, the French ...
.
These formulas can also be written using Lagrange's notation. If ' and ' are inverses, then
:
Example
*
has the inverse
. Using the formula for the second derivative of the inverse function,
:
so that
:
,
which agrees with the direct calculation.
See also
*
*
*
*
*
*
*
*
*
*
References
*
{{Calculus topics
Articles containing proofs
Differentiation rules
Inverse functions
Theorems in analysis
Theorems in calculus