mathematics Mathematics is a field of study that discovers and organizes methods, Mathematical theory, theories and theorems that are developed and Mathematical proof, proved for the needs of empirical sciences and mathematics itself. There are many ar ...

and its applications, a Sturm–Liouville problem is a second-order linear

ordinary differential equation In mathematics, an ordinary differential equation (ODE) is a differential equation (DE) dependent on only a single independent variable (mathematics), variable. As with any other DE, its unknown(s) consists of one (or more) Function (mathematic ...

of the form

+ q(x)y = -\lambda w(x) y

for given functions

p(x)

q(x)

and

w(x)

, together with some boundary conditions at extreme values of

x

. The goals of a given Sturm–Liouville problem are: * To find the for which there exists a non-trivial solution to the problem. Such values are called the ''

eigenvalue In linear algebra, an eigenvector ( ) or characteristic vector is a vector that has its direction unchanged (or reversed) by a given linear transformation. More precisely, an eigenvector \mathbf v of a linear transformation T is scaled by a ...

s'' of the problem. * For each eigenvalue , to find the corresponding solution

y = y(x)

of the problem. Such functions

y

are called the '' eigenfunctions'' associated to each . Sturm–Liouville theory is the general study of Sturm–Liouville problems. In particular, for a "regular" Sturm–Liouville problem, it can be shown that there are an infinite number of eigenvalues each with a unique eigenfunction, and that these eigenfunctions form an orthonormal basis of a certain Hilbert space of functions. This theory is important in

applied mathematics Applied mathematics is the application of mathematics, mathematical methods by different fields such as physics, engineering, medicine, biology, finance, business, computer science, and Industrial sector, industry. Thus, applied mathematics is a ...

, where Sturm–Liouville problems occur very frequently, particularly when dealing with separable linear

partial differential equation In mathematics, a partial differential equation (PDE) is an equation which involves a multivariable function and one or more of its partial derivatives. The function is often thought of as an "unknown" that solves the equation, similar to ho ...

s. For example, in

quantum mechanics Quantum mechanics is the fundamental physical Scientific theory, theory that describes the behavior of matter and of light; its unusual characteristics typically occur at and below the scale of atoms. Reprinted, Addison-Wesley, 1989, It is ...

, the one-dimensional time-independent

Schrödinger equation The Schrödinger equation is a partial differential equation that governs the wave function of a non-relativistic quantum-mechanical system. Its discovery was a significant landmark in the development of quantum mechanics. It is named after E ...

is a Sturm–Liouville problem. Sturm–Liouville theory is named after Jacques Charles François Sturm (1803–1855) and Joseph Liouville (1809–1882), who developed the theory.

Main results

The main results in Sturm–Liouville theory apply to a Sturm–Liouville problem on a finite interval

, b /math> that is "regular". The problem is said to be ''regular'' if:
* the coefficient functions p, q, w and the derivative p' are all continuous on, b /math>;
* p(x) > 0 and w(x) > 0 for all x \in, b /math>;
* the problem has separated boundary conditions of the form


The function w = w(x), sometimes denoted r = r(x), is called the ''weight'' or ''density'' function.

The goals of a Sturm–Liouville problem are:
* to find the eigenvalues: those  for which there exists a

non-trivial In mathematics, the adjective trivial is often used to refer to a claim or a case which can be readily obtained from context, or a particularly simple object possessing a given structure (e.g., group, topological space). The noun triviality usual ...

solution; * for each eigenvalue , to find the corresponding eigenfunction

y = y(x)

. For a regular Sturm–Liouville problem, a function

y = y(x)

is called a ''solution'' if it is continuously differentiable and satisfies the equation () at every

x \in (a, b)

. In the case of more general

p, q, w

, the solutions must be understood in a weak sense. The terms eigenvalue and eigenvector are used because the solutions correspond to the

eigenvalues In linear algebra, an eigenvector ( ) or characteristic vector is a vector that has its direction unchanged (or reversed) by a given linear transformation. More precisely, an eigenvector \mathbf v of a linear transformation T is scaled by a ...

and eigenfunctions of a Hermitian

differential operator In mathematics, a differential operator is an operator defined as a function of the differentiation operator. It is helpful, as a matter of notation first, to consider differentiation as an abstract operation that accepts a function and retur ...

in an appropriate

Hilbert space In mathematics, a Hilbert space is a real number, real or complex number, complex inner product space that is also a complete metric space with respect to the metric induced by the inner product. It generalizes the notion of Euclidean space. The ...

of functions with inner product defined using the weight function. Sturm–Liouville theory studies the existence and asymptotic behavior of the eigenvalues, the corresponding qualitative theory of the eigenfunctions and their completeness in the function space. The main result of Sturm–Liouville theory states that, for any regular Sturm–Liouville problem: * The eigenvalues

\lambda_1, \lambda_2, \dots

are real and can be numbered so that

\lambda_1 < \lambda_2 < \cdots < \lambda_n < \cdots \to \infty.

* Corresponding to each eigenvalue

\lambda_n

is a unique (up to constant multiple) eigenfunction

y_n = y_n(x)

with exactly

n - 1

zeros in

, b /math>, called the th ''fundamental solution''.
* The normalized eigenfunctions y_n form an

orthonormal basis In mathematics, particularly linear algebra, an orthonormal basis for an inner product space V with finite Dimension (linear algebra), dimension is a Basis (linear algebra), basis for V whose vectors are orthonormal, that is, they are all unit vec ...

under the ''w''-weighted inner product in the

w(x)\,\mathrmx\big)

; that is,

\langle y_n, y_m\rangle = \int_a^b y_n(x) y_m(x) w(x)\,\mathrmx = \delta_,

where

\delta_

is the

Kronecker delta In mathematics, the Kronecker delta (named after Leopold Kronecker) is a function of two variables, usually just non-negative integers. The function is 1 if the variables are equal, and 0 otherwise: \delta_ = \begin 0 &\text i \neq j, \\ 1 &\ ...

Reduction to Sturm–Liouville form

The differential equation () is said to be in Sturm–Liouville form or self-adjoint form. All second-order linear homogenous

s can be recast in the form on the left-hand side of () by multiplying both sides of the equation by an appropriate integrating factor (although the same is not true of second-order

s, or if is a

vector Vector most often refers to: * Euclidean vector, a quantity with a magnitude and a direction * Disease vector, an agent that carries and transmits an infectious pathogen into another living organism Vector may also refer to: Mathematics a ...

). Some examples are below.

Bessel equation

x^2y'' + xy' + \left(x^2-\nu^2\right)y = 0

which can be written in Sturm–Liouville form (first by dividing through by , then by collapsing the first two terms on the left into one term) as

\left(xy'\right)'+ \left (x-\frac x \right )y=0.

Legendre equation

\left(1-x^2\right)y''-2xy'+\nu(\nu+1)y=0

which can be put into Sturm–Liouville form, since , so the Legendre equation is equivalent to

\left (\left(1-x^2\right)y' \right )'+\nu(\nu+1)y=0

Example using an integrating factor

x^3y''-xy'+2y=0

Divide throughout by :

y''-\fracy'+\fracy=0

Multiplying throughout by an integrating factor of

\mu(x) =\exp\left(\int -\frac\right)=e^,

gives

e^y''-\frac y'+ \frac y = 0

which can be put into Sturm–Liouville form since

\frac e^ = -\frac

so the differential equation is equivalent to

\left (e^y' \right )'+\frac y = 0.

Integrating factor for general second-order homogenous equation

P(x)y'' + Q(x)y' + R(x)y=0

Multiplying through by the integrating factor

\mu(x) = \frac 1  \exp \left(\int \frac \, dx\right),

and then collecting gives the Sturm–Liouville form:

\frac \left(\mu(x)P(x)y'\right) + \mu(x)R(x)y = 0,

or, explicitly:

\frac \left(\exp\left (\int \frac \,dx\right)y' \right )+\frac \exp \left(\int \frac\, dx\right) y = 0.

Sturm–Liouville equations as self-adjoint differential operators

The mapping defined by:

q(x)u \right)

can be viewed as a

linear operator In mathematics, and more specifically in linear algebra, a linear map (also called a linear mapping, linear transformation, vector space homomorphism, or in some contexts linear function) is a mapping V \to W between two vector spaces that pr ...

mapping a function to another function , and it can be studied in the context of

functional analysis Functional analysis is a branch of mathematical analysis, the core of which is formed by the study of vector spaces endowed with some kind of limit-related structure (for example, Inner product space#Definition, inner product, Norm (mathematics ...

. In fact, equation () can be written as

Lu  = \lambda u.

This is precisely the

problem; that is, one seeks eigenvalues and the corresponding eigenvectors of the operator. The proper setting for this problem is the

w(x)\,dx)

with scalar product

\langle f, g\rangle = \int_a^b \overline g(x)w(x)\, dx.

In this space is defined on sufficiently smooth functions which satisfy the above regular boundary conditions. Moreover, is a

self-adjoint In mathematics, an element of a *-algebra is called self-adjoint if it is the same as its adjoint (i.e. a = a^*). Definition Let \mathcal be a *-algebra. An element a \in \mathcal is called self-adjoint if The set of self-adjoint elements ...

operator:

\langle L f, g \rangle = \langle f, L g \rangle .

This can be seen formally by using

integration by parts In calculus, and more generally in mathematical analysis, integration by parts or partial integration is a process that finds the integral of a product of functions in terms of the integral of the product of their derivative and antiderivati ...

twice, where the boundary terms vanish by virtue of the boundary conditions. It then follows that the eigenvalues of a Sturm–Liouville operator are real and that eigenfunctions of corresponding to different eigenvalues are orthogonal. However, this operator is unbounded and hence existence of an orthonormal basis of eigenfunctions is not evident. To overcome this problem, one looks at the resolvent

\left (L - z\right)^, \qquad z \in \Reals,

where is not an eigenvalue. Then, computing the resolvent amounts to solving a nonhomogeneous equation, which can be done using the variation of parameters formula. This shows that the resolvent is an integral operator with a continuous symmetric kernel (the Green's function of the problem). As a consequence of the

Arzelà–Ascoli theorem The Arzelà–Ascoli theorem is a fundamental result of mathematical analysis giving necessary and sufficient conditions to decide whether every sequence of a given family of real-valued continuous functions defined on a closed and bounded inte ...

, this integral operator is compact and existence of a sequence of eigenvalues which converge to 0 and eigenfunctions which form an orthonormal basis follows from the spectral theorem for compact operators. Finally, note that

\left(L-z\right)^ u = \alpha u, \qquad L u = \left(z+\alpha^\right) u,

are equivalent, so we may take

\lambda = z+\alpha^

with the same eigenfunctions. If the interval is unbounded, or if the coefficients have singularities at the boundary points, one calls singular. In this case, the spectrum no longer consists of eigenvalues alone and can contain a continuous component. There is still an associated eigenfunction expansion (similar to Fourier series versus Fourier transform). This is important in

, since the one-dimensional time-independent

is a special case of a Sturm–Liouville equation.

Application to inhomogeneous second-order boundary value problems

Consider a general inhomogeneous second-order linear differential equation

P(x)y'' + Q(x)y' +R(x)y = f(x)

for given functions

P(x), Q(x), R(x),f(x)

. As before, this can be reduced to the Sturm–Liouville form

Ly = f

: writing a general Sturm–Liouville operator as:

Lu = \fracu'' + \fracu' + \fracu,

one solves the system:

p = Pw,\quad p' = Qw,\quad q = Rw.

It suffices to solve the first two equations, which amounts to solving , or

w' = \fracw:= \alpha w.

A solution is:

w = \exp\left(\int\alpha \, dx\right), \quad p = P \exp\left(\int\alpha \, dx\right), \quad q = R \exp\left(\int\alpha \, dx\right).

Given this transformation, one is left to solve:

Ly = f.

In general, if initial conditions at some point are specified, for example and , a second order differential equation can be solved using ordinary methods and the Picard–Lindelöf theorem ensures that the differential equation has a unique solution in a neighbourhood of the point where the initial conditions have been specified. But if in place of specifying initial values at a ''single point'', it is desired to specify values at ''two'' different points (so-called boundary values), e.g. and , the problem turns out to be much more difficult. Notice that by adding a suitable known differentiable function to , whose values at and satisfy the desired boundary conditions, and injecting inside the proposed differential equation, it can be assumed without loss of generality that the boundary conditions are of the form and . Here, the Sturm–Liouville theory comes in play: indeed, a large class of functions can be expanded in terms of a series of orthonormal eigenfunctions of the associated Liouville operator with corresponding eigenvalues :

f(x) = \sum_i \alpha_i u_i(x), \quad \alpha_i \in .

Then a solution to the proposed equation is evidently:

y = \sum_i \frac u_i.

This solution will be valid only over the open interval , and may fail at the boundaries.

Example: Fourier series

Consider the Sturm–Liouville problem: for the unknowns are and . For boundary conditions, we take for example:

u(0) = u(\pi) = 0.

Observe that if is any integer, then the function

u_k(x) = \sin kx

is a solution with eigenvalue . We know that the solutions of a Sturm–Liouville problem form an orthogonal basis, and we know from

Fourier series A Fourier series () is an Series expansion, expansion of a periodic function into a sum of trigonometric functions. The Fourier series is an example of a trigonometric series. By expressing a function as a sum of sines and cosines, many problems ...

that this set of sinusoidal functions is an orthogonal basis. Since orthogonal bases are always maximal (by definition) we conclude that the Sturm–Liouville problem in this case has no other eigenvectors. Given the preceding, let us now solve the inhomogeneous problem

L  y  =x, \qquad x\in(0,\pi)

with the same boundary conditions

y(0) = y(\pi) = 0

. In this case, we must expand as a Fourier series. The reader may check, either by integrating or by consulting a table of Fourier transforms, that we thus obtain

L  y  = \sum_^\infty -2\frac k \sin kx.

This particular Fourier series is troublesome because of its poor convergence properties. It is not clear ''a priori'' whether the series converges pointwise. Because of Fourier analysis, since the Fourier coefficients are " square-summable", the Fourier series converges in which is all we need for this particular theory to function. We mention for the interested reader that in this case we may rely on a result which says that Fourier series converge at every point of differentiability, and at jump points (the function ''x'', considered as a periodic function, has a jump at ) converges to the average of the left and right limits (see convergence of Fourier series). Therefore, by using formula (), we obtain the solution:

y=\sum_^\infty 2\frac\sin kx= \tfrac 1 6 (x^3 -\pi^2 x).

In this case, we could have found the answer using antidifferentiation, but this is no longer useful in most cases when the differential equation is in many variables.

Application to partial differential equations

Normal modes

Certain

s can be solved with the help of Sturm–Liouville theory. Suppose we are interested in the vibrational modes of a thin membrane, held in a rectangular frame, , . The equation of motion for the vertical membrane's displacement, is given by the

wave equation The wave equation is a second-order linear partial differential equation for the description of waves or standing wave fields such as mechanical waves (e.g. water waves, sound waves and seismic waves) or electromagnetic waves (including light ...

\frac+\frac = \frac 1  \frac.

The method of

separation of variables In mathematics, separation of variables (also known as the Fourier method) is any of several methods for solving ordinary differential equation, ordinary and partial differential equations, in which algebra allows one to rewrite an equation so tha ...

suggests looking first for solutions of the simple form . For such a function the partial differential equation becomes . Since the three terms of this equation are functions of separately, they must be constants. For example, the first term gives for a constant . The boundary conditions ("held in a rectangular frame") are when , or , and define the simplest possible Sturm–Liouville eigenvalue problems as in the example, yielding the "normal mode solutions" for with harmonic time dependence,

W_(x,y,t) = A_ \sin\left(\frac\right) \sin\left(\frac\right)\cos\left(\omega_t\right)

where and are non-zero

integer An integer is the number zero (0), a positive natural number (1, 2, 3, ...), or the negation of a positive natural number (−1, −2, −3, ...). The negations or additive inverses of the positive natural numbers are referred to as negative in ...

s, are arbitrary constants, and

\omega^2_ = c^2 \left(\frac+\frac\right).

The functions form a basis for the

of (generalized) solutions of the wave equation; that is, an arbitrary solution can be decomposed into a sum of these modes, which vibrate at their individual frequencies . This representation may require a convergent infinite sum.

Second-order linear equation

Consider a linear second-order differential equation in one spatial dimension and first-order in time of the form:

f(x) \frac + g(x) \frac + h(x) u= \frac + k(t) u,

u(a,t)=u(b,t)=0, \qquad u(x,0)=s(x).

Separating variables, we assume that

u(x,t) = X(x) T(t).

Then our above partial differential equation may be written as:

\frac = \frac

where

\hat=f(x) \frac+g(x) \frac+h(x), \qquad \hat = \frac + k(t).

Since, by definition, and are independent of time and and are independent of position , then both sides of the above equation must be equal to a constant:

\hat X(x) =\lambda X(x),\qquad 
X(a)=X(b)=0,\qquad
\hat T(t) =\lambda T(t).

The first of these equations must be solved as a Sturm–Liouville problem in terms of the eigenfunctions and eigenvalues . The second of these equations can be analytically solved once the eigenvalues are known.

\frac T_n (t)= \bigl(\lambda_n -k(t)\bigr) T_n (t)

T_n (t) = a_n \exp \left(\lambda_n t -\int_0^t k(\tau) \, d\tau\right)

u(x,t) =\sum_n a_n X_n (x) \exp \left(\lambda_n t -\int_0^t k(\tau) \, d\tau\right)

a_n =\frac

where

\bigl\langle y(x),z(x)\bigr\rangle = \int_a^b y(x) z(x) w(x) \, dx,

w(x)= \frac.

Representation of solutions and numerical calculation

The Sturm–Liouville differential equation () with boundary conditions may be solved analytically, which can be exact or provide an approximation, by the Rayleigh–Ritz method, or by the matrix-variational method of Gerck et al. Numerically, a variety of methods are also available. In difficult cases, one may need to carry out the intermediate calculations to several hundred decimal places of accuracy in order to obtain the eigenvalues correctly to a few decimal places. * Shooting methods *

Finite difference method In numerical analysis, finite-difference methods (FDM) are a class of numerical techniques for solving differential equations by approximating Derivative, derivatives with Finite difference approximation, finite differences. Both the spatial doma ...

* Spectral parameter

power series In mathematics, a power series (in one variable) is an infinite series of the form \sum_^\infty a_n \left(x - c\right)^n = a_0 + a_1 (x - c) + a_2 (x - c)^2 + \dots where ''a_n'' represents the coefficient of the ''n''th term and ''c'' is a co ...

method

Shooting methods

Shooting methods proceed by guessing a value of , solving an initial value problem defined by the boundary conditions at one endpoint, say, , of the interval , comparing the value this solution takes at the other endpoint with the other desired boundary condition, and finally increasing or decreasing as necessary to correct the original value. This strategy is not applicable for locating complex eigenvalues.

Spectral parameter power series method

The spectral parameter power series (SPPS) method makes use of a generalization of the following fact about homogeneous second-order linear ordinary differential equations: if is a solution of equation () that does not vanish at any point of , then the function

y(x) \int_a^x \frac

is a solution of the same equation and is linearly independent from . Further, all solutions are linear combinations of these two solutions. In the SPPS algorithm, one must begin with an arbitrary value (often ; it does not need to be an eigenvalue) and any solution of () with which does not vanish on . (Discussion below of ways to find appropriate and .) Two sequences of functions , on , referred to as ''iterated integrals'', are defined recursively as follows. First when , they are taken to be identically equal to 1 on . To obtain the next functions they are multiplied alternately by and and integrated, specifically, for : The resulting iterated integrals are now applied as coefficients in the following two power series in ''λ'':

u_0 = y_0 \sum_^\infty \left (\lambda-\lambda_0^* \right )^k \tilde X^,

u_1 = y_0 \sum_^\infty \left (\lambda-\lambda_0^* \right )^k X^.

Then for any (real or complex), and are linearly independent solutions of the corresponding equation (). (The functions and take part in this construction through their influence on the choice of .) Next one chooses coefficients and so that the combination satisfies the first boundary condition (). This is simple to do since and , for . The values of and provide the values of and and the derivatives and , so the second boundary condition () becomes an equation in a power series in . For numerical work one may truncate this series to a finite number of terms, producing a calculable polynomial in whose roots are approximations of the sought-after eigenvalues. When , this reduces to the original construction described above for a solution linearly independent to a given one. The representations () and () also have theoretical applications in Sturm–Liouville theory.

Construction of a nonvanishing solution

The SPPS method can, itself, be used to find a starting solution . Consider the equation ; i.e., , , and are replaced in () by 0, , and respectively. Then the constant function 1 is a nonvanishing solution corresponding to the eigenvalue . While there is no guarantee that or will not vanish, the complex function will never vanish because two linearly-independent solutions of a regular Sturm–Liouville equation cannot vanish simultaneously as a consequence of the Sturm separation theorem. This trick gives a solution of () for the value . In practice if () has real coefficients, the solutions based on will have very small imaginary parts which must be discarded.

Main results

Reduction to Sturm–Liouville form

Bessel equation

Legendre equation

Example using an integrating factor

Integrating factor for general second-order homogenous equation

Sturm–Liouville equations as self-adjoint differential operators

Application to inhomogeneous second-order boundary value problems

Example: Fourier series

Application to partial differential equations

Normal modes

Second-order linear equation

Representation of solutions and numerical calculation

Shooting methods

Spectral parameter power series method

Construction of a nonvanishing solution

See also

References

Further reading