In
statistical modeling (especially
process modeling), polynomial functions and rational functions are sometimes used as an empirical technique for
curve fitting
Curve fitting is the process of constructing a curve, or mathematical function, that has the best fit to a series of data points, possibly subject to constraints. Curve fitting can involve either interpolation, where an exact fit to the data i ...
.
Polynomial function models
A
polynomial function is one that has the form
:
where ''n'' is a non-negative
integer
An integer is the number zero (), a positive natural number (, , , etc.) or a negative integer with a minus sign ( −1, −2, −3, etc.). The negative numbers are the additive inverses of the corresponding positive numbers. In the language ...
that defines the degree of the polynomial. A polynomial with a degree of 0 is simply a
constant function
In mathematics, a constant function is a function whose (output) value is the same for every input value. For example, the function is a constant function because the value of is 4 regardless of the input value (see image).
Basic properti ...
; with a degree of 1 is a
line
Line most often refers to:
* Line (geometry), object with zero thickness and curvature that stretches to infinity
* Telephone line, a single-user circuit on a telephone communication system
Line, lines, The Line, or LINE may also refer to:
Art ...
; with a degree of 2 is a
quadratic
In mathematics, the term quadratic describes something that pertains to squares, to the operation of squaring, to terms of the second degree, or equations or formulas that involve such terms. ''Quadratus'' is Latin for ''square''.
Mathematics ...
; with a degree of 3 is a
cubic
Cubic may refer to:
Science and mathematics
* Cube (algebra), "cubic" measurement
* Cube, a three-dimensional solid object bounded by six square faces, facets or sides, with three meeting at each vertex
** Cubic crystal system, a crystal system w ...
, and so on.
Historically, polynomial models are among the most frequently used empirical models for
curve fitting
Curve fitting is the process of constructing a curve, or mathematical function, that has the best fit to a series of data points, possibly subject to constraints. Curve fitting can involve either interpolation, where an exact fit to the data i ...
.
Advantages
These models are popular for the following reasons.
#Polynomial models have a simple form.
#Polynomial models have well known and understood properties.
#Polynomial models have moderate flexibility of shapes.
#Polynomial models are a closed family.
Changes of location and
scale
Scale or scales may refer to:
Mathematics
* Scale (descriptive set theory), an object defined on a set of points
* Scale (ratio), the ratio of a linear dimension of a model to the corresponding dimension of the original
* Scale factor, a number ...
in the raw data result in a polynomial model being mapped to a polynomial model. That is, polynomial models are not dependent on the underlying
metric.
#Polynomial models are computationally easy to use.
Disadvantages
However, polynomial models also have the following limitations.
#Polynomial models have poor
interpolatory properties. High-degree polynomials are notorious for
oscillations between exact-fit values.
#Polynomial models have poor
extrapolatory properties. Polynomials may provide good fits within the range of data, but they will frequently deteriorate rapidly outside the range of the data.
#Polynomial models have poor
asymptotic
In analytic geometry, an asymptote () of a curve is a line such that the distance between the curve and the line approaches zero as one or both of the ''x'' or ''y'' coordinates tends to infinity. In projective geometry and related contexts, ...
properties. By their nature, polynomials have a finite response for finite ''x'' values and have an infinite response if and only if the ''x'' value is infinite. Thus polynomials may not model asymptotic phenomena very well.
#While no procedure is immune to the
bias
Bias is a disproportionate weight ''in favor of'' or ''against'' an idea or thing, usually in a way that is closed-minded, prejudicial, or unfair. Biases can be innate or learned. People may develop biases for or against an individual, a group ...
-
variance
In probability theory and statistics, variance is the expectation of the squared deviation of a random variable from its population mean or sample mean. Variance is a measure of dispersion, meaning it is a measure of how far a set of number ...
tradeoff, polynomial models exhibit a particularly poor tradeoff between shape and degree. In order to model data with a complicated structure, the degree of the model must be high, indicating that the associated number of
parameter
A parameter (), generally, is any characteristic that can help in defining or classifying a particular system (meaning an event, project, object, situation, etc.). That is, a parameter is an element of a system that is useful, or critical, when ...
s to be
estimated will also be high. This can result in highly unstable models.
When modeling via polynomial functions is inadequate due to any of the limitations above, the use of rational functions for modeling may give a better fit.
Rational function models
A
rational function
In mathematics, a rational function is any function that can be defined by a rational fraction, which is an algebraic fraction such that both the numerator and the denominator are polynomials. The coefficients of the polynomials need not be ...
is simply the ratio of two polynomial functions.
:
with ''n'' denoting a non-negative integer that defines the degree of the numerator and ''m'' denoting a non-negative integer that defines the degree of the denominator. For fitting rational function models, the constant term in the denominator is usually set to 1. Rational functions are typically identified by the degrees of the numerator and denominator. For example, a quadratic for the numerator and a cubic for the denominator is identified as a quadratic/cubic rational function. The rational function model is a generalization of the polynomial model: rational function models contain polynomial models as a subset (i.e., the case when the denominator is a constant).
Advantages
Rational function models have the following advantages:
#Rational function models have a moderately simple form.
#Rational function models are a closed family. As with polynomial models, this means that rational function models are not dependent on the underlying metric.
#Rational function models can take on an extremely wide range of shapes, accommodating a much wider range of shapes than does the polynomial family.
#Rational function models have better interpolatory properties than polynomial models. Rational functions are typically smoother and less oscillatory than polynomial models.
#Rational functions have excellent extrapolatory powers. Rational functions can typically be tailored to model the function not only within the domain of the data, but also so as to be in agreement with theoretical/asymptotic behavior outside the domain of interest.
#Rational function models have excellent asymptotic properties. Rational functions can be either finite or infinite for finite values, or finite or infinite for infinite ''x'' values. Thus, rational functions can easily be incorporated into a rational function model.
#Rational function models can often be used to model complicated structure with a fairly low degree in both the numerator and denominator. This in turn means that fewer coefficients will be required compared to the polynomial model.
#Rational function models are moderately easy to handle computationally. Although they are
nonlinear models, rational function models are particularly easy nonlinear models to fit.
#One common difficulty in fitting nonlinear models is finding adequate starting values. A major advantage of rational function models is the ability to compute starting values using a
linear least squares fit. To do this, ''p'' points are chosen from the data set, with ''p'' denoting the number of parameters in the rational model. For example, given the linear/quadratic model
:::
::one would need to select four representative points, and perform a linear fit on the model
:::
::which is derived from the previous equation by clearing the denominator. Here, the ''x'' and ''y'' contain the subset of points, not the full data set. The estimated coefficients from this linear fit are used as the starting values for fitting the nonlinear model to the full data set.
::This type of fit, with the response variable appearing on both sides of the function, should only be used to obtain starting values for the nonlinear fit. The statistical properties of fits like this are not well understood.
::The subset of points should be selected over the range of the data. It is not critical which points are selected, although obvious outliers should be avoided.
Disadvantages
Rational function models have the following disadvantages:
#The properties of the rational function family are not as well known to engineers and scientists as are those of the polynomial family. The literature on the rational function family is also more limited. Because the properties of the family are often not well understood, it can be difficult to answer the following modeling question: ''Given that data has a certain shape, what values should be chosen for the degree of the numerator and the degree on the denominator?''
#Unconstrained rational function fitting can, at times, result in undesired vertical
asymptote
In analytic geometry, an asymptote () of a curve is a line such that the distance between the curve and the line approaches zero as one or both of the ''x'' or ''y'' coordinates tends to infinity. In projective geometry and related contexts, ...
s due to roots in the denominator polynomial. The range of ''x'' values affected by the function "blowing up" may be quite narrow, but such asymptotes, when they occur, are a nuisance for local interpolation in the neighborhood of the asymptote point. These asymptotes are easy to detect by a simple plot of the fitted function over the range of the data. These nuisance asymptotes occur occasionally and unpredictably, but practitioners argue that the gain in flexibility of shapes is well worth the chance that they may occur, and that such asymptotes should not discourage choosing rational function models for empirical modeling.
See also
*
Response surface methodology
In statistics, response surface methodology (RSM) explores the relationships between several explanatory variables and one or more response variables. The method was introduced by George E. P. Box and K. B. Wilson in 1951. The main idea of RSM ...
*
Pade Approximant
Bibliography
*
* Box, G. E. P. and Draper, Norman. 2007. ''Response Surfaces, Mixtures, and Ridge Analyses'', Second Edition
f ''Empirical Model-Building and Response Surfaces'', 1987 Wiley.
*
* R. H. Hardin and
N. J. A. Sloane
__NOTOC__
Neil James Alexander Sloane (born October 10, 1939) is a British-American mathematician. His major contributions are in the fields of combinatorics, error-correcting codes, and sphere packing. Sloane is best known for being the creator a ...
"A New Approach to the Construction of Optimal Designs", ''Journal of Statistical Planning and Inference'', vol. 37, 1993, pp. 339-369* R. H. Hardin and
N. J. A. Sloane
__NOTOC__
Neil James Alexander Sloane (born October 10, 1939) is a British-American mathematician. His major contributions are in the fields of combinatorics, error-correcting codes, and sphere packing. Sloane is best known for being the creator a ...
"Computer-Generated Minimal (and Larger) Response Surface Designs: (I) The Sphere"* R. H. Hardin and
N. J. A. Sloane
__NOTOC__
Neil James Alexander Sloane (born October 10, 1939) is a British-American mathematician. His major contributions are in the fields of combinatorics, error-correcting codes, and sphere packing. Sloane is best known for being the creator a ...
"Computer-Generated Minimal (and Larger) Response Surface Designs: (II) The Cube"*
**
**
* (Modeling with rational functions)
Historical
*
*
*
*
External links
{{NIST-PD
Regression models
Interpolation
Statistical ratios