In
probability theory
Probability theory or probability calculus is the branch of mathematics concerned with probability. Although there are several different probability interpretations, probability theory treats the concept in a rigorous mathematical manner by expre ...
and
statistics
Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
, the ''F''-distribution or ''F''-ratio, also known as Snedecor's ''F'' distribution or the Fisher–Snedecor distribution (after
Ronald Fisher
Sir Ronald Aylmer Fisher (17 February 1890 – 29 July 1962) was a British polymath who was active as a mathematician, statistician, biologist, geneticist, and academic. For his work in statistics, he has been described as "a genius who a ...
and
George W. Snedecor), is a
continuous probability distribution
In probability theory and statistics, a probability distribution is a Function (mathematics), function that gives the probabilities of occurrence of possible events for an Experiment (probability theory), experiment. It is a mathematical descri ...
that arises frequently as the
null distribution
Null may refer to:
Science, technology, and mathematics Astronomy
*Nuller, an optical tool using interferometry to block certain sources of light Computing
*Null (SQL) (or NULL), a special marker and keyword in SQL indicating that a data value do ...
of a
test statistic
Test statistic is a quantity derived from the sample for statistical hypothesis testing.Berger, R. L.; Casella, G. (2001). ''Statistical Inference'', Duxbury Press, Second Edition (p.374) A hypothesis test is typically specified in terms of a tes ...
, most notably in the
analysis of variance
Analysis of variance (ANOVA) is a family of statistical methods used to compare the Mean, means of two or more groups by analyzing variance. Specifically, ANOVA compares the amount of variation ''between'' the group means to the amount of variati ...
(ANOVA) and other
''F''-tests.
Definitions
The ''F''-distribution with ''d''
1 and ''d''
2 degrees of freedom is the distribution of
where
and
are
independent
Independent or Independents may refer to:
Arts, entertainment, and media Artist groups
* Independents (artist group), a group of modernist painters based in Pennsylvania, United States
* Independentes (English: Independents), a Portuguese artist ...
random variable
A random variable (also called random quantity, aleatory variable, or stochastic variable) is a Mathematics, mathematical formalization of a quantity or object which depends on randomness, random events. The term 'random variable' in its mathema ...
s with
chi-square distribution The term chi-square, chi-squared, or \chi^2 has various uses in statistics
Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analys ...
s with respective degrees of freedom
and
.
It can be shown to follow that the
probability density function
In probability theory, a probability density function (PDF), density function, or density of an absolutely continuous random variable, is a Function (mathematics), function whose value at any given sample (or point) in the sample space (the s ...
(pdf) for ''X'' is given by
for
real
Real may refer to:
Currencies
* Argentine real
* Brazilian real (R$)
* Central American Republic real
* Mexican real
* Portuguese real
* Spanish real
* Spanish colonial real
Nature and science
* Reality, the state of things as they exist, rathe ...
''x'' > 0. Here
is the
beta function
In mathematics, the beta function, also called the Euler integral of the first kind, is a special function that is closely related to the gamma function and to binomial coefficients. It is defined by the integral
: \Beta(z_1,z_2) = \int_0^1 t^ ...
. In many applications, the parameters ''d''
1 and ''d''
2 are
positive integer
In mathematics, the natural numbers are the numbers 0, 1, 2, 3, and so on, possibly excluding 0. Some start counting with 0, defining the natural numbers as the non-negative integers , while others start with 1, defining them as the positiv ...
s, but the distribution is well-defined for positive real values of these parameters.
The
cumulative distribution function
In probability theory and statistics, the cumulative distribution function (CDF) of a real-valued random variable X, or just distribution function of X, evaluated at x, is the probability that X will take a value less than or equal to x.
Ever ...
is
where ''I'' is the
regularized incomplete beta function
In mathematics, the beta function, also called the Euler integral of the first kind, is a special function that is closely related to the gamma function and to binomial coefficients. It is defined by the integral
: \Beta(z_1,z_2) = \int_0^1 t^( ...
.
Properties
The expectation, variance, and other details about the F(''d''
1, ''d''
2) are given in the sidebox; for ''d''
2 > 8, the
excess kurtosis
In probability theory and statistics, kurtosis (from , ''kyrtos'' or ''kurtos'', meaning "curved, arching") refers to the degree of “tailedness” in the probability distribution of a real-valued random variable. Similar to skewness, kurtosi ...
is
The ''k''-th moment of an F(''d''
1, ''d''
2) distribution exists and is finite only when 2''k'' < ''d''
2 and it is equal to
The ''F''-distribution is a particular
parametrization of the
beta prime distribution
In probability theory and statistics, the beta prime distribution (also known as inverted beta distribution or beta distribution of the second kindJohnson et al (1995), p 248) is an absolutely continuous probability distribution. If p\in ,1/math ...
, which is also called the beta distribution of the second kind.
The
characteristic function In mathematics, the term "characteristic function" can refer to any of several distinct concepts:
* The indicator function of a subset, that is the function
\mathbf_A\colon X \to \,
which for a given subset ''A'' of ''X'', has value 1 at points ...
is listed incorrectly in many standard references (e.g.,
). The correct expression is
where ''U''(''a'', ''b'', ''z'') is the
confluent hypergeometric function
In mathematics, a confluent hypergeometric function is a solution of a confluent hypergeometric equation, which is a degenerate form of a hypergeometric differential equation where two of the three regular singularities merge into an irregular s ...
of the second kind.
Related distributions
Relation to the chi-squared distribution
In instances where the ''F''-distribution is used, for example in the
analysis of variance
Analysis of variance (ANOVA) is a family of statistical methods used to compare the Mean, means of two or more groups by analyzing variance. Specifically, ANOVA compares the amount of variation ''between'' the group means to the amount of variati ...
, independence of
and
(defined above) might be demonstrated by applying
Cochran's theorem
In statistics, Cochran's theorem, devised by William G. Cochran, is a theorem used to justify results relating to the probability distributions of statistics that are used in the analysis of variance.
Examples
Sample mean and sample variance ...
.
Equivalently, since the
chi-squared distribution
In probability theory and statistics, the \chi^2-distribution with k Degrees of freedom (statistics), degrees of freedom is the distribution of a sum of the squares of k Independence (probability theory), independent standard normal random vari ...
is the sum of squares of
independent
Independent or Independents may refer to:
Arts, entertainment, and media Artist groups
* Independents (artist group), a group of modernist painters based in Pennsylvania, United States
* Independentes (English: Independents), a Portuguese artist ...
standard normal
In probability theory and statistics, a normal distribution or Gaussian distribution is a type of continuous probability distribution for a real-valued random variable. The general form of its probability density function is
f(x) = \frac e^ ...
random variables, the random variable of the ''F''-distribution may also be written
where
and
,
is the sum of squares of
random variables from normal distribution
and
is the sum of squares of
random variables from normal distribution
.
In a
frequentist
Frequentist inference is a type of statistical inference based in frequentist probability, which treats “probability” in equivalent terms to “frequency” and draws conclusions from sample-data by means of emphasizing the frequency or pro ...
context, a scaled ''F''-distribution therefore gives the probability
, with the ''F''-distribution itself, without any scaling, applying where
is being taken equal to
. This is the context in which the ''F''-distribution most generally appears in
''F''-tests: where the null hypothesis is that two independent normal variances are equal, and the observed sums of some appropriately selected squares are then examined to see whether their ratio is significantly incompatible with this null hypothesis.
The quantity
has the same distribution in Bayesian statistics, if an uninformative rescaling-invariant
Jeffreys prior
In Bayesian statistics, the Jeffreys prior is a non-informative prior distribution for a parameter space. Named after Sir Harold Jeffreys, its density function is proportional to the square root of the determinant of the Fisher information matri ...
is taken for the
prior probabilities
A prior probability distribution of an uncertain quantity, simply called the prior, is its assumed probability distribution before some evidence is taken into account. For example, the prior could be the probability distribution representing the ...
of
and
. In this context, a scaled ''F''-distribution thus gives the posterior probability
, where the observed sums
and
are now taken as known.
In general
*If
and
(
Chi squared distribution
In probability theory and statistics, the \chi^2-distribution with k degrees of freedom is the distribution of a sum of the squares of k independent standard normal random variables.
The chi-squared distribution \chi^2_k is a special case of ...
) are
independent
Independent or Independents may refer to:
Arts, entertainment, and media Artist groups
* Independents (artist group), a group of modernist painters based in Pennsylvania, United States
* Independentes (English: Independents), a Portuguese artist ...
, then
*If
(
Gamma distribution
In probability theory and statistics, the gamma distribution is a versatile two-parameter family of continuous probability distributions. The exponential distribution, Erlang distribution, and chi-squared distribution are special cases of the g ...
) are independent, then
*If
(
Beta distribution
In probability theory and statistics, the beta distribution is a family of continuous probability distributions defined on the interval , 1
The comma is a punctuation mark that appears in several variants in different languages. Some typefaces render it as a small line, slightly curved or straight, but inclined from the vertical; others give it the appearance of a miniature fille ...
or (0, 1) in terms of two positive Statistical parameter, parameters, denoted by ''alpha'' (''α'') an ...
) then
*Equivalently, if
, then
.
*If
, then
has a
beta prime distribution
In probability theory and statistics, the beta prime distribution (also known as inverted beta distribution or beta distribution of the second kindJohnson et al (1995), p 248) is an absolutely continuous probability distribution. If p\in ,1/math ...
:
.
*If
then
has the
chi-squared distribution
In probability theory and statistics, the \chi^2-distribution with k Degrees of freedom (statistics), degrees of freedom is the distribution of a sum of the squares of k Independence (probability theory), independent standard normal random vari ...
*
is equivalent to the scaled
Hotelling's T-squared distribution
In statistics, particularly in hypothesis testing, the Hotelling's ''T''-squared distribution (''T''2), proposed by Harold Hotelling, is a multivariate probability distribution that is tightly related to the ''F''-distribution and is most nota ...
.
*If
then
.
*If
—
Student's t-distribution
In probability theory and statistics, Student's distribution (or simply the distribution) t_\nu is a continuous probability distribution that generalizes the Normal distribution#Standard normal distribution, standard normal distribu ...
— then:
*''F''-distribution is a special case of type 6
Pearson distribution
The Pearson distribution is a family of continuous probability distributions. It was first published by Karl Pearson in 1895 and subsequently extended by him in 1901 and 1916 in a series of articles on biostatistics.
History
The Pearson syste ...
*If
and
are independent, with
Laplace(''μ'', ''b'') then
*If
then
(
Fisher's z-distribution
Fisher's ''z''-distribution is the statistical distribution of half the logarithm of an ''F''-distribution variate:
: z = \frac 1 2 \log F
It was first described by Ronald Fisher in a paper delivered at the International Mathematical Con ...
)
*The
noncentral ''F''-distribution simplifies to the ''F''-distribution if
.
*The doubly
noncentral ''F''-distribution simplifies to the ''F''-distribution if
*If
is the quantile ''p'' for
and
is the quantile
for
, then
* ''F''-distribution is an instance of
ratio distributions
*
W-distribution
is a unique parametrization of F-distribution.
See also
*
Beta prime distribution
In probability theory and statistics, the beta prime distribution (also known as inverted beta distribution or beta distribution of the second kindJohnson et al (1995), p 248) is an absolutely continuous probability distribution. If p\in ,1/math ...
*
Chi-square distribution The term chi-square, chi-squared, or \chi^2 has various uses in statistics
Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analys ...
*
Chow test
The Chow test (), proposed by econometrician Gregory Chow in 1960, is a statistical test of whether the true coefficients in two linear regressions on different data sets are equal. In econometrics, it is most commonly used in time series analysis ...
*
Gamma distribution
In probability theory and statistics, the gamma distribution is a versatile two-parameter family of continuous probability distributions. The exponential distribution, Erlang distribution, and chi-squared distribution are special cases of the g ...
*
Hotelling's T-squared distribution
In statistics, particularly in hypothesis testing, the Hotelling's ''T''-squared distribution (''T''2), proposed by Harold Hotelling, is a multivariate probability distribution that is tightly related to the ''F''-distribution and is most nota ...
*
Wilks' lambda distribution
In statistics, Wilks' lambda distribution (named for Samuel S. Wilks), is a probability distribution used in multivariate hypothesis testing, especially with regard to the likelihood-ratio test and multivariate analysis of variance (MANOVA).
...
*
Wishart distribution
In statistics, the Wishart distribution is a generalization of the gamma distribution to multiple dimensions. It is named in honor of John Wishart (statistician), John Wishart, who first formulated the distribution in 1928. Other names include Wi ...
*
Modified half-normal distribution
In probability theory and statistics, the modified half-normal distribution (MHN) is a three-parameter family of continuous probability distributions supported on the positive part of the real line. It can be viewed as a generalization of multiple ...
with the pdf on
is given as
, where
denotes the
Fox–Wright Psi function.
References
External links
Table of critical values of the ''F''-distributionEarliest Uses of Some of the Words of Mathematics: entry on ''F''-distribution contains a brief history
{{DEFAULTSORT:F-distribution
Continuous distributions
Analysis of variance