Harmonic Mean
   HOME

TheInfoList



OR:

In
mathematics Mathematics is a field of study that discovers and organizes methods, Mathematical theory, theories and theorems that are developed and Mathematical proof, proved for the needs of empirical sciences and mathematics itself. There are many ar ...
, the harmonic mean is a kind of
average In colloquial, ordinary language, an average is a single number or value that best represents a set of data. The type of average taken as most typically representative of a list of numbers is the arithmetic mean the sum of the numbers divided by ...
, one of the Pythagorean means. It is the most appropriate average for
ratio In mathematics, a ratio () shows how many times one number contains another. For example, if there are eight oranges and six lemons in a bowl of fruit, then the ratio of oranges to lemons is eight to six (that is, 8:6, which is equivalent to the ...
s and
rates Rate or rates may refer to: Finance * Rate (company), an American residential mortgage company formerly known as Guaranteed Rate * Rates (tax), a type of taxation system in the United Kingdom used to fund local government * Exchange rate, rate ...
such as speeds, and is normally only used for positive arguments. The harmonic mean is the reciprocal of the
arithmetic mean In mathematics and statistics, the arithmetic mean ( ), arithmetic average, or just the ''mean'' or ''average'' is the sum of a collection of numbers divided by the count of numbers in the collection. The collection is often a set of results fr ...
of the reciprocals of the numbers, that is, the generalized f-mean with f(x) = \frac. For example, the harmonic mean of 1, 4, and 4 is :\left(\frac\right)^ = \frac = \frac = 2\,.


Definition

The harmonic mean ''H'' of the positive
real number In mathematics, a real number is a number that can be used to measure a continuous one- dimensional quantity such as a duration or temperature. Here, ''continuous'' means that pairs of values can have arbitrarily small differences. Every re ...
s x_1, x_2, \ldots, x_n is :H(x_1, x_2, \ldots, x_n) = \frac = \frac. It is the reciprocal of the
arithmetic mean In mathematics and statistics, the arithmetic mean ( ), arithmetic average, or just the ''mean'' or ''average'' is the sum of a collection of numbers divided by the count of numbers in the collection. The collection is often a set of results fr ...
of the reciprocals, and vice versa: :\begin H(x_1, x_2, \ldots, x_n) &= \frac, \\ A(x_1, x_2, \ldots, x_n) &= \frac, \end where the arithmetic mean is A(x_1, x_2, \ldots, x_n) = \tfrac1n \sum_^n x_i. The harmonic mean is a Schur-concave function, and is greater than or equal to the minimum of its arguments: for positive arguments, \min(x_1 \ldots x_n) \le H(x_1 \ldots x_n) \le n \min(x_1 \ldots x_n). Thus, the harmonic mean cannot be made
arbitrarily large In mathematics, the phrases arbitrarily large, arbitrarily small and arbitrarily long are used in statements to make clear the fact that an object is large, small, or long with little limitation or restraint, respectively. The use of "arbitrarily" o ...
by changing some values to bigger ones (while having at least one value unchanged). The harmonic mean is also concave for positive arguments, an even stronger property than Schur-concavity.


Relationship with other means

For all ''positive'' data sets ''containing at least one pair of nonequal values'', the harmonic mean is always the least of the three Pythagorean means, while the
arithmetic mean In mathematics and statistics, the arithmetic mean ( ), arithmetic average, or just the ''mean'' or ''average'' is the sum of a collection of numbers divided by the count of numbers in the collection. The collection is often a set of results fr ...
is always the greatest of the three and the
geometric mean In mathematics, the geometric mean is a mean or average which indicates a central tendency of a finite collection of positive real numbers by using the product of their values (as opposed to the arithmetic mean which uses their sum). The geometri ...
is always in between. (If all values in a nonempty data set are equal, the three means are always equal.) It is the special case ''M''−1 of the power mean: H\left(x_1, x_2, \ldots, x_n\right) = M_\left(x_1, x_2, \ldots, x_n\right) = \frac Since the harmonic mean of a list of numbers tends strongly toward the least elements of the list, it tends (compared to the arithmetic mean) to mitigate the impact of large outliers and aggravate the impact of small ones. The arithmetic mean is often mistakenly used in places calling for the harmonic mean. In the speed example below for instance, the arithmetic mean of 40 is incorrect, and too big. The harmonic mean is related to the other Pythagorean means, as seen in the equation below. This can be seen by interpreting the denominator to be the arithmetic mean of the product of numbers ''n'' times but each time omitting the ''j''-th term. That is, for the first term, we multiply all ''n'' numbers except the first; for the second, we multiply all ''n'' numbers except the second; and so on. The numerator, excluding the ''n'', which goes with the arithmetic mean, is the geometric mean to the power ''n''. Thus the ''n''-th harmonic mean is related to the ''n''-th geometric and arithmetic means. The general formula is H\left(x_1, \ldots, x_n\right) = \frac = \frac . If a set of non-identical numbers is subjected to a
mean-preserving spread In probability and statistics, a mean-preserving spread (MPS) is a change from one probability distribution A to another probability distribution B, where B is formed by spreading out one or more portions of A's probability density function or proba ...
— that is, two or more elements of the set are "spread apart" from each other while leaving the arithmetic mean unchanged — then the harmonic mean always decreases.


Harmonic mean of two or three numbers


Two numbers

For the special case of just two numbers, x_1 and x_2, the harmonic mean can be written as: :H = \frac \qquad or \qquad \frac = \frac. (Note that the harmonic mean is undefined if x_1 + x_2 = 0, i.e. x_1 = -x_2.) In this special case, the harmonic mean is related to the
arithmetic mean In mathematics and statistics, the arithmetic mean ( ), arithmetic average, or just the ''mean'' or ''average'' is the sum of a collection of numbers divided by the count of numbers in the collection. The collection is often a set of results fr ...
A = \frac and the
geometric mean In mathematics, the geometric mean is a mean or average which indicates a central tendency of a finite collection of positive real numbers by using the product of their values (as opposed to the arithmetic mean which uses their sum). The geometri ...
G = \sqrt, by :H = \frac = G\left(\frac\right). Since \tfrac \le 1 by the
inequality of arithmetic and geometric means Inequality may refer to: * Inequality (mathematics), a relation between two quantities when they are different. * Economic inequality, difference in economic well-being between population groups ** Income inequality, an unequal distribution of in ...
, this shows for the ''n'' = 2 case that ''H'' ≤ ''G'' (a property that in fact holds for all ''n''). It also follows that G = \sqrt, meaning the two numbers' geometric mean equals the geometric mean of their arithmetic and harmonic means.


Three numbers

For the special case of three numbers, x_1, x_2 and x_3, the harmonic mean can be written as: :H = \frac. Three positive numbers ''H'', ''G'', and ''A'' are respectively the harmonic, geometric, and arithmetic means of three positive numbers
if and only if In logic and related fields such as mathematics and philosophy, "if and only if" (often shortened as "iff") is paraphrased by the biconditional, a logical connective between statements. The biconditional is true in two cases, where either bo ...
''Inequalities proposed in "
Crux Mathematicorum ''Crux Mathematicorum'' is a scientific journal of mathematics published by the Canadian Mathematical Society. It contains mathematical problems for secondary school and undergraduate students. Its editor-in-chief is Kseniya Garaschuk. The journ ...
"'', .
the following inequality holds :\frac + \frac + 1 \le \frac3 \left(1 + \frac\right)^2.


Weighted harmonic mean

If a set of weights w_1, ..., w_n is associated to the data set x_1, ..., x_n, the weighted harmonic mean is defined by Ferger F (1931) The nature and use of the harmonic mean. Journal of the American Statistical Association 26(173) 36-40 : H = \frac = \left( \frac \right)^. The unweighted harmonic mean can be regarded as the special case where all of the weights are equal.


Examples


In analytic number theory


Prime number theory

The
prime number theorem In mathematics, the prime number theorem (PNT) describes the asymptotic analysis, asymptotic distribution of the prime numbers among the positive integers. It formalizes the intuitive idea that primes become less common as they become larger by p ...
states that the number of primes less than or equal to n is asymptotically equal to the harmonic mean of the first n
natural number In mathematics, the natural numbers are the numbers 0, 1, 2, 3, and so on, possibly excluding 0. Some start counting with 0, defining the natural numbers as the non-negative integers , while others start with 1, defining them as the positive in ...
s.


In physics


Average speed

In many situations involving rates and
ratio In mathematics, a ratio () shows how many times one number contains another. For example, if there are eight oranges and six lemons in a bowl of fruit, then the ratio of oranges to lemons is eight to six (that is, 8:6, which is equivalent to the ...
s, the harmonic mean provides the correct
average In colloquial, ordinary language, an average is a single number or value that best represents a set of data. The type of average taken as most typically representative of a list of numbers is the arithmetic mean the sum of the numbers divided by ...
. For instance, if a vehicle travels a certain distance ''d'' outbound at a speed ''x'' (e.g. 60 km/h) and returns the same distance at a speed ''y'' (e.g. 20 km/h), then its average speed is the harmonic mean of ''x'' and ''y'' (30 km/h), not the arithmetic mean (40 km/h). The total travel time is the same as if it had traveled the whole distance at that average speed. This can be proven as follows: Average speed for the entire journey = However, if the vehicle travels for a certain amount of ''time'' at a speed ''x'' and then the same amount of time at a speed ''y'', then its average speed is the
arithmetic mean In mathematics and statistics, the arithmetic mean ( ), arithmetic average, or just the ''mean'' or ''average'' is the sum of a collection of numbers divided by the count of numbers in the collection. The collection is often a set of results fr ...
of ''x'' and ''y'', which in the above example is 40 km/h. Average speed for the entire journey The same principle applies to more than two segments: given a series of sub-trips at different speeds, if each sub-trip covers the same ''distance'', then the average speed is the ''harmonic'' mean of all the sub-trip speeds; and if each sub-trip takes the same amount of ''time'', then the average speed is the ''arithmetic'' mean of all the sub-trip speeds. (If neither is the case, then a weighted harmonic mean or
weighted arithmetic mean The weighted arithmetic mean is similar to an ordinary arithmetic mean (the most common type of average), except that instead of each of the data points contributing equally to the final average, some data points contribute more than others. Th ...
is needed. For the arithmetic mean, the speed of each portion of the trip is weighted by the duration of that portion, while for the harmonic mean, the corresponding weight is the distance. In both cases, the resulting formula reduces to dividing the total distance by the total time.) However, one may avoid the use of the harmonic mean for the case of "weighting by distance". Pose the problem as finding "slowness" of the trip where "slowness" (in hours per kilometre) is the inverse of speed. When trip slowness is found, invert it so as to find the "true" average trip speed. For each trip segment i, the slowness si = 1/speedi. Then take the weighted
arithmetic mean In mathematics and statistics, the arithmetic mean ( ), arithmetic average, or just the ''mean'' or ''average'' is the sum of a collection of numbers divided by the count of numbers in the collection. The collection is often a set of results fr ...
of the si's weighted by their respective distances (optionally with the weights normalized so they sum to 1 by dividing them by trip length). This gives the true average slowness (in time per kilometre). It turns out that this procedure, which can be done with no knowledge of the harmonic mean, amounts to the same mathematical operations as one would use in solving this problem by using the harmonic mean. Thus it illustrates why the harmonic mean works in this case.


Density

Similarly, if one wishes to estimate the density of an
alloy An alloy is a mixture of chemical elements of which in most cases at least one is a metal, metallic element, although it is also sometimes used for mixtures of elements; herein only metallic alloys are described. Metallic alloys often have prop ...
given the densities of its constituent elements and their mass fractions (or, equivalently, percentages by mass), then the predicted density of the alloy (exclusive of typically minor volume changes due to atom packing effects) is the weighted harmonic mean of the individual densities, weighted by mass, rather than the weighted arithmetic mean as one might at first expect. To use the weighted arithmetic mean, the densities would have to be weighted by volume. Applying dimensional analysis to the problem while labeling the mass units by element and making sure that only like element-masses cancel makes this clear.


Electricity

If one connects two electrical
resistor A resistor is a passive two-terminal electronic component that implements electrical resistance as a circuit element. In electronic circuits, resistors are used to reduce current flow, adjust signal levels, to divide voltages, bias active e ...
s in parallel, one having resistance ''x'' (e.g., 60  Ω) and one having resistance ''y'' (e.g., 40 Î©), then the effect is the same as if one had used two resistors with the same resistance, both equal to the harmonic mean of ''x'' and ''y'' (48 Î©): the equivalent resistance, in either case, is 24 Î© (one-half of the harmonic mean). This same principle applies to
capacitor In electrical engineering, a capacitor is a device that stores electrical energy by accumulating electric charges on two closely spaced surfaces that are insulated from each other. The capacitor was originally known as the condenser, a term st ...
s in series or to
inductor An inductor, also called a coil, choke, or reactor, is a Passivity (engineering), passive two-terminal electronic component, electrical component that stores energy in a magnetic field when an electric current flows through it. An inductor typic ...
s in parallel. However, if one connects the resistors in series, then the average resistance is the arithmetic mean of ''x'' and ''y'' (50 Î©), with total resistance equal to twice this, the sum of ''x'' and ''y'' (100 Î©). This principle applies to
capacitor In electrical engineering, a capacitor is a device that stores electrical energy by accumulating electric charges on two closely spaced surfaces that are insulated from each other. The capacitor was originally known as the condenser, a term st ...
s in parallel or to
inductor An inductor, also called a coil, choke, or reactor, is a Passivity (engineering), passive two-terminal electronic component, electrical component that stores energy in a magnetic field when an electric current flows through it. An inductor typic ...
s in series. As with the previous example, the same principle applies when more than two resistors, capacitors or inductors are connected, provided that all are in parallel or all are in series. The "conductivity effective mass" of a semiconductor is also defined as the harmonic mean of the effective masses along the three crystallographic directions.


Optics

As for other
optic equation In number theory, the optic equation is an equation that requires the sum of the multiplicative inverse, reciprocals of two positive integers and to equal the reciprocal of a third positive integer :Dickson, L. E., ''History of the Theory of N ...
s, the thin lens equation = + can be rewritten such that the focal length ''f'' is one-half of the harmonic mean of the distances of the subject ''u'' and object ''v'' from the lens. Two thin lenses of focal length ''f''1 and ''f''2 in series is equivalent to two thin lenses of focal length ''f''hm, their harmonic mean, in series. Expressed as
optical power In optics, optical power (also referred to as dioptric power, refractive power, focal power, focusing power, or convergence power) is the degree to which a lens, mirror, or other optical system converges or diverges light. It is equal to the ...
, two thin lenses of optical powers ''P''1 and ''P''2 in series is equivalent to two thin lenses of optical power ''P''am, their arithmetic mean, in series.


In finance

The weighted harmonic mean is the preferable method for averaging multiples, such as the
price–earnings ratio The price–earnings ratio, also known as P/E ratio, P/E, or PER, is the ratio of a company's share (stock) price to the company's earnings per share. The ratio is used for valuing companies and to find out whether they are overvalued or unde ...
(P/E). If these ratios are averaged using a weighted arithmetic mean, high data points are given greater weights than low data points. The weighted harmonic mean, on the other hand, correctly weights each data point. The simple weighted arithmetic mean when applied to non-price normalized ratios such as the P/E is biased upwards and cannot be numerically justified, since it is based on equalized earnings; just as vehicles speeds cannot be averaged for a roundtrip journey (see above).


In geometry

In any
triangle A triangle is a polygon with three corners and three sides, one of the basic shapes in geometry. The corners, also called ''vertices'', are zero-dimensional points while the sides connecting them, also called ''edges'', are one-dimension ...
, the radius of the
incircle In geometry, the incircle or inscribed circle of a triangle is the largest circle that can be contained in the triangle; it touches (is tangent to) the three sides. The center of the incircle is a triangle center called the triangle's incenter ...
is one-third of the harmonic mean of the
altitudes Altitude is a distance measurement, usually in the vertical or "up" direction, between a reference datum and a point or object. The exact definition and reference datum varies according to the context (e.g., aviation, geometry, geographical s ...
. For any point P on the minor arc BC of the
circumcircle In geometry, the circumscribed circle or circumcircle of a triangle is a circle that passes through all three vertex (geometry), vertices. The center of this circle is called the circumcenter of the triangle, and its radius is called the circumrad ...
of an
equilateral triangle An equilateral triangle is a triangle in which all three sides have the same length, and all three angles are equal. Because of these properties, the equilateral triangle is a regular polygon, occasionally known as the regular triangle. It is the ...
ABC, with distances ''q'' and ''t'' from B and C respectively, and with the intersection of PA and BC being at a distance ''y'' from point P, we have that ''y'' is half the harmonic mean of ''q'' and ''t''. In a
right triangle A right triangle or right-angled triangle, sometimes called an orthogonal triangle or rectangular triangle, is a triangle in which two sides are perpendicular, forming a right angle ( turn or 90 degrees). The side opposite to the right angle i ...
with legs ''a'' and ''b'' and
altitude Altitude is a distance measurement, usually in the vertical or "up" direction, between a reference datum (geodesy), datum and a point or object. The exact definition and reference datum varies according to the context (e.g., aviation, geometr ...
''h'' from the
hypotenuse In geometry, a hypotenuse is the side of a right triangle opposite to the right angle. It is the longest side of any such triangle; the two other shorter sides of such a triangle are called '' catheti'' or ''legs''. Every rectangle can be divided ...
to the right angle, is half the harmonic mean of and . Let ''t'' and ''s'' (''t'' > ''s'') be the sides of the two inscribed squares in a right triangle with hypotenuse ''c''. Then equals half the harmonic mean of and . Let a
trapezoid In geometry, a trapezoid () in North American English, or trapezium () in British English, is a quadrilateral that has at least one pair of parallel sides. The parallel sides are called the ''bases'' of the trapezoid. The other two sides are ...
have vertices A, B, C, and D in sequence and have parallel sides AB and CD. Let E be the intersection of the
diagonal In geometry, a diagonal is a line segment joining two vertices of a polygon or polyhedron, when those vertices are not on the same edge. Informally, any sloping line is called diagonal. The word ''diagonal'' derives from the ancient Greek Î ...
s, and let F be on side DA and G be on side BC such that FEG is parallel to AB and CD. Then FG is the harmonic mean of AB and DC. (This is provable using similar triangles.) One application of this trapezoid result is in the crossed ladders problem, where two ladders lie oppositely across an alley, each with feet at the base of one sidewall, with one leaning against a wall at height ''A'' and the other leaning against the opposite wall at height ''B'', as shown. The ladders cross at a height of ''h'' above the alley floor. Then ''h'' is half the harmonic mean of ''A'' and ''B''. This result still holds if the walls are slanted but still parallel and the "heights" ''A'', ''B'', and ''h'' are measured as distances from the floor along lines parallel to the walls. This can be proved easily using the area formula of a trapezoid and area addition formula. In an
ellipse In mathematics, an ellipse is a plane curve surrounding two focus (geometry), focal points, such that for all points on the curve, the sum of the two distances to the focal points is a constant. It generalizes a circle, which is the special ty ...
, the semi-latus rectum (the distance from a focus to the ellipse along a line parallel to the minor axis) is the harmonic mean of the maximum and minimum distances of the ellipse from a focus.


In other sciences

In
computer science Computer science is the study of computation, information, and automation. Computer science spans Theoretical computer science, theoretical disciplines (such as algorithms, theory of computation, and information theory) to Applied science, ...
, specifically
information retrieval Information retrieval (IR) in computing and information science is the task of identifying and retrieving information system resources that are relevant to an Information needs, information need. The information need can be specified in the form ...
and
machine learning Machine learning (ML) is a field of study in artificial intelligence concerned with the development and study of Computational statistics, statistical algorithms that can learn from data and generalise to unseen data, and thus perform Task ( ...
, the harmonic mean of the precision (true positives per predicted positive) and the recall (true positives per real positive) is often used as an aggregated performance score for the evaluation of algorithms and systems: the F-score (or F-measure). This is used in information retrieval because only the positive class is of
relevance Relevance is the connection between topics that makes one useful for dealing with the other. Relevance is studied in many different fields, including cognitive science, logic, and library and information science. Epistemology studies it in gener ...
, while number of negatives, in general, is large and unknown. It is thus a trade-off as to whether the correct positive predictions should be measured in relation to the number of predicted positives or the number of real positives, so it is measured versus a putative number of positives that is an arithmetic mean of the two possible denominators. A consequence arises from basic algebra in problems where people or systems work together. As an example, if a gas-powered pump can drain a pool in 4 hours and a battery-powered pump can drain the same pool in 6 hours, then it will take both pumps , which is equal to 2.4 hours, to drain the pool together. This is one-half of the harmonic mean of 6 and 4: . That is, the appropriate average for the two types of pump is the harmonic mean, and with one pair of pumps (two pumps), it takes half this harmonic mean time, while with two pairs of pumps (four pumps) it would take a quarter of this harmonic mean time. In
hydrology Hydrology () is the scientific study of the movement, distribution, and management of water on Earth and other planets, including the water cycle, water resources, and drainage basin sustainability. A practitioner of hydrology is called a hydro ...
, the harmonic mean is similarly used to average
hydraulic conductivity In science and engineering, hydraulic conductivity (, in SI units of meters per second), is a property of porous materials, soils and Rock (geology), rocks, that describes the ease with which a fluid (usually water) can move through the porosity, ...
values for a flow that is perpendicular to layers (e.g., geologic or soil) - flow parallel to layers uses the arithmetic mean. This apparent difference in averaging is explained by the fact that hydrology uses conductivity, which is the inverse of resistivity. In
sabermetrics Sabermetrics (originally SABRmetrics) is the original or blanket term for sports analytics in the US, the empirical analysis of baseball, especially the development of advanced metrics based on baseball statistics that measure in-game activity ...
, a baseball player's Power–speed number is the harmonic mean of their
home run In baseball, a home run (abbreviated HR) is scored when the Baseball (ball), ball is hit in such a way that the batting (baseball), batter is able to circle the bases and reach home plate safe (baseball), safely in one play without any error ( ...
and
stolen base In baseball, a stolen base occurs when a runner advances to a base unaided by other actions and the official scorer rules that the advance should be credited to the action of the runner. The umpires determine whether the runner is safe or out ...
totals. In
population genetics Population genetics is a subfield of genetics that deals with genetic differences within and among populations, and is a part of evolutionary biology. Studies in this branch of biology examine such phenomena as Adaptation (biology), adaptation, s ...
, the harmonic mean is used when calculating the effects of fluctuations in the census population size on the effective population size. The harmonic mean takes into account the fact that events such as population
bottleneck Bottleneck may refer to: * the narrowed portion (neck) of a bottle Science and technology * Bottleneck (engineering), where the performance of an entire system is limited by a single component * Bottleneck (network), in a communication network * ...
increase the rate genetic drift and reduce the amount of genetic variation in the population. This is a result of the fact that following a bottleneck very few individuals contribute to the
gene pool The gene pool is the set of all genes, or genetic information, in any population, usually of a particular species. Description A large gene pool indicates extensive genetic diversity, which is associated with robust populations that can survi ...
limiting the genetic variation present in the population for many generations to come. When considering
fuel economy in automobiles The fuel economy of an automobile relates to the distance traveled by a vehicle and the amount of fuel consumed. Consumption can be expressed in terms of the volume of fuel to travel a distance, or the distance traveled per unit volume of fue ...
two measures are commonly used – miles per gallon (mpg), and litres per 100 km. As the dimensions of these quantities are the inverse of each other (one is distance per volume, the other volume per distance) when taking the mean value of the fuel economy of a range of cars one measure will produce the harmonic mean of the other – i.e., converting the mean value of fuel economy expressed in litres per 100 km to miles per gallon will produce the harmonic mean of the fuel economy expressed in miles per gallon. For calculating the average fuel consumption of a fleet of vehicles from the individual fuel consumptions, the harmonic mean should be used if the fleet uses miles per gallon, whereas the arithmetic mean should be used if the fleet uses litres per 100 km. In the USA the CAFE standards (the federal automobile fuel consumption standards) make use of the harmonic mean. In
chemistry Chemistry is the scientific study of the properties and behavior of matter. It is a physical science within the natural sciences that studies the chemical elements that make up matter and chemical compound, compounds made of atoms, molecules a ...
and
nuclear physics Nuclear physics is the field of physics that studies atomic nuclei and their constituents and interactions, in addition to the study of other forms of nuclear matter. Nuclear physics should not be confused with atomic physics, which studies th ...
the average mass per particle of a mixture consisting of different species (e.g., molecules or isotopes) is given by the harmonic mean of the individual species' masses weighted by their respective mass fraction.


Beta distribution

The harmonic mean of a
beta distribution In probability theory and statistics, the beta distribution is a family of continuous probability distributions defined on the interval
, 1 The comma is a punctuation mark that appears in several variants in different languages. Some typefaces render it as a small line, slightly curved or straight, but inclined from the vertical; others give it the appearance of a miniature fille ...
or (0, 1) in terms of two positive Statistical parameter, parameters, denoted by ''alpha'' (''α'') an ...
with shape parameters ''α'' and ''β'' is: :H = \frac \text \alpha > 1 \, \, \& \, \, \beta > 0 The harmonic mean with ''α'' < 1 is undefined because its defining expression is not bounded in
, 1 The comma is a punctuation mark that appears in several variants in different languages. Some typefaces render it as a small line, slightly curved or straight, but inclined from the vertical; others give it the appearance of a miniature fille ...
Letting ''α'' = ''β'' : H = \frac showing that for ''α'' = ''β'' the harmonic mean ranges from 0 for ''α'' = ''β'' = 1, to 1/2 for ''α'' = ''β'' → ∞. The following are the limits with one parameter finite (non-zero) and the other parameter approaching these limits: :\begin \lim_ H &= \text \\ \lim_ H &= \lim_ H = 0 \\ \lim_ H &= \lim_ H = 1 \end With the geometric mean the harmonic mean may be useful in maximum likelihood estimation in the four parameter case. A second harmonic mean (''H''1 − X) also exists for this distribution :H_ = \frac \text \beta > 1 \, \, \& \, \, \alpha > 0 This harmonic mean with ''β'' < 1 is undefined because its defining expression is not bounded in 0, 1 Letting ''α'' = ''β'' in the above expression :H_ = \frac showing that for ''α'' = ''β'' the harmonic mean ranges from 0, for ''α'' = ''β'' = 1, to 1/2, for ''α'' = ''β'' → ∞. The following are the limits with one parameter finite (non zero) and the other approaching these limits: : \begin \lim_ H_ &= \text \\ \lim_ H_ &= \lim_ H_ = 0 \\ \lim_ H_ &= \lim_ H_ = 1 \end Although both harmonic means are asymmetric, when ''α'' = ''β'' the two means are equal.


Lognormal distribution

The harmonic mean ( ''H'' ) of the
lognormal distribution In probability theory, a log-normal (or lognormal) distribution is a continuous probability distribution of a random variable whose logarithm is normal distribution, normally distributed. Thus, if the random variable is log-normally distributed ...
of a random variable ''X'' isAitchison J, Brown JAC (1969). The lognormal distribution with special reference to its uses in economics. Cambridge University Press, New York : H = \exp \left( \mu - \frac \sigma^2 \right), where ''μ'' and ''σ''2 are the parameters of the distribution, i.e. the mean and variance of the distribution of the natural logarithm of ''X''. The harmonic and arithmetic means of the distribution are related by : \frac = 1 + C_v^2 \, , where ''C''v and ''μ''* are the
coefficient of variation In probability theory and statistics, the coefficient of variation (CV), also known as normalized root-mean-square deviation (NRMSD), percent RMS, and relative standard deviation (RSD), is a standardized measure of dispersion of a probability ...
and the mean of the distribution respectively.. The geometric (''G''), arithmetic and harmonic means of the distribution are related byRossman LA (1990) Design stream flows based on harmonic means. J Hydr Eng ASCE 116(7) 946–950 : H \mu^* = G^2.


Pareto distribution

The harmonic mean of type 1
Pareto distribution The Pareto distribution, named after the Italian civil engineer, economist, and sociologist Vilfredo Pareto, is a power-law probability distribution that is used in description of social, quality control, scientific, geophysical, actuarial scien ...
isJohnson NL, Kotz S, Balakrishnan N (1994) Continuous univariate distributions Vol 1. Wiley Series in Probability and Statistics. : H = k \left( 1 + \frac \right) where ''k'' is the scale parameter and ''α'' is the shape parameter.


Statistics

For a random sample, the harmonic mean is calculated as above. Both the
mean A mean is a quantity representing the "center" of a collection of numbers and is intermediate to the extreme values of the set of numbers. There are several kinds of means (or "measures of central tendency") in mathematics, especially in statist ...
and the
variance In probability theory and statistics, variance is the expected value of the squared deviation from the mean of a random variable. The standard deviation (SD) is obtained as the square root of the variance. Variance is a measure of dispersion ...
may be infinite (if it includes at least one term of the form 1/0).


Sample distributions of mean and variance

The mean of the sample ''m'' is asymptotically distributed normally with variance ''s''2. :s^2 = \frac The variance of the mean itself isZelen M (1972) Length-biased sampling and biomedical problems. In: Biometric Society Meeting, Dallas, Texas : \operatorname\left(\frac\right) = \frac where ''m'' is the arithmetic mean of the reciprocals, ''x'' are the variates, ''n'' is the population size and ''E'' is the expectation operator.


Delta method

Assuming that the variance is not infinite and that the
central limit theorem In probability theory, the central limit theorem (CLT) states that, under appropriate conditions, the Probability distribution, distribution of a normalized version of the sample mean converges to a Normal distribution#Standard normal distributi ...
applies to the sample then using the
delta method In statistics, the delta method is a method of deriving the asymptotic distribution of a random variable. It is applicable when the random variable being considered can be defined as a differentiable function of a random variable which is Asymptoti ...
, the variance is : \operatorname(H) = \frac\frac where ''H'' is the harmonic mean, ''m'' is the arithmetic mean of the reciprocals : m = \frac \sum. ''s''2 is the variance of the reciprocals of the data : s^2 = \operatorname\left( \frac \right) and ''n'' is the number of data points in the sample.


Jackknife method

A jackknife method of estimating the variance is possible if the mean is known.Lam FC (1985) Estimate of variance for harmonic mean half lives. J Pharm Sci 74(2) 229-231 This method is the usual 'delete 1' rather than the 'delete m' version. This method first requires the computation of the mean of the sample (''m'') : m = \frac where ''x'' are the sample values. A series of value ''wi'' is then computed where : w_i = \frac. The mean (''h'') of the ''w''i is then taken: : h = \frac \sum The variance of the mean is : \frac \sum^2. Significance testing and confidence intervals for the mean can then be estimated with the t test.


Size biased sampling

Assume a random variate has a distribution ''f''( ''x'' ). Assume also that the likelihood of a variate being chosen is proportional to its value. This is known as length based or size biased sampling. Let ''μ'' be the mean of the population. Then the
probability density function In probability theory, a probability density function (PDF), density function, or density of an absolutely continuous random variable, is a Function (mathematics), function whose value at any given sample (or point) in the sample space (the s ...
''f''*( ''x'' ) of the size biased population is : f^*(x) = \frac The expectation of this length biased distribution E*( ''x'' ) is : \operatorname^*(x) = \mu \left 1 + \frac \right/math> where ''σ''2 is the variance. The expectation of the harmonic mean is the same as the non-length biased version E( ''x'' ) : E^*( x^ ) = E( x )^ The problem of length biased sampling arises in a number of areas including textile manufactureCox DR (1969) Some sampling problems in technology. In: New developments in survey sampling. U.L. Johnson, H Smith eds. New York: Wiley Interscience pedigree analysisDavidov O, Zelen M (2001) Referent sampling, family history and relative risk: the role of length-biased sampling. Biostat 2(2): 173-181 and survival analysisZelen M, Feinleib M (1969) On the theory of screening for chronic diseases. Biometrika 56: 601-614 Akman ''et al.'' have developed a test for the detection of length based bias in samples.Akman O, Gamage J, Jannot J, Juliano S, Thurman A, Whitman D (2007) A simple test for detection of length-biased sampling. J Biostats 1 (2) 189-195


Shifted variables

If ''X'' is a positive random variable and ''q'' > 0 then for all ''ε'' > 0Chuen-Teck See, Chen J (2008) Convex functions of random variables. J Inequal Pure Appl Math 9 (3) Art 80 : \operatorname \left frac\right< \operatorname \left(\frac\right) .


Moments

Assuming that ''X'' and E(''X'') are > 0 then : \operatorname\left \frac \right\ge \frac This follows from Jensen's inequality. Gurland has shown thatGurland J (1967) An inequality satisfied by the expectation of the reciprocal of a random variable. The American Statistician. 21 (2) 24 for a distribution that takes only positive values, for any ''n'' > 0 : \operatorname \left(X^\right) \ge \frac . Under some conditionsSung SH (2010) On inverse moments for a class of nonnegative random variables. J Inequal Applic : \operatorname(a + X)^ \sim \operatorname\left(a + X^\right) where ~ means approximately equal to.


Sampling properties

Assuming that the variates (''x'') are drawn from a lognormal distribution there are several possible estimators for ''H'': : \begin H_1 &= \frac \\ H_2 &= \frac \\ H_3 &= \exp \left(m - \frac s^2 \right) \end where : m = \frac \sum \log_e (x) : s^2 = \frac \sum \left(\log_e (x) - m\right)^2 Of these ''H''3 is probably the best estimator for samples of 25 or more.Stedinger JR (1980) Fitting lognormal distributions to hydrologic data. Water Resour Res 16(3) 481–490


Bias and variance estimators

A first order approximation to the
bias Bias is a disproportionate weight ''in favor of'' or ''against'' an idea or thing, usually in a way that is inaccurate, closed-minded, prejudicial, or unfair. Biases can be innate or learned. People may develop biases for or against an individ ...
and variance of ''H''1 areLimbrunner JF, Vogel RM, Brown LC (2000) Estimation of harmonic mean of a lognormal variable. J Hydrol Eng 5(1) 59-66 : \begin \operatorname\left H_1 \right&= \frac \\ \operatorname\left H_1 \right&= \frac \end where ''C''v is the coefficient of variation. Similarly a first order approximation to the bias and variance of ''H''3 are : \begin \frac \left 1 + \frac \right\\ \frac \left 1 + \frac \right\end In numerical experiments ''H''3 is generally a superior estimator of the harmonic mean than ''H''1. ''H''2 produces estimates that are largely similar to ''H''1.


Notes

The Environmental Protection Agency recommends the use of the harmonic mean in setting maximum toxin levels in water.EPA (1991) Technical support document for water quality-based toxics control. EPA/505/2-90-001. Office of Water In geophysical reservoir engineering studies, the harmonic mean is widely used.Muskat M (1937) The flow of homogeneous fluids through porous media. McGraw-Hill, New York


See also

*
Rate (mathematics) In mathematics, a rate is the quotient of two quantities, often represented as a fraction. If the divisor (or fraction denominator) in the rate is equal to one expressed as a single unit, and if it is assumed that this quantity can be changed s ...
* Contraharmonic mean *
Generalized mean In mathematics, generalised means (or power mean or Hölder mean from Otto Hölder) are a family of functions for aggregating sets of numbers. These include as special cases the Pythagorean means (arithmetic mean, arithmetic, geometric mean, ge ...
*
Weighted mean The weighted arithmetic mean is similar to an ordinary arithmetic mean (the most common type of average), except that instead of each of the data points contributing equally to the final average, some data points contribute more than others. The ...
* Weighted geometric mean * HM-GM-AM-QM inequalities * Harmonic mean p-value *
Harmonic number In mathematics, the -th harmonic number is the sum of the reciprocals of the first natural numbers: H_n= 1+\frac+\frac+\cdots+\frac =\sum_^n \frac. Starting from , the sequence of harmonic numbers begins: 1, \frac, \frac, \frac, \frac, \dot ...
*
Parallel (operator) The parallel operator \, (pronounced "parallel", following the parallel lines notation from geometry; also known as reduced sum, parallel sum or parallel addition) is a binary operation which is used as a shorthand in electrical engineering, ...
, whose result is half of the harmonic mean *
Mediant (mathematics) In mathematics, the mediant of two vulgar fraction, fractions, generally made up of four positive integers : \frac \quad and \quad \frac \quad is defined as \quad \frac. That is to say, the numerator and denominator of the mediant are the sums ...
, a fraction which lies between two fractions


Notes


References


External links

*
Averages, Arithmetic and Harmonic Means
at
cut-the-knot Alexander Bogomolny (January 4, 1948 July 7, 2018) was a Soviet Union, Soviet-born Israeli Americans, Israeli-American mathematician. He was Professor Emeritus of Mathematics at the University of Iowa, and formerly research fellow at the Moscow ...
{{DEFAULTSORT:Harmonic Mean Means