History of statistics
   HOME

TheInfoList



OR:

Statistics, in the modern sense of the word, began evolving in the 18th century in response to the novel needs of industrializing sovereign states. In early times, the meaning was restricted to information about states, particularly
demographics Demography () is the statistical study of populations, especially human beings. Demographic analysis examines and measures the dimensions and dynamics of populations; it can cover whole societies or groups defined by criteria such as ed ...
such as population. This was later extended to include all collections of information of all types, and later still it was extended to include the analysis and interpretation of such data. In modern terms, "statistics" means both sets of collected information, as in
national accounts National accounts or national account systems (NAS) are the implementation of complete and consistent accounting techniques for measuring the economic activity of a nation. These include detailed underlying measures that rely on double-entry ...
and
temperature record The global temperature record shows the fluctuations of the temperature of the atmosphere and the oceans through various spans of time. There are numerous estimates of temperatures since the end of the Pleistocene glaciation, particularly dur ...
, and analytical work which requires statistical inference. Statistical activities are often associated with models expressed using probabilities, hence the connection with probability theory. The large requirements of data processing have made statistics a key application of computing. A number of statistical concepts have an important impact on a wide range of sciences. These include the
design of experiments The design of experiments (DOE, DOX, or experimental design) is the design of any task that aims to describe and explain the variation of information under conditions that are hypothesized to reflect the variation. The term is generally associ ...
and approaches to statistical inference such as Bayesian inference, each of which can be considered to have their own sequence in the development of the ideas underlying modern statistics.


Introduction

By the 18th century, the term " statistics" designated the systematic collection of demographic and
economic An economy is an area of the production, distribution and trade, as well as consumption of goods and services. In general, it is defined as a social domain that emphasize the practices, discourses, and material expressions associated with the ...
data by states. For at least two millennia, these data were mainly tabulations of human and material resources that might be taxed or put to military use. In the early 19th century, collection intensified, and the meaning of "statistics" broadened to include the discipline concerned with the collection, summary, and analysis of data. Today, data is collected and statistics are computed and widely distributed in government, business, most of the sciences and sports, and even for many pastimes. Electronic computers have expedited more elaborate statistical computation even as they have facilitated the collection and aggregation of data. A single data analyst may have available a set of data-files with millions of records, each with dozens or hundreds of separate measurements. These were collected over time from computer activity (for example, a stock exchange) or from computerized sensors, point-of-sale registers, and so on. Computers then produce simple, accurate summaries, and allow more tedious analyses, such as those that require inverting a large matrix or perform hundreds of steps of iteration, that would never be attempted by hand. Faster computing has allowed statisticians to develop "computer-intensive" methods which may look at all permutations, or use randomization to look at 10,000 permutations of a problem, to estimate answers that are not easy to quantify by theory alone. The term "
mathematical statistics Mathematical statistics is the application of probability theory, a branch of mathematics, to statistics, as opposed to techniques for collecting statistical data. Specific mathematical techniques which are used for this include mathematical an ...
" designates the mathematical theories of
probability Probability is the branch of mathematics concerning numerical descriptions of how likely an event is to occur, or how likely it is that a proposition is true. The probability of an event is a number between 0 and 1, where, roughly speakin ...
and statistical inference, which are used in statistical practice. The relation between statistics and probability theory developed rather late, however. In the 19th century, statistics increasingly used
probability theory Probability theory is the branch of mathematics concerned with probability. Although there are several different probability interpretations, probability theory treats the concept in a rigorous mathematical manner by expressing it through a set ...
, whose initial results were found in the 17th and 18th centuries, particularly in the analysis of
games of chance A game of chance is in contrast with a game of skill. It is a game whose outcome is strongly influenced by some randomizing device. Common devices used include dice, spinning tops, playing cards, roulette wheels, or numbered balls drawn from ...
(gambling). By 1800, astronomy used probability models and statistical theories, particularly the
method of least squares The method of least squares is a standard approach in regression analysis to approximate the solution of overdetermined systems (sets of equations in which there are more equations than unknowns) by minimizing the sum of the squares of the res ...
. Early probability theory and statistics was systematized in the 19th century and statistical reasoning and probability models were used by social scientists to advance the new sciences of experimental psychology and
sociology Sociology is a social science that focuses on society, human social behavior, patterns of social relationships, social interaction, and aspects of culture associated with everyday life. It uses various methods of empirical investigation an ...
, and by physical scientists in
thermodynamics Thermodynamics is a branch of physics that deals with heat, work, and temperature, and their relation to energy, entropy, and the physical properties of matter and radiation. The behavior of these quantities is governed by the four laws of th ...
and statistical mechanics. The development of statistical reasoning was closely associated with the development of
inductive logic Inductive reasoning is a method of reasoning in which a general principle is derived from a body of observations. It consists of making broad generalizations based on specific observations. Inductive reasoning is distinct from ''deductive'' rea ...
and the
scientific method The scientific method is an empirical method for acquiring knowledge that has characterized the development of science since at least the 17th century (with notable practitioners in previous centuries; see the article history of scientific ...
, which are concerns that move statisticians away from the narrower area of mathematical statistics. Much of the theoretical work was readily available by the time computers were available to exploit them. By the 1970s, Johnson and Kotz produced a four-volume ''Compendium on Statistical Distributions'' (1st ed., 1969-1972), which is still an invaluable resource. Applied statistics can be regarded as not a field of mathematics but an autonomous
mathematical science The mathematical sciences are a group of areas of study that includes, in addition to mathematics, those academic disciplines that are primarily mathematical in nature but may not be universally considered subfields of mathematics proper. Statisti ...
, like
computer science Computer science is the study of computation, automation, and information. Computer science spans theoretical disciplines (such as algorithms, theory of computation, information theory, and automation) to practical disciplines (includi ...
and
operations research Operations research ( en-GB, operational research) (U.S. Air Force Specialty Code: Operations Analysis), often shortened to the initialism OR, is a discipline that deals with the development and application of analytical methods to improve decis ...
. Unlike mathematics, statistics had its origins in
public administration Public Administration (a form of governance) or Public Policy and Administration (an academic discipline) is the implementation of public policy, administration of government establishment (public governance), management of non-profit est ...
. Applications arose early in
demography Demography () is the statistical study of populations, especially human beings. Demographic analysis examines and measures the dimensions and dynamics of populations; it can cover whole societies or groups defined by criteria such as edu ...
and
economics Economics () is the social science that studies the production, distribution, and consumption of goods and services. Economics focuses on the behaviour and interactions of economic agents and how economies work. Microeconomics analyzes ...
; large areas of micro- and macro-economics today are "statistics" with an emphasis on time-series analyses. With its emphasis on learning from data and making best predictions, statistics also has been shaped by areas of academic research including psychological testing, medicine and
epidemiology Epidemiology is the study and analysis of the distribution (who, when, and where), patterns and determinants of health and disease conditions in a defined population. It is a cornerstone of public health, and shapes policy decisions and evide ...
. The ideas of statistical testing have considerable overlap with
decision science Decision theory (or the theory of choice; not to be confused with choice theory) is a branch of applied probability theory concerned with the theory of making decisions based on assigning probabilities to various factors and assigning numerical ...
. With its concerns with searching and effectively presenting
data In the pursuit of knowledge, data (; ) is a collection of discrete Value_(semiotics), values that convey information, describing quantity, qualitative property, quality, fact, statistics, other basic units of meaning, or simply sequences of sy ...
, statistics has overlap with
information science Information science (also known as information studies) is an academic field which is primarily concerned with analysis, collection, classification, manipulation, storage, retrieval, movement, dissemination, and protection of informatio ...
and
computer science Computer science is the study of computation, automation, and information. Computer science spans theoretical disciplines (such as algorithms, theory of computation, information theory, and automation) to practical disciplines (includi ...
.


Etymology

:''Look up '' statistics'' in
Wiktionary Wiktionary ( , , rhyming with "dictionary") is a multilingual, web-based project to create a free content dictionary of terms (including words, phrases, proverbs, linguistic reconstructions, etc.) in all natural languages and in a numbe ...
, the free dictionary.'' The term ''statistics'' is ultimately derived from the
New Latin New Latin (also called Neo-Latin or Modern Latin) is the revival of Literary Latin used in original, scholarly, and scientific works since about 1500. Modern scholarly and technical nomenclature, such as in zoological and botanical taxonomy ...
''statisticum collegium'' ("council of state") and the
Italian Italian(s) may refer to: * Anything of, from, or related to the people of Italy over the centuries ** Italians, an ethnic group or simply a citizen of the Italian Republic or Italian Kingdom ** Italian language, a Romance language *** Regional Ita ...
word ''statista'' ("statesman" or "
politician A politician is a person active in party politics, or a person holding or seeking an elected office in government. Politicians propose, support, reject and create laws that govern the land and by an extension of its people. Broadly speaking ...
"). The
German German(s) may refer to: * Germany (of or related to) ** Germania (historical use) * Germans, citizens of Germany, people of German ancestry, or native speakers of the German language ** For citizens of Germany, see also German nationality law **Ge ...
''Statistik'', first introduced by Gottfried Achenwall (1749), originally designated the analysis of
data In the pursuit of knowledge, data (; ) is a collection of discrete Value_(semiotics), values that convey information, describing quantity, qualitative property, quality, fact, statistics, other basic units of meaning, or simply sequences of sy ...
about the
state State may refer to: Arts, entertainment, and media Literature * ''State Magazine'', a monthly magazine published by the U.S. Department of State * ''The State'' (newspaper), a daily newspaper in Columbia, South Carolina, United States * ''Our S ...
, signifying the "science of state" (then called ''political arithmetic'' in English). It acquired the meaning of the collection and classification of data generally in the early 19th century. It was introduced into English in 1791 by
Sir John Sinclair Sir John Sinclair of Ulbster, 1st Baronet, (10 May 1754 – 21 December 1835), was a British politician, a writer on both finance and agriculture, and was one of the first people to use the word '' statistics'' in the English language, in ...
when he published the first of 21 volumes titled ''
Statistical Account of Scotland The ''Statistical Accounts of Scotland'' are a series of documentary publications, related in subject matter though published at different times, covering life in Scotland in the 18th, 19th and 20th centuries. The ''Old (or First) Statistical Ac ...
''. Thus, the original principal purpose of ''Statistik'' was data to be used by governmental and (often centralized) administrative bodies. The collection of data about states and localities continues, largely through national and international statistical services. In particular, censuses provide frequently updated information about the
population Population typically refers to the number of people in a single area, whether it be a city or town, region, country, continent, or the world. Governments typically quantify the size of the resident population within their jurisdiction using a ...
. The first book to have 'statistics' in its title was "Contributions to Vital Statistics" (1845) by Francis GP Neison, actuary to the Medical Invalid and General Life Office. ]


Origins in probability theory

Basic forms of statistics have been used since the beginning of civilization. Early empires often collated censuses of the population or recorded the trade in various commodities. The
Han Dynasty The Han dynasty (, ; ) was an imperial dynasty of China (202 BC – 9 AD, 25–220 AD), established by Liu Bang (Emperor Gao) and ruled by the House of Liu. The dynasty was preceded by the short-lived Qin dynasty (221–207 BC) and a warr ...
and the
Roman Empire The Roman Empire ( la, Imperium Romanum ; grc-gre, Βασιλεία τῶν Ῥωμαίων, Basileía tôn Rhōmaíōn) was the post- Republican period of ancient Rome. As a polity, it included large territorial holdings around the Mediter ...
were some of the first states to extensively gather data on the size of the empire's population, geographical area and wealth. The use of statistical methods dates back to at least the 5th century BCE. The historian
Thucydides Thucydides (; grc, , }; BC) was an Athenian historian and general. His '' History of the Peloponnesian War'' recounts the fifth-century BC war between Sparta and Athens until the year 411 BC. Thucydides has been dubbed the father of " scienti ...
in his ''
History of the Peloponnesian War The ''History of the Peloponnesian War'' is a historical account of the Peloponnesian War (431–404 BC), which was fought between the Peloponnesian League (led by Sparta) and the Delian League (led by Athens). It was written by Thucydides, an ...
'' describes how the Athenians calculated the height of the wall of
Platea Plataea or Plataia (; grc, Πλάταια), also Plataeae or Plataiai (; grc, Πλαταιαί), was an ancient city, located in Greece in southeastern Boeotia, south of Thebes.Mish, Frederick C., Editor in Chief. “Plataea.” '' Webst ...
by counting the number of bricks in an unplastered section of the wall sufficiently near them to be able to count them. The count was repeated several times by a number of soldiers. The most frequent value (in modern terminology - the
mode Mode ( la, modus meaning "manner, tune, measure, due measure, rhythm, melody") may refer to: Arts and entertainment * '' MO''D''E (magazine)'', a defunct U.S. women's fashion magazine * ''Mode'' magazine, a fictional fashion magazine which is ...
) so determined was taken to be the most likely value of the number of bricks. Multiplying this value by the height of the bricks used in the wall allowed the Athenians to determine the height of the ladders necessary to scale the walls. The
Trial of the Pyx The Trial of the Pyx () is a judicial ceremony in the United Kingdom to ensure that newly minted coins from the Royal Mint conform to their required dimensional and fineness specifications. Although coin quality is now tested throughout the year ...
is a test of the purity of the coinage of the Royal Mint which has been held on a regular basis since the 12th century. The Trial itself is based on statistical sampling methods. After minting a series of coins - originally from ten pounds of silver - a single coin was placed in the Pyx - a box in
Westminster Abbey Westminster Abbey, formally titled the Collegiate Church of Saint Peter at Westminster, is an historic, mainly Gothic church in the City of Westminster, London, England, just to the west of the Palace of Westminster. It is one of the Unite ...
. After a given period - now once a year - the coins are removed and weighed. A sample of coins removed from the box are then tested for purity. The ''
Nuova Cronica The ''Nuova Cronica'' (also: ''Nova Cronica'') or '' New Chronicles'' is a 14th-century history of Florence created in a year-by-year linear format and written by the Italian banker and official Giovanni Villani (c. 1276 or 1280–1348). T ...
'', a 14th-century
history of Florence Florence ( it, Firenze) weathered the decline of the Western Roman Empire to emerge as a financial hub of Europe, home to several banks including that of the politically powerful Medici family. The city's wealth supported the development of ...
by the Florentine banker and official
Giovanni Villani Giovanni Villani (; 1276 or 1280 – 1348)Bartlett (1992), 35. was an Italian banker, official, diplomat and chronicler from Florence who wrote the ''Nuova Cronica'' (''New Chronicles'') on the history of Florence. He was a leading statesman of ...
, includes much statistical information on population, ordinances, commerce and trade, education, and religious facilities and has been described as the first introduction of statistics as a positive element in history,Villani, Giovanni. Encyclopædia Britannica.
Encyclopædia Britannica 2006 Ultimate Reference Suite DVD An encyclopedia (American English) or encyclopædia (British English) is a reference work or compendium providing summaries of knowledge either general or special to a particular field or discipline. Encyclopedias are divided into articles ...
. Retrieved on 2008-03-04.
though neither the term nor the concept of statistics as a specific field yet existed. The arithmetic
mean There are several kinds of mean in mathematics, especially in statistics. Each mean serves to summarize a given group of data, often to better understand the overall value (magnitude and sign) of a given data set. For a data set, the '' ari ...
, although a concept known to the Greeks, was not generalised to more than two values until the 16th century. The invention of the decimal system by
Simon Stevin Simon Stevin (; 1548–1620), sometimes called Stevinus, was a Flemish mathematician, scientist and music theorist. He made various contributions in many areas of science and engineering, both theoretical and practical. He also translated vario ...
in 1585 seems likely to have facilitated these calculations. This method was first adopted in astronomy by
Tycho Brahe Tycho Brahe ( ; born Tyge Ottesen Brahe; generally called Tycho (14 December 154624 October 1601) was a Danish astronomer, known for his comprehensive astronomical observations, generally considered to be the most accurate of his time. He was ...
who was attempting to reduce the errors in his estimates of the locations of various celestial bodies. The idea of the median originated in Edward Wright's book on navigation (''Certaine Errors in Navigation'') in 1599 in a section concerning the determination of location with a compass. Wright felt that this value was the most likely to be the correct value in a series of observations. The birth of statistics is often dated to 1662, when
John Graunt John Graunt (24 April 1620 – 18 April 1674) has been regarded as the founder of demography. Graunt was one of the first demographers, and perhaps the first epidemiologist, though by profession he was a haberdasher. He was bankrupted later in li ...
, along with
William Petty Sir William Petty FRS (26 May 1623 – 16 December 1687) was an English economist, physician, scientist and philosopher. He first became prominent serving Oliver Cromwell and the Commonwealth in Ireland. He developed efficient methods to s ...
, developed early human statistical and
census A census is the procedure of systematically acquiring, recording and calculating information about the members of a given population. This term is used mostly in connection with national population and housing censuses; other common censuses in ...
methods that provided a framework for modern
demography Demography () is the statistical study of populations, especially human beings. Demographic analysis examines and measures the dimensions and dynamics of populations; it can cover whole societies or groups defined by criteria such as edu ...
. He produced the first
life table In actuarial science and demography, a life table (also called a mortality table or actuarial table) is a table which shows, for each age, what the probability is that a person of that age will die before their next birthday ("probability of death ...
, giving probabilities of survival to each age. His book ''Natural and Political Observations Made upon the Bills of Mortality'' used analysis of the mortality rolls to make the first statistically based estimation of the population of
London London is the capital and List of urban areas in the United Kingdom, largest city of England and the United Kingdom, with a population of just under 9 million. It stands on the River Thames in south-east England at the head of a estuary dow ...
. He knew that there were around 13,000 funerals per year in London and that three people died per eleven families per year. He estimated from the parish records that the average family size was 8 and calculated that the population of London was about 384,000; this is the first known use of a
ratio estimator The ratio estimator is a statistical parameter and is defined to be the ratio of means of two random variables. Ratio estimates are biased and corrections must be made when they are used in experimental or survey work. The ratio estimates are asymm ...
. Laplace in 1802 estimated the population of France with a similar method; see for details. Although the original scope of statistics was limited to data useful for governance, the approach was extended to many fields of a scientific or commercial nature during the 19th century. The mathematical foundations for the subject heavily drew on the new
probability theory Probability theory is the branch of mathematics concerned with probability. Although there are several different probability interpretations, probability theory treats the concept in a rigorous mathematical manner by expressing it through a set ...
, pioneered in the 16th century by
Gerolamo Cardano Gerolamo Cardano (; also Girolamo or Geronimo; french: link=no, Jérôme Cardan; la, Hieronymus Cardanus; 24 September 1501– 21 September 1576) was an Italian polymath, whose interests and proficiencies ranged through those of mathematician, ...
,
Pierre de Fermat Pierre de Fermat (; between 31 October and 6 December 1607 – 12 January 1665) was a French mathematician who is given credit for early developments that led to infinitesimal calculus, including his technique of adequality. In particular, he ...
and Blaise Pascal. Christiaan Huygens (1657) gave the earliest known scientific treatment of the subject.
Jakob Bernoulli Jacob Bernoulli (also known as James or Jacques; – 16 August 1705) was one of the many prominent mathematicians in the Bernoulli family. He was an early proponent of Leibnizian calculus and sided with Gottfried Wilhelm Leibniz during the Leib ...
's ''
Ars Conjectandi (Latin for "The Art of Conjecturing") is a book on combinatorics and mathematical probability written by Jacob Bernoulli and published in 1713, eight years after his death, by his nephew, Niklaus Bernoulli. The seminal work consolidated, apa ...
'' (posthumous, 1713) and
Abraham de Moivre Abraham de Moivre FRS (; 26 May 166727 November 1754) was a French mathematician known for de Moivre's formula, a formula that links complex numbers and trigonometry, and for his work on the normal distribution and probability theory. He moved ...
's ''
The Doctrine of Chances ''The Doctrine of Chances'' was the first textbook on probability theory, written by 18th-century French mathematician Abraham de Moivre and first published in 1718.. De Moivre wrote in English because he resided in England at the time, having ...
'' (1718) treated the subject as a branch of mathematics. In his book Bernoulli introduced the idea of representing complete certainty as one and probability as a number between zero and one. A key early application of statistics in the 18th century was to the
human sex ratio In anthropology and demography, the human sex ratio is the ratio of males to females in a population. Like most sexual species, the sex ratio in humans is close to 1:1. In humans, the natural ratio at birth between males and females is sligh ...
at birth. John Arbuthnot studied this question in 1710. Arbuthnot examined birth records in London for each of the 82 years from 1629 to 1710. In every year, the number of males born in London exceeded the number of females. Considering more male or more female births as equally likely, the probability of the observed outcome is 0.5^82, or about 1 in 4,8360,0000,0000,0000,0000,0000; in modern terms, the ''p''-value. This is vanishingly small, leading Arbuthnot that this was not due to chance, but to divine providence: "From whence it follows, that it is Art, not Chance, that governs." This is and other work by Arbuthnot is credited as "the first use of
significance test A statistical hypothesis test is a method of statistical inference used to decide whether the data at hand sufficiently support a particular hypothesis. Hypothesis testing allows us to make probabilistic statements about population parameters. ...
s" the first example of reasoning about statistical significance and moral certainty, and "… perhaps the first published report of a
nonparametric test Nonparametric statistics is the branch of statistics that is not based solely on parametrized families of probability distributions (common examples of parameters are the mean and variance). Nonparametric statistics is based on either being distri ...
…", specifically the
sign test The sign test is a statistical method to test for consistent differences between pairs of observations, such as the weight of subjects before and after treatment. Given pairs of observations (such as weight pre- and post-treatment) for each subject ...
; see details at . The formal study of
theory of errors In statistics, propagation of uncertainty (or propagation of error) is the effect of variables' uncertainties (or errors, more specifically random errors) on the uncertainty of a function based on them. When the variables are the values of exp ...
may be traced back to
Roger Cotes Roger Cotes (10 July 1682 – 5 June 1716) was an English mathematician, known for working closely with Isaac Newton by proofreading the second edition of his famous book, the '' Principia'', before publication. He also invented the quadratur ...
' ''Opera Miscellanea'' (posthumous, 1722), but a memoir prepared by
Thomas Simpson Thomas Simpson FRS (20 August 1710 – 14 May 1761) was a British mathematician and inventor known for the eponymous Simpson's rule to approximate definite integrals. The attribution, as often in mathematics, can be debated: this rule had been ...
in 1755 (printed 1756) first applied the theory to the discussion of errors of observation. The reprint (1757) of this memoir lays down the axioms that positive and negative errors are equally probable, and that there are certain assignable limits within which all errors may be supposed to fall; continuous errors are discussed and a probability curve is given. Simpson discussed several possible distributions of error. He first considered the uniform distribution and then the discrete symmetric
triangular distribution In probability theory and statistics, the triangular distribution is a continuous probability distribution with lower limit ''a'', upper limit ''b'' and mode ''c'', where ''a'' < ''b'' and ''a'' ≤ ''c'' ≤ ''b''. ...
followed by the continuous symmetric triangle distribution.
Tobias Mayer Tobias Mayer (17 February 172320 February 1762) was a German astronomer famous for his studies of the Moon. He was born at Marbach, in Württemberg, and brought up at Esslingen in poor circumstances. A self-taught mathematician, he earned a l ...
, in his study of the libration of the
moon The Moon is Earth's only natural satellite. It is the fifth largest satellite in the Solar System and the largest and most massive relative to its parent planet, with a diameter about one-quarter that of Earth (comparable to the width of ...
(''Kosmographische Nachrichten'', Nuremberg, 1750), invented the first formal method for estimating the unknown quantities by generalized the averaging of observations under identical circumstances to the averaging of groups of similar equations.
Roger Joseph Boscovich Roger Joseph Boscovich ( hr, Ruđer Josip Bošković; ; it, Ruggiero Giuseppe Boscovich; la, Rogerius (Iosephus) Boscovicius; sr, Руђер Јосип Бошковић; 18 May 1711 – 13 February 1787) was a physicist, astronomer, ...
in 1755 based in his work on the shape of the earth proposed in his book ''De Litteraria expeditione per pontificiam ditionem ad dimetiendos duos meridiani gradus a PP. Maire et Boscovicli'' that the true value of a series of observations would be that which minimises the sum of absolute errors. In modern terminology this value is the median. The first example of what later became known as the normal curve was studied by
Abraham de Moivre Abraham de Moivre FRS (; 26 May 166727 November 1754) was a French mathematician known for de Moivre's formula, a formula that links complex numbers and trigonometry, and for his work on the normal distribution and probability theory. He moved ...
who plotted this curve on November 12, 1733.de Moivre, A. (1738) The doctrine of chances. Woodfall de Moivre was studying the number of heads that occurred when a 'fair' coin was tossed. In 1761
Thomas Bayes Thomas Bayes ( ; 1701 7 April 1761) was an English statistician, philosopher and Presbyterian minister who is known for formulating a specific case of the theorem that bears his name: Bayes' theorem. Bayes never published what would become his ...
proved Bayes' theorem and in 1765
Joseph Priestley Joseph Priestley (; 24 March 1733 – 6 February 1804) was an English chemist, natural philosopher, separatist theologian, grammarian, multi-subject educator, and liberal political theorist. He published over 150 works, and conducted exp ...
invented the first
timeline A timeline is a display of a list of events in chronological order. It is typically a graphic design showing a long bar labelled with dates paralleling it, and usually contemporaneous events. Timelines can use any suitable scale represen ...
charts.
Johann Heinrich Lambert Johann Heinrich Lambert (, ''Jean-Henri Lambert'' in French; 26 or 28 August 1728 – 25 September 1777) was a polymath from the Republic of Mulhouse, generally referred to as either Swiss or French, who made important contributions to the subject ...
in his 1765 book ''Anlage zur Architectonic'' proposed the
semicircle In mathematics (and more specifically geometry), a semicircle is a one-dimensional locus of points that forms half of a circle. The full arc of a semicircle always measures 180° (equivalently, radians, or a half-turn). It has only one line o ...
as a distribution of errors: : f(x) = \frac \sqrt with -1 < ''x'' < 1.
Pierre-Simon Laplace Pierre-Simon, marquis de Laplace (; ; 23 March 1749 – 5 March 1827) was a French scholar and polymath whose work was important to the development of engineering, mathematics, statistics, physics, astronomy, and philosophy. He summarized ...
(1774) made the first attempt to deduce a rule for the combination of observations from the principles of the theory of probabilities. He represented the law of probability of errors by a curve and deduced a formula for the mean of three observations. Laplace in 1774 noted that the frequency of an error could be expressed as an exponential function of its magnitude once its sign was disregarded.Wilson, Edwin Bidwell (1923) "First and second laws of error", ''
Journal of the American Statistical Association The ''Journal of the American Statistical Association (JASA)'' is the primary journal published by the American Statistical Association, the main professional body for statisticians in the United States. It is published four times a year in Mar ...
'', 18 (143), 841-851
This distribution is now known as the Laplace distribution. Lagrange proposed a
parabolic fractal distribution In probability and statistics, the parabolic fractal distribution is a type of discrete probability distribution in which the logarithm of the frequency or size of entities in a population is a quadratic polynomial of the logarithm of the rank (wi ...
of errors in 1776. Laplace in 1778 published his second law of errors wherein he noted that the frequency of an error was proportional to the exponential of the square of its magnitude. This was subsequently rediscovered by
Gauss Johann Carl Friedrich Gauss (; german: Gauß ; la, Carolus Fridericus Gauss; 30 April 177723 February 1855) was a German mathematician and physicist who made significant contributions to many fields in mathematics and science. Sometimes refer ...
(possibly in 1795) and is now best known as the
normal distribution In statistics, a normal distribution or Gaussian distribution is a type of continuous probability distribution for a real-valued random variable. The general form of its probability density function is : f(x) = \frac e^ The parameter \mu ...
which is of central importance in statistics.Havil J (2003) ''Gamma: Exploring Euler's Constant''. Princeton, NJ: Princeton University Press, p. 157 This distribution was first referred to as the ''normal'' distribution by C. S. Peirce in 1873 who was studying measurement errors when an object was dropped onto a wooden base. C. S. Peirce (1873) Theory of errors of observations. Report of the Superintendent US Coast Survey, Washington, Government Printing Office. Appendix no. 21: 200-224 He chose the term ''normal'' because of its frequent occurrence in naturally occurring variables. Lagrange also suggested in 1781 two other distributions for errors - a
raised cosine distribution In probability theory and statistics, the raised cosine distribution is a continuous probability distribution supported on the interval mu-s,\mu+s/math>. The probability density function (PDF) is :f(x;\mu,s)=\frac \left +\cos\left(\frac\,\pi\rig ...
and a
logarithmic distribution In probability and statistics, the logarithmic distribution (also known as the logarithmic series distribution or the log-series distribution) is a discrete probability distribution derived from the Maclaurin series expansion : -\ln(1-p) = p ...
. Laplace gave (1781) a formula for the law of facility of error (a term due to
Joseph Louis Lagrange Joseph-Louis Lagrange (born Giuseppe Luigi LagrangiaDaniel Bernoulli (1778) introduced the principle of the maximum product of the probabilities of a system of concurrent errors. In 1786
William Playfair William Playfair (22 September 1759 – 11 February 1823), a Scottish engineer and political economist, served as a secret agent on behalf of Great Britain during its war with France. The founder of graphical methods of statistics, Playfai ...
(1759-1823) introduced the idea of graphical representation into statistics. He invented the
line chart A line chart or line graph or curve chart is a type of chart which displays information as a series of data points called 'markers' connected by straight line segments. It is a basic type of chart common in many fields. It is similar to a ...
,
bar chart A bar chart or bar graph is a chart or graph that presents categorical data with rectangular bars with heights or lengths proportional to the values that they represent. The bars can be plotted vertically or horizontally. A vertical bar chart i ...
and histogram and incorporated them into his works on
economics Economics () is the social science that studies the production, distribution, and consumption of goods and services. Economics focuses on the behaviour and interactions of economic agents and how economies work. Microeconomics analyzes ...
, the ''Commercial and Political Atlas''. This was followed in 1795 by his invention of the pie chart and circle chart which he used to display the evolution of England's imports and exports. These latter charts came to general attention when he published examples in his ''Statistical Breviary'' in 1801. Laplace, in an investigation of the motions of Saturn and
Jupiter Jupiter is the fifth planet from the Sun and the largest in the Solar System. It is a gas giant with a mass more than two and a half times that of all the other planets in the Solar System combined, but slightly less than one-thousandth t ...
in 1787, generalized Mayer's method by using different linear combinations of a single group of equations. In 1791
Sir John Sinclair Sir John Sinclair of Ulbster, 1st Baronet, (10 May 1754 – 21 December 1835), was a British politician, a writer on both finance and agriculture, and was one of the first people to use the word '' statistics'' in the English language, in ...
introduced the term 'statistics' into English in his Statistical Accounts of Scotland. In 1802 Laplace estimated the population of France to be 28,328,612.Cochran W.G. (1978) "Laplace's ratio estimators". pp 3-10. In David H.A., (ed). ''Contributions to Survey Sampling and Applied Statistics: papers in honor of H. O. Hartley''. Academic Press, New York He calculated this figure using the number of births in the previous year and census data for three communities. The census data of these communities showed that they had 2,037,615 persons and that the number of births were 71,866. Assuming that these samples were representative of France, Laplace produced his estimate for the entire population. The
method of least squares The method of least squares is a standard approach in regression analysis to approximate the solution of overdetermined systems (sets of equations in which there are more equations than unknowns) by minimizing the sum of the squares of the res ...
, which was used to minimize errors in data measurement, was published independently by
Adrien-Marie Legendre Adrien-Marie Legendre (; ; 18 September 1752 – 9 January 1833) was a French mathematician who made numerous contributions to mathematics. Well-known and important concepts such as the Legendre polynomials and Legendre transformation are name ...
(1805),
Robert Adrain Robert Adrain (30 September 1775 – 10 August 1843) was an Irish political exile who won renown as a mathematician in the United States. He left Ireland after leading republican insurgents in the Rebellion of 1798, and settled in New Jersey an ...
(1808), and
Carl Friedrich Gauss Johann Carl Friedrich Gauss (; german: Gauß ; la, Carolus Fridericus Gauss; 30 April 177723 February 1855) was a German mathematician and physicist who made significant contributions to many fields in mathematics and science. Sometimes refer ...
(1809). Gauss had used the method in his famous 1801 prediction of the location of the
dwarf planet A dwarf planet is a small planetary-mass object that is in direct orbit of the Sun, smaller than any of the eight classical planets but still a world in its own right. The prototypical dwarf planet is Pluto. The interest of dwarf planets to ...
Ceres Ceres most commonly refers to: * Ceres (dwarf planet), the largest asteroid * Ceres (mythology), the Roman goddess of agriculture Ceres may also refer to: Places Brazil * Ceres, Goiás, Brazil * Ceres Microregion, in north-central Goiás ...
. The observations that Gauss based his calculations on were made by the Italian monk Piazzi. The method of least squares was preceded by the use a median regression slope. This method minimizing the sum of the absolute deviances. A method of estimating this slope was invented by
Roger Joseph Boscovich Roger Joseph Boscovich ( hr, Ruđer Josip Bošković; ; it, Ruggiero Giuseppe Boscovich; la, Rogerius (Iosephus) Boscovicius; sr, Руђер Јосип Бошковић; 18 May 1711 – 13 February 1787) was a physicist, astronomer, ...
in 1760 which he applied to astronomy. The term ''probable error'' (''der wahrscheinliche Fehler'') - the median deviation from the mean - was introduced in 1815 by the German astronomer Frederik Wilhelm Bessel. Antoine Augustin Cournot in 1843 was the first to use the term ''median'' (''valeur médiane'') for the value that divides a probability distribution into two equal halves. Other contributors to the theory of errors were Ellis (1844),
De Morgan De Morgan or de Morgan is a surname, and may refer to: * Augustus De Morgan (1806–1871), British mathematician and logician. ** De Morgan's laws (or De Morgan's theorem), a set of rules from propositional logic. ** The De Morgan Medal, a trien ...
(1864), Glaisher (1872), and
Giovanni Schiaparelli Giovanni Virginio Schiaparelli ( , also , ; 14 March 1835 – 4 July 1910) was an Italian astronomer and science historian. Biography He studied at the University of Turin, graduating in 1854, and later did research at Berlin Observatory, ...
(1875). Peters's (1856) formula for r, the "probable error" of a single observation was widely used and inspired early
robust statistics Robust statistics are statistics with good performance for data drawn from a wide range of probability distributions, especially for distributions that are not normal. Robust statistical methods have been developed for many common problems, su ...
(resistant to outliers: see
Peirce's criterion In robust statistics, Peirce's criterion is a rule for eliminating outliers from data sets, which was devised by Benjamin Peirce. Outliers removed by Peirce's criterion The problem of outliers In data sets containing real-numbered measurements, ...
). In the 19th century authors on
statistical theory The theory of statistics provides a basis for the whole range of techniques, in both study design and data analysis, that are used within applications of statistics. The theory covers approaches to statistical-decision problems and to statistica ...
included Laplace, S. Lacroix (1816), Littrow (1833),
Dedekind Julius Wilhelm Richard Dedekind (6 October 1831 – 12 February 1916) was a German mathematician who made important contributions to number theory, abstract algebra (particularly ring theory), and the axiomatic foundations of arithmetic. His ...
(1860), Helmert (1872), Laurent (1873), Liagre, Didion,
De Morgan De Morgan or de Morgan is a surname, and may refer to: * Augustus De Morgan (1806–1871), British mathematician and logician. ** De Morgan's laws (or De Morgan's theorem), a set of rules from propositional logic. ** The De Morgan Medal, a trien ...
and Boole.
Gustav Theodor Fechner Gustav Theodor Fechner (; ; 19 April 1801 – 18 November 1887) was a German physicist, philosopher, and experimental psychologist. A pioneer in experimental psychology and founder of psychophysics (techniques for measuring the mind), he inspired ...
used the median (''Centralwerth'') in sociological and psychological phenomena.Keynes, JM (1921) A treatise on probability. Pt II Ch XVII §5 (p 201) It had earlier been used only in astronomy and related fields. Francis Galton used the English term ''median'' for the first time in 1881 having earlier used the terms ''middle-most value'' in 1869 and the ''medium'' in 1880.Galton F (1881) Report of the Anthropometric Committee pp 245-260. Report of the 51st Meeting of the British Association for the Advancement of Science
Adolphe Quetelet Lambert Adolphe Jacques Quetelet FRSF or FRSE (; 22 February 1796 – 17 February 1874) was a Belgian astronomer, mathematician, statistician and sociologist who founded and directed the Brussels Observatory and was influential in intro ...
(1796–1874), another important founder of statistics, introduced the notion of the "average man" (''l'homme moyen'') as a means of understanding complex social phenomena such as
crime rates Crime statistics refer to systematic, quantitative results about crime, as opposed to crime news or anecdotes. Notably, crime statistics can be the result of two rather different processes: * scientific research, such as criminological studies, vi ...
, marriage rates, and suicide rates. The first tests of the normal distribution were invented by the German statistician Wilhelm Lexis in the 1870s. The only data sets available to him that he was able to show were normally distributed were birth rates.


Development of modern statistics

Although the origins of statistical theory lie in the 18th-century advances in probability, the modern field of statistics only emerged in the late-19th and early-20th century in three stages. The first wave, at the turn of the century, was led by the work of Francis Galton and Karl Pearson, who transformed statistics into a rigorous mathematical discipline used for analysis, not just in science, but in industry and politics as well. The second wave of the 1910s and 20s was initiated by
William Sealy Gosset William Sealy Gosset (13 June 1876 – 16 October 1937) was an English statistician, chemist and brewer who served as Head Brewer of Guinness and Head Experimental Brewer of Guinness and was a pioneer of modern statistics. He pioneered small s ...
, and reached its culmination in the insights of
Ronald Fisher Sir Ronald Aylmer Fisher (17 February 1890 – 29 July 1962) was a British polymath who was active as a mathematician, statistician, biologist, geneticist, and academic. For his work in statistics, he has been described as "a genius who ...
. This involved the development of better
design of experiments The design of experiments (DOE, DOX, or experimental design) is the design of any task that aims to describe and explain the variation of information under conditions that are hypothesized to reflect the variation. The term is generally associ ...
models, hypothesis testing and techniques for use with small data samples. The final wave, which mainly saw the refinement and expansion of earlier developments, emerged from the collaborative work between Egon Pearson and
Jerzy Neyman Jerzy Neyman (April 16, 1894 – August 5, 1981; born Jerzy Spława-Neyman; ) was a Polish mathematician and statistician who spent the first part of his professional career at various institutions in Warsaw, Poland and then at University Colleg ...
in the 1930s. Today, statistical methods are applied in all fields that involve decision making, for making accurate inferences from a collated body of data and for making decisions in the face of uncertainty based on statistical methodology. The first statistical bodies were established in the early 19th century. The
Royal Statistical Society The Royal Statistical Society (RSS) is an established statistical society. It has three main roles: a British learned society for statistics, a professional body for statisticians and a charity which promotes statistics for the public good. ...
was founded in 1834 and
Florence Nightingale Florence Nightingale (; 12 May 1820 – 13 August 1910) was an English social reformer, statistician and the founder of modern nursing. Nightingale came to prominence while serving as a manager and trainer of nurses during the Crimean War, i ...
, its first female member, pioneered the application of statistical analysis to health problems for the furtherance of epidemiological understanding and public health practice. However, the methods then used would not be considered as modern statistics today. The
Oxford Oxford () is a city in England. It is the county town and only city of Oxfordshire. In 2020, its population was estimated at 151,584. It is north-west of London, south-east of Birmingham and north-east of Bristol. The city is home to the ...
scholar
Francis Ysidro Edgeworth Francis Ysidro Edgeworth (8 February 1845 – 13 February 1926) was an Anglo-Irish philosopher and political economist who made significant contributions to the methods of statistics during the 1880s. From 1891 onward, he was appointed th ...
's book, ''Metretike: or The Method of Measuring Probability and Utility'' (1887) dealt with probability as the basis of inductive reasoning, and his later works focused on the 'philosophy of chance'. His first paper on statistics (1883) explored the law of error (
normal distribution In statistics, a normal distribution or Gaussian distribution is a type of continuous probability distribution for a real-valued random variable. The general form of its probability density function is : f(x) = \frac e^ The parameter \mu ...
), and his ''Methods of Statistics'' (1885) introduced an early version of the
t distribution The phrase "T distribution" may refer to * Student's t-distribution in univariate probability theory, * Hotelling's T-square distribution in multivariate statistics. * Multivariate Student distribution In statistics, the multivariate ''t''-dis ...
, the Edgeworth expansion, the Edgeworth series, the method of variate transformation and the asymptotic theory of maximum likelihood estimates. The Norwegian Anders Nicolai Kiær introduced the concept of stratified sampling in 1895.Bellhouse DR (1988) A brief history of random sampling methods. Handbook of statistics. Vol 6 pp 1-14 Elsevier
Arthur Lyon Bowley Sir Arthur Lyon Bowley, FBA (6 November 1869 – 21 January 1957) was an English statistician and economist who worked on economic statistics and pioneered the use of sampling techniques in social surveys. Early life Bowley's father, James Wil ...
introduced new methods of data sampling in 1906 when working on social statistics. Although statistical surveys of social conditions had started with Charles Booth's "Life and Labour of the People in London" (1889-1903) and
Seebohm Rowntree Benjamin Seebohm Rowntree, CH (7 July 1871 – 7 October 1954) was an English sociological researcher, social reformer and industrialist. He is known in particular for his three York studies of poverty conducted in 1899, 1935, and 1951. The fi ...
's "Poverty, A Study of Town Life" (1901), Bowley's, key innovation consisted of the use of
random sampling In statistics, quality assurance, and survey methodology, sampling is the selection of a subset (a statistical sample) of individuals from within a statistical population to estimate characteristics of the whole population. Statisticians attemp ...
techniques. His efforts culminated in his ''New Survey of London Life and Labour''. Francis Galton is credited as one of the principal founders of statistical theory. His contributions to the field included introducing the concepts of standard deviation, correlation, regression and the application of these methods to the study of the variety of human characteristics - height, weight, eyelash length among others. He found that many of these could be fitted to a normal curve distribution. Galton submitted a paper to ''Nature'' in 1907 on the usefulness of the median. He examined the accuracy of 787 guesses of the weight of an ox at a country fair. The actual weight was 1208 pounds: the median guess was 1198. The guesses were markedly non-normally distributed (cf.
Wisdom of the Crowd The wisdom of the crowd is the collective opinion of a diverse independent group of individuals rather than that of a single expert. This process, while not new to the Information Age, has been pushed into the mainstream spotlight by social infor ...
). Galton's publication of ''Natural Inheritance'' in 1889 sparked the interest of a brilliant mathematician, Karl Pearson, then working at
University College London , mottoeng = Let all come who by merit deserve the most reward , established = , type = Public research university , endowment = £143 million (2020) , budget = ...
, and he went on to found the discipline of mathematical statistics. He emphasised the statistical foundation of scientific laws and promoted its study and his laboratory attracted students from around the world attracted by his new methods of analysis, including
Udny Yule George Udny Yule FRS (18 February 1871 – 26 June 1951), usually known as Udny Yule, was a British statistician, particularly known for the Yule distribution. Personal life Yule was born at Beech Hill, a house in Morham near Haddingto ...
. His work grew to encompass the fields of
biology Biology is the scientific study of life. It is a natural science with a broad scope but has several unifying themes that tie it together as a single, coherent field. For instance, all organisms are made up of cells that process hereditary i ...
,
epidemiology Epidemiology is the study and analysis of the distribution (who, when, and where), patterns and determinants of health and disease conditions in a defined population. It is a cornerstone of public health, and shapes policy decisions and evide ...
, anthropometry,
medicine Medicine is the science and practice of caring for a patient, managing the diagnosis, prognosis, prevention, treatment, palliation of their injury or disease, and promoting their health. Medicine encompasses a variety of health care pr ...
and social
history History (derived ) is the systematic study and the documentation of the human activity. The time period of event before the invention of writing systems is considered prehistory. "History" is an umbrella term comprising past events as well ...
. In 1901, with
Walter Weldon Walter Weldon FRS FRSE (31 October 183220 September 1885) was a 19th-century English industrial chemist and journalist. He was President of the Society of Chemical Industry 1883/84. Life He was born in Loughborough on 31 October 1832, the son ...
, founder of
biometry Biostatistics (also known as biometry) are the development and application of statistical methods to a wide range of topics in biology. It encompasses the design of biological experiments, the collection and analysis of data from those experimen ...
, and Galton, he founded the journal '' Biometrika'' as the first journal of mathematical statistics and biometry. His work, and that of Galton's, underpins many of the 'classical' statistical methods which are in common use today, including the
Correlation coefficient A correlation coefficient is a numerical measure of some type of correlation, meaning a statistical relationship between two variables. The variables may be two columns of a given data set of observations, often called a sample, or two components ...
, defined as a product-moment; the method of moments for the fitting of distributions to samples; Pearson's system of continuous curves that forms the basis of the now conventional continuous probability distributions; Chi distance a precursor and special case of the Mahalanobis distance and P-value, defined as the probability measure of the complement of the ball with the hypothesized value as center point and chi distance as radius. He also introduced the term 'standard deviation'. He also founded the statistical hypothesis testing theory, Pearson's chi-squared test and principal component analysis. In 1911 he founded the world's first university statistics department at
University College London , mottoeng = Let all come who by merit deserve the most reward , established = , type = Public research university , endowment = £143 million (2020) , budget = ...
. The second wave of mathematical statistics was pioneered by
Ronald Fisher Sir Ronald Aylmer Fisher (17 February 1890 – 29 July 1962) was a British polymath who was active as a mathematician, statistician, biologist, geneticist, and academic. For his work in statistics, he has been described as "a genius who ...
who wrote two textbooks, '' Statistical Methods for Research Workers'', published in 1925 and ''
The Design of Experiments ''The Design of Experiments'' is a 1935 book by the English statistician Ronald Fisher about the design of experiments and is considered a foundational work in experimental design. Among other contributions, the book introduced the concept of th ...
'' in 1935, that were to define the academic discipline in universities around the world. He also systematized previous results, putting them on a firm mathematical footing. In his 1918 seminal paper ''
The Correlation between Relatives on the Supposition of Mendelian Inheritance #REDIRECT The Correlation between Relatives on the Supposition of Mendelian Inheritance {{R from other capitalisation ...
'', the first use to use the statistical term,
variance In probability theory and statistics, variance is the expectation of the squared deviation of a random variable from its population mean or sample mean. Variance is a measure of dispersion, meaning it is a measure of how far a set of numbe ...
. In 1919, at
Rothamsted Experimental Station Rothamsted Research, previously known as the Rothamsted Experimental Station and then the Institute of Arable Crops Research, is one of the oldest agricultural research institutions in the world, having been founded in 1843. It is located at Har ...
he started a major study of the extensive collections of data recorded over many years. This resulted in a series of reports under the general title ''Studies in Crop Variation.'' In 1930 he published ''
The Genetical Theory of Natural Selection ''The Genetical Theory of Natural Selection'' is a book by Ronald Fisher which combines Mendelian genetics with Charles Darwin's theory of natural selection, with Fisher being the first to argue that "Mendelism therefore validates Darwinism" and ...
'' where he applied statistics to
evolution Evolution is change in the heritable characteristics of biological populations over successive generations. These characteristics are the expressions of genes, which are passed on from parent to offspring during reproduction. Variation ...
. Over the next seven years, he pioneered the principles of the
design of experiments The design of experiments (DOE, DOX, or experimental design) is the design of any task that aims to describe and explain the variation of information under conditions that are hypothesized to reflect the variation. The term is generally associ ...
(see below) and elaborated his studies of analysis of variance. He furthered his studies of the statistics of small samples. Perhaps even more important, he began his systematic approach of the analysis of real data as the springboard for the development of new statistical methods. He developed computational algorithms for analyzing data from his balanced experimental designs. In 1925, this work resulted in the publication of his first book, '' Statistical Methods for Research Workers''. This book went through many editions and translations in later years, and it became the standard reference work for scientists in many disciplines. In 1935, this book was followed by ''
The Design of Experiments ''The Design of Experiments'' is a 1935 book by the English statistician Ronald Fisher about the design of experiments and is considered a foundational work in experimental design. Among other contributions, the book introduced the concept of th ...
'', which was also widely used. In addition to analysis of variance, Fisher named and promoted the method of
maximum likelihood In statistics, maximum likelihood estimation (MLE) is a method of estimating the parameters of an assumed probability distribution, given some observed data. This is achieved by maximizing a likelihood function so that, under the assumed stat ...
estimation. Fisher also originated the concepts of sufficiency,
ancillary statistic An ancillary statistic is a measure of a sample whose distribution (or whose pmf or pdf) does not depend on the parameters of the model. An ancillary statistic is a pivotal quantity that is also a statistic. Ancillary statistics can be used to c ...
s, Fisher's linear discriminator and
Fisher information In mathematical statistics, the Fisher information (sometimes simply called information) is a way of measuring the amount of information that an observable random variable ''X'' carries about an unknown parameter ''θ'' of a distribution that model ...
. His article ''On a distribution yielding the error functions of several well known statistics'' (1924) presented Pearson's chi-squared test and
William Sealy Gosset William Sealy Gosset (13 June 1876 – 16 October 1937) was an English statistician, chemist and brewer who served as Head Brewer of Guinness and Head Experimental Brewer of Guinness and was a pioneer of modern statistics. He pioneered small s ...
's t in the same framework as the Gaussian distribution, and his own parameter in the analysis of variance
Fisher's z-distribution Fisher's ''z''-distribution is the statistical distribution of half the logarithm of an ''F''-distribution variate: : z = \frac 1 2 \log F It was first described by Ronald Fisher in a paper delivered at the International Mathematical Cong ...
(more commonly used decades later in the form of the
F distribution In probability theory and statistics, the ''F''-distribution or F-ratio, also known as Snedecor's ''F'' distribution or the Fisher–Snedecor distribution (after Ronald Fisher and George W. Snedecor) is a continuous probability distribution t ...
). The 5% level of significance appears to have been introduced by Fisher in 1925.Fisher RA (1925) Statistical methods for research workers, Edinburgh: Oliver & Boyd Fisher stated that deviations exceeding twice the standard deviation are regarded as significant. Before this deviations exceeding three times the probable error were considered significant. For a symmetrical distribution the probable error is half the interquartile range. For a normal distribution the probable error is approximately 2/3 the standard deviation. It appears that Fisher's 5% criterion was rooted in previous practice. Other important contributions at this time included
Charles Spearman Charles Edward Spearman, FRS (10 September 1863 – 17 September 1945) was an English psychologist known for work in statistics, as a pioneer of factor analysis, and for Spearman's rank correlation coefficient. He also did seminal work on mod ...
's rank correlation coefficient that was a useful extension of the Pearson correlation coefficient.
William Sealy Gosset William Sealy Gosset (13 June 1876 – 16 October 1937) was an English statistician, chemist and brewer who served as Head Brewer of Guinness and Head Experimental Brewer of Guinness and was a pioneer of modern statistics. He pioneered small s ...
, the English statistician better known under his pseudonym of ''Student'', introduced Student's t-distribution, a continuous probability distribution useful in situations where the sample size is small and population standard deviation is unknown. Egon Pearson (Karl's son) and
Jerzy Neyman Jerzy Neyman (April 16, 1894 – August 5, 1981; born Jerzy Spława-Neyman; ) was a Polish mathematician and statistician who spent the first part of his professional career at various institutions in Warsaw, Poland and then at University Colleg ...
introduced the concepts of " Type II" error, power of a test and confidence intervals.
Jerzy Neyman Jerzy Neyman (April 16, 1894 – August 5, 1981; born Jerzy Spława-Neyman; ) was a Polish mathematician and statistician who spent the first part of his professional career at various institutions in Warsaw, Poland and then at University Colleg ...
in 1934 showed that stratified random sampling was in general a better method of estimation than purposive (quota) sampling.


Design of experiments

In 1747, while serving as surgeon on HM Bark ''Salisbury'',
James Lind James Lind (4 October 1716 – 13 July 1794) was a Scottish doctor. He was a pioneer of naval hygiene in the Royal Navy. By conducting one of the first ever clinical trials, he developed the theory that citrus fruits cured scurvy. Lind ...
carried out a controlled experiment to develop a cure for
scurvy Scurvy is a disease resulting from a lack of vitamin C (ascorbic acid). Early symptoms of deficiency include weakness, feeling tired and sore arms and legs. Without treatment, decreased red blood cells, gum disease, changes to hair, and bleeding ...
. In this study his subjects' cases "were as similar as I could have them", that is he provided strict entry requirements to reduce extraneous variation. The men were paired, which provided blocking. From a modern perspective, the main thing that is missing is randomized allocation of subjects to treatments. Lind is today often described as a one-factor-at-a-time experimenter. Similar one-factor-at-a-time (OFAT) experimentation was performed at the
Rothamsted Research Station Rothamsted Research, previously known as the Rothamsted Experimental Station and then the Institute of Arable Crops Research, is one of the oldest agricultural research institutions in the world, having been founded in 1843. It is located at Har ...
in the 1840s by Sir
John Lawes Sir John Bennet Lawes, 1st Baronet, Royal Society, FRS (28 December 1814 – 31 August 1900) was an English entrepreneur and agricultural scientist. He founded an experimental farm at his home at Rothamsted Manor that eventually became Rothams ...
to determine the optimal inorganic fertilizer for use on wheat. A theory of statistical inference was developed by
Charles S. Peirce Charles Sanders Peirce ( ; September 10, 1839 – April 19, 1914) was an American philosopher, logician, mathematician and scientist who is sometimes known as "the father of pragmatism". Educated as a chemist and employed as a scientist for t ...
in " Illustrations of the Logic of Science" (1877–1878) and " A Theory of Probable Inference" (1883), two publications that emphasized the importance of randomization-based inference in statistics. In another study, Peirce randomly assigned volunteers to a blinded, repeated-measures design to evaluate their ability to discriminate weights. Peirce's experiment inspired other researchers in psychology and education, which developed a research tradition of randomized experiments in laboratories and specialized textbooks in the 1800s. Peirce also contributed the first English-language publication on an
optimal design In the design of experiments, optimal designs (or optimum designs) are a class of design of experiments, experimental designs that are Optimization (mathematics), optimal with respect to some statistical theory, statistical objective function, ...
for regression- models in 1876. A pioneering
optimal design In the design of experiments, optimal designs (or optimum designs) are a class of design of experiments, experimental designs that are Optimization (mathematics), optimal with respect to some statistical theory, statistical objective function, ...
for
polynomial regression In statistics, polynomial regression is a form of regression analysis in which the relationship between the independent variable ''x'' and the dependent variable ''y'' is modelled as an ''n''th degree polynomial in ''x''. Polynomial regression fi ...
was suggested by
Gergonne Joseph Diez Gergonne (19 June 1771 at Nancy, France – 4 May 1859 at Montpellier, France) was a French mathematician and logician. Life In 1791, Gergonne enlisted in the French army as a captain. That army was undergoing rapid expansion becau ...
in 1815. In 1918
Kirstine Smith Kirstine Smith (April 12, 1878 – November 11, 1939) was a Danish statistician. She is credited with the creation of the field of optimal design of experiments. Background Smith grew up in the town of Nykøbing Mors, Denmark. In 1903, she grad ...
published optimal designs for polynomials of degree six (and less). The use of a sequence of experiments, where the design of each may depend on the results of previous experiments, including the possible decision to stop experimenting, was pioneered by
Abraham Wald Abraham Wald (; hu, Wald Ábrahám, yi, אברהם וואַלד;  – ) was a Jewish Hungarian mathematician who contributed to decision theory, geometry, and econometrics and founded the field of statistical sequential analysis. One ...
in the context of sequential tests of statistical hypotheses. Surveys are available of optimal sequential designs, Chernoff, H. (1972) ''Sequential Analysis and Optimal Design'',
SIAM Thailand ( ), historically known as Siam () and officially the Kingdom of Thailand, is a country in Southeast Asia, located at the centre of the Indochinese Peninsula, spanning , with a population of almost 70 million. The country is bo ...
Monograph.
and of adaptive designs. One specific type of sequential design is the "two-armed bandit", generalized to the
multi-armed bandit In probability theory and machine learning, the multi-armed bandit problem (sometimes called the ''K''- or ''N''-armed bandit problem) is a problem in which a fixed limited set of resources must be allocated between competing (alternative) choices ...
, on which early work was done by
Herbert Robbins Herbert Ellis Robbins (January 12, 1915 – February 12, 2001) was an American mathematician and statistician. He did research in topology, measure theory, statistics, and a variety of other fields. He was the co-author, with Richard Co ...
in 1952. The term "design of experiments" (DOE) derives from early statistical work performed by
Sir Ronald Fisher Sir Ronald Aylmer Fisher (17 February 1890 – 29 July 1962) was a British polymath who was active as a mathematician, statistician, biologist, geneticist, and academic. For his work in statistics, he has been described as "a genius who a ...
. He was described by
Anders Hald Anders Hjorth Hald (3 July 1913 – 11 November 2007) was a Danish statistician. He was a professor at the University of Copenhagen from 1960 to 1982. While a professor, he did research in industrial quality control and other areas, and also auth ...
as "a genius who almost single-handedly created the foundations for modern statistical science." Fisher initiated the principles of
design of experiments The design of experiments (DOE, DOX, or experimental design) is the design of any task that aims to describe and explain the variation of information under conditions that are hypothesized to reflect the variation. The term is generally associ ...
and elaborated on his studies of "
analysis of variance Analysis of variance (ANOVA) is a collection of statistical models and their associated estimation procedures (such as the "variation" among and between groups) used to analyze the differences among means. ANOVA was developed by the statistician ...
". Perhaps even more important, Fisher began his systematic approach to the analysis of real data as the springboard for the development of new statistical methods. He began to pay particular attention to the labour involved in the necessary computations performed by hand, and developed methods that were as practical as they were founded in rigour. In 1925, this work culminated in the publication of his first book, '' Statistical Methods for Research Workers''. This went into many editions and translations in later years, and became a standard reference work for scientists in many disciplines. A methodology for designing experiments was proposed by Ronald A. Fisher, in his innovative book ''
The Design of Experiments ''The Design of Experiments'' is a 1935 book by the English statistician Ronald Fisher about the design of experiments and is considered a foundational work in experimental design. Among other contributions, the book introduced the concept of th ...
'' (1935) which also became a standard. As an example, he described how to test the
hypothesis A hypothesis (plural hypotheses) is a proposed explanation for a phenomenon. For a hypothesis to be a scientific hypothesis, the scientific method requires that one can test it. Scientists generally base scientific hypotheses on previous obse ...
that a certain lady could distinguish by flavour alone whether the milk or the tea was first placed in the cup. While this sounds like a frivolous application, it allowed him to illustrate the most important ideas of experimental design: see
Lady tasting tea In the design of experiments in statistics, the lady tasting tea is a randomized experiment devised by Ronald Fisher and reported in his book ''The Design of Experiments'' (1935). The experiment is the original exposition of Fisher's notion of ...
. Agricultural science advances served to meet the combination of larger city populations and fewer farms. But for crop scientists to take due account of widely differing geographical growing climates and needs, it was important to differentiate local growing conditions. To extrapolate experiments on local crops to a national scale, they had to extend crop sample testing economically to overall populations. As statistical methods advanced (primarily the efficacy of designed experiments instead of one-factor-at-a-time experimentation), representative factorial design of experiments began to enable the meaningful extension, by inference, of experimental sampling results to the population as a whole. But it was hard to decide how representative was the crop sample chosen. Factorial design methodology showed how to estimate and correct for any random variation within the sample and also in the data collection procedures.


Bayesian statistics

The term ''Bayesian'' refers to
Thomas Bayes Thomas Bayes ( ; 1701 7 April 1761) was an English statistician, philosopher and Presbyterian minister who is known for formulating a specific case of the theorem that bears his name: Bayes' theorem. Bayes never published what would become his ...
(1702–1761), who proved that probabilistic limits could be placed on an unknown event. However it was
Pierre-Simon Laplace Pierre-Simon, marquis de Laplace (; ; 23 March 1749 – 5 March 1827) was a French scholar and polymath whose work was important to the development of engineering, mathematics, statistics, physics, astronomy, and philosophy. He summarized ...
(1749–1827) who introduced (as principle VI) what is now called Bayes' theorem and applied it to
celestial mechanics Celestial mechanics is the branch of astronomy that deals with the motions of objects in outer space. Historically, celestial mechanics applies principles of physics (classical mechanics) to astronomical objects, such as stars and planets, to ...
, medical statistics,
reliability Reliability, reliable, or unreliable may refer to: Science, technology, and mathematics Computing * Data reliability (disambiguation), a property of some disk arrays in computer storage * High availability * Reliability (computer networking), a ...
, and
jurisprudence Jurisprudence, or legal theory, is the theoretical study of the propriety of law. Scholars of jurisprudence seek to explain the nature of law in its most general form and they also seek to achieve a deeper understanding of legal reasoning a ...
.Stigler (1986, Chapter 3: Inverse Probability) When insufficient knowledge was available to specify an informed prior, Laplace used
uniform A uniform is a variety of clothing worn by members of an organization while participating in that organization's activity. Modern uniforms are most often worn by armed forces and paramilitary organizations such as police, emergency services, ...
priors, according to his "
principle of insufficient reason The principle of indifference (also called principle of insufficient reason) is a rule for assigning epistemic probabilities. The principle of indifference states that in the absence of any relevant evidence, agents should distribute their cre ...
". Laplace assumed uniform priors for mathematical simplicity rather than for philosophical reasons. Laplace also introduced primitive versions of
conjugate prior In Bayesian probability theory, if the posterior distribution p(\theta \mid x) is in the same probability distribution family as the prior probability distribution p(\theta), the prior and posterior are then called conjugate distributions, and th ...
s and the
theorem In mathematics, a theorem is a statement that has been proved, or can be proved. The ''proof'' of a theorem is a logical argument that uses the inference rules of a deductive system to establish that the theorem is a logical consequence of t ...
of
von Mises Mises or von Mises may refer to: * Ludwig von Mises, an Austrian-American economist of the Austrian School, older brother of Richard von Mises ** Mises Institute, or the Ludwig von Mises Institute for Austrian Economics, named after Ludwig von ...
and Bernstein, according to which the posteriors corresponding to initially differing priors ultimately agree, as the number of observations increases. This early Bayesian inference, which used uniform priors following Laplace's
principle of insufficient reason The principle of indifference (also called principle of insufficient reason) is a rule for assigning epistemic probabilities. The principle of indifference states that in the absence of any relevant evidence, agents should distribute their cre ...
, was called "
inverse probability In probability theory, inverse probability is an obsolete term for the probability distribution of an unobserved variable. Today, the problem of determining an unobserved variable (by whatever method) is called inferential statistics, the method o ...
" (because it infers backwards from observations to parameters, or from effects to causesStephen. E. Fienberg, (2006
When did Bayesian Inference become "Bayesian"?
''Bayesian Analysis'', 1 (1), 1–40. See page 5.
). After the 1920s,
inverse probability In probability theory, inverse probability is an obsolete term for the probability distribution of an unobserved variable. Today, the problem of determining an unobserved variable (by whatever method) is called inferential statistics, the method o ...
was largely supplanted by a collection of methods that were developed by Ronald A. Fisher,
Jerzy Neyman Jerzy Neyman (April 16, 1894 – August 5, 1981; born Jerzy Spława-Neyman; ) was a Polish mathematician and statistician who spent the first part of his professional career at various institutions in Warsaw, Poland and then at University Colleg ...
and Egon Pearson. Their methods came to be called
frequentist statistics Frequentist inference is a type of statistical inference based in frequentist probability, which treats “probability” in equivalent terms to “frequency” and draws conclusions from sample-data by means of emphasizing the frequency or pr ...
. Fisher rejected the Bayesian view, writing that "the theory of inverse probability is founded upon an error, and must be wholly rejected". At the end of his life, however, Fisher expressed greater respect for the essay of Bayes, which Fisher believed to have anticipated his own, fiducial approach to probability; Fisher still maintained that Laplace's views on probability were "fallacious rubbish". Neyman started out as a "quasi-Bayesian", but subsequently developed confidence intervals (a key method in frequentist statistics) because "the whole theory would look nicer if it were built from the start without reference to Bayesianism and priors". The word ''Bayesian'' appeared around 1950, and by the 1960s it became the term preferred by those dissatisfied with the limitations of frequentist statistics.Jeff Miller
"Earliest Known Uses of Some of the Words of Mathematics (B)"
"The term Bayesian entered circulation around 1950. R. A. Fisher used it in the notes he wrote to accompany the papers in his Contributions to Mathematical Statistics (1950). Fisher thought Bayes's argument was all but extinct for the only recent work to take it seriously was
Harold Jeffreys Sir Harold Jeffreys, FRS (22 April 1891 – 18 March 1989) was a British mathematician, statistician, geophysicist, and astronomer. His book, ''Theory of Probability'', which was first published in 1939, played an important role in the revival ...
's Theory of Probability (1939). In 1951 L. J. Savage, reviewing Wald's Statistical Decisions Functions, referred to "modern, or unBayesian, statistical theory" ("The Theory of Statistical Decision," ''Journal of the American Statistical Association'', 46, p. 58.). Soon after, however, Savage changed from being an unBayesian to being a Bayesian."
In the 20th century, the ideas of Laplace were further developed in two different directions, giving rise to ''objective'' and ''subjective'' currents in Bayesian practice. In the objectivist stream, the statistical analysis depends on only the model assumed and the data analysed. No subjective decisions need to be involved. In contrast, "subjectivist" statisticians deny the possibility of fully objective analysis for the general case. In the further development of Laplace's ideas, subjective ideas predate objectivist positions. The idea that 'probability' should be interpreted as 'subjective degree of belief in a proposition' was proposed, for example, by
John Maynard Keynes John Maynard Keynes, 1st Baron Keynes, ( ; 5 June 1883 – 21 April 1946), was an English economist whose ideas fundamentally changed the theory and practice of macroeconomics and the economic policies of governments. Originally trained in ...
in the early 1920s. This idea was taken further by
Bruno de Finetti Bruno de Finetti (13 June 1906 – 20 July 1985) was an Italian probabilist statistician and actuary, noted for the "operational subjective" conception of probability. The classic exposition of his distinctive theory is the 1937 "La prévision: ...
in Italy (''Fondamenti Logici del Ragionamento Probabilistico'', 1930) and Frank Ramsey in Cambridge (''The Foundations of Mathematics'', 1931). The approach was devised to solve problems with the frequentist definition of probability but also with the earlier, objectivist approach of Laplace. The subjective Bayesian methods were further developed and popularized in the 1950s by L.J. Savage. Objective Bayesian inference was further developed by
Harold Jeffreys Sir Harold Jeffreys, FRS (22 April 1891 – 18 March 1989) was a British mathematician, statistician, geophysicist, and astronomer. His book, ''Theory of Probability'', which was first published in 1939, played an important role in the revival ...
at the
University of Cambridge The University of Cambridge is a public collegiate research university in Cambridge, England. Founded in 1209 and granted a royal charter by Henry III in 1231, Cambridge is the world's third oldest surviving university and one of its most pr ...
. His seminal book "Theory of probability" first appeared in 1939 and played an important role in the revival of the Bayesian view of probability. In 1957,
Edwin Jaynes Edwin Thompson Jaynes (July 5, 1922 – April 30, 1998) was the Wayman Crow Distinguished Professor of Physics at Washington University in St. Louis. He wrote extensively on statistical mechanics and on foundations of probability and statisti ...
promoted the concept of maximum entropy for constructing priors, which is an important principle in the formulation of objective methods, mainly for discrete problems. In 1965,
Dennis Lindley Dennis Victor Lindley (25 July 1923 – 14 December 2013) was an English statistician, decision theorist and leading advocate of Bayesian statistics. Biography Lindley grew up in the south-west London suburb of Surbiton. He was an only child an ...
's 2-volume work "Introduction to Probability and Statistics from a Bayesian Viewpoint" brought Bayesian methods to a wide audience. In 1979, José-Miguel Bernardo introduced reference analysis, which offers a general applicable framework for objective analysis. Other well-known proponents of Bayesian probability theory include I.J. Good, B.O. Koopman,
Howard Raiffa Howard Raiffa (; January 24, 1924 – July 8, 2016) was an American academic who was the Frank P. Ramsey Professor (Emeritus) of Managerial Economics, a joint chair held by the Business School and Harvard Kennedy School at Harvard University. He w ...
,
Robert Schlaifer Robert Osher Schlaifer (13 September 1914 – 24 July 1994) was a pioneer of Bayesian decision theory. At the time of his death he was William Ziegler Professor of Business Administration Emeritus of the Harvard Business School. In 1961 he was ...
and
Alan Turing Alan Mathison Turing (; 23 June 1912 – 7 June 1954) was an English mathematician, computer scientist, logician, cryptanalyst, philosopher, and theoretical biologist. Turing was highly influential in the development of theoretical co ...
. In the 1980s, there was a dramatic growth in research and applications of Bayesian methods, mostly attributed to the discovery of
Markov chain Monte Carlo In statistics, Markov chain Monte Carlo (MCMC) methods comprise a class of algorithms for sampling from a probability distribution. By constructing a Markov chain that has the desired distribution as its equilibrium distribution, one can obtain ...
methods, which removed many of the
computational problem In theoretical computer science, a computational problem is a problem that may be solved by an algorithm. For example, the problem of factoring :"Given a positive integer ''n'', find a nontrivial prime factor of ''n''." is a computational probl ...
s, and an increasing interest in nonstandard, complex applications. Despite growth of Bayesian research, most undergraduate teaching is still based on frequentist statistics. Nonetheless, Bayesian methods are widely accepted and used, such as for example in the field of
machine learning Machine learning (ML) is a field of inquiry devoted to understanding and building methods that 'learn', that is, methods that leverage data to improve performance on some set of tasks. It is seen as a part of artificial intelligence. Machine ...
.Bishop, C.M. (2007) ''Pattern Recognition and Machine Learning''. Springer


Important contributors to statistics

*
Thomas Bayes Thomas Bayes ( ; 1701 7 April 1761) was an English statistician, philosopher and Presbyterian minister who is known for formulating a specific case of the theorem that bears his name: Bayes' theorem. Bayes never published what would become his ...
* George E. P. Box *
Pafnuty Chebyshev Pafnuty Lvovich Chebyshev ( rus, Пафну́тий Льво́вич Чебышёв, p=pɐfˈnutʲɪj ˈlʲvovʲɪtɕ tɕɪbɨˈʂof) ( – ) was a Russian mathematician and considered to be the founding father of Russian mathematics. Chebyshe ...
*
David R. Cox Sir David Roxbee Cox (15 July 1924 – 18 January 2022) was a British statistician and educator. His wide-ranging contributions to the field of statistics included introducing logistic regression, the proportional hazards model and the Cox pro ...
* Gertrude Cox * Harald Cramér *
Francis Ysidro Edgeworth Francis Ysidro Edgeworth (8 February 1845 – 13 February 1926) was an Anglo-Irish philosopher and political economist who made significant contributions to the methods of statistics during the 1880s. From 1891 onward, he was appointed th ...
*
Bradley Efron Bradley Efron (; born May 24, 1938) is an American statistician. Efron has been president of the American Statistical Association (2004) and of the Institute of Mathematical Statistics (1987–1988).Cochran, J. (1 September 2015), "ASA Lead ...
*
Bruno de Finetti Bruno de Finetti (13 June 1906 – 20 July 1985) was an Italian probabilist statistician and actuary, noted for the "operational subjective" conception of probability. The classic exposition of his distinctive theory is the 1937 "La prévision: ...
* Ronald A. Fisher * Francis Galton *
Carl Friedrich Gauss Johann Carl Friedrich Gauss (; german: Gauß ; la, Carolus Fridericus Gauss; 30 April 177723 February 1855) was a German mathematician and physicist who made significant contributions to many fields in mathematics and science. Sometimes refer ...
*
William Sealey Gosset William Sealy Gosset (13 June 1876 – 16 October 1937) was an English statistician, chemist and brewer who served as Head Brewer of Guinness and Head Experimental Brewer of Guinness and was a pioneer of modern statistics. He pioneered small s ...
("Student") *
Al-Kindi Abū Yūsuf Yaʻqūb ibn ʼIsḥāq aṣ-Ṣabbāḥ al-Kindī (; ar, أبو يوسف يعقوب بن إسحاق الصبّاح الكندي; la, Alkindus; c. 801–873 AD) was an Arab Muslim philosopher, polymath, mathematician, physician ...
*
Andrey Kolmogorov Andrey Nikolaevich Kolmogorov ( rus, Андре́й Никола́евич Колмого́ров, p=ɐnˈdrʲej nʲɪkɐˈlajɪvʲɪtɕ kəlmɐˈɡorəf, a=Ru-Andrey Nikolaevich Kolmogorov.ogg, 25 April 1903 – 20 October 1987) was a Sovi ...
*
Pierre-Simon Laplace Pierre-Simon, marquis de Laplace (; ; 23 March 1749 – 5 March 1827) was a French scholar and polymath whose work was important to the development of engineering, mathematics, statistics, physics, astronomy, and philosophy. He summarized ...
* Erich L. Lehmann *
Aleksandr Lyapunov Aleksandr Mikhailovich Lyapunov (russian: Алекса́ндр Миха́йлович Ляпуно́в, ; – 3 November 1918) was a Russian mathematician, mechanician and physicist. His surname is variously romanized as Ljapunov, Liapunov, Lia ...
*
Anil Kumar Gain Anil Kumar Gain FRSS FCPS (1 February 1919 – 7 February 1978) (also spelt Anil Kumar Gayen) was an Indian mathematician and statistician best known for his works on the Pearson product-moment correlation coefficient in the field of applied ...
*
Prasanta Chandra Mahalanobis Prasanta Chandra Mahalanobis OBE, FNA, FASc, FRS (29 June 1893– 28 June 1972) was an Indian scientist and statistician. He is best remembered for the Mahalanobis distance, a statistical measure, and for being one of the members of the fir ...
*
Abraham De Moivre Abraham de Moivre FRS (; 26 May 166727 November 1754) was a French mathematician known for de Moivre's formula, a formula that links complex numbers and trigonometry, and for his work on the normal distribution and probability theory. He moved ...
*
Jerzy Neyman Jerzy Neyman (April 16, 1894 – August 5, 1981; born Jerzy Spława-Neyman; ) was a Polish mathematician and statistician who spent the first part of his professional career at various institutions in Warsaw, Poland and then at University Colleg ...
*
Florence Nightingale Florence Nightingale (; 12 May 1820 – 13 August 1910) was an English social reformer, statistician and the founder of modern nursing. Nightingale came to prominence while serving as a manager and trainer of nurses during the Crimean War, i ...
* Blaise Pascal * Karl Pearson *
Charles S. Peirce Charles Sanders Peirce ( ; September 10, 1839 – April 19, 1914) was an American philosopher, logician, mathematician and scientist who is sometimes known as "the father of pragmatism". Educated as a chemist and employed as a scientist for t ...
*
Adolphe Quetelet Lambert Adolphe Jacques Quetelet FRSF or FRSE (; 22 February 1796 – 17 February 1874) was a Belgian astronomer, mathematician, statistician and sociologist who founded and directed the Brussels Observatory and was influential in intro ...
* C. R. Rao * Walter A. Shewhart *
Charles Spearman Charles Edward Spearman, FRS (10 September 1863 – 17 September 1945) was an English psychologist known for work in statistics, as a pioneer of factor analysis, and for Spearman's rank correlation coefficient. He also did seminal work on mod ...
* Charles Stein *
Thorvald N. Thiele Thorvald Nicolai Thiele (24 December 1838 – 26 September 1910) was a Danish astronomer and director of the Copenhagen Observatory. He was also an actuary and mathematician, most notable for his work in statistics, interpolation and the three- ...
* John Tukey *
Abraham Wald Abraham Wald (; hu, Wald Ábrahám, yi, אברהם וואַלד;  – ) was a Jewish Hungarian mathematician who contributed to decision theory, geometry, and econometrics and founded the field of statistical sequential analysis. One ...


References


Bibliography

*
Revised version, 2002
* * * Kotz, S., Johnson, N.L. (1992,1992,1997). ''Breakthroughs in Statistics'', Vols I, II, III. Springer , , * * Salsburg, David (2001). '' The Lady Tasting Tea: How Statistics Revolutionized Science in the Twentieth Century''. * * Stigler, Stephen M. (1999) ''Statistics on the Table: The History of Statistical Concepts and Methods''. Harvard University Press. *


External links


JEHPS: Recent publications in the history of probability and statistics


* ttp://www.economics.soton.ac.uk/staff/aldrich/Figures.htm Figures from the History of Probability and Statistics (Univ. of Southampton)
Materials for the History of Statistics (Univ. of York)


* ttp://jeff560.tripod.com/stat.html Earliest Uses of Symbols in Probability and Statisticso
Earliest Uses of Various Mathematical Symbols
{{DEFAULTSORT:History Of Statistics Statistics