The look-elsewhere effect is a
phenomenon in the statistical analysis of
scientific experiments where an apparently
statistically significant
In statistical hypothesis testing, a result has statistical significance when it is very unlikely to have occurred given the null hypothesis (simply by chance alone). More precisely, a study's defined significance level, denoted by \alpha, is the p ...
observation may have actually arisen by chance because of the sheer size of the
parameter space to be searched.
Once the possibility of look-elsewhere error in an analysis is acknowledged, it can be compensated for by careful application of standard mathematical techniques.
More generally known in statistics as the
problem of multiple comparisons, the term gained some media attention in 2011, in the context of the search for the
Higgs boson
The Higgs boson, sometimes called the Higgs particle, is an elementary particle in the Standard Model of particle physics produced by the quantum excitation of the Higgs field,
one of the fields in particle physics theory. In the Stand ...
at the
Large Hadron Collider
The Large Hadron Collider (LHC) is the world's largest and highest-energy particle collider. It was built by the European Organization for Nuclear Research (CERN) between 1998 and 2008 in collaboration with over 10,000 scientists and hundred ...
.
Use
Many statistical tests deliver a
p-value
In null-hypothesis significance testing, the ''p''-value is the probability of obtaining test results at least as extreme as the result actually observed, under the assumption that the null hypothesis is correct. A very small ''p''-value means ...
, the probability that a given result could be obtained by chance, assuming the hypothesis one seeks to prove is in fact false. When asking "does ''X'' affect ''Y''?", it is common to vary ''X'' and see if there is significant variation in ''Y'' as a result. If this p-value is less than some predetermined
statistical significance
In statistical hypothesis testing, a result has statistical significance when it is very unlikely to have occurred given the null hypothesis (simply by chance alone). More precisely, a study's defined significance level, denoted by \alpha, is the p ...
threshold ''α'', one considers the result "significant".
However, if one is performing multiple tests ("looking elsewhere" if the first test fails) then a ''p'' value of 1/''n'' is expected to occur once per ''n'' tests. For example, when there is no real effect, an event with ''p'' < 0.05 will still occur once, on average, for each 20 tests performed. In order to compensate for this, you could divide your threshold ''α'' by the number of tests ''n'', so a result is significant when ''p'' < ''α''/''n''. Or, equivalently, multiply the observed ''p'' value by the number of tests (significant when ''np'' < ''α'').
This is a simplified case; the number ''n'' is actually the number of
degrees of freedom
Degrees of freedom (often abbreviated df or DOF) refers to the number of independent variables or parameters of a thermodynamic system. In various scientific fields, the word "freedom" is used to describe the limits to which physical movement or ...
in the tests, or the number of effectively independent tests. If they are not fully independent, the number may be lower than the number of tests.
The look-elsewhere effect is a frequent cause of "significance inflation" when the number of independent tests ''n'' is underestimated because failed tests are not published. One paper may fail to mention alternative hypotheses considered, or a paper producing no result may simply not be published at all, leading to journals dominated by statistical outliers.
Examples
* A Swedish study in 1992 tried to determine whether or not power lines caused some kind of poor health effects. The researchers surveyed everyone living within 300 m of high-voltage power lines over a 25-year period and looked for statistically significant increases in rates of over 800 ailments. The study found that the incidence of childhood leukemia was four times higher among those that lived closest to the power lines, and it spurred calls to action by the Swedish government. The problem with the conclusion, however, was that they failed to compensate for the look-elsewhere effect; in any collection of 800 random samples, it is likely that at least one will be at least 3 standard deviations above the expected value, by chance alone. Subsequent studies failed to show any links between power lines and childhood leukemia, neither in causation nor even in correlation.

* The
Bible Code
The Bible code ( he, הצופן התנ"כי, ), also known as the Torah code, is a purported set of encoded words within a Hebrew text of the Torah that, according to proponents, has predicted significant historical events. The statistical like ...
phenomenon purports to find atypical significant groupings of words predicting future events hidden in text of the
Hebrew Bible
The Hebrew Bible or Tanakh (;["Tanach"](_blank)
''Random House Webster's Unabridged Dictionary''. Hebrew: ''Tān ...
taken as a raw sequence of unspaced letters and arranged into various grids of different proportions. However, as an article in ''
Skeptical Inquirer
''Skeptical Inquirer'' is a bimonthly American general-audience magazine published by the Committee for Skeptical Inquiry (CSI) with the subtitle: ''The Magazine for Science and Reason''.
Mission statement and goals
Daniel Loxton, writing in ...
'' demonstrated,
this amounts to generating vast numbers of grids to examine for patterns or groupings by dividing the full text string into widths of from a few to hundreds of thousands of letters wide, repeating the width for subsequent rows. Each one of those many grids can then in turn be searched further for a wide range of words of interest by skipping in intervals, forward or backward, of an arbitrary x letters in the text (or x+1, x+2, etc.), in a massive
cross product
In mathematics, the cross product or vector product (occasionally directed area product, to emphasize its geometric significance) is a binary operation on two vectors in a three-dimensional oriented Euclidean vector space (named here E), and i ...
of
parameter
A parameter (), generally, is any characteristic that can help in defining or classifying a particular system (meaning an event, project, object, situation, etc.). That is, a parameter is an element of a system that is useful, or critical, when ...
ized possibilities, and an associated coincident word of interest can be any nearby string in an arbitrary skip of x+k or y+k letters, forward or backward, such that the permutational volumes become enormous. Thus, setting aside related questions like
confirmation bias
Confirmation bias is the tendency to search for, interpret, favor, and recall information in a way that confirms or supports one's prior beliefs or values. People display this bias when they select information that supports their views, ignorin ...
, even if no groupings of interest or significance were found in the first grid, the next iteration can be tried by computer and so on until "miraculous" or "improbable" groupings are finally arrived at. This is tantamount in effect to, upon dealing oneself an uninteresting
poker
Poker is a family of comparing card games in which players wager over which hand is best according to that specific game's rules. It is played worldwide, however in some places the rules may vary. While the earliest known form of the game w ...
hand, continuing to do so in whatever great quantities necessary until one obtains a
straight flush
In poker, players form sets of five playing cards, called ''hands'', according to the rules of the game. Each hand has a rank, which is compared against the ranks of other hands participating in the showdown to decide who wins the pot. In hi ...
,
royal flush, or even many such events in sequence, and calling the deck inspired for enabling such a result. The Skeptical Inquirer author was thus able to achieve identical effects simply by applying the same search algorithms both to the English language
King James Bible
The King James Version (KJV), also the King James Bible (KJB) and the Authorized Version, is an English translation of the Christian Bible for the Church of England, which was commissioned in 1604 and published in 1611, by sponsorship of K ...
text in place of the allegedly divinely inspired Hebrew version, and then just as effectively to the mundane and arbitrary example text of the 1987
United States Supreme Court
The Supreme Court of the United States (SCOTUS) is the highest court in the federal judiciary of the United States. It has ultimate appellate jurisdiction over all U.S. federal court cases, and over state court cases that involve a point ...
decision ''
Edwards v. Aguillard
''Edwards v. Aguillard'', 482 U.S. 578 (1987), was a United States Supreme Court case concerning the constitutionality of teaching creationism. The Court considered a Louisiana law requiring that where evolutionary science was taught in publi ...
''.
* The
XKCD
''xkcd'', sometimes styled ''XKCD'', is a webcomic created in 2005 by American author Randall Munroe. The comic's tagline describes it as "a webcomic of romance, sarcasm, math, and language". Munroe states on the comic's website that the nam ...
comi
"Significant"provides a good fictional example of this problem.
See also
*
Bonferroni correction
In statistics, the Bonferroni correction is a method to counteract the multiple comparisons problem.
Background
The method is named for its use of the Bonferroni inequalities.
An extension of the method to confidence intervals was proposed by Oliv ...
*
Data dredging
Data dredging (also known as data snooping or ''p''-hacking) is the misuse of data analysis to find patterns in data that can be presented as statistically significant, thus dramatically increasing and understating the risk of false positives. ...
*
Law of truly large numbers
The law of truly large numbers (a statistical adage), attributed to Persi Diaconis and Frederick Mosteller, states that with a large enough number of independent samples, any highly implausible (i.e. unlikely in any single sample, but with cons ...
: with a sample size large enough, any outrageous thing is likely to happen
*
Littlewood's law __NOTOC__
Littlewood's law states that a person can expect to experience events with odds of one in a million (referred to as a "miracle") at the rate of about one per month. It was framed by British mathematician John Edensor Littlewood.
History
...
: any individual can expect a "miracle" to happen to them at the rate of about one per month
*
Texas sharpshooter fallacy
The Texas sharpshooter fallacy is an informal fallacy which is committed when differences in data are ignored, but similarities are overemphasized. From this reasoning, a false conclusion is inferred. This fallacy is the philosophical or rhetorical ...
*
Multiple comparisons problem
In statistics, the multiple comparisons, multiplicity or multiple testing problem occurs when one considers a set of statistical inferences simultaneously or infers a subset of parameters selected based on the observed values.
The more inferences ...
References
{{reflist, 30em
External links
XKCD comic illustrating the Look-Elsewhere effect
Multiple comparisons