Ground Truth
   HOME

TheInfoList



OR:

Ground truth is information that is known to be real or true, provided by direct observation and measurement (i.e.
empirical evidence Empirical evidence is evidence obtained through sense experience or experimental procedure. It is of central importance to the sciences and plays a role in various other fields, like epistemology and law. There is no general agreement on how the ...
) as opposed to information provided by
inference Inferences are steps in logical reasoning, moving from premises to logical consequences; etymologically, the word '' infer'' means to "carry forward". Inference is theoretically traditionally divided into deduction and induction, a distinct ...
.


Etymology

The ''
Oxford English Dictionary The ''Oxford English Dictionary'' (''OED'') is the principal historical dictionary of the English language, published by Oxford University Press (OUP), a University of Oxford publishing house. The dictionary, which published its first editio ...
'' (s.v. ''ground truth'') records the use of the word ''Groundtruth'' in the sense of 'fundamental truth' from Henry Ellison's poem "The Siberian Exile's Tale", published in 1833.


Usage

The term "ground truth" can be used as a noun, adjective, and verb. * Noun: "ground truth" (no hyphen). Example: "The ground truth is essential for training accurate models." * Adjective: "ground-truth" (hyphenated compound adjective). Example: "We need to use ground-truth data to validate the model." * Verb: "to ground-truth" or "to groundtruth" (
compound verb In linguistics, a compound verb or complex predicate is a multi-word compound that functions as a single verb. One component of the compound is a '' light verb'' or ''vector'', which carries any inflections, indicating tense, mood, or aspect ...
,). Example: "We need to ground-truth the results to ensure their accuracy."


Statistics and machine learning

"Ground truth" may be seen as a conceptual term relative to the knowledge of the truth concerning a specific question. It is the ideal expected result. This is used in
statistical model A statistical model is a mathematical model that embodies a set of statistical assumptions concerning the generation of Sample (statistics), sample data (and similar data from a larger Statistical population, population). A statistical model repre ...
s to prove or disprove
research Research is creative and systematic work undertaken to increase the stock of knowledge. It involves the collection, organization, and analysis of evidence to increase understanding of a topic, characterized by a particular attentiveness to ...
hypotheses A hypothesis (: hypotheses) is a proposed explanation for a phenomenon. A scientific method, scientific hypothesis must be based on observations and make a testable and reproducible prediction about reality, in a process beginning with an educ ...
. The term "ground truthing" refers to the process of gathering the proper objective (provable) data for this test. Compare with
gold standard A gold standard is a backed currency, monetary system in which the standard economics, economic unit of account is based on a fixed quantity of gold. The gold standard was the basis for the international monetary system from the 1870s to the ...
. For example, suppose we are testing a stereo vision system to see how well it can estimate 3D positions. The "ground truth" might be the positions given by a laser rangefinder which is known to be much more accurate than the camera system.
Bayesian spam filtering In statistics, naive (sometimes simple or idiot's) Bayes classifiers are a family of " probabilistic classifiers" which assumes that the features are conditionally independent, given the target class. In other words, a naive Bayes model assumes th ...
is a common example of supervised learning. In this system, the algorithm is manually taught the differences between spam and non-spam. This depends on the ''ground truth'' of the messages used to train the algorithm – inaccuracies in the ground truth will correlate to inaccuracies in the resulting spam/non-spam verdicts.


Remote sensing

In
remote sensing Remote sensing is the acquisition of information about an physical object, object or phenomenon without making physical contact with the object, in contrast to in situ or on-site observation. The term is applied especially to acquiring inform ...
, "ground truth" refers to information collected at the imaged location. Ground truth allows image data to be related to real features and materials on the ground. The collection of ground truth data enables calibration of remote-sensing data, and aids in the interpretation and analysis of what is being sensed. Examples include
cartography Cartography (; from , 'papyrus, sheet of paper, map'; and , 'write') is the study and practice of making and using maps. Combining science, aesthetics and technique, cartography builds on the premise that reality (or an imagined reality) can ...
,
meteorology Meteorology is the scientific study of the Earth's atmosphere and short-term atmospheric phenomena (i.e. weather), with a focus on weather forecasting. It has applications in the military, aviation, energy production, transport, agricultur ...
, analysis of aerial photographs,
satellite imagery Satellite images (also Earth observation imagery, spaceborne photography, or simply satellite photo) are images of Earth collected by imaging satellites operated by governments and businesses around the world. Satellite imaging companies sell im ...
and other techniques in which data are gathered at a distance. More specifically, ground truth may refer to a process in which "
pixel In digital imaging, a pixel (abbreviated px), pel, or picture element is the smallest addressable element in a Raster graphics, raster image, or the smallest addressable element in a dot matrix display device. In most digital display devices, p ...
s" on a
satellite A satellite or an artificial satellite is an object, typically a spacecraft, placed into orbit around a celestial body. They have a variety of uses, including communication relay, weather forecasting, navigation ( GPS), broadcasting, scient ...
image are compared to what is imaged (at the time of capture) in order to verify the contents of the "pixels" in the image (noting that the concept of "pixel" is imaging-system-dependent). In the case of a classified image, supervised classification can help to determine the accuracy of the classification by the remote sensing system which can minimize error in the classification. Ground truth is usually done on site, correlating what is known with surface observations and measurements of various properties of the features of the ground resolution cells under study in the remotely sensed digital image. The process also involves taking geographic coordinates of the ground resolution cell with GPS technology and comparing those with the coordinates of the "pixel" being studied provided by the remote sensing software to understand and analyze the location errors and how it may affect a particular study. Ground truth is important in the initial supervised classification of an image. When the identity and location of land cover types are known through a combination of field work, maps, and personal experience these areas are known as training sites. The spectral characteristics of these areas are used to train the remote sensing software using decision rules for classifying the rest of the image. These decision rules such as Maximum Likelihood Classification, Parallelopiped Classification, and Minimum Distance Classification offer different techniques to classify an image. Additional ground truth sites allow the remote sensor to establish an error matrix that validates the accuracy of the classification method used. Different classification methods may have different percentages of error for a given classification project. It is important that the remote sensor chooses a classification method that works best with the number of classifications used while providing the least amount of error. Ground truth also helps with
atmospheric correction An atmosphere () is a layer of gases that envelop an astronomical object, held in place by the gravity of the object. A planet retains an atmosphere when the gravity is great and the temperature of the atmosphere is low. A stellar atmospher ...
. Since images from satellites have to pass through the atmosphere, they can get distorted because of absorption in the atmosphere. So ground truth can help fully identify objects in satellite photos.


Errors of commission

An example of an error of commission is when a pixel reports the presence of a feature (such a tree) that, in reality, is absent (no tree is actually present). Ground truthing ensures that the error matrices have a higher accuracy percentage than would be the case if no pixels were ground-truthed. This value is the inverse of the user's accuracy, i.e. Commission Error = 1 - user's accuracy.


Errors of omission

An example of an error of omission is when pixels of a certain type, for example, maple trees, are not classified as maple trees. The process of ground-truthing helps to ensure that the pixel is classified correctly and the error matrices are more accurate. This value is the inverse of the producer's accuracy, i.e. Omission Error = 1 - producer's accuracy


Geographical information systems

In GIS the spatial data is modeled as ''field'' (like in remote sensing raster images) or as ''object'' (like in vectorial map representation). They are modeled from the real world (also named ''geographical reality''), typically by a cartographic process (illustrated).
Geographic information system A geographic information system (GIS) consists of integrated computer hardware and Geographic information system software, software that store, manage, Spatial analysis, analyze, edit, output, and Cartographic design, visualize Geographic data ...
s such as GIS, GPS, and GNSS, have become so widespread that the term "ground truth" has taken on special meaning in that context. If the location coordinates returned by a location method such as GPS are an estimate of a location, then the "ground truth" is the actual location on Earth. A smart phone might return a set of estimated location coordinates such as 43.87870,-103.45901. The ground truth being estimated by those coordinates is the tip of George Washington's nose on
Mount Rushmore The Mount Rushmore National Memorial is a National Memorial (United States), national memorial centered on a colossal sculpture carved into the granite face of Mount Rushmore (, or Six Grandfathers) in the Black Hills near Keystone, South Dak ...
. The accuracy of the estimate is the maximum distance between the location coordinates and the ground truth. We could say in this case that the estimate accuracy is 10 meters, meaning that the point on earth represented by the location coordinates is thought to be within 10 meters of George's nose—the ground truth. In slang, the coordinates indicate where we think George Washington's nose is located, and the ground truth is where it really is. In practice a smart phone or hand-held GPS unit is routinely able to estimate the ground truth within 6–10 meters. Specialized instruments can reduce GPS measurement error to under a centimeter.


Military usage

US
military slang Military slang is an array of colloquial terminology used commonly by military personnel, including slang which is unique to or originates with the armed forces. In English-speaking countries, it often takes the form of abbreviations/acronyms or d ...
uses "ground truth" to refer to the facts comprising a tactical situation—as opposed to intelligence reports, mission plans, and other descriptions reflecting the conative or policy-based projections of the industrial·military complex. The term appears in the title of the
Iraq War The Iraq War (), also referred to as the Second Gulf War, was a prolonged conflict in Iraq lasting from 2003 to 2011. It began with 2003 invasion of Iraq, the invasion by a Multi-National Force – Iraq, United States-led coalition, which ...
documentary film '' The Ground Truth'' (2006), and also in military publications, for example '' Stars and Stripes'' saying: "Stripes decided to figure out what the ground truth was in Iraq."


See also

* Baseline (science) *
Calibration In measurement technology and metrology, calibration is the comparison of measurement values delivered by a device under test with those of a calibration standard of known accuracy. Such a standard could be another measurement device of known ...
*
Foundationalism Foundationalism concerns philosophical theories of knowledge resting upon non-inferential justified belief, or some secure foundation of certainty such as a conclusion inferred from a basis of sound premises.Simon Blackburn, ''The Oxford Dict ...


References

{{Reflist


External links


Forestry Organization Remote Sensing Technology Project
(includes an example of an error matrix) Applications of computer vision Automatic identification and data capture Computational linguistics Machine learning task Satellite meteorology