Phylogenetic comparative methods (PCMs) use information on the historical relationships of lineages (
phylogenies) to test evolutionary hypotheses. The comparative method has a long history in evolutionary biology; indeed,
Charles Darwin
Charles Robert Darwin ( ; 12 February 1809 – 19 April 1882) was an English natural history#Before 1900, naturalist, geologist, and biologist, widely known for his contributions to evolutionary biology. His proposition that all speci ...
used differences and similarities between species as a major source of evidence in ''
The Origin of Species
''On the Origin of Species'' (or, more completely, ''On the Origin of Species by Means of Natural Selection, or the Preservation of Favoured Races in the Struggle for Life''),The book's full original title was ''On the Origin of Species by Me ...
''. However, the fact that closely related lineages share many traits and trait combinations as a result of the process of descent with modification means that lineages are not independent. This realization inspired the development of explicitly phylogenetic comparative methods.
Initially, these methods were primarily developed to control for phylogenetic history when testing for
adaptation;
[ however, in recent years the use of the term has broadened to include any use of phylogenies in statistical tests.] Although most studies that employ PCMs focus on extant organisms, many methods can also be applied to extinct taxa and can incorporate information from the fossil record
A fossil (from Classical Latin , ) is any preserved remains, impression, or trace of any once-living thing from a past geological age. Examples include bones, shells, exoskeletons, stone imprints of animals or microbes, objects preserved ...
.
PCMs can generally be divided into two types of approaches: those that infer the evolutionary history of some character (phenotypic
In genetics, the phenotype () is the set of observable characteristics or traits of an organism. The term covers the organism's morphology or physical form and structure, its developmental processes, its biochemical and physiological proper ...
or genetic) across a phylogeny and those that infer the process of evolutionary branching itself ( diversification rates), though there are some approaches that do both simultaneously. Typically the tree that is used in conjunction with PCMs has been estimated independently (see computational phylogenetics
Computational phylogenetics is the application of computational algorithms, methods, and programs to phylogenetic "tribe, clan, race", and wikt:γενετικός, γενετικός [] "origin, source, birth") is the study of the evolutionary hist ...
) such that both the relationships between lineages and the length of branches separating them is assumed to be known.
Applications
Phylogenetic comparative approaches can complement other ways of studying adaptation, such as studying natural populations, experimental studies, and mathematical models. Interspecific comparisons allow researchers to assess the generality of evolutionary phenomena by considering independent evolutionary events. Such an approach is particularly useful when there is little or no variation within species. And because they can be used to explicitly model evolutionary processes occurring over very long time periods, they can provide insight into macroevolutionary questions, once the exclusive domain of paleontology
Paleontology (), also spelled palaeontology or palæontology, is the scientific study of life that existed prior to, and sometimes including, the start of the Holocene epoch (roughly 11,700 years before present). It includes the study of fos ...
.
Phylogenetic comparative methods are commonly applied to such questions as:
* What is the slope
In mathematics, the slope or gradient of a line is a number that describes both the ''direction'' and the ''steepness'' of the line. Slope is often denoted by the letter ''m''; there is no clear answer to the question why the letter ''m'' is used ...
of an allometric scaling relationship?
→ ''Example: how does brain mass vary in relation to body mass?''
* Do different clades
A clade (), also known as a monophyletic group or natural group, is a group of organisms that are monophyletic – that is, composed of a common ancestor and all its lineal descendants – on a phylogenetic tree. Rather than the English term ...
of organisms differ with respect to some phenotypic
In genetics, the phenotype () is the set of observable characteristics or traits of an organism. The term covers the organism's morphology or physical form and structure, its developmental processes, its biochemical and physiological proper ...
trait?
→ ''Example: do canids
Canidae (; from Latin, ''canis'', "dog") is a family (biology), biological family of dog-like carnivorans, colloquially referred to as dogs, and constitutes a clade. A member of this family is also called a canid (). There are three subfamily, ...
have larger hearts than felids?''
* Do groups of species that share a behavioral
Behavior (American English) or behaviour (British English) is the range of actions and mannerisms made by individuals, organisms, systems or artificial entities in some environment. These systems can include other systems or organisms as we ...
or ecological feature (e.g., social system
In sociology, a social system is the patterned network of relationships constituting a coherent whole that exist between individuals, groups, and institutions. It is the formal structure of role and status that can form in a small, stable group. ...
, diet) differ in average phenotype?
→ ''Example: do carnivores have larger home range
A home range is the area in which an animal lives and moves on a periodic basis. It is related to the concept of an animal's territory which is the area that is actively defended. The concept of a home range was introduced by W. H. Burt in 1943. H ...
s than herbivores?''
* What was the ancestral
An ancestor, also known as a forefather, fore-elder or a forebear, is a parent or ( recursively) the parent of an antecedent (i.e., a grandparent, great-grandparent, great-great-grandparent and so forth). ''Ancestor'' is "any person from w ...
state of a trait?
→ ''Example: where did endothermy evolve in the lineage that led to mammals?''
→ ''Example: where, when, and why did placenta
The placenta is a temporary embryonic and later fetal organ (anatomy), organ that begins embryonic development, developing from the blastocyst shortly after implantation (embryology), implantation. It plays critical roles in facilitating nutrien ...
s and viviparity
Among animals, viviparity is development of the embryo inside the body of the parent. This is opposed to oviparity which is a reproductive mode in which females lay developing eggs that complete their development and hatch externally from the m ...
evolve?''
* Does a trait exhibit significant phylogenetic signal in a particular group of organisms? Do certain types of traits tend to "follow phylogeny" more than others?
→ ''Example: are behavioral traits more labile during evolution?''
* Do species differences in life history traits trade-off, as in the so-called fast-slow continuum?
→ ''Example: why do small-bodied species have shorter life spans than their larger relatives?''
Phylogenetically independent contrasts
Felsenstein proposed the first general statistical method in 1985 for incorporating phylogenetic information, i.e., the first that could use any arbitrary topology
In mathematics, topology (from the Greek words , and ) is concerned with the properties of a geometric object that are preserved under continuous deformations, such as stretching, twisting, crumpling, and bending; that is, without closing ho ...
(branching order) and a specified set of branch lengths. The method is now recognized as an algorithm
In mathematics and computer science, an algorithm () is a finite sequence of rigorous instructions, typically used to solve a class of specific problems or to perform a computation. Algorithms are used as specifications for performing ...
that implements a special case of what are termed phylogenetic generalized least-squares models. The logic
Logic is the study of correct reasoning. It includes both formal and informal logic. Formal logic is the science of deductively valid inferences or of logical truths. It is a formal science investigating how conclusions follow from premis ...
of the method is to use phylogenetic information (and an assumed Brownian motion
Brownian motion, or pedesis (from grc, πήδησις "leaping"), is the random motion of particles suspended in a medium (a liquid or a gas).
This pattern of motion typically consists of random fluctuations in a particle's position insi ...
like model of trait evolution) to transform the original tip data (mean values for a set of species) into values that are statistically independent and identically distributed.
The algorithm involves computing values at internal nodes as an intermediate step, but they are generally not used for inferences by themselves. An exception occurs for the basal (root) node, which can be interpreted as an estimate of the ancestral value for the entire tree (assuming that no directional evolutionary trends .g., Cope's rule">Cope's_rule.html" ;"title=".g., Cope's rule">.g., Cope's rulehave occurred) or as a phylogenetically weighted estimate of the mean for the entire set of tip species (terminal taxa). The value at the root is equivalent to that obtained from the "squared-change parsimony" algorithm and is also the maximum likelihood estimate under Brownian motion. The independent contrasts algebra can also be used to compute a standard error or confidence interval.
Phylogenetic generalized least squares (PGLS)
Probably the most commonly used PCM is phylogenetic generalized least squares (PGLS). This approach is used to test whether there is a relationship between two (or more) variables while accounting for the fact that lineage are not independent. The method is a special case of generalized least squares
In statistics, generalized least squares (GLS) is a technique for estimating the unknown parameters in a linear regression model when there is a certain degree of correlation between the residuals in a regression model. In these cases, ordin ...
(GLS) and as such the PGLS estimator is also unbiased, consistent
In classical deductive logic, a consistent theory is one that does not lead to a logical contradiction. The lack of contradiction can be defined in either semantic or syntactic terms. The semantic definition states that a theory is consisten ...
, efficient, and asymptotically normal. In many statistical situations where GLS (or, ordinary least squares
In statistics, ordinary least squares (OLS) is a type of linear least squares method for choosing the unknown parameters in a linear regression model (with fixed level-one effects of a linear function of a set of explanatory variables) by the ...
LS is used residual errors ''ε'' are assumed to be independent and identically distributed random variables that are assumed to be normal
:
whereas in PGLS the errors are assumed to be distributed as
:
where ''V'' is a matrix of expected variance and covariance of the residuals given an evolutionary model and a phylogenetic tree. Therefore, it is the structure of residuals and not the variables themselves that show phylogenetic signal
Phylogenetic signal is an evolutionary and ecological term, that describes the tendency or the pattern of related biological species to resemble each other more than any other species that is randomly picked from the same phylogenetic tree.
C ...
. This has long been a source of confusion in the scientific literature. A number of models have been proposed for the structure of ''V'' such as Brownian motion
Brownian motion, or pedesis (from grc, πήδησις "leaping"), is the random motion of particles suspended in a medium (a liquid or a gas).
This pattern of motion typically consists of random fluctuations in a particle's position insi ...
Ornstein-Uhlenbeck, and Pagel's λ model. (When a Brownian motion model is used, PGLS is identical to the independent contrasts estimator.). In PGLS, the parameters of the evolutionary model are typically co-estimated with the regression parameters.
PGLS can only be applied to questions where the dependent variable
Dependent and independent variables are variables in mathematical modeling, statistical modeling and experimental sciences. Dependent variables receive this name because, in an experiment, their values are studied under the supposition or dema ...
is continuously distributed; however, the phylogenetic tree can also be incorporated into the residual distribution of generalized linear models, making it possible to generalize the approach to a broader set of distributions for the response.
Phylogenetically informed Monte Carlo computer simulations
Martins and Garland proposed in 1991 that one way to account for phylogenetic relations when conducting statistical analyses was to use computer simulations to create many data sets that are consistent with the null hypothesis under test (e.g., no correlation between two traits, no difference between two ecologically defined groups of species) but that mimic evolution along the relevant phylogenetic tree. If such data sets (typically 1,000 or more) are analyzed with the same statistical procedure that is used to analyze a real data set, then results for the simulated data sets can be used to create phylogenetically correct (or "PC") null distributions of the test statistic (e.g., a correlation coefficient, t, F). Such simulation approaches can also be combined with such methods as phylogenetically independent contrasts or PGLS (see above).
See also
* Allometry
Allometry is the study of the relationship of body size to shape, anatomy, physiology and finally behaviour, first outlined by Otto Snell in 1892, by D'Arcy Thompson in 1917 in ''On Growth and Form'' and by Julian Huxley in 1932.
Overview
Allom ...
* Behavioral ecology
Behavioral ecology, also spelled behavioural ecology, is the study of the evolutionary basis for animal behavior due to ecological pressures. Behavioral ecology emerged from ethology after Niko Tinbergen outlined four questions to address w ...
* Biodiversity
Biodiversity or biological diversity is the variety and variability of life on Earth. Biodiversity is a measure of variation at the genetic ('' genetic variability''), species ('' species diversity''), and ecosystem ('' ecosystem diversity' ...
* Bioinformatics
Bioinformatics () is an interdisciplinary field that develops methods and software tools for understanding biological data, in particular when the data sets are large and complex. As an interdisciplinary field of science, bioinformatics combin ...
* Cladistics
Cladistics (; ) is an approach to biological classification in which organisms are categorized in groups (" clades") based on hypotheses of most recent common ancestry. The evidence for hypothesized relationships is typically shared derived ch ...
* Comparative anatomy
Comparative anatomy is the study of similarities and differences in the anatomy of different species. It is closely related to evolutionary biology and phylogeny (the evolution of species).
The science began in the classical era, continuing in ...
* Comparative method
In linguistics, the comparative method is a technique for studying the development of languages by performing a feature-by-feature comparison of two or more languages with common descent from a shared ancestor and then extrapolating backwards ...
in linguistics
* Comparative physiology
* Computational phylogenetics
Computational phylogenetics is the application of computational algorithms, methods, and programs to phylogenetic "tribe, clan, race", and wikt:γενετικός, γενετικός [] "origin, source, birth") is the study of the evolutionary hist ...
* Disk-covering method
* Ecophysiology
* Evolutionary neurobiology
* Evolutionary physiology
* Generalized least squares (GLS)
* Generalized linear model
In statistics, a generalized linear model (GLM) is a flexible generalization of ordinary linear regression. The GLM generalizes linear regression by allowing the linear model to be related to the response variable via a ''link function'' and by ...
* Joe Felsenstein
Joseph "Joe" Felsenstein (born May 9, 1942) is a Professor Emeritus in the Departments of Genome Sciences and Biology at the University of Washington in Seattle. He is best known for his work on phylogenetic inference, and is the author of ''Inferr ...
* Mark Pagel
* Maximum likelihood
In statistics, maximum likelihood estimation (MLE) is a method of estimating the parameters of an assumed probability distribution, given some observed data. This is achieved by maximizing a likelihood function so that, under the assumed sta ...
* Maximum parsimony
* Paul H. Harvey
* Phylogenetics
In biology, phylogenetics (; from Greek φυλή/ φῦλον [] "tribe, clan, race", and wikt:γενετικός, γενετικός [] "origin, source, birth") is the study of the evolutionary history and relationships among or within groups ...
* Phylogenetic reconciliation
* Roderic D.M. Page
Roderic Dugald Morton Page (born 1962) is a New Zealand-born evolutionary biologist at the University of Glasgow, Scotland, and the author of several books. he is professor at the University of Glasgow and was editor of the journal '' Systema ...
* Sexual selection
Sexual selection is a mode of natural selection in which members of one biological sex choose mates of the other sex to mate with (intersexual selection), and compete with members of the same sex for access to members of the opposite sex (in ...
* Statistics
* Systematics
Biological systematics is the study of the diversification of living forms, both past and present, and the relationships among living things through time. Relationships are visualized as evolutionary trees (synonyms: cladograms, phylogenetic t ...
* Theodore Garland Jr.
References
Further reading
* Ackerly, D. D. 1999. Comparative plant ecology and the role of phylogenetic information. Pages 391–413 in M. C. Press, J. D. Scholes, and M. G. Braker, eds. Physiological plant ecology. The 39th symposium of the British Ecological Society held at the University of York 7–9 September 1998. Blackwell Science, Oxford, U.K.
*
*
* Brooks, D. R., and D. A. McLennan. 1991. Phylogeny, ecology, and behavior: a research program in comparative biology. Univ. Chicago Press, Chicago. 434 pp.
*
* Eggleton, P., and R. I. Vane-Wright, eds. 1994. Phylogenetics and ecology. Linnean Society Symposium Series Number 17. Academic Press, London.
* Felsenstein, J. 2004. Inferring phylogenies. Sinauer Associates, Sunderland, Mass. xx + 664 pp.
*
*
*
*
*
*
*
*
* Ives, A. R. 2018. Mixed and phylogenetic models: a conceptual introduction to correlated data. leanpub.com, 125 pp., https://leanpub.com/correlateddata
*
*
*
* Maddison, W. P., and D. R. Maddison. 1992. MacClade. Analysis of phylogeny and character evolution. Version 3. Sinauer Associates, Sunderland, Mass. 398 pp.
* Martins, E. P., ed. 1996. Phylogenies and the comparative method in animal behavior. Oxford University Press, Oxford. 415 pp.
* Erratum Am. Nat. 153:448.
*
*
*
*
* Page, R. D. M., ed. 2003. Tangled trees: phylogeny, cospeciation, and coevolution. University of Chicago Press, Chicago.
*
*
*
*
*
* Rezende, E. L., and Garland, T. Jr. 2003. Comparaciones interespecíficas y métodos estadísticos filogenéticos. Pages 79–98 in F. Bozinovic, ed. Fisiología Ecológica & Evolutiva. Teoría y casos de estudios en animales. Ediciones Universidad Católica de Chile, Santiago
PDF
*
* Ridley, M. 1983. The explanation of organic diversity: The comparative method and adaptations for mating. Clarendon, Oxford, U.K.
*
*
*
*
*
*
*
External links
* ttp://evolution.genetics.washington.edu/phylip/software.html List of phylogeny programs
Phylocomm
Phylogenetic Tools for Comparative Biology
Phylogeny of Sleep website
Tree of Life
Journals
American Naturalist
Behavioral Ecology
Evolution
Evolutionary Ecology Research
Functional Ecology
Journal of Evolutionary Biology
Philosophical Transactions of the Royal Society of London B
Systematic Biology
Software packages (incomplete list)
Analyses of Phylogenetics and Evolution
Comparative Analysis by Independent Contrasts
COMPARE
mvMorph
ouch: Ornstein-Uhlenbeck for Comparative Hypotheses
Phylocom
Laboratories
Ackerly
Bininda-Emonds
Blomberg
Butler
Grafen
Hansen
Harmon
Housworth
Ives
Losos
Mort
Nunn
Oakley
Paradis
Rambaut
Rohlf
Sanderson
{{Portal bar, Evolutionary biology
Phylogenetics