
In
genetics
Genetics is the study of genes, genetic variation, and heredity in organisms.Hartl D, Jones E (2005) It is an important branch in biology because heredity is vital to organisms' evolution. Gregor Mendel, a Moravian Augustinians, Augustinian ...
, the phenotype () is the set of observable characteristics or
traits of an
organism
An organism is any life, living thing that functions as an individual. Such a definition raises more problems than it solves, not least because the concept of an individual is also difficult. Many criteria, few of them widely accepted, have be ...
.
The term covers the organism's
morphology (physical form and structure), its
developmental processes, its biochemical and physiological properties, and its
behavior
Behavior (American English) or behaviour (British English) is the range of actions of Individual, individuals, organisms, systems or Artificial intelligence, artificial entities in some environment. These systems can include other systems or or ...
. An organism's phenotype results from two basic factors: the
expression of an organism's genetic code (its
genotype
The genotype of an organism is its complete set of genetic material. Genotype can also be used to refer to the alleles or variants an individual carries in a particular gene or genetic location. The number of alleles an individual can have in a ...
) and the influence of environmental factors. Both factors may interact, further affecting the phenotype. When two or more clearly different phenotypes exist in the same population of a species, the species is called
polymorphic. A well-documented example of polymorphism is
Labrador Retriever coloring; while the coat color depends on many genes, it is clearly seen in the environment as yellow, black, and brown.
Richard Dawkins in 1978
[ and again in his 1982 book '' The Extended Phenotype'' suggested that one can regard bird nests and other built structures such as caddisfly larva cases and ]beaver dam
A beaver dam or beaver impoundment is a dam built by beavers; it creates a pond which protects against predators such as coyotes, alligators, cougars, foxes, eagles, wolves and bears, and holds their food during winter. These structures modify th ...
s as "extended phenotypes".
Wilhelm Johannsen proposed the genotype–phenotype distinction in 1911 to make clear the difference between an organism's hereditary material and what that hereditary material produces. The distinction resembles that proposed by August Weismann (1834–1914), who distinguished between germ plasm (heredity) and somatic cells (the body). More recently in '' The Selfish Gene'' (1976), Dawkins distinguished these concepts as replicators and vehicles.
Definition
Despite its seemingly straightforward definition, the concept of the phenotype has hidden subtleties. It may seem that anything dependent on the genotype
The genotype of an organism is its complete set of genetic material. Genotype can also be used to refer to the alleles or variants an individual carries in a particular gene or genetic location. The number of alleles an individual can have in a ...
is a phenotype, including molecule
A molecule is a group of two or more atoms that are held together by Force, attractive forces known as chemical bonds; depending on context, the term may or may not include ions that satisfy this criterion. In quantum physics, organic chemi ...
s such as RNA
Ribonucleic acid (RNA) is a polymeric molecule that is essential for most biological functions, either by performing the function itself (non-coding RNA) or by forming a template for the production of proteins (messenger RNA). RNA and deoxyrib ...
and protein
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residue (biochemistry), residues. Proteins perform a vast array of functions within organisms, including Enzyme catalysis, catalysing metab ...
s. Most molecules and structures coded by the genetic material are not visible in the appearance of an organism, yet they are observable (for example by Western blotting) and are thus part of the phenotype; human blood groups are an example. It may seem that this goes beyond the original intentions of the concept with its focus on the (living) organism in itself. Either way, the term phenotype includes inherent traits or characteristics that are observable or traits that can be made visible by some technical procedure.
The term "phenotype" has sometimes been incorrectly used as a shorthand for the phenotypic difference between a mutant and its wild type, which would lead to the false statement that a
"mutation has no phenotype".
Behaviors and their consequences are also phenotypes, since behaviors are observable characteristics. Behavioral phenotypes include cognitive, personality, and behavioral patterns. Some behavioral phenotypes may characterize psychiatric disorders or syndromes.
A phenome is the set of all traits expressed by a cell, tissue, organ, organism
An organism is any life, living thing that functions as an individual. Such a definition raises more problems than it solves, not least because the concept of an individual is also difficult. Many criteria, few of them widely accepted, have be ...
, or species
A species () is often defined as the largest group of organisms in which any two individuals of the appropriate sexes or mating types can produce fertile offspring, typically by sexual reproduction. It is the basic unit of Taxonomy (biology), ...
. The term was first used by Davis in 1949, "We here propose the name ''phenome'' for the sum total of extragenic, non-autoreproductive portions of the cell, whether cytoplasmic or nuclear. The phenome would be the material basis of the phenotype, just as the genome is the material basis of the genotype
The genotype of an organism is its complete set of genetic material. Genotype can also be used to refer to the alleles or variants an individual carries in a particular gene or genetic location. The number of alleles an individual can have in a ...
." Although phenome has been in use for many years, the distinction between the use of phenome and phenotype is problematic. A proposed definition for both terms as the "physical totality of all traits of an organism or of one of its subsystems" was put forth by Mahner and Kary in 1997, who argue that although scientists tend to intuitively use these and related terms in a manner that does not impede research, the terms are not well defined and usage of the terms is not consistent.
Some usages of the term suggest that the phenome of a given organism is best understood as a kind of matrix of data representing physical manifestation of phenotype. For example, discussions led by A. Varki among those who had used the term up to 2003 suggested the following definition: "The body of information describing an organism's phenotypes, under the influences of genetic and environmental factors". Another team of researchers characterize "the human phenome s/nowiki> a multidimensional search space with several neurobiological levels, spanning the proteome, cellular systems (e.g., signaling pathways), neural systems and cognitive and behavioural phenotypes." Plant biologists have begun to explore the phenome in the study of plant physiology. In 2009, a research team demonstrated the feasibility of identifying genotype–phenotype associations using electronic health records (EHRs) linked to DNA biobanks. They called this method phenome-wide association study (PheWAS).
Inspired by the evolution from genotype to genome to pan-genome, a concept of eventually exploring the relationship among pan-phenome, pan-genome, and pan- envirome was proposed in 2023.
Phenotypic variation
Phenotypic variation (due to underlying heritable genetic variation) is a fundamental prerequisite for evolution
Evolution is the change in the heritable Phenotypic trait, characteristics of biological populations over successive generations. It occurs when evolutionary processes such as natural selection and genetic drift act on genetic variation, re ...
by natural selection
Natural selection is the differential survival and reproduction of individuals due to differences in phenotype. It is a key mechanism of evolution, the change in the Heredity, heritable traits characteristic of a population over generation ...
. It is the living organism as a whole that contributes (or not) to the next generation, so natural selection affects the genetic structure of a population indirectly via the contribution of phenotypes. Without phenotypic variation, there would be no evolution by natural selection.
The interaction between genotype and phenotype has often been conceptualized by the following relationship:
:genotype (G) + environment (E) → phenotype (P)
A more nuanced version of the relationship is:
:genotype (G) + environment (E) + genotype & environment interactions (GE) → phenotype (P)
Genotypes often have much flexibility in the modification and expression of phenotypes; in many organisms these phenotypes are very different under varying environmental conditions. The plant '' Hieracium umbellatum'' is found growing in two different habitat
In ecology, habitat refers to the array of resources, biotic factors that are present in an area, such as to support the survival and reproduction of a particular species. A species' habitat can be seen as the physical manifestation of its ...
s in Sweden
Sweden, formally the Kingdom of Sweden, is a Nordic countries, Nordic country located on the Scandinavian Peninsula in Northern Europe. It borders Norway to the west and north, and Finland to the east. At , Sweden is the largest Nordic count ...
. One habitat is rocky, sea-side cliff
In geography and geology, a cliff or rock face is an area of Rock (geology), rock which has a general angle defined by the vertical, or nearly vertical. Cliffs are formed by the processes of weathering and erosion, with the effect of gravity. ...
s, where the plants are bushy with broad leaves and expanded inflorescence
In botany, an inflorescence is a group or cluster of flowers arranged on a plant's Plant stem, stem that is composed of a main branch or a system of branches. An inflorescence is categorized on the basis of the arrangement of flowers on a mai ...
s; the other is among sand dunes where the plants grow prostrate with narrow leaves and compact inflorescences. The habitats alternate along the coast of Sweden and the habitat that the seeds of ''Hieracium umbellatum'' land in, determine the phenotype that grows.
An example of random variation in '' Drosophila'' flies is the number of ommatidia, which may vary (randomly) between left and right eyes in a single individual as much as they do between different genotypes overall, or between clones raised in different environments.
The concept of phenotype can be extended to variations below the level of the gene
In biology, the word gene has two meanings. The Mendelian gene is a basic unit of heredity. The molecular gene is a sequence of nucleotides in DNA that is transcribed to produce a functional RNA. There are two types of molecular genes: protei ...
which affect an organism's fitness. For example, silent mutations that do not change the corresponding amino acid sequence of a gene may change the frequency of guanine
Guanine () (symbol G or Gua) is one of the four main nucleotide bases found in the nucleic acids DNA and RNA, the others being adenine, cytosine, and thymine ( uracil in RNA). In DNA, guanine is paired with cytosine. The guanine nucleoside ...
-cytosine
Cytosine () (symbol C or Cyt) is one of the four nucleotide bases found in DNA and RNA, along with adenine, guanine, and thymine ( uracil in RNA). It is a pyrimidine derivative, with a heterocyclic aromatic ring and two substituents attac ...
base pairs ( GC content). The base pairs have a higher thermal stability (''melting point'') than adenine- thymine, a property that might convey, among organisms living in high-temperature environments, a selective advantage on variants enriched in GC content.
The extended phenotype
Richard Dawkins described a phenotype that included all effects that a gene has on its surroundings, including other organisms, as an extended phenotype, arguing that "An animal's behavior tends to maximize the survival of the genes 'for' that behavior, whether or not those genes happen to be in the body of the particular animal performing it." For instance, an organism such as a beaver modifies its environment by building a beaver dam
A beaver dam or beaver impoundment is a dam built by beavers; it creates a pond which protects against predators such as coyotes, alligators, cougars, foxes, eagles, wolves and bears, and holds their food during winter. These structures modify th ...
; this can be considered an expression of its genes, just as its incisor teeth are—which it uses to modify its environment. Similarly, when a bird feeds a brood parasite such as a cuckoo, it is unwittingly extending its phenotype; and when genes in an orchid
Orchids are plants that belong to the family Orchidaceae (), a diverse and widespread group of flowering plants with blooms that are often colourful and fragrant. Orchids are cosmopolitan plants that are found in almost every habitat on Eart ...
affect orchid bee behavior to increase pollination, or when genes in a peacock affect the copulatory decisions of peahens, again, the phenotype is being extended. Genes are, in Dawkins's view, selected by their phenotypic effects.
Other biologists broadly agree that the extended phenotype concept is relevant, but consider that its role is largely explanatory, rather than assisting in the design of experimental tests.
Genes and phenotypes
Phenotypes are determined by an interaction of genes and the environment, but the mechanism for each gene and phenotype is different. For instance, an albino phenotype may be caused by a mutation in the gene encoding tyrosinase which is a key enzyme in melanin formation. However, exposure to UV radiation can increase melanin production, hence the environment plays a role in this phenotype as well. For most complex phenotypes the precise genetic mechanism remains unknown.
Gene expression plays a crucial role in determining the phenotypes of organisms. The level of gene expression can affect the phenotype of an organism. For example, if a gene that codes for a particular enzyme
An enzyme () is a protein that acts as a biological catalyst by accelerating chemical reactions. The molecules upon which enzymes may act are called substrate (chemistry), substrates, and the enzyme converts the substrates into different mol ...
is expressed at high levels, the organism may produce more of that enzyme and exhibit a particular trait as a result. On the other hand, if the gene is expressed at low levels, the organism may produce less of the enzyme and exhibit a different trait. Gene expression is regulated at various levels and thus each level can affect certain phenotypes, including transcriptional and post-transcriptional regulation.
Changes in the levels of gene expression can be influenced by a variety of factors, such as environmental conditions, genetic variations, and epigenetic modifications. These modifications can be influenced by environmental factors such as diet, stress, and exposure to toxins, and can have a significant impact on an individual's phenotype. Some phenotypes may be the result of changes in gene expression due to these factors, rather than changes in genotype. An experiment involving machine learning
Machine learning (ML) is a field of study in artificial intelligence concerned with the development and study of Computational statistics, statistical algorithms that can learn from data and generalise to unseen data, and thus perform Task ( ...
methods utilizing gene expressions measured from RNA sequencing found that they can contain enough signal to separate individuals in the context of phenotype prediction.
Phenome and phenomics
Although a phenotype is the ensemble of observable characteristics displayed by an organism, the word '' phenome'' is sometimes used to refer to a collection of traits, while the simultaneous study of such a collection is referred to as '' phenomics''. Phenomics is an important field of study because it can be used to figure out which genomic variants affect phenotypes which then can be used to explain things like health, disease, and evolutionary fitness. Phenomics forms a large part of the Human Genome Project.
Phenomics has applications in agriculture. For instance, genomic variations such as drought and heat resistance can be identified through phenomics to create more durable GMOs. Phenomics may be a stepping stone towards personalized medicine, particularly drug therapy. Once the phenomic database has acquired enough data, a person's phenomic information can be used to select specific drugs tailored to the individual.
Large-scale phenotyping and genetic screens
Large-scale genetic screens can identify the genes or mutation
In biology, a mutation is an alteration in the nucleic acid sequence of the genome of an organism, virus, or extrachromosomal DNA. Viral genomes contain either DNA or RNA. Mutations result from errors during DNA or viral replication, ...
s that affect the phenotype of an organism. Analyzing the phenotypes of mutant genes can also aid in determining gene function. Most genetic screens have used microorganisms, in which genes can be easily deleted. For instance, nearly all genes have been deleted in '' E. coli'' and many other bacteria
Bacteria (; : bacterium) are ubiquitous, mostly free-living organisms often consisting of one Cell (biology), biological cell. They constitute a large domain (biology), domain of Prokaryote, prokaryotic microorganisms. Typically a few micr ...
, but also in several eukaryotic model organisms such as baker's yeast and fission yeast. Among other discoveries, such studies have revealed lists of essential genes .
More recently, large-scale phenotypic screens have also been used in animals, e.g. to study lesser understood phenotypes such as behavior
Behavior (American English) or behaviour (British English) is the range of actions of Individual, individuals, organisms, systems or Artificial intelligence, artificial entities in some environment. These systems can include other systems or or ...
. In one screen, the role of mutations in mice were studied in areas including learning and memory
Memory is the faculty of the mind by which data or information is encoded, stored, and retrieved when needed. It is the retention of information over time for the purpose of influencing future action. If past events could not be remembe ...
, circadian rhythm
A circadian rhythm (), or circadian cycle, is a natural oscillation that repeats roughly every 24 hours. Circadian rhythms can refer to any process that originates within an organism (i.e., Endogeny (biology), endogenous) and responds to the env ...
icity, vision, responses to stress, and response to psychostimulants.
This experiment involves the progeny of mice treated with ENU, or N-ethyl-N-nitrosourea, which is a potent mutagen that causes point mutations. The mice were phenotypically screened for alterations in the different behavioral domains in order to find the number of putative mutants (see table for details). Putative mutants are then tested for heritability in order to help determine the inheritance pattern as well as map out the mutations. Once they have been mapped out, cloned, and identified, it can be determined whether a mutation represents a new gene or not.
These experiments show that mutations in the rhodopsin
Rhodopsin, also known as visual purple, is a protein encoded by the ''RHO'' gene and a G-protein-coupled receptor (GPCR). It is a light-sensitive receptor protein that triggers visual phototransduction in rod cells. Rhodopsin mediates dim ...
gene affected vision and can even cause retinal degeneration in mice. The same amino acid
Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although over 500 amino acids exist in nature, by far the most important are the 22 α-amino acids incorporated into proteins. Only these 22 a ...
change causes human familial blindness, showing how phenotyping in animals can inform medical diagnostics and possibly therapy.
Evolutionary origin of phenotype
The RNA world is the hypothesized pre-cellular stage in the evolutionary history of life on earth, in which self-replicating RNA molecules proliferated prior to the evolution of DNA and proteins. The folded three-dimensional
In geometry, a three-dimensional space (3D space, 3-space or, rarely, tri-dimensional space) is a mathematical space in which three values (''coordinates'') are required to determine the position (geometry), position of a point (geometry), poi ...
physical structure of the first RNA molecule that possessed ribozyme activity promoting replication while avoiding destruction would have been the first phenotype, and the nucleotide sequence of the first self-replicating RNA molecule would have been the original genotype.[
]
See also
* Bioinformatics
Bioinformatics () is an interdisciplinary field of science that develops methods and Bioinformatics software, software tools for understanding biological data, especially when the data sets are large and complex. Bioinformatics uses biology, ...
* Ecotype
Ecotypes are organisms which belong to the same species but possess different phenotypical features as a result of environmental factors such as elevation, climate and predation. Ecotypes can be seen in wide geographical distributions and may event ...
* Endophenotype
* Genotype–phenotype distinction
* Molecular phenotyping
* Phenomics
* Phenotypic trait
* Physiome
* Physiomics
* Race and genetics
* Systems biology
* List of omics topics in biology
References
External links
Mouse Phenome Database
Human Phenotype Ontology
Europhenome: Access to raw and annotated mouse phenotype data
"Wilhelm Johannsen's Genotype-Phenotype Distinction" by E. Peirson at the Embryo Project Encyclopedia
Mouse Phenome Project
at the Jackson Laboratory
{{Authority control
Classical genetics
Polymorphism (biology)