Quasispecies model
   HOME

TheInfoList



OR:

The quasispecies model is a description of the process of the Darwinian
evolution Evolution is change in the heritable characteristics of biological populations over successive generations. These characteristics are the expressions of genes, which are passed on from parent to offspring during reproduction. Variation ...
of certain
self-replicating Self-replication is any behavior of a dynamical system that yields construction of an identical or similar copy of itself. Biological cells, given suitable environments, reproduce by cell division. During cell division, DNA is replicated and ca ...
entities within the framework of
physical chemistry Physical chemistry is the study of macroscopic and microscopic phenomena in chemical systems in terms of the principles, practices, and concepts of physics such as motion, energy, force, time, thermodynamics, quantum chemistry, statistica ...
. A quasispecies is a large group or "cloud" of related
genotype The genotype of an organism is its complete set of genetic material. Genotype can also be used to refer to the alleles or variants an individual carries in a particular gene or genetic location. The number of alleles an individual can have in a ...
s that exist in an environment of high mutation rate (at stationary state), where a large fraction of offspring are expected to contain one or more mutations relative to the parent. This is in contrast to a
species In biology, a species is the basic unit of classification and a taxonomic rank of an organism, as well as a unit of biodiversity. A species is often defined as the largest group of organisms in which any two individuals of the appropriat ...
, which from an evolutionary perspective is a more-or-less stable single genotype, most of the offspring of which will be genetically accurate copies. It is useful mainly in providing a qualitative understanding of the evolutionary processes of self-replicating macromolecules such as RNA or DNA or simple asexual organisms such as
bacteria Bacteria (; singular: bacterium) are ubiquitous, mostly free-living organisms often consisting of one biological cell. They constitute a large domain of prokaryotic microorganisms. Typically a few micrometres in length, bacteria were am ...
or
virus A virus is a submicroscopic infectious agent that replicates only inside the living cells of an organism. Viruses infect all life forms, from animals and plants to microorganisms, including bacteria and archaea. Since Dmitri Ivanovsk ...
es (see also viral quasispecies), and is helpful in explaining something of the early stages of the
origin of life In biology, abiogenesis (from a- 'not' + Greek bios 'life' + genesis 'origin') or the origin of life is the natural process by which life has arisen from non-living matter, such as simple organic compounds. The prevailing scientific hypothes ...
. Quantitative predictions based on this model are difficult because the parameters that serve as its input are impossible to obtain from actual biological systems. The quasispecies model was put forward by
Manfred Eigen Manfred Eigen (; 9 May 1927 – 6 February 2019) was a German biophysical chemist who won the 1967 Nobel Prize in Chemistry for work on measuring fast chemical reactions. Eigen's research helped solve major problems in physical chemistry and ...
and Peter Schuster based on initial work done by Eigen.


Simplified explanation

When evolutionary biologists describe competition between species, they generally assume that each species is a single genotype whose descendants are mostly accurate copies. (Such genotypes are said to have a high reproductive ''fidelity''.) In evolutionary terms, we are interested in the behavior and fitness of that one species or genotype over time. Some organisms or genotypes, however, may exist in circumstances of low fidelity, where most descendants contain one or more mutations. A group of such genotypes is constantly changing, so discussions of which single genotype is the most fit become meaningless. Importantly, if many closely related genotypes are only one mutation away from each other, then genotypes in the group can mutate back and forth into each other. For example, with one mutation per generation, a child of the sequence AGGT could be AGTT, and a grandchild could be AGGT again. Thus we can envision a "cloud" of related genotypes that is rapidly mutating, with sequences going back and forth among different points in the cloud. Though the proper definition is mathematical, that cloud, roughly speaking, is a quasispecies. Quasispecies behavior exists for large numbers of individuals existing at a certain (high) range of mutation rates.


Quasispecies, fitness, and evolutionary selection

In a species, though reproduction may be mostly accurate, periodic mutations will give rise to one or more competing genotypes. If a mutation results in greater replication and survival, the mutant genotype may out-compete the parent genotype and come to dominate the species. Thus, the individual genotypes (or species) may be seen as the units on which selection acts and biologists will often speak of a single genotype's fitness. In a quasispecies, however, mutations are ubiquitous and so the fitness of an individual genotype becomes meaningless: if one particular mutation generates a boost in reproductive success, it can't amount to much because that genotype's offspring are unlikely to be accurate copies with the same properties. Instead, what matters is the ''connectedness'' of the cloud. For example, the sequence AGGT has 12 (3+3+3+3) possible single point mutants AGGA, AGGG, and so on. If 10 of those mutants are viable genotypes that may reproduce (and some of whose offspring or grandchildren may mutate back into AGGT again), we would consider that sequence a well-connected node in the cloud. If instead only two of those mutants are viable, the rest being lethal mutations, then that sequence is poorly connected and most of its descendants will not reproduce. The analog of fitness for a quasispecies is the tendency of nearby relatives within the cloud to be well-connected, meaning that more of the mutant descendants will be viable and give rise to further descendants within the cloud. When the fitness of a single genotype becomes meaningless because of the high rate of mutations, the cloud as a whole or quasispecies becomes the natural unit of selection.


Application to biological research

Quasispecies represents the evolution of high-mutation-rate viruses such as HIV and sometimes single genes or molecules within the genomes of other organisms. Quasispecies models have also been proposed by Jose Fontanari and Emmanuel David Tannenbaum to model the evolution of sexual reproduction. Quasispecies was also shown in compositional replicators (based on the Gard model for
abiogenesis In biology, abiogenesis (from a- 'not' + Greek bios 'life' + genesis 'origin') or the origin of life is the natural process by which life has arisen from non-living matter, such as simple organic compounds. The prevailing scientific hypothes ...
) and was also suggested to be applicable to describe cell's replication, which amongst other things requires the maintenance and evolution of the internal composition of the parent and bud.


Formal background

The model rests on four assumptions: # The self-replicating entities can be represented as sequences composed of a small number of building blocks—for example, sequences of RNA consisting of the four bases
adenine Adenine () ( symbol A or Ade) is a nucleobase (a purine derivative). It is one of the four nucleobases in the nucleic acid of DNA that are represented by the letters G–C–A–T. The three others are guanine, cytosine and thymine. Its deriv ...
,
guanine Guanine () ( symbol G or Gua) is one of the four main nucleobases found in the nucleic acids DNA and RNA, the others being adenine, cytosine, and thymine ( uracil in RNA). In DNA, guanine is paired with cytosine. The guanine nucleoside is ...
,
cytosine Cytosine () ( symbol C or Cyt) is one of the four nucleobases found in DNA and RNA, along with adenine, guanine, and thymine ( uracil in RNA). It is a pyrimidine derivative, with a heterocyclic aromatic ring and two substituents attached ( ...
, and
uracil Uracil () (symbol U or Ura) is one of the four nucleobases in the nucleic acid RNA. The others are adenine (A), cytosine (C), and guanine (G). In RNA, uracil binds to adenine via two hydrogen bonds. In DNA, the uracil nucleobase is replaced b ...
. # New sequences enter the system solely as the result of a copy process, either correct or erroneous, of other sequences that are already present. # The substrates, or raw materials, necessary for ongoing replication are always present in sufficient quantity. Excess sequences are washed away in an outgoing flux. # Sequences may decay into their building blocks. The probability of decay does not depend on the sequences' age; old sequences are just as likely to decay as young sequences. In the quasispecies model,
mutation In biology, a mutation is an alteration in the nucleic acid sequence of the genome of an organism, virus, or extrachromosomal DNA. Viral genomes contain either DNA or RNA. Mutations result from errors during DNA or viral replication, m ...
s occur through errors made in the process of copying already existing sequences. Further,
selection Selection may refer to: Science * Selection (biology), also called natural selection, selection in evolution ** Sex selection, in genetics ** Mate selection, in mating ** Sexual selection in humans, in human sexuality ** Human mating strateg ...
arises because different types of sequences tend to replicate at different rates, which leads to the suppression of sequences that replicate more slowly in favor of sequences that replicate faster. However, the quasispecies model does not predict the ultimate extinction of all but the fastest replicating sequence. Although the sequences that replicate more slowly cannot sustain their abundance level by themselves, they are constantly replenished as sequences that replicate faster mutate into them. At equilibrium, removal of slowly replicating sequences due to decay or outflow is balanced by replenishing, so that even relatively slowly replicating sequences can remain present in finite abundance. Due to the ongoing production of mutant sequences, selection does not act on single sequences, but on mutational "clouds" of closely related sequences, referred to as ''quasispecies''. In other words, the evolutionary success of a particular sequence depends not only on its own replication rate, but also on the replication rates of the mutant sequences it produces, and on the replication rates of the sequences of which it is a mutant. As a consequence, the sequence that replicates fastest may even disappear completely in selection-mutation equilibrium, in favor of more slowly replicating sequences that are part of a quasispecies with a higher average growth rate. Mutational clouds as predicted by the quasispecies model have been observed in RNA viruses and in ''in vitro'' RNA replication. The mutation rate and the general fitness of the molecular sequences and their neighbors is crucial to the formation of a quasispecies. If the mutation rate is zero, there is no exchange by mutation, and each sequence is its own species. If the mutation rate is too high, exceeding what is known as the
error threshold In evolutionary biology and population genetics, the error threshold (or critical mutation rate) is a limit on the number of base pairs a self-replicating molecule may have before mutation will destroy the information in subsequent generations o ...
, the quasispecies will break down and be dispersed over the entire range of available sequences.


Mathematical description

A simple mathematical model for a quasispecies is as follows: let there be S possible sequences and let there be n_i organisms with sequence ''i''. Let's say that each of these organisms asexually gives rise to A_i offspring. Some are duplicates of their parent, having sequence ''i'', but some are mutant and have some other sequence. Let the mutation rate q_ correspond to the
probability Probability is the branch of mathematics concerning numerical descriptions of how likely an event is to occur, or how likely it is that a proposition is true. The probability of an event is a number between 0 and 1, where, roughly speaking, ...
that a ''j'' type parent will produce an ''i'' type organism. Then the expected fraction of offspring generated by ''j'' type organisms that would be ''i'' type organisms is w_=A_j q_, where \sum_i q_=1. Then the total number of ''i''-type organisms after the first round of reproduction, given as n'_i, is :n'_i=\sum_j w_n_j Sometimes a death rate term D_i is included so that: :w_=A_j q_-D_i\delta_ where \delta_ is equal to 1 when i=j and is zero otherwise. Note that the ''n-th'' generation can be found by just taking the ''n-th'' power of W substituting it in place of W in the above formula. This is just a
system of linear equations In mathematics, a system of linear equations (or linear system) is a collection of one or more linear equations involving the same variables. For example, :\begin 3x+2y-z=1\\ 2x-2y+4z=-2\\ -x+\fracy-z=0 \end is a system of three equations in t ...
. The usual way to solve such a system is to first diagonalize the W matrix. Its diagonal entries will be
eigenvalues In linear algebra, an eigenvector () or characteristic vector of a linear transformation is a nonzero vector that changes at most by a scalar factor when that linear transformation is applied to it. The corresponding eigenvalue, often denote ...
corresponding to certain linear combinations of certain subsets of sequences which will be
eigenvectors In linear algebra, an eigenvector () or characteristic vector of a linear transformation is a nonzero vector that changes at most by a scalar factor when that linear transformation is applied to it. The corresponding eigenvalue, often denoted ...
of the W matrix. These subsets of sequences are the quasispecies. Assuming that the matrix W is a primitive matrix ( irreducible and
aperiodic A periodic function is a function that repeats its values at regular intervals. For example, the trigonometric functions, which repeat at intervals of 2\pi radians, are periodic functions. Periodic functions are used throughout science to desc ...
), then after very many generations only the eigenvector with the largest eigenvalue will prevail, and it is this quasispecies that will eventually dominate. The components of this eigenvector give the relative abundance of each sequence at equilibrium.


Note about primitive matrices

W being primitive means that for some integer n > 0 , that the n^ power of W is > 0, i.e. all the entries are positive. If W is primitive then each type can, through a sequence of mutations (i.e. powers of W) mutate into all the other types after some number of generations. W is not primitive if it is periodic, where the population can perpetually cycle through different disjoint sets of compositions, or if it is reducible, where the dominant species (or quasispecies) that develops can depend on the initial population, as is the case in the simple example given below.


Alternative formulations

The quasispecies formulae may be expressed as a set of linear differential equations. If we consider the difference between the new state n'_i and the old state n_i to be the state change over one moment of time, then we can state that the time derivative of n_i is given by this difference, \dot_i = n'_i-n_i we can write: :\dot_i=\sum_j w_n_j-n_i The quasispecies equations are usually expressed in terms of concentrations x_i where :x_i\ \stackrel\ \frac. :x'_i\ \stackrel\ \frac. The above equations for the quasispecies then become for the discrete version: :x'_i = \frac or, for the continuum version: :\dot_i =\sum_j w_x_j-x_i\sum_w_x_j.


Simple example

The quasispecies concept can be illustrated by a simple system consisting of 4 sequences. Sequences ,0 ,1 ,0 and ,1are numbered 1, 2, 3, and 4, respectively. Let's say the ,0sequence never mutates and always produces a single offspring. Let's say the other 3 sequences all produce, on average, 1-k replicas of themselves, and k of each of the other two types, where 0\le k\le 1. The W matrix is then: :\mathbf= \begin 1&0&0&0\\ 0&1-k&k&k\\ 0&k&1-k&k\\ 0&k&k&1-k \end . The diagonalized matrix is: :\mathbf= \begin 1-2k&0&0&0\\ 0&1-2k&0&0\\ 0&0&1&0\\ 0&0&0&1+k \end . And the eigenvectors corresponding to these eigenvalues are: : Only the eigenvalue 1+k is more than unity. For the n-th generation, the corresponding eigenvalue will be (1+k)^n and so will increase without bound as time goes by. This eigenvalue corresponds to the eigenvector ,1,1,1 which represents the quasispecies consisting of sequences 2, 3, and 4, which will be present in equal numbers after a very long time. Since all population numbers must be positive, the first two quasispecies are not legitimate. The third quasispecies consists of only the non-mutating sequence 1. It's seen that even though sequence 1 is the most fit in the sense that it reproduces more of itself than any other sequence, the quasispecies consisting of the other three sequences will eventually dominate (assuming that the initial population was not homogeneous of the sequence 1 type).


References


Further reading

* * {{Clear Virology Evolutionary biology Microbial population biology Evolutionary dynamics Mathematical modeling