Bacillus coli
   HOME

TheInfoList



OR:

''Escherichia coli'' (),Wells, J. C. (2000) Longman Pronunciation Dictionary. Harlow ngland Pearson Education Ltd. also known as ''E. coli'' (), is a
Gram-negative Gram-negative bacteria are bacteria that do not retain the crystal violet stain used in the Gram staining method of bacterial differentiation. They are characterized by their cell envelopes, which are composed of a thin peptidoglycan cell wa ...
, facultative anaerobic,
rod-shaped A bacillus (), also called a bacilliform bacterium or often just a rod (when the context makes the sense clear), is a rod-shaped bacterium or archaeon. Bacilli are found in many different taxonomic groups of bacteria. However, the name '' Baci ...
, coliform bacterium of the genus ''
Escherichia ''Escherichia'' () is a genus of Gram-negative, non- spore-forming, facultatively anaerobic, rod-shaped bacteria from the family Enterobacteriaceae. In those species which are inhabitants of the gastrointestinal tracts of warm-blooded animals, ...
'' that is commonly found in the lower intestine of
warm-blooded Warm-blooded is an informal term referring to animal species which can maintain a body temperature higher than their environment. In particular, homeothermic species maintain a stable body temperature by regulating metabolic processes. The onl ...
organisms. Most ''E. coli'' strains are harmless, but some serotypes ( EPEC, ETEC etc.) can cause serious food poisoning in their hosts, and are occasionally responsible for food contamination incidents that prompt product recalls. Most strains do not cause disease in humans and are part of the normal microbiota of the gut; such strains are harmless or even beneficial to humans (although these strains tend to be less studied than the pathogenic ones). For example, some strains of ''E. coli'' benefit their hosts by producing vitamin K2 or by preventing the colonization of the intestine by pathogenic bacteria. These mutually beneficial relationships between ''E. coli'' and humans are a type of mutualistic biological relationship — where both the humans and the ''E. coli'' are benefitting each other. ''E. coli'' is expelled into the environment within fecal matter. The bacterium grows massively in fresh faecal matter under aerobic conditions for three days, but its numbers decline slowly afterwards. ''E. coli'' and other facultative anaerobes constitute about 0.1% of gut microbiota, and fecal–oral transmission is the major route through which pathogenic strains of the bacterium cause disease. Cells are able to survive outside the body for a limited amount of time, which makes them potential
indicator organism Indicator organisms are used as a proxy to monitor conditions in a particular environment, ecosystem, area, habitat, or consumer product. Certain bacteria, fungi and helminth eggs are being used for various purposes. Types Indicator bacteria ...
s to test environmental samples for fecal contamination. A growing body of research, though, has examined environmentally persistent ''E. coli'' which can survive for many days and grow outside a host. The bacterium can be grown and cultured easily and inexpensively in a laboratory setting, and has been intensively investigated for over 60 years. ''E. coli'' is a
chemoheterotroph A Chemotroph is an organism that obtains energy by the oxidation of electron donors in their environments. These molecules can be organic (chemoorganotrophs) or inorganic ( chemolithotrophs). The chemotroph designation is in contrast to phototro ...
whose chemically defined medium must include a source of carbon and energy. ''E. coli'' is the most widely studied
prokaryotic A prokaryote () is a single-celled organism that lacks a nucleus and other membrane-bound organelles. The word ''prokaryote'' comes from the Greek πρό (, 'before') and κάρυον (, 'nut' or 'kernel').Campbell, N. "Biology:Concepts & Connec ...
model organism, and an important species in the fields of
biotechnology Biotechnology is the integration of natural sciences and engineering sciences in order to achieve the application of organisms, cells, parts thereof and molecular analogues for products and services. The term ''biotechnology'' was first used ...
and microbiology, where it has served as the
host organism In biology and medicine, a host is a larger organism that harbours a smaller organism; whether a parasitic, a mutualistic, or a commensalist ''guest'' (symbiont). The guest is typically provided with nourishment and shelter. Examples include a ...
for the majority of work with
recombinant DNA Recombinant DNA (rDNA) molecules are DNA molecules formed by laboratory methods of genetic recombination (such as molecular cloning) that bring together genetic material from multiple sources, creating sequences that would not otherwise be fo ...
. Under favourable conditions, it takes as little as 20 minutes to reproduce.


Biology and biochemistry


Type and morphology

''E. coli'' is a
Gram-negative Gram-negative bacteria are bacteria that do not retain the crystal violet stain used in the Gram staining method of bacterial differentiation. They are characterized by their cell envelopes, which are composed of a thin peptidoglycan cell wa ...
,
facultative anaerobe A facultative anaerobic organism is an organism that makes ATP by aerobic respiration if oxygen is present, but is capable of switching to fermentation if oxygen is absent. Some examples of facultatively anaerobic bacteria are '' Staphylococc ...
, nonsporulating coliform bacterium. Cells are typically rod-shaped, and are about 2.0 μm long and 0.25–1.0 μm in diameter, with a cell volume of 0.6–0.7 μm3. Antibiotics can effectively treat ''E. coli'' infections outside the digestive tract and most intestinal infections but are not used to treat intestinal infections by one strain of these bacteria. The flagella which allow the bacteria to swim have a
peritrichous A flagellum (; ) is a hairlike appendage that protrudes from certain plant and animal sperm cells, and from a wide range of microorganisms to provide motility. Many protists with flagella are termed as flagellates. A microorganism may have f ...
arrangement. It also attaches and effaces to the microvilli of the intestines via an adhesion molecule known as
intimin Intimin is a virulence factor ( adhesin) of EPEC (''e.g.'' ''E. coli'' O127:H6) and EHEC (''e.g. E. coli'' O157:H7) '' E. coli'' strains. It is an attaching and effacing (A/E) protein, which with other virulence factors is necessary and respons ...
.


Metabolism

''E. coli'' can live on a wide variety of substrates and uses
mixed acid fermentation In biochemistry, mixed acid fermentation is the metabolic process by which a six-carbon sugar (e.g. glucose, ) is converted into a complex and variable mixture of acids. It is an anaerobic (non-oxygen-requiring) fermentation reaction that is ...
in anaerobic conditions, producing lactate,
succinate Succinic acid () is a dicarboxylic acid with the chemical formula (CH2)2(CO2H)2. The name derives from Latin ''succinum'', meaning amber. In living organisms, succinic acid takes the form of an anion, succinate, which has multiple biological ro ...
,
ethanol Ethanol (abbr. EtOH; also called ethyl alcohol, grain alcohol, drinking alcohol, or simply alcohol) is an organic compound. It is an alcohol with the chemical formula . Its formula can be also written as or (an ethyl group linked to a ...
, acetate, and
carbon dioxide Carbon dioxide ( chemical formula ) is a chemical compound made up of molecules that each have one carbon atom covalently double bonded to two oxygen atoms. It is found in the gas state at room temperature. In the air, carbon dioxide is trans ...
. Since many pathways in mixed-acid fermentation produce
hydrogen Hydrogen is the chemical element with the symbol H and atomic number 1. Hydrogen is the lightest element. At standard conditions hydrogen is a gas of diatomic molecules having the formula . It is colorless, odorless, tasteless, non-toxic ...
gas, these pathways require the levels of hydrogen to be low, as is the case when ''E. coli'' lives together with hydrogen-consuming organisms, such as methanogens or sulphate-reducing bacteria. In addition, ''E. coli''s metabolism can be rewired to solely use CO2 as the source of carbon for biomass production. In other words, this obligate heterotroph's metabolism can be altered to display autotrophic capabilities by heterologously expressing carbon fixation genes as well as formate dehydrogenase and conducting laboratory evolution experiments. This may be done by using formate to reduce electron carriers and supply the ATP required in anabolic pathways inside of these synthetic autotrophs. ''E. coli'' have three native glycolytic pathways: EMPP, EDP, and OPPP. The EMPP employs ten enzymatic steps to yield two pyruvates, two ATP, and two
NADH Nicotinamide adenine dinucleotide (NAD) is a coenzyme central to metabolism. Found in all living cells, NAD is called a dinucleotide because it consists of two nucleotides joined through their phosphate groups. One nucleotide contains an aden ...
per
glucose Glucose is a simple sugar with the molecular formula . Glucose is overall the most abundant monosaccharide, a subcategory of carbohydrates. Glucose is mainly made by plants and most algae during photosynthesis from water and carbon dioxide, u ...
molecule while OPPP serves as an oxidation route for NADPH synthesis. Although the EDP is the more thermodynamically favourable of the three pathways, ''E. coli'' do not use the EDP for
glucose metabolism Carbohydrate metabolism is the whole of the biochemical processes responsible for the metabolic formation, breakdown, and interconversion of carbohydrates in living organisms. Carbohydrates are central to many essential metabolic pathways. Pla ...
, relying mainly on the EMPP and the OPPP. The EDP mainly remains inactive except for during growth with
gluconate Gluconic acid is an organic compound with molecular formula C6H12O7 and condensed structural formula HOCH2(CHOH)4COOH. It is one of the 16 stereoisomers of 2,3,4,5,6-pentahydroxyhexanoic acid. In aqueous solution at neutral pH, gluconic acid f ...
.


Catabolite repression

When growing in the presence of a mixture of sugars, bacteria will often consume the sugars sequentially through a process known as catabolite repression. By repressing the expression of the genes involved in metabolizing the less preferred sugars, cells will usually first consume the sugar yielding the highest growth rate, followed by the sugar yielding the next highest growth rate, and so on. In doing so the cells ensure that their limited metabolic resources are being used to maximize the rate of growth. The well-used example of this with ''E. coli'' involves the growth of the bacterium on glucose and lactose, where ''E. coli'' will consume glucose before lactose. Catabolite repression has also been observed in ''E. coli'' in the presence of other non-glucose sugars, such as arabinose and
xylose Xylose ( grc, ξύλον, , "wood") is a sugar first isolated from wood, and named for it. Xylose is classified as a monosaccharide of the aldopentose type, which means that it contains five carbon atoms and includes an aldehyde functional g ...
, sorbitol, rhamnose, and
ribose Ribose is a simple sugar and carbohydrate with molecular formula C5H10O5 and the linear-form composition H−(C=O)−(CHOH)4−H. The naturally-occurring form, , is a component of the ribonucleotides from which RNA is built, and so this compo ...
. In ''E. coli'', glucose catabolite repression is regulated by the
phosphotransferase system PEP group translocation, also known as the phosphotransferase system or PTS, is a distinct method used by bacteria for sugar uptake where the source of energy is from phosphoenolpyruvate (PEP). It is known to be a multicomponent system that always i ...
, a multi-protein phosphorylation cascade that couples glucose uptake and metabolism.


Culture growth

Optimum growth of ''E. coli'' occurs at , but some laboratory strains can multiply at temperatures up to . ''E. coli'' grows in a variety of defined laboratory media, such as
lysogeny broth Lysogeny broth (LB) is a nutritionally rich medium primarily used for the growth of bacteria. Its creator, Giuseppe Bertani, intended LB to stand for lysogeny broth, but LB has also come to colloquially mean Luria broth, Lennox broth, life br ...
, or any medium that contains glucose, ammonium phosphate monobasic, sodium chloride, magnesium sulfate, potassium phosphate dibasic, and water. Growth can be driven by
aerobic Aerobic means "requiring air," in which "air" usually means oxygen. Aerobic may also refer to * Aerobic exercise, prolonged exercise of moderate intensity * Aerobics, a form of aerobic exercise * Aerobic respiration, the aerobic process of cel ...
or
anaerobic respiration Anaerobic respiration is respiration using electron acceptors other than molecular oxygen (O2). Although oxygen is not the final electron acceptor, the process still uses a respiratory electron transport chain. In aerobic organisms undergoing r ...
, using a large variety of redox pairs, including the oxidation of pyruvic acid, formic acid,
hydrogen Hydrogen is the chemical element with the symbol H and atomic number 1. Hydrogen is the lightest element. At standard conditions hydrogen is a gas of diatomic molecules having the formula . It is colorless, odorless, tasteless, non-toxic ...
, and
amino acid Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although hundreds of amino acids exist in nature, by far the most important are the alpha-amino acids, which comprise proteins. Only 22 alpha a ...
s, and the reduction of substrates such as
oxygen Oxygen is the chemical element with the symbol O and atomic number 8. It is a member of the chalcogen group in the periodic table, a highly reactive nonmetal, and an oxidizing agent that readily forms oxides with most elements as ...
, nitrate, fumarate, dimethyl sulfoxide, and
trimethylamine N-oxide Trimethylamine ''N''-oxide (TMAO) is an organic compound with the formula (CH3)3NO. It is in the class of amine oxides. Although the anhydrous compound is known, trimethylamine ''N''-oxide is usually encountered as the dihydrate. Both the anhydr ...
. ''E. coli'' is classified as a facultative anaerobe. It uses oxygen when it is present and available. It can, however, continue to grow in the absence of oxygen using fermentation or anaerobic respiration. Respiraton type is managed in part by the arc system. The ability to continue growing in the absence of oxygen is an advantage to bacteria because their survival is increased in environments where water predominates.


Cell cycle

The bacterial cell cycle is divided into three stages. The B period occurs between the completion of cell division and the beginning of DNA replication. The C period encompasses the time it takes to replicate the chromosomal DNA. The D period refers to the stage between the conclusion of DNA replication and the end of cell division. The doubling rate of ''E. coli'' is higher when more nutrients are available. However, the length of the C and D periods do not change, even when the doubling time becomes less than the sum of the C and D periods. At the fastest growth rates, replication begins before the previous round of replication has completed, resulting in multiple replication forks along the DNA and overlapping cell cycles. The number of replication forks in fast growing ''E. coli'' typically follows 2n (n = 1, 2 or 3). This only happens if replication is initiated simultaneously from all origins of replications, and is referred to as synchronous replication. However, not all cells in a culture replicate synchronously. In this case cells do not have multiples of two
replication fork In molecular biology, DNA replication is the biological process of producing two identical replicas of DNA from one original DNA molecule. DNA replication occurs in all living organisms acting as the most essential part for biological inheritanc ...
s. Replication initiation is then referred to being asynchronous. However, asynchrony can be caused by mutations to for instance
DnaA Introduction Based on the Replicon Model, a positively active initiator molecule contacts with a particular spot on a circular chromosome called the replicator to start DNA replication. DnaA is a protein that activates initiation of DNA replica ...
or
DnaA Introduction Based on the Replicon Model, a positively active initiator molecule contacts with a particular spot on a circular chromosome called the replicator to start DNA replication. DnaA is a protein that activates initiation of DNA replica ...
initiator-associating protein
DiaA Diaa ( ar, ضياء) is an Egyptian male given name. Notable people with this name include: * Ahmed Diaa Eddine (1912–1976), Egyptian film director * Diaa al-Din Dawoud Diaa al-Din Dawoud (name also spelled ''Diya el-Din Dawud'' or ''Diaaeddin ...
.


Genetic adaptation

''E. coli'' and related bacteria possess the ability to transfer DNA via bacterial conjugation or transduction, which allows genetic material to spread horizontally through an existing population. The process of transduction, which uses the bacterial virus called a bacteriophage, is where the spread of the gene encoding for the
Shiga toxin Shiga toxins are a family of related toxins with two major groups, Stx1 and Stx2, expressed by genes considered to be part of the genome of lambdoid prophages. The toxins are named after Kiyoshi Shiga, who first described the bacterial or ...
from the ''
Shigella ''Shigella'' is a genus of bacteria that is Gram-negative, facultative anaerobic, non-spore-forming, nonmotile, rod-shaped, and genetically closely related to '' E. coli''. The genus is named after Kiyoshi Shiga, who first discovered it in 1 ...
'' bacteria to ''E. coli'' helped produce ''E. coli'' O157:H7, the Shiga toxin-producing strain of ''E. coli.''


Diversity

''E. coli'' encompasses an enormous population of bacteria that exhibit a very high degree of both genetic and phenotypic diversity. Genome sequencing of many isolates of ''E. coli'' and related bacteria shows that a taxonomic reclassification would be desirable. However, this has not been done, largely due to its medical importance, and ''E. coli'' remains one of the most diverse bacterial species: only 20% of the genes in a typical ''E. coli'' genome is shared among all strains. In fact, from the more constructive point of view, the members of genus ''Shigella'' (''S. dysenteriae'', ''S. flexneri'', ''S. boydii'', and ''S. sonnei'') should be classified as ''E. coli'' strains, a phenomenon termed
taxa in disguise In bacteriology, a taxon in disguise is a species, genus or higher unit of biological classification whose evolutionary history reveals has evolved from another unit of a similar or lower rank, making the parent unit paraphyletic. That happens whe ...
. Similarly, other strains of ''E. coli'' (e.g. the K-12 strain commonly used in
recombinant DNA Recombinant DNA (rDNA) molecules are DNA molecules formed by laboratory methods of genetic recombination (such as molecular cloning) that bring together genetic material from multiple sources, creating sequences that would not otherwise be fo ...
work) are sufficiently different that they would merit reclassification. A
strain Strain may refer to: Science and technology * Strain (biology), variants of plants, viruses or bacteria; or an inbred animal used for experimental purposes * Strain (chemistry), a chemical stress of a molecule * Strain (injury), an injury to a mu ...
is a subgroup within the species that has unique characteristics that distinguish it from other strains. These differences are often detectable only at the molecular level; however, they may result in changes to the physiology or lifecycle of the bacterium. For example, a strain may gain pathogenic capacity, the ability to use a unique carbon source, the ability to take upon a particular
ecological niche In ecology, a niche is the match of a species to a specific environmental condition. Three variants of ecological niche are described by It describes how an organism or population responds to the distribution of resources and competitors (for ...
, or the ability to resist antimicrobial agents. Different strains of ''E. coli'' are often host-specific, making it possible to determine the source of fecal contamination in environmental samples. For example, knowing which ''E. coli'' strains are present in a water sample allows researchers to make assumptions about whether the contamination originated from a human, another mammal, or a
bird Birds are a group of warm-blooded vertebrates constituting the class Aves (), characterised by feathers, toothless beaked jaws, the laying of hard-shelled eggs, a high metabolic rate, a four-chambered heart, and a strong yet lightweig ...
.


Serotypes

A common subdivision system of ''E. coli'', but not based on evolutionary relatedness, is by serotype, which is based on major surface
antigen In immunology, an antigen (Ag) is a molecule or molecular structure or any foreign particulate matter or a pollen grain that can bind to a specific antibody or T-cell receptor. The presence of antigens in the body may trigger an immune respons ...
s (O antigen: part of
lipopolysaccharide Lipopolysaccharides (LPS) are large molecules consisting of a lipid and a polysaccharide that are bacterial toxins. They are composed of an O-antigen, an outer core, and an inner core all joined by a covalent bond, and are found in the outer ...
layer; H:
flagellin Flagellin is a globular protein that arranges itself in a hollow cylinder to form the filament in a bacterial flagellum. It has a mass of about 30,000 to 60,000 daltons. Flagellin is the principal component of bacterial flagella, and is present ...
; K antigen: capsule), e.g. O157:H7). It is, however, common to cite only the serogroup, i.e. the O-antigen. At present, about 190 serogroups are known. The common laboratory strain has a mutation that prevents the formation of an O-antigen and is thus not typeable.


Genome plasticity and evolution

Like all lifeforms, new strains of ''E. coli'' evolve through the natural biological processes of
mutation In biology, a mutation is an alteration in the nucleic acid sequence of the genome of an organism, virus, or extrachromosomal DNA. Viral genomes contain either DNA or RNA. Mutations result from errors during DNA replication, DNA or viral repl ...
, gene duplication, and
horizontal gene transfer Horizontal gene transfer (HGT) or lateral gene transfer (LGT) is the movement of genetic material between unicellular and/or multicellular organisms other than by the ("vertical") transmission of DNA from parent to offspring (reproduction). H ...
; in particular, 18% of the genome of the laboratory strain MG1655 was horizontally acquired since the divergence from '' Salmonella''. ''E. coli'' K-12 and ''E. coli'' B strains are the most frequently used varieties for laboratory purposes. Some strains develop traits that can be harmful to a host animal. These
virulent Virulence is a pathogen's or microorganism's ability to cause damage to a host. In most, especially in animal systems, virulence refers to the degree of damage caused by a microbe to its host. The pathogenicity of an organism—its ability to ...
strains typically cause a bout of
diarrhea Diarrhea, also spelled diarrhoea, is the condition of having at least three loose, liquid, or watery bowel movements each day. It often lasts for a few days and can result in dehydration due to fluid loss. Signs of dehydration often begin w ...
that is often self-limiting in healthy adults but is frequently lethal to children in the developing world. More virulent strains, such as O157:H7, cause serious illness or death in the elderly, the very young, or the
immunocompromised Immunodeficiency, also known as immunocompromisation, is a state in which the immune system's ability to fight infectious diseases and cancer is compromised or entirely absent. Most cases are acquired ("secondary") due to extrinsic factors that a ...
. The genera ''
Escherichia ''Escherichia'' () is a genus of Gram-negative, non- spore-forming, facultatively anaerobic, rod-shaped bacteria from the family Enterobacteriaceae. In those species which are inhabitants of the gastrointestinal tracts of warm-blooded animals, ...
'' and '' Salmonella'' diverged around 102 million years ago (credibility interval: 57–176 mya), an event unrelated to the much earlier (see ''
Synapsid Synapsids + (, 'arch') > () "having a fused arch"; synonymous with ''theropsids'' (Greek, "beast-face") are one of the two major groups of animals that evolved from basal amniotes, the other being the sauropsids, the group that includes reptil ...
'') divergence of their hosts: the former being found in mammals and the latter in birds and reptiles. This was followed by a split of an ''Escherichia'' ancestor into five species (''E. albertii'', ''E. coli'', ''E. fergusonii'', ''E. hermannii'', and ''E. vulneris''). The last ''E. coli'' ancestor split between 20 and 30 million years ago. The long-term evolution experiments using ''E. coli'', begun by
Richard Lenski Richard Eimer Lenski (born August 13, 1956) is an American evolutionary biologist, a Hannah Distinguished Professor of Microbial Ecology at Michigan State University. He is a member of the National Academy of Sciences and a MacArthur fellow. ...
in 1988, have allowed direct observation of genome evolution over more than 65,000 generations in the laboratory. For instance, ''E. coli'' typically do not have the ability to grow aerobically with citrate as a carbon source, which is used as a diagnostic criterion with which to differentiate ''E. coli'' from other, closely, related bacteria such as '' Salmonella''. In this experiment, one population of ''E. coli'' unexpectedly evolved the ability to aerobically metabolize
citrate Citric acid is an organic compound with the chemical formula HOC(CO2H)(CH2CO2H)2. It is a colorless weak organic acid. It occurs naturally in citrus fruits. In biochemistry, it is an intermediate in the citric acid cycle, which occurs in the ...
, a major evolutionary shift with some hallmarks of microbial speciation. In the microbial world, a relationship of predation can be established similar to that observed in the animal world. Considered, it has been seen that ''E. coli'' is the prey of multiple generalist predators, such as ''
Myxococcus xanthus ''Myxococcus xanthus'' is a gram-negative, rod-shaped species of myxobacteria that exhibits various forms of self-organizing behavior in response to environmental cues. Under normal conditions with abundant food, it exists as a predatory, sapro ...
''. In this predator-prey relationship, a parallel evolution of both species is observed through genomic and phenotypic modifications, in the case of ''E. coli'' the modifications are modified in two aspects involved in their virulence such as mucoid production (excessive production of exoplasmic acid alginate ) and the suppression of the OmpT gene, producing in future generations a better adaptation of one of the species that is counteracted by the evolution of the other, following a co-evolutionary model demonstrated by the
Red Queen hypothesis The Red Queen hypothesis is a hypothesis in evolutionary biology proposed in 1973, that species must constantly adapt, evolve, and proliferate in order to survive while pitted against ever-evolving opposing species. The hypothesis was intended ...
.


Neotype strain

''E. coli'' is the type species of the genus (''Escherichia'') and in turn ''Escherichia'' is the type genus of the family Enterobacteriaceae, where the family name does not stem from the genus ''Enterobacter'' + "i" (sic.) + " aceae", but from "enterobacterium" + "aceae" (enterobacterium being not a genus, but an alternative trivial name to enteric bacterium). The original strain described by Escherich is believed to be lost, consequently a new type strain (neotype) was chosen as a representative: the neotype strain is U5/41T, also known under the deposit names DSM 30083, ATCC 11775, and NCTC 9001, which is pathogenic to chickens and has an O1:K1:H7 serotype. However, in most studies, either O157:H7, K-12 MG1655, or K-12 W3110 were used as a representative ''E. coli''. The genome of the type strain has only lately been sequenced.


Phylogeny of ''E. coli'' strains

Many strains belonging to this species have been isolated and characterised. In addition to serotype (''vide supra''), they can be classified according to their
phylogeny A phylogenetic tree (also phylogeny or evolutionary tree Felsenstein J. (2004). ''Inferring Phylogenies'' Sinauer Associates: Sunderland, MA.) is a branching diagram or a tree showing the evolutionary relationships among various biological spe ...
, i.e. the inferred evolutionary history, as shown below where the species is divided into six groups. Particularly the use of whole genome sequences yields highly supported phylogenies. Based on such data, five subspecies of ''E. coli'' were distinguished. The link between phylogenetic distance ("relatedness") and pathology is small, ''e.g.'' the O157:H7 serotype strains, which form a clade ("an exclusive group")—group E below—are all enterohaemorragic strains (EHEC), but not all EHEC strains are closely related. In fact, four different species of ''Shigella'' are nested among ''E. coli'' strains (''vide supra''), while ''E. albertii'' and ''E. fergusonii'' are outside this group. Indeed, all ''Shigella'' species were placed within a single subspecies of ''E. coli'' in a phylogenomic study that included the type strain, and for this reason an according reclassification is difficult. All commonly used research strains of ''E. coli'' belong to group A and are derived mainly from Clifton's K-12 strain (λ+ F+; O16) and to a lesser degree from d'Herelle's '' Bacillus coli'' strain (B strain) (O7).


Genomics

The first complete DNA sequence of an ''E. coli''
genome In the fields of molecular biology and genetics, a genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding g ...
(laboratory strain K-12 derivative MG1655) was published in 1997. It is a circular DNA molecule 4.6 million base pairs in length, containing 4288 annotated protein-coding genes (organized into 2584
operons In genetics, an operon is a functioning unit of DNA containing a cluster of genes under the control of a single promoter. The genes are transcribed together into an mRNA strand and either translated together in the cytoplasm, or undergo splic ...
), seven
ribosomal RNA Ribosomal ribonucleic acid (rRNA) is a type of non-coding RNA which is the primary component of ribosomes, essential to all cells. rRNA is a ribozyme which carries out protein synthesis in ribosomes. Ribosomal RNA is transcribed from ribosom ...
(rRNA) operons, and 86 transfer RNA (tRNA) genes. Despite having been the subject of intensive genetic analysis for about 40 years, many of these genes were previously unknown. The coding density was found to be very high, with a mean distance between genes of only 118 base pairs. The genome was observed to contain a significant number of transposable genetic elements, repeat elements, cryptic
prophages A prophage is a bacteriophage (often shortened to "phage") genome that is integrated into the circular bacterial chromosome or exists as an extrachromosomal plasmid within the bacterial cell (biology), cell. Integration of prophages into the bacteri ...
, and bacteriophage remnants. More than three hundred complete genomic sequences of ''Escherichia'' and ''Shigella'' species are known. The genome sequence of the type strain of ''E. coli'' was added to this collection before 2014. Comparison of these sequences shows a remarkable amount of diversity; only about 20% of each genome represents sequences present in every one of the isolates, while around 80% of each genome can vary among isolates. Each individual genome contains between 4,000 and 5,500 genes, but the total number of different genes among all of the sequenced ''E. coli'' strains (the pangenome) exceeds 16,000. This very large variety of component genes has been interpreted to mean that two-thirds of the ''E. coli'' pangenome originated in other species and arrived through the process of horizontal gene transfer.


Gene nomenclature

Genes in ''E. coli'' are usually named in accordance with the uniform nomenclature proposed by Demerec et al. Gene names are 3-letter acronyms that derive from their function (when known) or mutant phenotype and are italicized. When multiple genes have the same acronym, the different genes are designated by a capital later that follows the acronym and is also italicized. For instance, ''recA'' is named after its role in
homologous recombination Homologous recombination is a type of genetic recombination in which genetic information is exchanged between two similar or identical molecules of double-stranded or single-stranded nucleic acids (usually DNA as in cellular organisms but may ...
plus the letter A. Functionally related genes are named ''recB'', ''recC'', ''recD'' etc. The proteins are named by uppercase acronyms, e.g.
RecA RecA is a 38 kilodalton protein essential for the repair and maintenance of DNA. A RecA structural and functional homolog has been found in every species in which one has been seriously sought and serves as an archetype for this class of homolog ...
, RecB, etc. When the genome of ''E. coli'' strain K-12 substr. MG1655 was sequenced, all known or predicted protein-coding genes were numbered (more or less) in their order on the genome and abbreviated by b numbers, such as b2819 (= ''recD''). The "b" names were created after Fred Blattner, who led the genome sequence effort. Another numbering system was introduced with the sequence of another ''E. coli'' K-12 substrain, W3110, which was sequenced in Japan and hence uses numbers starting by JW... (Japanese W3110), e.g. JW2787 (= ''recD''). Hence, ''recD'' = b2819 = JW2787. Note, however, that most databases have their own numbering system, e.g. the EcoGene database uses EG10826 for ''recD''. Finally, ECK numbers are specifically used for alleles in the MG1655 strain of ''E. coli'' K-12. Complete lists of genes and their synonyms can be obtained from databases such as EcoGene or
Uniprot UniProt is a freely accessible database of protein sequence and functional information, many entries being derived from genome sequencing projects. It contains a large amount of information about the biological function of proteins derived from ...
.


Proteomics


Proteome

The genome sequence of ''E. coli'' predicts 4288 protein-coding genes, of which 38 percent initially had no attributed function. Comparison with five other sequenced microbes reveals ubiquitous as well as narrowly distributed gene families; many families of similar genes within ''E. coli'' are also evident. The largest family of paralogous proteins contains 80 ABC transporters. The genome as a whole is strikingly organized with respect to the local direction of replication; guanines, oligonucleotides possibly related to replication and recombination, and most genes are so oriented. The genome also contains insertion sequence (IS) elements, phage remnants, and many other patches of unusual composition indicating genome plasticity through horizontal transfer. Several studies have experimentally investigated the
proteome The proteome is the entire set of proteins that is, or can be, expressed by a genome, cell, tissue, or organism at a certain time. It is the set of expressed proteins in a given type of cell or organism, at a given time, under defined conditions. ...
of ''E. coli''. By 2006, 1,627 (38%) of the predicted proteins (
open reading frames In molecular biology, open reading frames (ORFs) are defined as spans of DNA sequence between the start and stop codons. Usually, this is considered within a studied region of a prokaryotic DNA sequence, where only one of the six possible readin ...
, ORFs) had been identified experimentally. Mateus et al. 2020 detected 2,586 proteins with at least 2 peptides (60% of all proteins).


Post-translational modifications (PTMs)

Although much fewer bacterial proteins seem to have
post-translational modification Post-translational modification (PTM) is the covalent and generally enzymatic modification of proteins following protein biosynthesis. This process occurs in the endoplasmic reticulum and the golgi apparatus. Proteins are synthesized by ribos ...
s (PTMs) compared to
eukaryotic Eukaryotes () are organisms whose Cell (biology), cells have a cell nucleus, nucleus. All animals, plants, fungi, and many unicellular organisms, are Eukaryotes. They belong to the group of organisms Eukaryota or Eukarya, which is one of the ...
proteins, a substantial number of proteins are modified in ''E. coli''. For instance, Potel et al. (2018) found 227
phosphoprotein A phosphoprotein is a protein that is posttranslationally modified by the attachment of either a single phosphate group, or a complex molecule such as 5'-phospho-DNA, through a phosphate group. The target amino acid is most often serine, threonin ...
s of which 173 were phosphorylated on
histidine Histidine (symbol His or H) is an essential amino acid that is used in the biosynthesis of proteins. It contains an α-amino group (which is in the protonated –NH3+ form under biological conditions), a carboxylic acid group (which is in the d ...
. Interestingly, the majority of phosphorylated
amino acid Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although hundreds of amino acids exist in nature, by far the most important are the alpha-amino acids, which comprise proteins. Only 22 alpha a ...
s were serine (1,220 sites) with only 246 sites on
histidine Histidine (symbol His or H) is an essential amino acid that is used in the biosynthesis of proteins. It contains an α-amino group (which is in the protonated –NH3+ form under biological conditions), a carboxylic acid group (which is in the d ...
and 501 phosphorylated theronines and 162
tyrosine -Tyrosine or tyrosine (symbol Tyr or Y) or 4-hydroxyphenylalanine is one of the 20 standard amino acids that are used by cells to synthesize proteins. It is a non-essential amino acid with a polar side group. The word "tyrosine" is from the G ...
s.


Interactome

The
interactome In molecular biology, an interactome is the whole set of molecular interactions in a particular cell. The term specifically refers to physical interactions among molecules (such as those among proteins, also known as protein–protein interactions, ...
of ''E. coli'' has been studied by
affinity purification Affinity chromatography is a method of separating a biomolecule from a mixture, based on a highly specific macromolecular binding interaction between the biomolecule and another substance. The specific type of binding interaction depends on the ...
and mass spectrometry (AP/MS) and by analyzing the binary interactions among its proteins. Protein complexes. A 2006 study purified 4,339 proteins from cultures of strain K-12 and found interacting partners for 2,667 proteins, many of which had unknown functions at the time. A 2009 study found 5,993 interactions between proteins of the same ''E. coli'' strain, though these data showed little overlap with those of the 2006 publication. Binary interactions. Rajagopala ''et al.'' (2014) have carried out systematic yeast two-hybrid screens with most ''E. coli'' proteins, and found a total of 2,234 protein-protein interactions. This study also integrated genetic interactions and protein structures and mapped 458 interactions within 227
protein complexes A protein complex or multiprotein complex is a group of two or more associated polypeptide chains. Protein complexes are distinct from multienzyme complexes, in which multiple catalytic domains are found in a single polypeptide chain. Protein ...
.


Normal microbiota

''E. coli'' belongs to a group of bacteria informally known as coliforms that are found in the gastrointestinal tract of warm-blooded animals. ''E. coli'' normally colonizes an infant's gastrointestinal tract within 40 hours of birth, arriving with food or water or from the individuals handling the child. In the bowel, ''E. coli'' adheres to the
mucus Mucus ( ) is a slippery aqueous secretion produced by, and covering, mucous membranes. It is typically produced from cells found in mucous glands, although it may also originate from mixed glands, which contain both serous and mucous cells. It ...
of the
large intestine The large intestine, also known as the large bowel, is the last part of the gastrointestinal tract and of the digestive system in tetrapods. Water is absorbed here and the remaining waste material is stored in the rectum as feces before bein ...
. It is the primary
facultative anaerobe A facultative anaerobic organism is an organism that makes ATP by aerobic respiration if oxygen is present, but is capable of switching to fermentation if oxygen is absent. Some examples of facultatively anaerobic bacteria are '' Staphylococc ...
of the human gastrointestinal tract. (
Facultative anaerobes A facultative anaerobic organism is an organism that makes ATP by aerobic respiration if oxygen is present, but is capable of switching to fermentation if oxygen is absent. Some examples of facultatively anaerobic bacteria are ''Staphylococcus' ...
are organisms that can grow in either the presence or absence of oxygen.) As long as these bacteria do not acquire genetic elements encoding for
virulence factor Virulence factors (preferably known as pathogenicity factors or effectors in plant science) are cellular structures, molecules and regulatory systems that enable microbial pathogens (bacteria, viruses, fungi, and protozoa) to achieve the following ...
s, they remain benign commensals.


Therapeutic use

Due to the low cost and speed with which it can be grown and modified in laboratory settings, ''E. coli'' is a popular expression platform for the production of
recombinant proteins Recombinant DNA (rDNA) molecules are DNA molecules formed by laboratory methods of genetic recombination (such as molecular cloning) that bring together genetic material from multiple sources, creating sequences that would not otherwise be fou ...
used in therapeutics. One advantage to using ''E. coli'' over another expression platform is that ''E. coli'' naturally does not export many proteins into the
periplasm The periplasm is a concentrated gel-like matrix in the space between the inner cytoplasmic membrane and the bacterial outer membrane called the ''periplasmic space'' in gram-negative bacteria. Using cryo-electron microscopy it has been found that ...
, making it easier to recover a protein of interest without cross-contamination. The ''E. coli'' K-12 strains and their derivatives (DH1, DH5α, MG1655, RV308 and W3110) are the strains most widely used by the biotechnology industry. Nonpathogenic ''E. coli'' strain Nissle 1917 (EcN), (Mutaflor) and ''E. coli'' O83:K24:H31 (Colinfant)) are used as
probiotic Probiotics are live microorganisms promoted with claims that they provide health benefits when consumed, generally by improving or restoring the gut microbiota. Probiotics are considered generally safe to consume, but may cause bacteria- host ...
agents in medicine, mainly for the treatment of various
gastrointestinal disease Gastrointestinal diseases (abbrev. GI diseases or GI illnesses) refer to diseases involving the gastrointestinal tract, namely the oesophagus, stomach, small intestine, large intestine and rectum, and the accessory organs of digestion, the liver, ...
s, including inflammatory bowel disease. It is thought that the EcN strain might impede the growth of opportunistic pathogens, including '' Salmonella'' and other
coliform Coliform bacteria are defined as either motile or non-motile Gram-negative non- spore forming Bacilli that possess β-galactosidase to produce acids and gases under their optimal growth temperature of 35-37°C. They can be aerobes or facultativ ...
enteropathogens, through the production of
microcin Microcins are very small bacteriocins, composed of relatively few amino acids. For this reason, they are distinct from their larger protein cousins. The classic example is microcin V, of ''Escherichia coli''. Subtilosin A is another bacterioci ...
proteins the production of
siderophore Siderophores (Greek: "iron carrier") are small, high-affinity iron- chelating compounds that are secreted by microorganisms such as bacteria and fungi. They help the organism accumulate iron. Although a widening range of siderophore functions is n ...
s.


Role in disease

Most ''E. coli'' strains do not cause disease, naturally living in the gut, but virulent strains can cause gastroenteritis, Uropathogenic Escherichia coli, urinary tract infections, Meningitis#Bacterial, neonatal meningitis, hemorrhagic colitis, and Crohn's disease. Common signs and symptoms include severe abdominal cramps, diarrhea, hemorrhagic colitis, vomiting, and sometimes fever. In rarer cases, virulent strains are also responsible for bowel necrosis (tissue death) and perforation without progressing to hemolytic-uremic syndrome, peritonitis, mastitis, sepsis, and Gram-negative pneumonia. Very young children are more susceptible to develop severe illness, such as hemolytic uremic syndrome; however, healthy individuals of all ages are at risk to the severe consequences that may arise as a result of being infected with ''E. coli''. Some strains of ''E. coli'', for example O157:H7, can produce
Shiga toxin Shiga toxins are a family of related toxins with two major groups, Stx1 and Stx2, expressed by genes considered to be part of the genome of lambdoid prophages. The toxins are named after Kiyoshi Shiga, who first described the bacterial or ...
(classified as a bioterrorism agent). The Shiga toxin causes inflammatory responses in target cells of the gut, leaving behind lesions which result in the bloody diarrhea that is a symptom of a Shigatoxigenic and verotoxigenic Escherichia coli, Shiga toxin-producing ''E. coli'' (STEC) infection. This toxin further causes premature destruction of the red blood cells, which then clog the body's filtering system, the kidneys, in some rare cases (usually in children and the elderly) causing hemolytic-uremic syndrome (HUS), which may lead to kidney failure and even death. Signs of hemolytic uremic syndrome include decreased frequency of urination, lethargy, and paleness of cheeks and inside the lower eyelids. In 25% of HUS patients, complications of nervous system occur, which in turn causes strokes. In addition, this strain causes the buildup of fluid (since the kidneys do not work), leading to edema around the lungs, legs, and arms. This increase in fluid buildup especially around the lungs impedes the functioning of the heart, causing an increase in blood pressure. Pathogenic Escherichia coli#Urinary tract infection, Uropathogenic ''E. coli'' (UPEC) is one of the main causes of urinary tract infections. It is part of the normal microbiota in the gut and can be introduced in many ways. In particular for females, the direction of wiping after defecation (wiping back to front) can lead to fecal contamination of the urogenital orifices. Anal intercourse can also introduce this bacterium into the male urethra, and in switching from anal to vaginal intercourse, the male can also introduce UPEC to the female urogenital system. Enterotoxigenic Escherichia coli, Enterotoxigenic ''E. coli'' (ETEC) is the most common cause of traveler's diarrhea, with as many as 840 million cases worldwide in developing countries each year. The bacteria, typically transmitted through contaminated food or drinking water, adheres to the Intestinal epithelium, intestinal lining, where it secretes either of two types of enterotoxins, leading to watery diarrhea. The rate and severity of infections are higher among children under the age of five, including as many as 380,000 deaths annually. In May 2011, one ''E. coli'' strain, E. coli O104:H4, O104:H4, was the subject of a 2011 E. coli O104:H4 outbreak, bacterial outbreak that began in Germany. Certain strains of ''E. coli'' are a major cause of foodborne illness. The outbreak started when several people in Germany were infected with enterohemorrhagic, enterohemorrhagic ''E. coli'' (EHEC) bacteria, leading to hemolytic-uremic syndrome (HUS), a medical emergency that requires urgent treatment. The outbreak did not only concern Germany, but also 15 other countries, including regions in North America. On 30 June 2011, the German ''Bundesinstitut für Risikobewertung (BfR)'' (Federal Institute for Risk Assessment, a federal institute within the German Federal Ministry of Food, Agriculture and Consumer Protection) announced that seeds of fenugreek from Egypt were likely the cause of the EHEC outbreak. Some studies have demonstrated an absence of E. ''coli'' in the gut flora of subjects with the metabolic disorder Phenylketonuria. It is hypothesized that the absence of these normal bacterium impairs the production of the key vitamins B2 (riboflavin) and K2 (menaquinone) - vitamins which are implicated in many physiological roles in humans such as cellular and bone metabolism - and so contributes to the disorder. Carbapenem-resistant ''E. coli'' (carbapenemase-producing ''E. coli'') that are resistant to the carbapenem class of antibiotics, considered the Drug of last resort, drugs of last resort for such infections. They are resistant because they produce an enzyme called a carbapenemase that disables the drug molecule.


Incubation period

The time between ingesting the STEC bacteria and feeling sick is called the "incubation period". The incubation period is usually 3–4 days after the exposure, but may be as short as 1 day or as long as 10 days. The symptoms often begin slowly with mild belly pain or non-bloody diarrhea that worsens over several days. HUS, if it occurs, develops an average 7 days after the first symptoms, when the diarrhea is improving.


Diagnosis

Diagnosis of infectious diarrhea and identification of antimicrobial resistance is performed using a stool culture with subsequent antibiotic sensitivity testing. It requires a minimum of 2 days and maximum of several weeks to culture gastrointestinal pathogens. The sensitivity (true positive) and specificity (true negative) rates for stool culture vary by pathogen, although a number of human pathogens can not be microbiological culture, cultured. For culture-positive samples, antimicrobial resistance testing takes an additional 12–24 hours to perform. Current point of care molecular diagnostic tests can identify ''E. coli'' and antimicrobial resistance in the identified strains much faster than culture and sensitivity testing. Microarray-based platforms can identify specific pathogenic strains of ''E. coli'' and ''E. coli''-specific AMR genes in two hours or less with high sensitivity and specificity, but the size of the test panel (i.e., total pathogens and antimicrobial resistance genes) is limited. Newer Metagenomics#Infectious disease diagnosis, metagenomics-based infectious disease diagnostic platforms are currently being developed to overcome the various limitations of culture and all currently available molecular diagnostic technologies.


Treatment

The mainstay of treatment is the assessment of dehydration and replacement of fluid and electrolytes. Administration of antibiotics has been shown to shorten the course of illness and duration of excretion of enterotoxigenic ''E. coli'' (ETEC) in adults in endemic areas and in traveller's diarrhea, though the rate of resistance to commonly used antibiotics is increasing and they are generally not recommended. The antibiotic used depends upon susceptibility patterns in the particular geographical region. Currently, the antibiotics of choice are fluoroquinolones or azithromycin, with an emerging role for rifaximin. Oral rifaximin, a semisynthetic rifamycin derivative, is an effective and well-tolerated antibacterial for the management of adults with non-invasive traveller's diarrhea. Rifaximin was significantly more effective than placebo and no less effective than ciprofloxacin in reducing the duration of diarrhea. While rifaximin is effective in patients with ''E. coli''-predominant traveller's diarrhea, it appears ineffective in patients infected with inflammatory or invasive enteropathogens.


Prevention

ETEC is the type of ''E. coli'' that most vaccine development efforts are focused on. Antibodies against the LT and major CFs of ETEC provide protection against LT-producing, ETEC-expressing homology (biology), homologous CFs. Oral inactivated vaccines consisting of toxin antigen and whole cells, i.e. the licensed recombinant cholera B subunit (rCTB)-WC cholera vaccine Dukoral, have been developed. There are currently no licensed vaccines for ETEC, though several are in various stages of development. In different trials, the rCTB-WC cholera vaccine provided high (85–100%) short-term protection. An oral ETEC vaccine candidate consisting of rCTB and formalin inactivated ''E. coli'' bacteria expressing major CFs has been shown in clinical trials to be safe, immunogenic, and effective against severe diarrhoea in American travelers but not against ETEC diarrhoea in young children in Egypt. A modified ETEC vaccine consisting of recombinant ''E. coli'' strains over-expressing the major CFs and a more LT-like hybrid toxoid called LCTBA, are undergoing clinical testing. Other proven prevention methods for ''E. coli'' transmission include handwashing and improved sanitation and drinking water, as transmission occurs through fecal contamination of food and water supplies. Additionally, thoroughly cooking meat and avoiding consumption of raw, unpasteurized beverages, such as juices and milk are other proven methods for preventing ''E. coli''. Lastly, cross-contamination of utensils and work spaces should be avoided when preparing food.


Model organism in life science research

Because of its long history of laboratory culture and ease of manipulation, ''E. coli'' plays an important role in modern biological engineering and industrial microbiology. The work of Stanley Norman Cohen and Herbert Boyer in ''E. coli'', using plasmids and restriction enzymes to create
recombinant DNA Recombinant DNA (rDNA) molecules are DNA molecules formed by laboratory methods of genetic recombination (such as molecular cloning) that bring together genetic material from multiple sources, creating sequences that would not otherwise be fo ...
, became a foundation of biotechnology. ''E. coli'' is a very versatile host for the production of heterologous proteins, and various Protein expression (biotechnology), protein expression systems have been developed which allow the production of
recombinant proteins Recombinant DNA (rDNA) molecules are DNA molecules formed by laboratory methods of genetic recombination (such as molecular cloning) that bring together genetic material from multiple sources, creating sequences that would not otherwise be fou ...
in ''E. coli''. Researchers can introduce genes into the microbes using plasmids which permit high level expression of protein, and such protein may be mass-produced in industrial fermentation processes. One of the first useful applications of recombinant DNA technology was the manipulation of ''E. coli'' to produce human insulin. Many proteins previously thought difficult or impossible to be expressed in ''E. coli'' in folded form have been successfully expressed in ''E. coli''. For example, proteins with multiple disulphide bonds may be produced in the periplasmic space or in the cytoplasm of mutants rendered sufficiently oxidizing to allow disulphide-bonds to form, while proteins requiring
post-translational modification Post-translational modification (PTM) is the covalent and generally enzymatic modification of proteins following protein biosynthesis. This process occurs in the endoplasmic reticulum and the golgi apparatus. Proteins are synthesized by ribos ...
such as glycosylation for stability or function have been expressed using the N-linked glycosylation system of ''Campylobacter jejuni'' engineered into ''E. coli''. Modified ''E. coli'' cells have been used in vaccine development, bioremediation, production of biofuels, lighting, and production of immobilised enzymes. Strain K-12 is a mutant form of ''E. coli'' that over-expresses the enzyme Alkaline phosphatase, Alkaline Phosphatase (ALP). The mutation arises due to a defect in the gene that constantly codes for the enzyme. A gene that is producing a product without any inhibition is said to have Receptor (biochemistry), constitutive activity. This particular mutant form is used to isolate and purify the aforementioned enzyme. Strain OP50 of ''Escherichia coli'' is used for maintenance of ''Caenorhabditis elegans'' cultures. Strain JM109 is a mutant form of ''E. coli'' that is recA and endA deficient. The strain can be utilized for blue/white screening when the cells carry the fertility factor episome. Lack of recA decreases the possibility of unwanted restriction of the DNA of interest and lack of endA inhibit plasmid DNA decomposition. Thus, JM109 is useful for cloning and expression systems.


Model organism

''E. coli'' is frequently used as a model organism in microbiology studies. Cultivated strains (e.g. ''E. coli'' K12) are well-adapted to the laboratory environment, and, unlike wild-type strains, have lost their ability to thrive in the intestine. Many laboratory strains lose their ability to form biofilms. These features protect wild-type strains from antibody, antibodies and other chemical attacks, but require a large expenditure of energy and material resources. ''E. coli'' is often used as a representative microorganism in the research of novel water treatment and sterilisation methods, including photocatalysis. By standard Bacteriological water analysis#Plate count, plate count methods, following sequential dilutions, and growth on agar gel plates, the concentration of viable organisms or CFUs (Colony Forming Units), in a known volume of treated water can be evaluated, allowing the comparative assessment of materials performance. In 1946, Joshua Lederberg and Edward Tatum first described the phenomenon known as bacterial conjugation using ''E. coli'' as a model bacterium, and it remains the primary model to study conjugation. ''E. coli'' was an integral part of the first experiments to understand bacteriophage, phage genetics, and early researchers, such as Seymour Benzer, used ''E. coli'' and phage T4 to understand the topography of gene structure. Prior to Benzer's research, it was not known whether the gene was a linear structure, or if it had a branching pattern. ''E. coli'' was one of the first organisms to have its genome sequenced; the complete genome of ''E. coli'' K12 was published by ''Science'' in 1997. From 2002 to 2010, a team at the Hungarian Academy of Science created a strain of ''Escherichia coli'' called MDS42, which is now sold by Scarab Genomics of Madison, WI under the name of "Clean Genome ''E. coli''", where 15% of the genome of the parental strain (''E. coli'' K-12 MG1655) were removed to aid in molecular biology efficiency, removing IS elements, pseudogenes and phages, resulting in better maintenance of plasmid-encoded toxic genes, which are often inactivated by transposons. Biochemistry and replication machinery were not altered. By evaluating the possible combination of Nanotechnology, nanotechnologies with landscape ecology, complex habitat landscapes can be generated with details at the nanoscale. On such synthetic ecosystems, evolutionary experiments with ''E. coli'' have been performed to study the spatial biophysics of adaptation in an island biogeography on-chip. In other studies, non-pathogenic ''E. coli'' has been used as a model microorganism towards understanding the effects of simulated microgravity (on Earth) on the same.


Uses in biological computing

Since 1961, scientists proposed the idea of genetic circuits used for computational tasks. Collaboration between biologists and computing scientists has allowed designing digital logic gates on the metabolism of ''E. coli''. As Lac operon is a two-stage process, genetic regulation in the bacteria is used to realize computing functions. The process is controlled at the transcription stage of DNA into messenger RNA. Studies are being performed attempting to program ''E. coli'' to solve complicated mathematics problems, such as the Hamiltonian path problem. A computer to control protein production of ''E. coli'' within yeast cells has been developed. A method has also been developed to use bacteria to behave as an Liquid-crystal display, LCD screen. In July 2017, separate experiments with ''E. coli'' published on Nature showed the potential of using living cells for computing tasks and storing information. A team formed with collaborators of the The Biodesign Institute, Biodesign Institute at Arizona State University and Harvard’s Wyss Institute for Biologically Inspired Engineering developed a biological computer inside ''E. coli'' that responded to a dozen inputs. The team called the computer "ribocomputer", as it was composed of RNA, ribonucleic acid. Meanwhile, Harvard researchers probed that is possible to store information in bacteria after successfully archiving images and movies in the DNA of living ''E. coli'' cells. In 2021, a team led by biophysicist Sangram Bagh realized a study with ''E. coli'' to solve Maze, 2 × 2 maze problems to probe the principle for distributed computing among cells.


History

In 1885, the German-Austrian pediatrician Theodor Escherich discovered this organism in the feces of healthy individuals. He called it ''Bacterium coli commune'' because it is found in the colon. Early classifications of prokaryotes placed these in a handful of genera based on their shape and motility (at that time Ernst Haeckel's classification of bacteria in the kingdom Monera was in place). ''Bacterium coli'' was the type species of the now invalid genus Bacterium (genus), ''Bacterium'' when it was revealed that the former type species ("''Bacterium triloculare''") was missing. Following a revision of ''Bacterium'', it was reclassified as ''Bacillus coli'' by Migula in 1895 and later reclassified in the newly created genus ''
Escherichia ''Escherichia'' () is a genus of Gram-negative, non- spore-forming, facultatively anaerobic, rod-shaped bacteria from the family Enterobacteriaceae. In those species which are inhabitants of the gastrointestinal tracts of warm-blooded animals, ...
'', List of bacterial genera named after personal names, named after its original discoverer, by Aldo Castellani and Albert John Chalmers. In 1996, the world's worst to date outbreak of ''E. coli'' food poisoning occurred in Wishaw, Scotland, killing 21 people. This death toll was exceeded in 2011, when the 2011 Germany E. coli O104:H4 outbreak, 2011 Germany ''E. coli'' O104:H4 outbreak, linked to organic fenugreek sprouts, killed 53 people.


Uses

''E. coli'' has several practical uses besides its use as a vector for genetic experiments and processes. For example, ''E. coli'' can be used to generate synthetic propane and recombinant human growth hormone.


See also

* Bacteriological water analysis * BolA-like protein family * Carbon monoxide-releasing molecules * Contamination control * Dam dcm strain * Eijkman test * Fecal coliform * International Code of Nomenclature of Bacteria * List of strains of Escherichia coli, List of strains of ''Escherichia coli'' * Mannan oligosaccharide-based nutritional supplements * Overflow metabolism * T4 rII system


References


Databases and external links

* EcoCyc – literature-based curation of the entire genome, and of transcriptional regulation, transporters, and metabolic pathways * Membranome database provides information about single-pass transmembrane proteins from ''E. coli'' and several other organisms
''E. coli'' statistics



Bacteriome
''E. coli'' interaction database
EcoGene
(genome database and website dedicated to ''Escherichia coli'' K-12 substrain MG1655)
EcoSal
Continually updated Web resource based on the classic ASM Press publication ''Escherichia coli and Salmonella: Cellular and Molecular Biology''
ECODAB
The structure of the O-antigens that form the basis of the serological classification of ''E. coli''
Coli Genetic Stock Center
Strains and genetic information on ''E. coli'' K-12
PortEco (formerly EcoliHub)
– NIH-funded comprehensive data resource for ''E. coli'' K-12 and its phage, plasmids, and mobile genetic elements
EcoliWiki
is the community annotation component o
PortEco

RegulonDB
RegulonDB is a model of the complex regulation of transcription initiation or regulatory network of the cell ''E. coli'' K-12.


AlignACE
Matrices that search for additional binding sites in the ''E. coli'' genomic sequence
''E. coli'' on Protein Data Bank
{{Authority control Escherichia coli, Gut flora bacteria Tropical diseases Model organisms Bacteria described in 1919