
Transmission electron microscopy DNA sequencing is a single-molecule
sequencing
In genetics and biochemistry, sequencing means to determine the primary structure (sometimes incorrectly called the primary sequence) of an unbranched biopolymer. Sequencing results in a symbolic linear depiction known as a sequence which succ ...
technology that uses
transmission electron microscopy
Transmission electron microscopy (TEM) is a microscopy technique in which a beam of electrons is transmitted through a specimen to form an image. The specimen is most often an ultrathin section less than 100 nm thick or a suspension on a g ...
techniques. The method was conceived and developed in the 1960s and 70s,
[[ Michael Beer and Richard Zobel (1961) "Electron stains II: Electron microscopic studies on the visibility of stained DNA molecules" J. Mol. Biol. Volume 3, Issue 6, December 1961, Pages 717–726, IN3–IN5"]] but lost favor when the extent of damage to the sample was recognized.
[[M. Cole et al (1977) "Molecular microscopy of labeled polynucleotides: Stability of osmium atoms" J. Mol. Biol. Volume 117, Issue 2, 5 December 1977, Pages 387–400]]
In order for DNA to be clearly visualized under an
electron microscope
An electron microscope is a microscope that uses a beam of electrons as a source of illumination. It uses electron optics that are analogous to the glass lenses of an optical light microscope to control the electron beam, for instance focusing it ...
, it must be labeled with heavy atoms. In addition, specialized imaging techniques and
aberration corrected optics are beneficial for obtaining the
resolution required to image the labeled DNA molecule. In theory, transmission electron microscopy DNA sequencing could provide extremely long read lengths, but the issue of electron beam damage may still remain and the technology has not yet been commercially developed.
History
Only a few years after
James Watson
James Dewey Watson (born April 6, 1928) is an American molecular biology, molecular biologist, geneticist, and zoologist. In 1953, he co-authored with Francis Crick the academic paper in ''Nature (journal), Nature'' proposing the Nucleic acid ...
and
Francis Crick
Francis Harry Compton Crick (8 June 1916 – 28 July 2004) was an English molecular biologist, biophysicist, and neuroscientist. He, James Watson, Rosalind Franklin, and Maurice Wilkins played crucial roles in deciphering the Nucleic acid doub ...
deduced the
structure of DNA, and nearly two decades before
Frederick Sanger
Frederick Sanger (; 13 August 1918 – 19 November 2013) was a British biochemist who received the Nobel Prize in Chemistry twice.
He won the 1958 Chemistry Prize for determining the amino acid sequence of insulin and numerous other prote ...
published the first method for rapid
DNA sequencing
DNA sequencing is the process of determining the nucleic acid sequence – the order of nucleotides in DNA. It includes any method or technology that is used to determine the order of the four bases: adenine, thymine, cytosine, and guanine. The ...
,
Richard Feynman
Richard Phillips Feynman (; May 11, 1918 – February 15, 1988) was an American theoretical physicist. He is best known for his work in the path integral formulation of quantum mechanics, the theory of quantum electrodynamics, the physics of t ...
, an American physicist, envisioned the
electron microscope
An electron microscope is a microscope that uses a beam of electrons as a source of illumination. It uses electron optics that are analogous to the glass lenses of an optical light microscope to control the electron beam, for instance focusing it ...
as the tool that would one day allow biologists to "see the order of
bases in the
DNA
Deoxyribonucleic acid (; DNA) is a polymer composed of two polynucleotide chains that coil around each other to form a double helix. The polymer carries genetic instructions for the development, functioning, growth and reproduction of al ...
chain".
[Feynman R. (1959]
There's plenty of room at the bottom
Caltech lecture. Feynman believed that if the electron microscope could be made powerful enough, then it would become possible to visualize the atomic structure of any and all chemical compounds, including DNA.
In 1970,
Albert Crewe developed the high-angle
annular dark-field imaging (HAADF) imaging technique in a
scanning transmission electron microscope. Using this technique, he visualized individual heavy atoms on thin amorphous carbon films. In 2010 Krivanek and colleagues reported several technical improvements to the HAADF method, including a combination of aberration corrected electron optics and low accelerating voltage. The latter is crucial for imaging biological objects, as it allows to reduce damage by the beam and increase the image contrast for light atoms. As a result, single atom substitutions in a boron nitride monolayer could be imaged.
Despite the invention of a multitude of chemical and fluorescent sequencing technologies, electron microscopy is still being explored as a means of performing single-molecule DNA sequencing. For example, in 2012 a collaboration between scientists at
Harvard University
Harvard University is a Private university, private Ivy League research university in Cambridge, Massachusetts, United States. Founded in 1636 and named for its first benefactor, the History of the Puritans in North America, Puritan clergyma ...
, the
University of New Hampshire
The University of New Hampshire (UNH) is a Public university, public Land-grant university, land-grant research university with its main campus in Durham, New Hampshire, United States. It was founded and incorporated in 1866 as a land grant coll ...
and
ZS Genetics demonstrated the ability to read long sequences of DNA using the technique,
however transmission electron microscopy DNA sequencing technology is still far from being commercially available.
Principle
The
electron microscope
An electron microscope is a microscope that uses a beam of electrons as a source of illumination. It uses electron optics that are analogous to the glass lenses of an optical light microscope to control the electron beam, for instance focusing it ...
has the capacity to obtain a resolution of up to 100 pm, whereby microscopic biomolecules and structures such as viruses, ribosomes, proteins, lipids, small molecules and even single atoms can be observed.
[Campbell NA and Reece JB. (2002) Biology (6th ed.). San Francisco: Benjamin Cummings. ]
Although
DNA
Deoxyribonucleic acid (; DNA) is a polymer composed of two polynucleotide chains that coil around each other to form a double helix. The polymer carries genetic instructions for the development, functioning, growth and reproduction of al ...
is visible when observed with the electron microscope, the resolution of the image obtained is not high enough to allow for deciphering the sequence of the individual
bases, ''i.e.'',
DNA sequencing
DNA sequencing is the process of determining the nucleic acid sequence – the order of nucleotides in DNA. It includes any method or technology that is used to determine the order of the four bases: adenine, thymine, cytosine, and guanine. The ...
. However, upon differential labeling of the DNA bases with heavy atoms or metals, it is possible to both visualize and distinguish between the individual bases. Therefore,
electron microscopy
An electron microscope is a microscope that uses a beam of electrons as a source of illumination. It uses electron optics that are analogous to the glass lenses of an optical light microscope to control the electron beam, for instance focusing i ...
in conjunction with differential heavy atom DNA labeling could be used to directly image the DNA in order to determine its sequence.
Workflow
Step 1 – DNA denaturation
As in a standard
polymerase chain reaction (PCR), the double stranded DNA molecules to be sequenced must be
denatured before the second strand can be synthesized with labeled nucleotides.
Step 2 – Heavy atom labeling
The elements that make up biological molecules (
C,
H,
N,
O,
P,
S) are too light (low
atomic number, Z) to be clearly visualized as individual atoms by
transmission electron microscopy
Transmission electron microscopy (TEM) is a microscopy technique in which a beam of electrons is transmitted through a specimen to form an image. The specimen is most often an ultrathin section less than 100 nm thick or a suspension on a g ...
. To circumvent this problem, the DNA
bases can be labeled with heavier atoms (higher Z). Each
nucleotide
Nucleotides are Organic compound, organic molecules composed of a nitrogenous base, a pentose sugar and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both o ...
is tagged with a characteristic heavy label, so that they can be distinguished in the transmission electron micrograph.
* ZS Genetics proposes using three heavy labels:
bromine
Bromine is a chemical element; it has chemical symbol, symbol Br and atomic number 35. It is a volatile red-brown liquid at room temperature that evaporates readily to form a similarly coloured vapour. Its properties are intermediate between th ...
(Z=35),
iodine
Iodine is a chemical element; it has symbol I and atomic number 53. The heaviest of the stable halogens, it exists at standard conditions as a semi-lustrous, non-metallic solid that melts to form a deep violet liquid at , and boils to a vi ...
(Z=53), and
trichloromethane
Chloroform, or trichloromethane (often abbreviated as TCM), is an organochloride with the chemical formula, formula and a common solvent. It is a Volatility (chemistry), volatile, colorless, sweet-smelling, dense liquid produced on a large scale ...
(total Z=63). These would appear as differential dark and light spots on the micrograph, and the fourth DNA base would remain unlabeled.
* Halcyon Molecular, in collaboration with the Toste group, proposes that
purine
Purine is a heterocyclic aromatic organic compound that consists of two rings (pyrimidine and imidazole) fused together. It is water-soluble. Purine also gives its name to the wider class of molecules, purines, which include substituted puri ...
and
pyrimidine
Pyrimidine (; ) is an aromatic, heterocyclic, organic compound similar to pyridine (). One of the three diazines (six-membered heterocyclics with two nitrogen atoms in the ring), it has nitrogen atoms at positions 1 and 3 in the ring. The oth ...
bases can be functionalized with platinum diamine or osmium tetraoxide bipyridine, respectively. Heavy metal atoms such as
osmium
Osmium () is a chemical element; it has Symbol (chemistry), symbol Os and atomic number 76. It is a hard, brittle, bluish-white transition metal in the platinum group that is found as a Abundance of elements in Earth's crust, trace element in a ...
(Z=76),
iridium
Iridium is a chemical element; it has the symbol Ir and atomic number 77. This very hard, brittle, silvery-white transition metal of the platinum group, is considered the second-densest naturally occurring metal (after osmium) with a density ...
(Z=77),
gold
Gold is a chemical element; it has chemical symbol Au (from Latin ) and atomic number 79. In its pure form, it is a brightness, bright, slightly orange-yellow, dense, soft, malleable, and ductile metal. Chemically, gold is a transition metal ...
(Z=79), or
uranium
Uranium is a chemical element; it has chemical symbol, symbol U and atomic number 92. It is a silvery-grey metal in the actinide series of the periodic table. A uranium atom has 92 protons and 92 electrons, of which 6 are valence electrons. Ura ...
(Z=92) can then form metal-metal bonds with these functional groups to label the individual bases.
Step 3 – DNA alignment on substrate
The DNA molecules must be stretched out on a thin, solid substrate so that order of the labeled bases will be clearly visible on the electron micrograph.
Molecular combing is a technique that utilizes the force of a receding air-water interface to extend DNA molecules, leaving them irreversibly bound to a silane layer once dry.
This is one means by which alignment of the DNA on a solid substrate may be achieved.
Step 4 – TEM imaging

Transmission electron microscopy (TEM) produces high magnification,
high resolution images by passing a beam of electrons through a very thin sample. Whereas atomic resolution has been demonstrated with conventional TEM, further improvement in spatial resolution requires correcting the
spherical
A sphere (from Ancient Greek, Greek , ) is a surface (mathematics), surface analogous to the circle, a curve. In solid geometry, a sphere is the Locus (mathematics), set of points that are all at the same distance from a given point in three ...
and chromatic aberrations of the microscope
lenses
A lens is a transmissive optical device that focuses or disperses a light beam by means of refraction. A simple lens consists of a single piece of transparent material, while a compound lens consists of several simple lenses (''elements''), ...
. This has only been possible in
where the image is obtained by scanning the object with a finely focused electron beam, in a way similar to a
cathode ray tube
A cathode-ray tube (CRT) is a vacuum tube containing one or more electron guns, which emit electron beams that are manipulated to display images on a phosphorescent screen. The images may represent electrical waveforms on an oscilloscope, a ...
. However, the achieved improvement in resolution comes together with irradiation of the studied object by much higher beam intensities, the concomitant sample damage and the associated imaging artefacts.
Different imaging techniques are applied depending on whether the sample contains heavy or light atoms:
*
Annular dark-field imaging measures the scattering of electrons as they deflect off the nuclei of the atoms in the transmission electron microscopy sample.
This is best suited to samples containing heavy atoms, as they cause more scattering of electrons. The technique has been used to image atoms as light as
boron
Boron is a chemical element; it has symbol B and atomic number 5. In its crystalline form it is a brittle, dark, lustrous metalloid; in its amorphous form it is a brown powder. As the lightest element of the boron group it has three ...
,
nitrogen
Nitrogen is a chemical element; it has Symbol (chemistry), symbol N and atomic number 7. Nitrogen is a Nonmetal (chemistry), nonmetal and the lightest member of pnictogen, group 15 of the periodic table, often called the Pnictogen, pnictogens. ...
, and
carbon
Carbon () is a chemical element; it has chemical symbol, symbol C and atomic number 6. It is nonmetallic and tetravalence, tetravalent—meaning that its atoms are able to form up to four covalent bonds due to its valence shell exhibiting 4 ...
;
however, the signal is very weak for such light atoms. If annular dark-field microscopy is put to use for transmission electron microscopy DNA sequencing, it will certainly be necessary to label the DNA bases with heavy atoms so that a strong signal can be detected.
* Annular bright-field imaging detects electrons transmitted directly through the sample, and measures the wave interference produced by their interactions with the atomic nuclei. This technique can detect light atoms with greater sensitivity than annular dark-field imaging methods. In fact,
oxygen
Oxygen is a chemical element; it has chemical symbol, symbol O and atomic number 8. It is a member of the chalcogen group (periodic table), group in the periodic table, a highly reactivity (chemistry), reactive nonmetal (chemistry), non ...
,
nitrogen
Nitrogen is a chemical element; it has Symbol (chemistry), symbol N and atomic number 7. Nitrogen is a Nonmetal (chemistry), nonmetal and the lightest member of pnictogen, group 15 of the periodic table, often called the Pnictogen, pnictogens. ...
,
lithium
Lithium (from , , ) is a chemical element; it has chemical symbol, symbol Li and atomic number 3. It is a soft, silvery-white alkali metal. Under standard temperature and pressure, standard conditions, it is the least dense metal and the ...
,
and
hydrogen
Hydrogen is a chemical element; it has chemical symbol, symbol H and atomic number 1. It is the lightest and abundance of the chemical elements, most abundant chemical element in the universe, constituting about 75% of all baryon, normal matter ...
in crystalline solids have been imaged using annular bright-field electron microscopy. Thus, it is theoretically possible to obtain direct images of the atoms in the DNA chain; however, the structure of DNA is much less geometric than crystalline solids, so direct imaging without prior labeling may not be achievable.
Step 5 – Data analysis
Dark and bright spots on the electron micrograph, corresponding to the differentially labeled DNA bases, are analyzed by computer software.
Applications
Transmission electron microscopy DNA sequencing is not yet commercially available, but the long read lengths that this technology may one day provide will make it useful in a variety of contexts.
''De novo'' genome assembly
When sequencing a genome, it must be broken down into pieces that are short enough to be sequenced in a single read. These reads must then be put back together like a jigsaw puzzle by aligning the regions that overlap between reads; this process is called ''de novo''
genome assembly. The longer the read length that a sequencing platform provides, the longer the overlapping regions, and the easier it is to assemble the genome. From a computational perspective,
microfluidic Sanger sequencing is still the most effective way to sequence and assemble genomes for which no
reference genome
A reference genome (also known as a reference assembly) is a digital nucleic acid sequence database, assembled by scientists as a representative example of the genome, set of genes in one idealized individual organism of a species. As they are a ...
sequence exists. The relatively long read lengths provide substantial overlap between individual sequencing reads, which allows for greater statistical confidence in the assembly. In addition, long Sanger reads are able to span most regions of
repetitive DNA sequence which otherwise confound sequence assembly by causing false alignments. However, ''de novo'' genome assembly by Sanger sequencing is extremely expensive and time-consuming.
Second generation sequencing technologies,
while less expensive, are generally unfit for ''de novo'' genome assembly due to short read lengths. In general, third generation sequencing technologies,
including transmission electron microscopy DNA sequencing, aim to improve read length while maintaining low sequencing cost. Thus, as third generation sequencing technologies improve, rapid and inexpensive ''de novo'' genome assembly will become a reality.
Full haplotypes
A
haplotype
A haplotype (haploid genotype) is a group of alleles in an organism that are inherited together from a single parent.
Many organisms contain genetic material (DNA) which is inherited from two parents. Normally these organisms have their DNA orga ...
is a series of linked
allele
An allele is a variant of the sequence of nucleotides at a particular location, or Locus (genetics), locus, on a DNA molecule.
Alleles can differ at a single position through Single-nucleotide polymorphism, single nucleotide polymorphisms (SNP), ...
s that are inherited together on a single chromosome. DNA sequencing can be used to
genotype
The genotype of an organism is its complete set of genetic material. Genotype can also be used to refer to the alleles or variants an individual carries in a particular gene or genetic location. The number of alleles an individual can have in a ...
all of the
single nucleotide polymorphisms (SNPs) that constitute a haplotype. However, short DNA sequencing reads often cannot be phased; that is,
heterozygous
Zygosity (the noun, zygote, is from the Greek "yoked," from "yoke") () is the degree to which both copies of a chromosome or gene have the same genetic sequence. In other words, it is the degree of similarity of the alleles in an organism.
Mos ...
variants cannot be confidently assigned to the correct haplotype. In fact, haplotyping with short read DNA sequencing data requires very high coverage (average >50x coverage of each DNA base) to accurately identify SNPs, as well as additional sequence data from the parents so that
Mendelian transmission can be used to estimate the haplotypes.
Sequencing technologies that generate long reads, including transmission electron microscopy DNA sequencing, can capture entire haploblocks in a single read. That is, haplotypes are not broken up among multiple reads, and the genetically linked alleles remain together in the sequencing data. Therefore, long reads make haplotyping easier and more accurate, which is beneficial to the field of
population genetics
Population genetics is a subfield of genetics that deals with genetic differences within and among populations, and is a part of evolutionary biology. Studies in this branch of biology examine such phenomena as Adaptation (biology), adaptation, s ...
.
Copy number variants
Genes are normally present in two copies in the
diploid
Ploidy () is the number of complete sets of chromosomes in a cell, and hence the number of possible alleles for autosomal and pseudoautosomal genes. Here ''sets of chromosomes'' refers to the number of maternal and paternal chromosome copies, ...
human genome; genes that deviate from this standard copy number are referred to as
copy number variants (CNVs). Copy number variation can be benign (these are usually common variants, called copy number polymorphisms) or pathogenic.
CNVs are detected by
fluorescence in situ hybridization (FISH) or
comparative genomic hybridization (CGH). To detect the specific breakpoints at which a deletion occurs, or to detect genomic lesions introduced by a duplication or amplification event, CGH can be performed using a
tiling array (
array CGH), or the variant region can be sequenced. Long sequencing reads are especially useful for analyzing duplications or amplifications, as it is possible to analyze the orientation of the amplified segments if they are captured in a single sequencing read.
Cancer
Cancer genomics, or
oncogenomics, is an emerging field in which
high-throughput, second generation DNA sequencing technology is being applied to sequence entire cancer genomes. Analyzing this short read sequencing data encompasses all of the problems associated with ''de novo'' genome assembly using short read data.
Furthermore, cancer genomes are often
aneuploid.
These aberrations, which are essentially large scale copy number variants, can be analyzed by second-generation sequencing technologies using read frequency to estimate the copy number.
Longer reads would, however, provide a more accurate picture of copy number, orientation of amplified regions, and SNPs present in cancer genomes.
Microbiome sequencing
The
microbiome
A microbiome () is the community of microorganisms that can usually be found living together in any given habitat. It was defined more precisely in 1988 by Whipps ''et al.'' as "a characteristic microbial community occupying a reasonably wel ...
refers the total collection of microbes present in a microenvironment and their respective genomes. For example, an estimated 100 trillion microbial cells colonize the human body at any given time.
The human microbiome is of particular interest, as these
commensal bacteria are important for human health and immunity. Most of the Earth's bacterial genomes have not yet been sequenced; undertaking a microbiome sequencing project would require extensive ''de novo'' genome assembly, a prospect which is daunting with short read DNA sequencing technologies.
Longer reads would greatly facilitate the assembly of new microbial genomes.
Strengths and weaknesses
Compared to other second- and third-generation DNA sequencing technologies, transmission electron microscopy DNA sequencing has a number of potential key strengths and weaknesses, which will ultimately determine its usefulness and prominence as a future DNA sequencing technology.
Strengths
* Longer read lengths: ZS Genetics has estimated potential read lengths of transmission electron microscopy DNA sequencing to be 10,000 to 20,000 base pairs with a rate of 1.7 billion base pairs per day.
Such long read lengths would allow easier ''de novo'' genome assembly and direct detection of haplotypes, among other applications.
* Lower cost: Transmission electron microscopy DNA sequencing is estimated to cost just US$5,000-US$10,000 per human genome, compared to the more expensive second-generation DNA sequencing alternatives.
* No dephasing: Dephasing of the DNA strands due to loss in synchronicity during synthesis is a major problem of second-generation sequencing technologies. For transmission electron microscopy DNA sequencing and several other third-generation sequencing technologies, synchronization of the reads is unnecessary as only one molecule is being read at a time.
* Shorter turnaround time: The capacity to read native fragments of DNA renders complex template preparation an unnecessary step in the general workflow of whole genome sequencing. Consequently, shorter turnaround times are possible.
Weaknesses
* High capital cost: A transmission electron microscope with sufficient resolution required for transmission electron microscopy DNA sequencing costs approximately US$1,000,000, therefore pursuing DNA sequencing by this method requires a substantial investment.
* Technically challenging: Selective heavy atom labeling and attaching and straightening the labeled DNA to a substrate are a serious technical challenge.
Further, the DNA sample should be stable to the high vacuum of electron microscope and irradiation by a focused beam of high-energy electrons.
* Potential
PCR bias and artefacts: Although PCR is only being utilized in transmission electron microscopy DNA sequencing as a means to label the DNA strand with heavy atoms or metals, there could be the possibility of introducing bias in template representation or errors during the single amplification.
Comparison to other sequencing technologies
Many non-Sanger second- and third-generation DNA sequencing technologies have been or are currently being developed with the common aim of increasing throughput and decreasing cost such that personalized genetic medicine can be fully realized.
Both the US$10 million
Archon X Prize
The Archon Genomics X PRIZE presented by Express Scripts for Genomics, the second X Prize offered by the X Prize Foundation, based in Playa Vista, California, was announced on October 4, 2006 stating that the prize of "$10 million will be awarded ...
for Genomics supported by the
X Prize Foundation (Santa Monica, CA, USA) and the US$70 million in grant awards supported by the
National Human Genome Research Institute
The National Human Genome Research Institute (NHGRI) is an institute of the National Institutes of Health, located in Bethesda, Maryland.
NHGRI began as the Office of Human Genome Research in The Office of the Director in 1988. This Office transi ...
of the
National Institutes of Health
The National Institutes of Health (NIH) is the primary agency of the United States government responsible for biomedical and public health research. It was founded in 1887 and is part of the United States Department of Health and Human Service ...
(NIH-NHGRI) are fueling the rapid burst of research activity in the development of new DNA sequencing technologies.
Since different approaches, techniques, and strategies are what define each DNA sequencing technology, each has its own strengths and weaknesses. Comparison of important parameters between various second- and third-generation DNA sequencing technologies are presented in Table 1.
References
{{reflist, colwidth=30em
DNA sequencing
Electron microscopy
Laboratory techniques
Microbiology techniques
Molecular biology techniques
Scientific techniques