
An alpha solenoid (sometimes also known as an alpha horseshoe or as stacked pairs of alpha helices, abbreviated SPAH) is a
protein fold composed of
repeating alpha helix
The alpha helix (α-helix) is a common motif in the secondary structure of proteins and is a right hand-helix conformation in which every backbone N−H group hydrogen bonds to the backbone C=O group of the amino acid located four residues earl ...
subunits, commonly
helix-turn-helix
Helix-turn-helix is a DNA-binding protein (DBP). The helix-turn-helix (HTH) is a major structural motif capable of binding DNA. Each monomer incorporates two α helices, joined by a short strand of amino acids, that bind to the major groove o ...
motifs, arranged in
antiparallel fashion to form a superhelix.
Alpha solenoids are known for their flexibility and plasticity.
Like
beta propellers, alpha solenoids are a form of
solenoid protein domain
Solenoid protein domains are a highly modular type of protein domain. They consist of a chain of nearly identical folds, often simply called tandem repeats. They are extremely common among all types of proteins, though exact figures are unknown.
...
commonly found in the proteins comprising the
nuclear pore complex
A nuclear pore is a part of a large complex of proteins, known as a nuclear pore complex that spans the nuclear envelope, which is the double membrane surrounding the eukaryotic cell nucleus. There are approximately 1,000 nuclear pore complexe ...
.
They are also common in membrane coat proteins known as
coatomer
The coatomer is a protein complex that coats membrane-bound transport vesicles. Two types of coatomers are known:
*COPI (retrograde transport from trans-Golgi network to cis-Golgi network and endoplasmic reticulum)
*COPII (anterograde transport ...
s, such as
clathrin
Clathrin is a protein that plays a major role in the formation of coated vesicles. Clathrin was first isolated and named by Barbara Pearse in 1976. It forms a triskelion shape composed of three clathrin heavy chains and three light chains. Wh ...
, and in
regulatory protein
Regulation of gene expression, or gene regulation, includes a wide range of mechanisms that are used by cells to increase or decrease the production of specific gene products (protein or RNA). Sophisticated programs of gene expression are wide ...
s that form extensive
protein-protein interactions with their binding partners.
Examples of alpha solenoid structures binding
RNA
Ribonucleic acid (RNA) is a polymeric molecule essential in various biological roles in coding, decoding, regulation and expression of genes. RNA and deoxyribonucleic acid ( DNA) are nucleic acids. Along with lipids, proteins, and carbohydra ...
and
lipid
Lipids are a broad group of naturally-occurring molecules which includes fats, waxes, sterols, fat-soluble vitamins (such as vitamins A, D, E and K), monoglycerides, diglycerides, phospholipids, and others. The functions of lipids incl ...
s have also been described.
Terminology and classification
The term "alpha solenoid" has been used somewhat inconsistently in the literature.
As originally defined, alpha solenoids were composed of
helix-turn-helix
Helix-turn-helix is a DNA-binding protein (DBP). The helix-turn-helix (HTH) is a major structural motif capable of binding DNA. Each monomer incorporates two α helices, joined by a short strand of amino acids, that bind to the major groove o ...
motifs that stacked into an open superhelix.
However, protein structural classification systems have used varying terminology; the
Structural Classification of Proteins
The Structural Classification of Proteins (SCOP) database is a largely manual classification of protein structural domains based on similarities of their structures and amino acid sequences. A motivation for this classification is to determine ...
(SCOP) database describes these proteins using the term "alpha alpha superhelix". The
CATH database uses the term "alpha horseshoe"
for these proteins, and uses "alpha solenoid" for a somewhat different and more compact structure exemplified by the
peridinin-chlorophyll binding protein.
Structure
Alpha solenoid proteins are composed of repeating structural units containing at least two
alpha helices
The alpha helix (α-helix) is a common motif in the secondary structure of proteins and is a right hand-helix conformation in which every backbone N−H group hydrogen bonds to the backbone C=O group of the amino acid located four residues earli ...
arranged in an
antiparallel orientation. Often the repeating unit is a
helix-turn-helix
Helix-turn-helix is a DNA-binding protein (DBP). The helix-turn-helix (HTH) is a major structural motif capable of binding DNA. Each monomer incorporates two α helices, joined by a short strand of amino acids, that bind to the major groove o ...
motif, but it can be more elaborate, as in variants with an additional helix in the turn segment.
Alpha solenoids can be formed by several different types of helical
tandem repeats, including
HEAT repeats,
Armadillo repeats,
tetratricopeptide (TPR) repeats,
leucine-rich repeat
A leucine-rich repeat (LRR) is a protein structural motif that forms an α/β horseshoe fold. It is composed of repeating 20–30 amino acid stretches that are unusually rich in the hydrophobic amino acid leucine. These tandem repeats common ...
s, and
ankyrin repeat
The ankyrin repeat is a 33-residue motif in proteins consisting of two alpha helices separated by loops, first discovered in signaling proteins in yeast Cdc10 and ''Drosophila'' Notch. Domains consisting of ankyrin tandem repeats mediate prot ...
s.
Alpha solenoids have unusual elasticity and flexibility relative to
globular protein
In biochemistry, globular proteins or spheroproteins are spherical ("globe-like") proteins and are one of the common protein types (the others being fibrous, disordered and membrane proteins). Globular proteins are somewhat water-soluble (fo ...
s.
They are sometimes considered to occupy an intermediate position between globular proteins and fibrous
structural proteins, distinct from the latter in part due to the alpha solenoids' lack of need for intermolecular interactions to maintain their structure.
The extent of the curvature of an alpha solenoid superhelix varies considerably among the class, resulting in the ability of these proteins to form large, extended
protein-protein interaction surfaces or to form deep concave areas for binding globular proteins.
Because they are composed of repeating relatively short subunits, alpha solenoids can acquire additional subunits relatively easily, resulting in new interaction surface properties.
As a result, known alpha solenoid proteins vary substantially in length.
Function
Nuclear pore complex components
Alpha solenoids feature prominently in the proteins making up the
nuclear pore complex
A nuclear pore is a part of a large complex of proteins, known as a nuclear pore complex that spans the nuclear envelope, which is the double membrane surrounding the eukaryotic cell nucleus. There are approximately 1,000 nuclear pore complexe ...
(NPC); alpha solenoid and
beta propeller domains together account for up to half of the core NPC scaffold by mass.
A large number of the conserved
nucleoporin
Nucleoporins are a family of proteins which are the constituent building blocks of the nuclear pore complex (NPC). The nuclear pore complex is a massive structure embedded in the nuclear envelope at sites where the inner and outer nuclear membr ...
proteins forming the NPC are either alpha solenoid proteins or consist of a beta propeller domain at the
N-terminus
The N-terminus (also known as the amino-terminus, NH2-terminus, N-terminal end or amine-terminus) is the start of a protein or polypeptide, referring to the free amine group (-NH2) located at the end of a polypeptide. Within a peptide, the ami ...
and an alpha solenoid at the
C-terminus
The C-terminus (also known as the carboxyl-terminus, carboxy-terminus, C-terminal tail, C-terminal end, or COOH-terminus) is the end of an amino acid chain (protein or polypeptide), terminated by a free carboxyl group (-COOH). When the protein i ...
.
This latter domain architecture also occurs in
clathrin
Clathrin is a protein that plays a major role in the formation of coated vesicles. Clathrin was first isolated and named by Barbara Pearse in 1976. It forms a triskelion shape composed of three clathrin heavy chains and three light chains. Wh ...
and
Sec31
SEC31 is a protein which in yeast promotes the formation of COPII transport vesicles from the Endoplasmic Reticulum (ER). The coat has two main functions, the physical deformation of the endoplasmic reticulum membrane into vesicles and the selectio ...
, and was thought to be unique to
eukaryote
Eukaryotes () are organisms whose cells have a nucleus. All animals, plants, fungi, and many unicellular organisms, are Eukaryotes. They belong to the group of organisms Eukaryota or Eukarya, which is one of the three domains of life. Bact ...
s,
though a few examples have been reported in
planctomycetes.
Vesicle coat proteins

Vesicle coat proteins frequently contain alpha solenoids and share common domain architecture with some NPC proteins.
Three major coat complexes involved in distinct cellular pathways all contain alpha solenoid proteins: the
clathrin
Clathrin is a protein that plays a major role in the formation of coated vesicles. Clathrin was first isolated and named by Barbara Pearse in 1976. It forms a triskelion shape composed of three clathrin heavy chains and three light chains. Wh ...
/
adaptin complex, which buds vesicles from the
plasma membrane
The cell membrane (also known as the plasma membrane (PM) or cytoplasmic membrane, and historically referred to as the plasmalemma) is a biological membrane that separates and protects the interior of all cells from the outside environment (t ...
and is involved in
endocytosis
Endocytosis is a cellular process in which substances are brought into the cell. The material to be internalized is surrounded by an area of cell membrane, which then buds off inside the cell to form a vesicle containing the ingested material. E ...
; the
COPI
COPI is a coatomer, a protein complex that coats vesicles transporting proteins from the ''cis'' end of the Golgi complex back to the rough endoplasmic reticulum (ER), where they were originally synthesized, and between Golgi compartments. This ...
complex, which buds vesicles from the
Golgi apparatus
The Golgi apparatus (), also known as the Golgi complex, Golgi body, or simply the Golgi, is an organelle found in most eukaryotic cells. Part of the endomembrane system in the cytoplasm, it packages proteins into membrane-bound vesicles ...
and is associated with
retrograde transport; and the
COPII
The Coat Protein Complex II, or COPII, is a group of proteins that facilitate the formation of vesicles to transport proteins from the endoplasmic reticulum to the Golgi apparatus or endoplasmic-reticulum–Golgi intermediate compartment. This pr ...
complex, which buds vesicles from the
endoplasmic reticulum
The endoplasmic reticulum (ER) is, in essence, the transportation system of the eukaryotic cell, and has many other important functions such as protein folding. It is a type of organelle made up of two subunits – rough endoplasmic reticulum ( ...
and is associated with
anterograde transport.
Transport proteins
Due to their propensity for forming large interaction surfaces well-suited to
protein-protein interactions, and their flexible surfaces permitting binding of various cargo molecules, alpha solenoid proteins commonly function as
transport protein
A transport protein (variously referred to as a transmembrane pump, transporter, escort protein, acid transport protein, cation transport protein, or anion transport protein) is a protein that serves the function of moving other materials within ...
s, particularly in transport between the
nucleus
Nucleus ( : nuclei) is a Latin word for the seed inside a fruit. It most often refers to:
*Atomic nucleus, the very dense central region of an atom
* Cell nucleus, a central organelle of a eukaryotic cell, containing most of the cell's DNA
Nucl ...
and the
cytoplasm
In cell biology, the cytoplasm is all of the material within a eukaryotic cell, enclosed by the cell membrane, except for the cell nucleus. The material inside the nucleus and contained within the nuclear membrane is termed the nucleoplasm. ...
.
For example, the beta-
karyopherin
Karyopherins are proteins involved in transporting molecules between the cytoplasm and the nucleus of a eukaryotic cell. The inside of the nucleus is called the karyoplasm (or nucleoplasm). Generally, karyopherin-mediated transport occurs through ...
superfamily consists of alpha solenoid proteins formed from
HEAT repeats;
importin
Importin is a type of karyopherin that transports protein molecules from the cell's cytoplasm to the nucleus. It does so by binding to specific recognition sequences, called nuclear localization sequences (NLS).
Importin has two subunits, imp ...
beta is a member of this family, and its
adaptor protein importin alpha is an alpha solenoid formed from
Armadillo repeats.
Transporters of other molecules, such as
RNA
Ribonucleic acid (RNA) is a polymeric molecule essential in various biological roles in coding, decoding, regulation and expression of genes. RNA and deoxyribonucleic acid ( DNA) are nucleic acids. Along with lipids, proteins, and carbohydra ...
, can also be of alpha solenoid architecture, as in
exportin-5 or
pentatricopeptide-repeat-containing RNA-binding proteins, which are particularly common in plants.
Regulatory proteins

The protein-protein interaction capacity of alpha solenoid proteins also makes them well suited to function as
regulatory protein
Regulation of gene expression, or gene regulation, includes a wide range of mechanisms that are used by cells to increase or decrease the production of specific gene products (protein or RNA). Sophisticated programs of gene expression are wide ...
s. For example, regulatory subunit A (also known as PR65) of
protein phosphatase 2A is a HEAT-repeat alpha solenoid whose conformational flexibility regulates access to the enzyme binding site.
Taxonomic distribution
Alpha solenoid proteins are found in all
domains of life
In biological taxonomy, a domain ( or ) ( Latin: ''regio''), also dominion, superkingdom, realm, or empire, is the highest taxonomic rank of all organisms taken together. It was introduced in the three-domain system of taxonomy devised by Ca ...
; however, their frequencies in different
proteome
The proteome is the entire set of proteins that is, or can be, expressed by a genome, cell, tissue, or organism at a certain time. It is the set of expressed proteins in a given type of cell or organism, at a given time, under defined conditions. ...
s vary significantly. They are rare in
virus
A virus is a wikt:submicroscopic, submicroscopic infectious agent that replicates only inside the living Cell (biology), cells of an organism. Viruses infect all life forms, from animals and plants to microorganisms, including bacteria and ...
es and
bacteria
Bacteria (; singular: bacterium) are ubiquitous, mostly free-living organisms often consisting of one biological cell. They constitute a large domain of prokaryotic microorganisms. Typically a few micrometres in length, bacteria were am ...
, somewhat more common in
archaea, and quite common in
eukaryote
Eukaryotes () are organisms whose cells have a nucleus. All animals, plants, fungi, and many unicellular organisms, are Eukaryotes. They belong to the group of organisms Eukaryota or Eukarya, which is one of the three domains of life. Bact ...
s. Many of the eukaryotic alpha solenoid proteins have detectable homologs only in other eukaryotes and are often restricted even further, to the
chordate
A chordate () is an animal of the phylum Chordata (). All chordates possess, at some point during their larval or adult stages, five synapomorphies, or primary physical characteristics, that distinguish them from all the other taxa. These five ...
s.
Prokaryotic
A prokaryote () is a single-celled organism that lacks a nucleus and other membrane-bound organelles. The word ''prokaryote'' comes from the Greek πρό (, 'before') and κάρυον (, 'nut' or 'kernel').Campbell, N. "Biology:Concepts & Connec ...
alpha solenoid proteins are concentrated in particular taxa, notably the
cyanobacteria and
planctomycetes, which have unusually complex intracellular compartmentalization relative to most prokaryotes.
Evolution
Evolutionary relationships between different alpha solenoid proteins are difficult to trace due to the low
sequence homology
Sequence homology is the biological homology between DNA, RNA, or protein sequences, defined in terms of shared ancestry in the evolutionary history of life. Two segments of DNA can have shared ancestry because of three phenomena: either a sp ...
of the repeats.
Convergent evolution
Convergent evolution is the independent evolution of similar features in species of different periods or epochs in time. Convergent evolution creates analogous structures that have similar form or function but were not present in the last com ...
of similar protein structures from ancestrally unrelated proteins is thought to be significant in the evolutionary history of this fold class.
Nuclear pore complexes and vesicle transport
The
nuclear pore complex
A nuclear pore is a part of a large complex of proteins, known as a nuclear pore complex that spans the nuclear envelope, which is the double membrane surrounding the eukaryotic cell nucleus. There are approximately 1,000 nuclear pore complexe ...
is an extremely large
protein complex
A protein complex or multiprotein complex is a group of two or more associated polypeptide chains. Protein complexes are distinct from multienzyme complexes, in which multiple catalytic domains are found in a single polypeptide chain.
Protein ...
that mediates transit into and out of the
cell nucleus. Homologous structures from which the NPC might have evolved have not been detected in prokaryotic transmembrane transport proteins; however, it has been suggested that the NPC components show distinct homology to vesicle coat proteins found in
clathrin
Clathrin is a protein that plays a major role in the formation of coated vesicles. Clathrin was first isolated and named by Barbara Pearse in 1976. It forms a triskelion shape composed of three clathrin heavy chains and three light chains. Wh ...
/
adaptin,
COPI
COPI is a coatomer, a protein complex that coats vesicles transporting proteins from the ''cis'' end of the Golgi complex back to the rough endoplasmic reticulum (ER), where they were originally synthesized, and between Golgi compartments. This ...
, and
COPII
The Coat Protein Complex II, or COPII, is a group of proteins that facilitate the formation of vesicles to transport proteins from the endoplasmic reticulum to the Golgi apparatus or endoplasmic-reticulum–Golgi intermediate compartment. This pr ...
complexes. Most distinctively, a shared domain architecture consisting of an N-terminal
beta propeller and a C-terminal alpha solenoid has been detected in both NPC and coat proteins, suggesting a possible common origin.
An ancestral "protocoatomer" that diversified to acquire derived characteristics of all four modern complexes has been proposed.
Examination of the genome of
Lokiarchaeum, thought to be among the closest
archaeal relatives to eukaryotes, did not reveal any examples of the beta propeller/alpha solenoid domain architecture, although homologs of other proteins involved in eukaryotic membrane trafficking were identified. However, it is unclear whether this observation means that the propeller/solenoid architecture evolved later or was lost from modern lokiarchaea.
Membrane coat proteins in prokaryotes
A survey of the sequenced genomes of complex prokaryotes from the
PVC superphylum
The PVC superphylum is a superphylum of bacteria named after its three important members, Planctomycetota, Verrucomicrobiota, and Chlamydiota. Cavalier-Smith postulated that the PVC bacteria probably lost or reduced their peptidoglycan cell wall ...
(
Planctomycetota
The Planctomycetota are a phylum of widely distributed bacteria, occurring in both aquatic and terrestrial habitats. They play a considerable role in global carbon and nitrogen cycles, with many species of this phylum capable of anaerobic ammoniu ...
-
Verrucomicrobiota
Verrucomicrobiota is a phylum of Gram-negative bacteria that contains only a few described species. The species identified have been isolated from fresh water, marine and soil environments and human faeces. A number of as-yet uncultivated specie ...
-
Chlamydiota
The Chlamydiota (synonym Chlamydiae) are a bacterial phylum and class whose members are remarkably diverse, including pathogens of humans and animals, symbionts of ubiquitous protozoa, and marine sediment forms not yet well understood. All of ...
) identified examples of proteins with homology to eukaryotic membrane trafficking proteins, including examples of the distinctive beta-propeller/alpha-solenoid domain architecture previously believed to be unique to eukaryotes.
The PVC superphylum is known for containing bacteria with unusually complex membrane morphology, and this discovery has been cited as evidence in favor of these organisms' status as an intermediate form between prokaryotes and eukaryotes. The planctomycete ''
Gemmata obscuriglobus
''Gemmata obscuriglobus'' is a species of Gram-negative, aerobic, heterotrophic bacteria of the phylum Planctomycetota. ''G. obscuriglobus'' occur in freshwater habitats and was first described in 1984, and is the only described species in its ...
'' has exceptionally complex membrane architecture and has been a source of controversy in the literature regarding the possibility that it has a membrane-bound "nucleoid" compartment enclosing its DNA.
The identification of proteins with sequence similarities to HEAT repeats in the ''G. obscuriglobus''
proteome
The proteome is the entire set of proteins that is, or can be, expressed by a genome, cell, tissue, or organism at a certain time. It is the set of expressed proteins in a given type of cell or organism, at a given time, under defined conditions. ...
has been interpreted as support for the membrane-bound nucleoid hypothesis;
however, this has been disputed.
Bioinformatics
Low sequence similarity among alpha solenoid proteins of similar structure has impeded their identification using
bioinformatics
Bioinformatics () is an interdisciplinary field that develops methods and software tools for understanding biological data, in particular when the data sets are large and complex. As an interdisciplinary field of science, bioinformatics combin ...
methods, since the repeats are often not well defined in sequence. A large number of different computational methods have been developed to identify candidate alpha solenoid proteins based on their
amino acid sequence
Protein primary structure is the linear sequence of amino acids in a peptide or protein. By convention, the primary structure of a protein is reported starting from the amino-terminal (N) end to the carboxyl-terminal (C) end. Protein biosynthesi ...
.
External links
RepeatsDBα-solenoid class
References
{{Protein tandem repeats
Protein folds