HOME

TheInfoList



OR:

An alpha solenoid (sometimes also known as an alpha horseshoe or as stacked pairs of alpha helices, abbreviated SPAH) is a protein fold composed of repeating
alpha helix The alpha helix (α-helix) is a common motif in the secondary structure of proteins and is a right hand-helix conformation in which every backbone N−H group hydrogen bonds to the backbone C=O group of the amino acid located four residues earl ...
subunits, commonly
helix-turn-helix Helix-turn-helix is a DNA-binding protein (DBP). The helix-turn-helix (HTH) is a major structural motif capable of binding DNA. Each monomer incorporates two α helices, joined by a short strand of amino acids, that bind to the major groove o ...
motifs, arranged in antiparallel fashion to form a superhelix. Alpha solenoids are known for their flexibility and plasticity. Like beta propellers, alpha solenoids are a form of
solenoid protein domain Solenoid protein domains are a highly modular type of protein domain. They consist of a chain of nearly identical folds, often simply called tandem repeats. They are extremely common among all types of proteins, though exact figures are unknown. ...
commonly found in the proteins comprising the
nuclear pore complex A nuclear pore is a part of a large complex of proteins, known as a nuclear pore complex that spans the nuclear envelope, which is the double membrane surrounding the eukaryotic cell nucleus. There are approximately 1,000 nuclear pore complexe ...
. They are also common in membrane coat proteins known as
coatomer The coatomer is a protein complex that coats membrane-bound transport vesicles. Two types of coatomers are known: *COPI (retrograde transport from trans-Golgi network to cis-Golgi network and endoplasmic reticulum) *COPII (anterograde transport ...
s, such as
clathrin Clathrin is a protein that plays a major role in the formation of coated vesicles. Clathrin was first isolated and named by Barbara Pearse in 1976. It forms a triskelion shape composed of three clathrin heavy chains and three light chains. Wh ...
, and in
regulatory protein Regulation of gene expression, or gene regulation, includes a wide range of mechanisms that are used by cells to increase or decrease the production of specific gene products (protein or RNA). Sophisticated programs of gene expression are wide ...
s that form extensive protein-protein interactions with their binding partners. Examples of alpha solenoid structures binding
RNA Ribonucleic acid (RNA) is a polymeric molecule essential in various biological roles in coding, decoding, regulation and expression of genes. RNA and deoxyribonucleic acid ( DNA) are nucleic acids. Along with lipids, proteins, and carbohydra ...
and
lipid Lipids are a broad group of naturally-occurring molecules which includes fats, waxes, sterols, fat-soluble vitamins (such as vitamins A, D, E and K), monoglycerides, diglycerides, phospholipids, and others. The functions of lipids incl ...
s have also been described.


Terminology and classification

The term "alpha solenoid" has been used somewhat inconsistently in the literature. As originally defined, alpha solenoids were composed of
helix-turn-helix Helix-turn-helix is a DNA-binding protein (DBP). The helix-turn-helix (HTH) is a major structural motif capable of binding DNA. Each monomer incorporates two α helices, joined by a short strand of amino acids, that bind to the major groove o ...
motifs that stacked into an open superhelix. However, protein structural classification systems have used varying terminology; the
Structural Classification of Proteins The Structural Classification of Proteins (SCOP) database is a largely manual classification of protein structural domains based on similarities of their structures and amino acid sequences. A motivation for this classification is to determine ...
(SCOP) database describes these proteins using the term "alpha alpha superhelix". The CATH database uses the term "alpha horseshoe" for these proteins, and uses "alpha solenoid" for a somewhat different and more compact structure exemplified by the peridinin-chlorophyll binding protein.


Structure

Alpha solenoid proteins are composed of repeating structural units containing at least two
alpha helices The alpha helix (α-helix) is a common motif in the secondary structure of proteins and is a right hand-helix conformation in which every backbone N−H group hydrogen bonds to the backbone C=O group of the amino acid located four residues earli ...
arranged in an antiparallel orientation. Often the repeating unit is a
helix-turn-helix Helix-turn-helix is a DNA-binding protein (DBP). The helix-turn-helix (HTH) is a major structural motif capable of binding DNA. Each monomer incorporates two α helices, joined by a short strand of amino acids, that bind to the major groove o ...
motif, but it can be more elaborate, as in variants with an additional helix in the turn segment. Alpha solenoids can be formed by several different types of helical tandem repeats, including HEAT repeats, Armadillo repeats, tetratricopeptide (TPR) repeats,
leucine-rich repeat A leucine-rich repeat (LRR) is a protein structural motif that forms an α/β horseshoe fold. It is composed of repeating 20–30 amino acid stretches that are unusually rich in the hydrophobic amino acid leucine. These tandem repeats common ...
s, and
ankyrin repeat The ankyrin repeat is a 33-residue motif in proteins consisting of two alpha helices separated by loops, first discovered in signaling proteins in yeast Cdc10 and ''Drosophila'' Notch. Domains consisting of ankyrin tandem repeats mediate prot ...
s. Alpha solenoids have unusual elasticity and flexibility relative to
globular protein In biochemistry, globular proteins or spheroproteins are spherical ("globe-like") proteins and are one of the common protein types (the others being fibrous, disordered and membrane proteins). Globular proteins are somewhat water-soluble (fo ...
s. They are sometimes considered to occupy an intermediate position between globular proteins and fibrous structural proteins, distinct from the latter in part due to the alpha solenoids' lack of need for intermolecular interactions to maintain their structure. The extent of the curvature of an alpha solenoid superhelix varies considerably among the class, resulting in the ability of these proteins to form large, extended protein-protein interaction surfaces or to form deep concave areas for binding globular proteins. Because they are composed of repeating relatively short subunits, alpha solenoids can acquire additional subunits relatively easily, resulting in new interaction surface properties. As a result, known alpha solenoid proteins vary substantially in length.


Function


Nuclear pore complex components

Alpha solenoids feature prominently in the proteins making up the
nuclear pore complex A nuclear pore is a part of a large complex of proteins, known as a nuclear pore complex that spans the nuclear envelope, which is the double membrane surrounding the eukaryotic cell nucleus. There are approximately 1,000 nuclear pore complexe ...
(NPC); alpha solenoid and beta propeller domains together account for up to half of the core NPC scaffold by mass. A large number of the conserved
nucleoporin Nucleoporins are a family of proteins which are the constituent building blocks of the nuclear pore complex (NPC). The nuclear pore complex is a massive structure embedded in the nuclear envelope at sites where the inner and outer nuclear membr ...
proteins forming the NPC are either alpha solenoid proteins or consist of a beta propeller domain at the
N-terminus The N-terminus (also known as the amino-terminus, NH2-terminus, N-terminal end or amine-terminus) is the start of a protein or polypeptide, referring to the free amine group (-NH2) located at the end of a polypeptide. Within a peptide, the ami ...
and an alpha solenoid at the
C-terminus The C-terminus (also known as the carboxyl-terminus, carboxy-terminus, C-terminal tail, C-terminal end, or COOH-terminus) is the end of an amino acid chain (protein or polypeptide), terminated by a free carboxyl group (-COOH). When the protein i ...
. This latter domain architecture also occurs in
clathrin Clathrin is a protein that plays a major role in the formation of coated vesicles. Clathrin was first isolated and named by Barbara Pearse in 1976. It forms a triskelion shape composed of three clathrin heavy chains and three light chains. Wh ...
and
Sec31 SEC31 is a protein which in yeast promotes the formation of COPII transport vesicles from the Endoplasmic Reticulum (ER). The coat has two main functions, the physical deformation of the endoplasmic reticulum membrane into vesicles and the selectio ...
, and was thought to be unique to
eukaryote Eukaryotes () are organisms whose cells have a nucleus. All animals, plants, fungi, and many unicellular organisms, are Eukaryotes. They belong to the group of organisms Eukaryota or Eukarya, which is one of the three domains of life. Bact ...
s, though a few examples have been reported in planctomycetes.


Vesicle coat proteins

Vesicle coat proteins frequently contain alpha solenoids and share common domain architecture with some NPC proteins. Three major coat complexes involved in distinct cellular pathways all contain alpha solenoid proteins: the
clathrin Clathrin is a protein that plays a major role in the formation of coated vesicles. Clathrin was first isolated and named by Barbara Pearse in 1976. It forms a triskelion shape composed of three clathrin heavy chains and three light chains. Wh ...
/ adaptin complex, which buds vesicles from the
plasma membrane The cell membrane (also known as the plasma membrane (PM) or cytoplasmic membrane, and historically referred to as the plasmalemma) is a biological membrane that separates and protects the interior of all cells from the outside environment (t ...
and is involved in
endocytosis Endocytosis is a cellular process in which substances are brought into the cell. The material to be internalized is surrounded by an area of cell membrane, which then buds off inside the cell to form a vesicle containing the ingested material. E ...
; the
COPI COPI is a coatomer, a protein complex that coats vesicles transporting proteins from the ''cis'' end of the Golgi complex back to the rough endoplasmic reticulum (ER), where they were originally synthesized, and between Golgi compartments. This ...
complex, which buds vesicles from the
Golgi apparatus The Golgi apparatus (), also known as the Golgi complex, Golgi body, or simply the Golgi, is an organelle found in most eukaryotic cells. Part of the endomembrane system in the cytoplasm, it packages proteins into membrane-bound vesicles ...
and is associated with retrograde transport; and the
COPII The Coat Protein Complex II, or COPII, is a group of proteins that facilitate the formation of vesicles to transport proteins from the endoplasmic reticulum to the Golgi apparatus or endoplasmic-reticulum–Golgi intermediate compartment. This pr ...
complex, which buds vesicles from the
endoplasmic reticulum The endoplasmic reticulum (ER) is, in essence, the transportation system of the eukaryotic cell, and has many other important functions such as protein folding. It is a type of organelle made up of two subunits – rough endoplasmic reticulum ( ...
and is associated with anterograde transport.


Transport proteins

Due to their propensity for forming large interaction surfaces well-suited to protein-protein interactions, and their flexible surfaces permitting binding of various cargo molecules, alpha solenoid proteins commonly function as
transport protein A transport protein (variously referred to as a transmembrane pump, transporter, escort protein, acid transport protein, cation transport protein, or anion transport protein) is a protein that serves the function of moving other materials within ...
s, particularly in transport between the
nucleus Nucleus ( : nuclei) is a Latin word for the seed inside a fruit. It most often refers to: *Atomic nucleus, the very dense central region of an atom * Cell nucleus, a central organelle of a eukaryotic cell, containing most of the cell's DNA Nucl ...
and the
cytoplasm In cell biology, the cytoplasm is all of the material within a eukaryotic cell, enclosed by the cell membrane, except for the cell nucleus. The material inside the nucleus and contained within the nuclear membrane is termed the nucleoplasm. ...
. For example, the beta-
karyopherin Karyopherins are proteins involved in transporting molecules between the cytoplasm and the nucleus of a eukaryotic cell. The inside of the nucleus is called the karyoplasm (or nucleoplasm). Generally, karyopherin-mediated transport occurs through ...
superfamily consists of alpha solenoid proteins formed from HEAT repeats;
importin Importin is a type of karyopherin that transports protein molecules from the cell's cytoplasm to the nucleus. It does so by binding to specific recognition sequences, called nuclear localization sequences (NLS). Importin has two subunits, imp ...
beta is a member of this family, and its adaptor protein importin alpha is an alpha solenoid formed from Armadillo repeats. Transporters of other molecules, such as
RNA Ribonucleic acid (RNA) is a polymeric molecule essential in various biological roles in coding, decoding, regulation and expression of genes. RNA and deoxyribonucleic acid ( DNA) are nucleic acids. Along with lipids, proteins, and carbohydra ...
, can also be of alpha solenoid architecture, as in exportin-5 or pentatricopeptide-repeat-containing RNA-binding proteins, which are particularly common in plants.


Regulatory proteins

The protein-protein interaction capacity of alpha solenoid proteins also makes them well suited to function as
regulatory protein Regulation of gene expression, or gene regulation, includes a wide range of mechanisms that are used by cells to increase or decrease the production of specific gene products (protein or RNA). Sophisticated programs of gene expression are wide ...
s. For example, regulatory subunit A (also known as PR65) of protein phosphatase 2A is a HEAT-repeat alpha solenoid whose conformational flexibility regulates access to the enzyme binding site.


Taxonomic distribution

Alpha solenoid proteins are found in all
domains of life In biological taxonomy, a domain ( or ) ( Latin: ''regio''), also dominion, superkingdom, realm, or empire, is the highest taxonomic rank of all organisms taken together. It was introduced in the three-domain system of taxonomy devised by Ca ...
; however, their frequencies in different
proteome The proteome is the entire set of proteins that is, or can be, expressed by a genome, cell, tissue, or organism at a certain time. It is the set of expressed proteins in a given type of cell or organism, at a given time, under defined conditions. ...
s vary significantly. They are rare in
virus A virus is a wikt:submicroscopic, submicroscopic infectious agent that replicates only inside the living Cell (biology), cells of an organism. Viruses infect all life forms, from animals and plants to microorganisms, including bacteria and ...
es and
bacteria Bacteria (; singular: bacterium) are ubiquitous, mostly free-living organisms often consisting of one biological cell. They constitute a large domain of prokaryotic microorganisms. Typically a few micrometres in length, bacteria were am ...
, somewhat more common in archaea, and quite common in
eukaryote Eukaryotes () are organisms whose cells have a nucleus. All animals, plants, fungi, and many unicellular organisms, are Eukaryotes. They belong to the group of organisms Eukaryota or Eukarya, which is one of the three domains of life. Bact ...
s. Many of the eukaryotic alpha solenoid proteins have detectable homologs only in other eukaryotes and are often restricted even further, to the
chordate A chordate () is an animal of the phylum Chordata (). All chordates possess, at some point during their larval or adult stages, five synapomorphies, or primary physical characteristics, that distinguish them from all the other taxa. These five ...
s.
Prokaryotic A prokaryote () is a single-celled organism that lacks a nucleus and other membrane-bound organelles. The word ''prokaryote'' comes from the Greek πρό (, 'before') and κάρυον (, 'nut' or 'kernel').Campbell, N. "Biology:Concepts & Connec ...
alpha solenoid proteins are concentrated in particular taxa, notably the cyanobacteria and planctomycetes, which have unusually complex intracellular compartmentalization relative to most prokaryotes.


Evolution

Evolutionary relationships between different alpha solenoid proteins are difficult to trace due to the low
sequence homology Sequence homology is the biological homology between DNA, RNA, or protein sequences, defined in terms of shared ancestry in the evolutionary history of life. Two segments of DNA can have shared ancestry because of three phenomena: either a sp ...
of the repeats.
Convergent evolution Convergent evolution is the independent evolution of similar features in species of different periods or epochs in time. Convergent evolution creates analogous structures that have similar form or function but were not present in the last com ...
of similar protein structures from ancestrally unrelated proteins is thought to be significant in the evolutionary history of this fold class.


Nuclear pore complexes and vesicle transport

The
nuclear pore complex A nuclear pore is a part of a large complex of proteins, known as a nuclear pore complex that spans the nuclear envelope, which is the double membrane surrounding the eukaryotic cell nucleus. There are approximately 1,000 nuclear pore complexe ...
is an extremely large
protein complex A protein complex or multiprotein complex is a group of two or more associated polypeptide chains. Protein complexes are distinct from multienzyme complexes, in which multiple catalytic domains are found in a single polypeptide chain. Protein ...
that mediates transit into and out of the cell nucleus. Homologous structures from which the NPC might have evolved have not been detected in prokaryotic transmembrane transport proteins; however, it has been suggested that the NPC components show distinct homology to vesicle coat proteins found in
clathrin Clathrin is a protein that plays a major role in the formation of coated vesicles. Clathrin was first isolated and named by Barbara Pearse in 1976. It forms a triskelion shape composed of three clathrin heavy chains and three light chains. Wh ...
/ adaptin,
COPI COPI is a coatomer, a protein complex that coats vesicles transporting proteins from the ''cis'' end of the Golgi complex back to the rough endoplasmic reticulum (ER), where they were originally synthesized, and between Golgi compartments. This ...
, and
COPII The Coat Protein Complex II, or COPII, is a group of proteins that facilitate the formation of vesicles to transport proteins from the endoplasmic reticulum to the Golgi apparatus or endoplasmic-reticulum–Golgi intermediate compartment. This pr ...
complexes. Most distinctively, a shared domain architecture consisting of an N-terminal beta propeller and a C-terminal alpha solenoid has been detected in both NPC and coat proteins, suggesting a possible common origin. An ancestral "protocoatomer" that diversified to acquire derived characteristics of all four modern complexes has been proposed. Examination of the genome of Lokiarchaeum, thought to be among the closest archaeal relatives to eukaryotes, did not reveal any examples of the beta propeller/alpha solenoid domain architecture, although homologs of other proteins involved in eukaryotic membrane trafficking were identified. However, it is unclear whether this observation means that the propeller/solenoid architecture evolved later or was lost from modern lokiarchaea.


Membrane coat proteins in prokaryotes

A survey of the sequenced genomes of complex prokaryotes from the
PVC superphylum The PVC superphylum is a superphylum of bacteria named after its three important members, Planctomycetota, Verrucomicrobiota, and Chlamydiota. Cavalier-Smith postulated that the PVC bacteria probably lost or reduced their peptidoglycan cell wall ...
(
Planctomycetota The Planctomycetota are a phylum of widely distributed bacteria, occurring in both aquatic and terrestrial habitats. They play a considerable role in global carbon and nitrogen cycles, with many species of this phylum capable of anaerobic ammoniu ...
-
Verrucomicrobiota Verrucomicrobiota is a phylum of Gram-negative bacteria that contains only a few described species. The species identified have been isolated from fresh water, marine and soil environments and human faeces. A number of as-yet uncultivated specie ...
-
Chlamydiota The Chlamydiota (synonym Chlamydiae) are a bacterial phylum and class whose members are remarkably diverse, including pathogens of humans and animals, symbionts of ubiquitous protozoa, and marine sediment forms not yet well understood. All of ...
) identified examples of proteins with homology to eukaryotic membrane trafficking proteins, including examples of the distinctive beta-propeller/alpha-solenoid domain architecture previously believed to be unique to eukaryotes. The PVC superphylum is known for containing bacteria with unusually complex membrane morphology, and this discovery has been cited as evidence in favor of these organisms' status as an intermediate form between prokaryotes and eukaryotes. The planctomycete ''
Gemmata obscuriglobus ''Gemmata obscuriglobus'' is a species of Gram-negative, aerobic, heterotrophic bacteria of the phylum Planctomycetota. ''G. obscuriglobus'' occur in freshwater habitats and was first described in 1984, and is the only described species in its ...
'' has exceptionally complex membrane architecture and has been a source of controversy in the literature regarding the possibility that it has a membrane-bound "nucleoid" compartment enclosing its DNA. The identification of proteins with sequence similarities to HEAT repeats in the ''G. obscuriglobus''
proteome The proteome is the entire set of proteins that is, or can be, expressed by a genome, cell, tissue, or organism at a certain time. It is the set of expressed proteins in a given type of cell or organism, at a given time, under defined conditions. ...
has been interpreted as support for the membrane-bound nucleoid hypothesis; however, this has been disputed.


Bioinformatics

Low sequence similarity among alpha solenoid proteins of similar structure has impeded their identification using
bioinformatics Bioinformatics () is an interdisciplinary field that develops methods and software tools for understanding biological data, in particular when the data sets are large and complex. As an interdisciplinary field of science, bioinformatics combin ...
methods, since the repeats are often not well defined in sequence. A large number of different computational methods have been developed to identify candidate alpha solenoid proteins based on their
amino acid sequence Protein primary structure is the linear sequence of amino acids in a peptide or protein. By convention, the primary structure of a protein is reported starting from the amino-terminal (N) end to the carboxyl-terminal (C) end. Protein biosynthesi ...
.


External links


RepeatsDB
α-solenoid class


References

{{Protein tandem repeats Protein folds