Ankyrin Repeat
   HOME

TheInfoList



OR:

The ankyrin repeat is a 33-residue motif in
protein Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residue (biochemistry), residues. Proteins perform a vast array of functions within organisms, including Enzyme catalysis, catalysing metab ...
s consisting of two alpha helices separated by loops, first discovered in signaling proteins in yeast Cdc10 and '' Drosophila'' Notch. Domains consisting of ankyrin tandem repeats mediate protein–protein interactions and are among the most common structural motifs in known proteins. They appear in
bacteria Bacteria (; : bacterium) are ubiquitous, mostly free-living organisms often consisting of one Cell (biology), biological cell. They constitute a large domain (biology), domain of Prokaryote, prokaryotic microorganisms. Typically a few micr ...
l,
archaea Archaea ( ) is a Domain (biology), domain of organisms. Traditionally, Archaea only included its Prokaryote, prokaryotic members, but this has since been found to be paraphyletic, as eukaryotes are known to have evolved from archaea. Even thou ...
l, and
eukaryotic The eukaryotes ( ) constitute the Domain (biology), domain of Eukaryota or Eukarya, organisms whose Cell (biology), cells have a membrane-bound cell nucleus, nucleus. All animals, plants, Fungus, fungi, seaweeds, and many unicellular organisms ...
proteins, but are far more common in eukaryotes. Ankyrin repeat proteins, though absent in most viruses, are common among poxviruses. Most proteins that contain the motif have four to six repeats, although its namesake ankyrin contains 24, and the largest known number of repeats is 34, predicted in a protein expressed by '' Giardia lamblia''. Ankyrin repeats typically fold together to form a single, linear solenoid structure called ankyrin repeat domains. These domains are one of the most common protein–protein interaction platforms in nature. They occur in a large number of functionally diverse proteins, mainly from
eukaryote The eukaryotes ( ) constitute the Domain (biology), domain of Eukaryota or Eukarya, organisms whose Cell (biology), cells have a membrane-bound cell nucleus, nucleus. All animals, plants, Fungus, fungi, seaweeds, and many unicellular organisms ...
s. The few known examples from
prokaryote A prokaryote (; less commonly spelled procaryote) is a unicellular organism, single-celled organism whose cell (biology), cell lacks a cell nucleus, nucleus and other membrane-bound organelles. The word ''prokaryote'' comes from the Ancient Gree ...
s and viruses may be the result of horizontal gene transfers. The repeat has been found in proteins of diverse function such as transcriptional initiators,
cell cycle The cell cycle, or cell-division cycle, is the sequential series of events that take place in a cell (biology), cell that causes it to divide into two daughter cells. These events include the growth of the cell, duplication of its DNA (DNA re ...
regulators, cytoskeletal, ion transporters, and signal transducers. The ankyrin fold appears to be defined by its structure rather than its function, since there is no specific sequence or structure that is universally recognised by it. Considering the atomic structures of individual ankyrin repeats, the loop is often a type 1 beta bulge loop, while both alpha-helices commonly have a Schellman loop at their N-terminus.


Role in protein folding

The ankyrin-repeat sequence motif has been studied using multiple sequence alignment to determine conserved amino acid residues critical for folding and stability. The residues on the wide lateral surface of ankyrin repeat structures are variable, often hydrophobic, and involved mainly in mediating protein–protein interactions. An artificial protein design based on a consensus sequence derived from sequence alignment has been synthesized and found to fold stably, representing the first designed protein with multiple repeats. More extensive design strategies have used combinatorial sequences to "evolve" ankyrin-repeats that recognize particular protein targets, a technique that has been presented as an alternative to
antibody An antibody (Ab) or immunoglobulin (Ig) is a large, Y-shaped protein belonging to the immunoglobulin superfamily which is used by the immune system to identify and neutralize antigens such as pathogenic bacteria, bacteria and viruses, includin ...
design for applications requiring high-affinity binding. A structure-based study involving a range of ankyrin proteins of known structures, shows that consensus-based ankyrin proteins are very stable since they maximize the energetic gap between the folding and unfolding structures, encoding a densely connected network of favourable interactions among conserved sequence motifs, like the TPLX motif. The same study shows that insertions in the canonical framework of ankyrin repeats are enriched in conflictive interactions, that are related to function. The same applies to interactions surrounding deletion hotspots. These might be related to complex folding/unfolding transitions that are important to the partner recognition and interaction. Ankyrin-repeat proteins present an unusual problem in the study of protein folding, which has largely focused on globular proteins that form well-defined tertiary structure stabilized by long-range, nonlocal residue-residue contacts. Ankyrin repeats, by contrast, contain very few such contacts (that is, they have a low contact order). Most studies have found that ankyrin repeats fold in a two-state folding mechanism, suggesting a high degree of folding cooperativity despite the local inter-residue contacts and the evident need for successful folding with varying numbers of repeats. Some evidence, based on synthesis of truncated versions of natural repeat proteins, and on the examination of phi values, suggests that the C-terminus forms the folding nucleation site.


Clinical significance

Ankyrin-repeat proteins have been associated with a number of human
disease A disease is a particular abnormal condition that adversely affects the structure or function (biology), function of all or part of an organism and is not immediately due to any external injury. Diseases are often known to be medical condi ...
s. These proteins include the
cell cycle The cell cycle, or cell-division cycle, is the sequential series of events that take place in a cell (biology), cell that causes it to divide into two daughter cells. These events include the growth of the cell, duplication of its DNA (DNA re ...
inhibitor p16, which is associated with cancer, and the Notch protein (a key component of cell signalling pathways) which can cause the neurological disorder CADASIL when the repeat domain is disrupted by mutations. A specialized family of ankyrin proteins known as muscle ankyrin repeat proteins (MARPs) are involved with the repair and regeneration of muscle tissue following damage due to injury and stress. A natural variation between glutamine and lysine at position 703 in the 11th ankyrin repeat of ANKK1, known as the TaqI A1 allele, has been credited with encouraging addictive behaviours such as obesity, alcoholism, nicotine dependency and the Eros love style while discouraging juvenile delinquency and neuroticism-anxiety. (interim reference) The variation may affect the specificity of protein interactions made by the ANKK1 protein kinase through this repeat.


Human proteins containing this repeat

ABTB1; ABTB2; ACBD6; ACTBL1; ANK1; ANK2; ANK3; ANKAR; ANKDD1A; ANKEF1; ANKFY1; ANKHD1; ANKIB1; ANKK1; ANKMY1; ANKMY2; ANKRA2; ANKRD1; ANKRD10; ANKRD11; ANKRD12; ANKRD13; ANKRD13A; ANKRD13B; ANKRD13C; ANKRD13D; ANKRD15; ANKRD16; ANKRD17; ANKRD18A; ANKRD18B; ANKRD19; ANKRD2; ANKRD20A1; ANKRD20A2; ANKRD20A3; ANKRD20A4; ANKRD21; ANKRD22; ANKRD23
ANKRD24
ANKRD25; ANKRD26; ANKRD27; ANKRD28; ANKRD30A; ANKRD30B; ANKRD30BL; ANKRD32; ANKRD33; ANKRD35; ANKRD36; ANKRD36B; ANKRD37; ANKRD38; ANKRD39; ANKRD40; ANKRD41; ANKRD42; ANKRD43; ANKRD44; ANKRD45; ANKRD46; ANKRD47; ; ANKRD50; ANKRD52; ANKRD53; ANKRD54; ANKRD55; ANKRD56; ANKRD57; ANKRD58; ANKRD60; ANKRD6; ANKRD7; ANKRD9; ANKS1A; ANKS3; ANKS4B; ANKS6; ANKZF1; ASB1; ASB10; ASB11; ASB12; ASB13; ASB14; ASB15; ASB16; ASB2; ASB3; ASB4; ASB5; ASB6; ASB7; ASB8; ASB9; ASZ1; BARD1; BAT4; BAT8; BCL3; BCOR; BCORL1; BTBD11; CAMTA1; CAMTA2; CASKIN1; CASKIN2; CCM1; CDKN2A; CDKN2B; CDKN2C; CDKN2D; CENTB1; CENTB2; CENTB5; CENTG1; CENTG2; CENTG3; CLIP3; CLIP4; CLPB; CTGLF1; CTGLF2; CTGLF3; CTGLF4; CTGLF5; CTTNBP2; DAPK1; DDEF1; DDEF2; DDEFL1; DGKI; DGKZ; DP58; DYSFIP1; DZANK; EHMT1; EHMT2;
ESPN ESPN (an initialism of their original name, which was the Entertainment and Sports Programming Network) is an American international basic cable sports channel owned by the Walt Disney Company (80% and operational control) and Hearst Commu ...
; FANK1; FEM1A; FEM1B; GABPB2; GIT1; GIT2; GLS; GLS2; HACE1; HECTD1; IBTK; ILK; INVS; KIDINS220; KRIT1; LRRK1;
MAIL The mail or post is a system for physically transporting postcards, letter (message), letters, and parcel (package), parcels. A postal service can be private or public, though many governments place restrictions on private systems. Since the mid ...
; MIB1; MIB2; MPHOSPH8; MTPN; MYO16; NFKB1; NFKB2; NFKBIA; NFKBIB; NFKBIE; NFKBIL1; NFKBIL2; NOTCH1; NOTCH2; NOTCH3; NOTCH4; NRARP; NUDT12; OSBPL1A; OSTF1; PLA2G6; POTE14; POTE15; POTE8; PPP1R12A; PPP1R12B; PPP1R12C; PPP1R13B; PPP1R13L; PPP1R16A; PPP1R16B; PSMD10; RAI14; RFXANK; RIPK4; RNASEL; SHANK1; SHANK2; SHANK3; SNCAIP; TA-NFKBH; TEX14; TNKS; TNKS2; TNNI3K; TP53BP2; TRP7; TRPA1; TRPC3; TRPC4; TRPC5; TRPC6; TRPC7; TRPV1; TRPV2; TRPV3; TRPV4; TRPV5; TRPV6; UACA; USH1G; ZDHHC13; ZDHHC17;


See also

* DARPin (designed ankyrin repeat protein), an engineered antibody mimetic based on the structure of ankyrin repeats


References


External links

* * {{Protein tandem repeats