HOME

TheInfoList



OR:

SMC proteins represent a large family of ATPases that participate in many aspects of higher-order chromosome organization and dynamics. SMC proteins are widely conserved across bacteria, archaea, and eukaryotes. In eukaryotes, they function as the core ATPase subunits of large protein complexes such as condensin, cohesin, and SMC5/6. The term SMC derives from a mutant strain of ''
Saccharomyces cerevisiae ''Saccharomyces cerevisiae'' () (brewer's yeast or baker's yeast) is a species of yeast (single-celled fungal microorganisms). The species has been instrumental in winemaking, baking, and brewing since ancient times. It is believed to have be ...
'' named ''smc1'' (stability of mini-chromosomes 1), which was identified based on its defect in maintaining the stability of mini-chromosomes. After the gene product of ''SMC1'' was characterized, and homologous proteins were found to be essential for chromosome structure and dynamics in many organisms, the acronym SMC was redefined to stand for "Structural Maintenance of Chromosomes".


Classification


Eukaryotic SMCs

Eukaryotes have at least six SMC proteins in individual organisms, and they form three distinct heterodimers with specialized functions: *SMC1-SMC3: A pair of SMC1 and SMC3 constitutes the core subunits of the
cohesin Cohesin is a protein complex that mediates Establishment of sister chromatid cohesion, sister chromatid cohesion, homologous recombination, and Topologically associating domain, DNA looping. Cohesin is formed of SMC3, SMC1A, SMC1, RAD21, SCC1 an ...
complexes involved in sister chromatid cohesion. *SMC2-SMC4: A pair of SMC2 and SMC4 acts as the core of the condensin complexes implicated in chromosome condensation. *SMC5-SMC6: A pair of SMC5 and SMC6 functions as part of a yet-to-be-named complex implicated in
DNA repair DNA repair is a collection of processes by which a cell (biology), cell identifies and corrects damage to the DNA molecules that encode its genome. A weakened capacity for DNA repair is a risk factor for the development of cancer. DNA is cons ...
and checkpoint responses. The pairings of SMC proteins in eukaryotes, SMC1-SMC3, SMC2–SMC4, and SMC5–SMC6, are highly specific and invariant; no exceptions to these combinations have been reported to date. Sequence comparisons reveal that SMC1 and SMC4, as well as SMC2 and SMC3, share a high degree of similarity, while SMC5 and SMC6 form a more distinct clade (Figure 1). It is hypothesized that the last eukaryotic common ancestor (LECA) possessed all six SMC proteins. While SMC1–4 are conserved in all known eukaryotic species, some lineages (such as the
ciliate The ciliates are a group of alveolates characterized by the presence of hair-like organelles called cilia, which are identical in structure to flagellum, eukaryotic flagella, but are in general shorter and present in much larger numbers, with a ...
'' Tetrahymena thermophila'') have lost SMC5 and SMC6 during evolution, suggesting that the SMC5/6 complex may not be strictly essential for eukaryotic cell viability. In addition to the six subtypes, some organisms have variants of SMC proteins. For instance, mammals have a
meiosis Meiosis () is a special type of cell division of germ cells in sexually-reproducing organisms that produces the gametes, the sperm or egg cells. It involves two rounds of division that ultimately result in four cells, each with only one c ...
-specific variant of SMC1, known as SMC1β. The nematode ''
Caenorhabditis elegans ''Caenorhabditis elegans'' () is a free-living transparent nematode about 1 mm in length that lives in temperate soil environments. It is the type species of its genus. The name is a Hybrid word, blend of the Greek ''caeno-'' (recent), ''r ...
'' has an SMC4-variant that has a specialized role in dosage compensation. The following table shows the SMC proteins names for several
model organism A model organism is a non-human species that is extensively studied to understand particular biological phenomena, with the expectation that discoveries made in the model organism will provide insight into the workings of other organisms. Mo ...
s and vertebrates:


Prokaryotic SMCs

The evolutionary origin of SMC proteins is ancient, and homologs are widely conserved in both
bacteria Bacteria (; : bacterium) are ubiquitous, mostly free-living organisms often consisting of one Cell (biology), biological cell. They constitute a large domain (biology), domain of Prokaryote, prokaryotic microorganisms. Typically a few micr ...
and
archaea Archaea ( ) is a Domain (biology), domain of organisms. Traditionally, Archaea only included its Prokaryote, prokaryotic members, but this has since been found to be paraphyletic, as eukaryotes are known to have evolved from archaea. Even thou ...
. *SMC (canonical type): Many bacteria (e.g., ''
Bacillus subtilis ''Bacillus subtilis'' (), known also as the hay bacillus or grass bacillus, is a gram-positive, catalase-positive bacterium, found in soil and the gastrointestinal tract of ruminants, humans and marine sponges. As a member of the genus ''Bacill ...
'') and archaea possess canonical SMC proteins that closely resemble their eukaryotic counterparts. These bacterial and archaeal SMCs form homodimers and associate with regulatory subunits to form condensin-like complexes, SMC-ScpAB. It is hypothesized that the eukaryotic ancestor (most likely the Asgard archaeon) possessed two types of SMC proteins: a canonical SMC (SMCc) and a non-canonical SMC (SMCnc). Gene duplications of these two ancestral types are thought to have given rise to the six SMC subfamilies present in the last eukaryotic common ancestor (LECA): SMC1–4 evolved from the canonical lineage, while SMC5 and SMC6 evolved from the non-canonical lineage (Figure 1). *MukB: In some
γ-proteobacteria ''Gammaproteobacteria'' is a class of bacteria in the phylum ''Pseudomonadota'' (synonym ''Proteobacteria''). It contains about 250 genera, which makes it the most genus-rich taxon of the Prokaryotes. Several medically, ecologically, and scienti ...
, including ''
Escherichia coli ''Escherichia coli'' ( )Wells, J. C. (2000) Longman Pronunciation Dictionary. Harlow ngland Pearson Education Ltd. is a gram-negative, facultative anaerobic, rod-shaped, coliform bacterium of the genus '' Escherichia'' that is commonly fo ...
'', SMC function is carried out by a distantly related protein called MukB. MukB also forms homodimers and, together with regulatory subunits, assembles into a MukBEF complex, which performs condensin-like functions in organizing bacterial chromosomes. *MksB/JetC/EptC: A third type of prokaryotic SMC protein, known as MksB, has been identified in certain bacterial species. Like MukB, MksB forms a distantly-related condensin-like complex, MksBEF. More recently, a variant complex called MksBEFG, which includes a nuclease subunit MksG, has been shown to function in plasmid defense. In other bacterial lineages, orthologous systems have been identified, including JetABCD and EptABCD. These systems are collectively referred to as the Wadjet family of SMC-like complexes.


SMC-related proteins

In a broader sense, several proteins with structural similarities to SMC are considered members of the SMC superfamily. * In eukaryotes,
Rad50 DNA repair protein RAD50, also known as RAD50, is a protein that in humans is encoded by the ''RAD50'' gene. Function The protein encoded by this gene is highly similar to ''Saccharomyces cerevisiae'' Rad50, a protein involved in DNA double- ...
is a well-known SMC-related protein involved in the repair of DNA double-strand breaks. * In bacteria, several proteins related to DNA repair also belong to the extended SMC family, including SbcC, RecF, and RecN. * In archaea, a subfamily known as Archaea-specific SMC-related proteins (ASRPs) has been identified. Previously described archaeal proteins such as Sph1/2 and ClsN (also known as coalescin) are now considered members of this ASRP subfamily.


Subunit composition of SMC protein complexes

The subunit composition of SMC protein complexes varies across domains of life. The table below and Figures 2 & 3 summarize the representative complexes found in
eukaryotes The eukaryotes ( ) constitute the domain of Eukaryota or Eukarya, organisms whose cells have a membrane-bound nucleus. All animals, plants, fungi, seaweeds, and many unicellular organisms are eukaryotes. They constitute a major group of ...
and
prokaryotes A prokaryote (; less commonly spelled procaryote) is a single-celled organism whose cell lacks a nucleus and other membrane-bound organelles. The word ''prokaryote'' comes from the Ancient Greek (), meaning 'before', and (), meaning 'nut' ...
.
All SMC dimers, whether of eukaryotic or prokaryotic origin, associate with a kleisin subunit. In condensins and
cohesin Cohesin is a protein complex that mediates Establishment of sister chromatid cohesion, sister chromatid cohesion, homologous recombination, and Topologically associating domain, DNA looping. Cohesin is formed of SMC3, SMC1A, SMC1, RAD21, SCC1 an ...
, the kleisin subunit is further associated with a pair of HEAT-repeat subunits. Notably, the eukaryotic SMC5/6 complex contains "kite" (kleisin interacting tandem winged-helix elements) subunits instead of HEAT-repeat subunits, making it structurally more similar to prokaryotic complexes such as SMC–ScpAB, MukBEF, and MksBEF. However, unlike their typically homodimeric prokaryotic counterparts, both the SMC and kite subunits in the SMC5/6 complex are heterodimeric, resulting in a more elaborate subunit architecture. The SMC5/6 complex and the Wadjet complex (JetABCD) each possess an additional catalytic subunit: the SUMO ligase Nse2 in SMC5/6, and the nuclease JetD in JetABCD.


Molecular structure


Primary structure

SMC proteins are 1,000-1,500 amino-acid long. They have a modular structure that is composed of the following domains: * Walker A
ATP-binding motif An ATP-binding motif is a 250-residue sequence within an ATP-binding protein’s primary structure. The binding motif is associated with a protein’s structure and/or function. ATP is a molecule of energy, and can be a coenzyme, involved in a num ...
* coiled-coil region I * hinge region * coiled-coil region II * Walker B ATP-binding motif; signature motif


Secondary and tertiary structure

SMC dimers form a V-shaped molecule with two long coiled-coil arms (Figure 4). To make such a unique structure, an SMC protomer is self-folded through anti-parallel coiled-coil interactions, forming a rod-shaped molecule. At one end of the molecule, the N-terminal and C-terminal domains form an ATP-binding domain. The other end is called a hinge domain. Two protomers then dimerize through their hinge domains and assemble a V-shaped dimer. The length of the coiled-coil arms is ~50 nm long. Such long "antiparallel" coiled coils are very rare and found only among SMC proteins (and their relatives such as Rad50). The ATP-binding domain of SMC proteins is structurally related to that of ABC transporters, a large family of transmembrane proteins that actively transport small molecules across cellular membranes. It is thought that the cycle of ATP binding and
hydrolysis Hydrolysis (; ) is any chemical reaction in which a molecule of water breaks one or more chemical bonds. The term is used broadly for substitution reaction, substitution, elimination reaction, elimination, and solvation reactions in which water ...
modulates the cycle of closing and opening of the V-shaped molecule. Still, the detailed mechanisms of action of SMC proteins remain to be determined.


Holo-complex assembly

The formation of an SMC protein complex involves the association of an SMC dimer with non-SMC subunits (Figure 4). First, the N-terminal domain of the kleisin subunit binds to the neck region (a segment of the coiled coil near the head domain) of one SMC protein, while its C-terminal domain binds to the cap region (part of the head domain) of the other SMC subunit. These interactions result in the formation of a ring-like architecture. As a consequence, the SMC–kleisin trimer adopts an asymmetric configuration. Accordingly, the SMC subunit bound at the N-terminal domain of the kleisin is sometimes referred to as the ν-SMC, while the one bound at the C-terminal domain is called the κ-SMC. Finally, two HEAT-repeat subunits (or two KITE subunits depending on the complex) associate with the central region of the kleisin, completing the assembly of the holo-complex. MukBEF and JetABC form higher-order assemblies through dimerization mediated by their kleisin subunits, a configuration often referred to as a "dimer-of-dimers" (Figure 3).


Molecular activities

SMC protein complexes are involved in a wide range of chromosome-related functions, and each complex is thought to possess distinct molecular activities tailored to its specific role. At the same time, based on their evolutionary origins and conserved structural features, it has been suggested that certain molecular activities may be shared across multiple SMC complexes. For example, several SMC complexes are known to exhibit DNA entrapment activity, in which DNA is topologically entrapped within the ring-like structure formed by their long coiled-coil arms. This activity has been demonstrated in cohesin, condensin, and the SMC5/6 complex. More recent studies have highlighted DNA loop extrusion as a conserved molecular activity shared by many SMC protein complexes. Single-molecule analyses have demonstrated that condensin,
cohesin Cohesin is a protein complex that mediates Establishment of sister chromatid cohesion, sister chromatid cohesion, homologous recombination, and Topologically associating domain, DNA looping. Cohesin is formed of SMC3, SMC1A, SMC1, RAD21, SCC1 an ...
, the SMC5/6 complex, and Wadjet are capable of extruding DNA loops in an ATP-dependent manner. During loop extrusion, the ATPase cycle of the SMC subunits is thought to be coupled with dynamic and multivalent interactions between various subunits and DNA. These interactions likely occur in multiple modes, making the molecular mechanism of
loop extrusion Loop extrusion is a major mechanism of Nuclear organization. It is a dynamic process in which structural maintenance of chromosomes (SMC) protein complexes progressively grow loops of DNA or chromatin. In this process, SMC complexes, such as con ...
highly complex and still incompletely understood.


Genetic Disorders

Several genetic disorders have been linked to mutations in genes encoding components or regulators of SMC protein complexes: *
Cohesin Cohesin is a protein complex that mediates Establishment of sister chromatid cohesion, sister chromatid cohesion, homologous recombination, and Topologically associating domain, DNA looping. Cohesin is formed of SMC3, SMC1A, SMC1, RAD21, SCC1 an ...
-related disorders **
Cornelia de Lange syndrome Cornelia de Lange syndrome (CdLS) is a genetic disorder. People with Cornelia de Lange syndrome experience a range of physical, cognitive, and medical challenges ranging from mild to severe. Cornelia de Lange syndrome has a widely varied phenotype, ...
: caused by mutations in NIPBL, SMC1A or SMC3. ** Roberts syndrome: caused by mutations in ESCO2, a gene encoding a cohesin acetyltransferase. ** Aneuploidy in cancer cells: frequently associated with mutations in STAG2. * Condensin-related disorders **
Microcephaly Microcephaly (from Neo-Latin ''microcephalia'', from Ancient Greek μικρός ''mikrós'' "small" and κεφαλή ''kephalé'' "head") is a medical condition involving a smaller-than-normal head. Microcephaly may be present at birth or it m ...
: linked to mutations in CAP-D2, CAP-H, or CAP-D3. * SMC5/6-related disorders **
Primordial dwarfism Primordial dwarfism (PD) is a form of dwarfism that results in a smaller body size in all stages of life beginning from before birth. More specifically, primordial dwarfism is a diagnostic category including specific types of profoundly proportion ...
: associated with mutations in NSE2. ** Severe lung disease: linked to mutations in NSE3. ** Atelís syndrome: caused by mutations in SMC5.


International SMC meetings

Active research on SMC proteins began in the 1990s. As global interest in this field increased, international meetings dedicated to SMC proteins have been held regularly since the 2010s. These meetings, which are organized approximately every two years, cover a wide range of topics reflecting the diverse functions of SMC protein complexes, from bacterial chromosome segregation to human genetic disorders. * The 0th International SMC meeting(The 18th IMCB Symposium)“SMC proteins: from molecule to disease”, November 29, 2013,
Tokyo Tokyo, officially the Tokyo Metropolis, is the capital of Japan, capital and List of cities in Japan, most populous city in Japan. With a population of over 14 million in the city proper in 2023, it is List of largest cities, one of the most ...
,
Japan Japan is an island country in East Asia. Located in the Pacific Ocean off the northeast coast of the Asia, Asian mainland, it is bordered on the west by the Sea of Japan and extends from the Sea of Okhotsk in the north to the East China Sea ...
. * The 1st International SMC meeting( EMBO Workshop)“SMC proteins: chromosomal organizers from bacteria to human”, May 12-15, 2015,
Vienna Vienna ( ; ; ) is the capital city, capital, List of largest cities in Austria, most populous city, and one of Federal states of Austria, nine federal states of Austria. It is Austria's primate city, with just over two million inhabitants. ...
,
Austria Austria, formally the Republic of Austria, is a landlocked country in Central Europe, lying in the Eastern Alps. It is a federation of nine Federal states of Austria, states, of which the capital Vienna is the List of largest cities in Aust ...
. * The 2nd International SMC meeting “SMC proteins: chromosomal organizers from bacteria to human”, June 13-16, 2017, Nanyo, Yamagata,
Japan Japan is an island country in East Asia. Located in the Pacific Ocean off the northeast coast of the Asia, Asian mainland, it is bordered on the west by the Sea of Japan and extends from the Sea of Okhotsk in the north to the East China Sea ...
. * The 3rd International SMC meeting( EMBO Workshop)“Organization of bacterial and eukaryotic genomes by SMC complexes”, September 10-13, 2019,
Vienna Vienna ( ; ; ) is the capital city, capital, List of largest cities in Austria, most populous city, and one of Federal states of Austria, nine federal states of Austria. It is Austria's primate city, with just over two million inhabitants. ...
,
Austria Austria, formally the Republic of Austria, is a landlocked country in Central Europe, lying in the Eastern Alps. It is a federation of nine Federal states of Austria, states, of which the capital Vienna is the List of largest cities in Aust ...
. * The 4th International SMC meeting ( Biochemical Society of the UK)“Genome Organisation by SMC protein complexes”, September 27-30, 2022,
Edinburgh Edinburgh is the capital city of Scotland and one of its 32 Council areas of Scotland, council areas. The city is located in southeast Scotland and is bounded to the north by the Firth of Forth and to the south by the Pentland Hills. Edinburgh ...
, UK. * The 5th International SMC meetingNIG & RIKEN International Symposium 2024
/ref>( NIG &
RIKEN is a national scientific research institute in Japan. Founded in 1917, it now has about 3,000 scientists on seven campuses across Japan, including the main site at Wakō, Saitama, Wakō, Saitama Prefecture, on the outskirts of Tokyo. Riken is a ...
International Symposium 2024)“SMC complexes: orchestrating diverse genome functions”, October 15-18, 2024,
Numazu is a city located in eastern Shizuoka Prefecture, Japan. , the city had an estimated population of 189,486 in 91,986 households, and a population density of 1,014 persons per km2. The total area of the city is . Geography Numazu is at the n ...
, Shizuoka,
Japan Japan is an island country in East Asia. Located in the Pacific Ocean off the northeast coast of the Asia, Asian mainland, it is bordered on the west by the Sea of Japan and extends from the Sea of Okhotsk in the north to the East China Sea ...
.


See also

*
cohesin Cohesin is a protein complex that mediates Establishment of sister chromatid cohesion, sister chromatid cohesion, homologous recombination, and Topologically associating domain, DNA looping. Cohesin is formed of SMC3, SMC1A, SMC1, RAD21, SCC1 an ...
* condensin *
Cornelia de Lange Syndrome Cornelia de Lange syndrome (CdLS) is a genetic disorder. People with Cornelia de Lange syndrome experience a range of physical, cognitive, and medical challenges ranging from mild to severe. Cornelia de Lange syndrome has a widely varied phenotype, ...


References

{{Portal bar, Biology, border=no EC 3.6.3 Cell biology Mitosis Cell cycle