A structural gene is a
gene
In biology, the word gene has two meanings. The Mendelian gene is a basic unit of heredity. The molecular gene is a sequence of nucleotides in DNA that is transcribed to produce a functional RNA. There are two types of molecular genes: protei ...
that codes for any RNA or protein product other than a regulatory factor (i.e.
regulatory protein). Structural genes are typically viewed as those containing sequences of DNA corresponding to the amino acids of a protein that will be produced, as long as said protein does not function to regulate gene expression. Structural gene products include enzymes and structural proteins. Also encoded by structural genes are non-coding RNAs, such as
rRNAs and
tRNAs (but excluding any regulatory
miRNAs
Micro ribonucleic acid (microRNA, miRNA, μRNA) are small, single-stranded, non-coding RNA molecules containing 21–23 nucleotides. Found in plants, animals, and even some viruses, miRNAs are involved in RNA silencing and post-trans ...
and
siRNAs).
The distinction between structural and regulatory genes
can be traced back to 1959 and work by
Pardee,
Jacob
Jacob, later known as Israel, is a Hebrew patriarch of the Abrahamic religions. He first appears in the Torah, where he is described in the Book of Genesis as a son of Isaac and Rebecca. Accordingly, alongside his older fraternal twin brother E ...
, and
Monod—the so-called
PaJaMo experiment—on the
''lac'' operon and the synthesis of proteins in ''
E. coli''. In that system, a single regulatory protein was detected that affected the transcription of the other proteins now known to compose the ''lac'' operon.
Placement in the genome
In
prokaryote
A prokaryote (; less commonly spelled procaryote) is a unicellular organism, single-celled organism whose cell (biology), cell lacks a cell nucleus, nucleus and other membrane-bound organelles. The word ''prokaryote'' comes from the Ancient Gree ...
s, structural genes of related function are typically adjacent to one another on a single strand of DNA, forming an
operon
In genetics, an operon is a functioning unit of DNA containing a cluster of genes under the control of a single promoter. The genes are transcribed together into an mRNA strand and either translated together in the cytoplasm, or undergo splic ...
. This permits simpler regulation of gene expression, as a single regulatory factor can affect transcription of all associated genes. This is best illustrated by the well-studied ''lac'' operon, in which three structural genes (''
lacZ'', ''
lacY'', and ''
lacA'') are all regulated by a single promoter and a single operator. Prokaryotic structural genes are transcribed into a polycistronic mRNA and subsequently translated.
In
eukaryote
The eukaryotes ( ) constitute the Domain (biology), domain of Eukaryota or Eukarya, organisms whose Cell (biology), cells have a membrane-bound cell nucleus, nucleus. All animals, plants, Fungus, fungi, seaweeds, and many unicellular organisms ...
s, structural genes are not sequentially placed. Each gene is instead composed of coding
exon
An exon is any part of a gene that will form a part of the final mature RNA produced by that gene after introns have been removed by RNA splicing. The term ''exon'' refers to both the DNA sequence within a gene and to the corresponding sequence ...
s and interspersed non-coding
intron
An intron is any nucleotide sequence within a gene that is not expressed or operative in the final RNA product. The word ''intron'' is derived from the term ''intragenic region'', i.e., a region inside a gene."The notion of the cistron .e., gen ...
s. Regulatory sequences are typically found in non-coding regions upstream and downstream from the gene. Structural gene mRNAs must be spliced prior to translation to remove intronic sequences. This in turn lends itself to the eukaryotic phenomenon of
alternative splicing
Alternative splicing, alternative RNA splicing, or differential splicing, is an alternative RNA splicing, splicing process during gene expression that allows a single gene to produce different splice variants. For example, some exons of a gene ma ...
, in which a single mRNA from a single structural gene can produce several different proteins based on which exons are included. Despite the complexity of this process, it is estimated that up to 94% of human genes are spliced in some way. Furthermore, different splicing patterns occur in different tissue types.
An exception to this layout in eukaryotes are genes for histone proteins, which lack introns entirely. Also distinct are the rDNA clusters of structural genes, in which 28S, 5.8S, and 18S sequences are adjacent, separated by short internally transcribed spacers, and likewise the 45S rDNA occurs five distinct places on the genome, but is clustered into adjacent repeats. In eubacteria these genes are organized into operons. However, in archaebacteria these genes are non-adjacent and exhibit no linkage.
Role in human disease
The identification of the genetic basis for the causative agent of a disease can be an important component of understanding its effects and spread. Location and content of structural genes can elucidate the evolution of virulence, as well as provide necessary information for treatment. Likewise understanding the specific changes in structural gene sequences underlying a gain or loss of virulence aids in understanding the mechanism by which diseases affect their hosts.
For example, ''
Yersinia pestis
''Yersinia pestis'' (''Y. pestis''; formerly ''Pasteurella pestis'') is a Gram-negative bacteria, gram-negative, non-motile bacteria, non-motile, coccobacillus Bacteria, bacterium without Endospore, spores. It is related to pathogens ''Yer ...
'' (the
bubonic plague
Bubonic plague is one of three types of Plague (disease), plague caused by the Bacteria, bacterium ''Yersinia pestis''. One to seven days after exposure to the bacteria, flu-like symptoms develop. These symptoms include fever, headaches, and ...
) was found to carry several virulence and inflammation-related structural genes on plasmids. Likewise, the structural gene responsible for
tetanus
Tetanus (), also known as lockjaw, is a bacterial infection caused by ''Clostridium tetani'' and characterized by muscle spasms. In the most common type, the spasms begin in the jaw and then progress to the rest of the body. Each spasm usually l ...
was determined to be carried on a plasmid as well.
Diphtheria
Diphtheria is an infection caused by the bacteria, bacterium ''Corynebacterium diphtheriae''. Most infections are asymptomatic or have a mild Course (medicine), clinical course, but in some outbreaks, the mortality rate approaches 10%. Signs a ...
is caused by a bacterium, but only after that bacterium has been infected by a bacteriophage carrying the structural genes for the toxin.
In
Herpes simplex virus
Herpes simplex virus 1 and 2 (HSV-1 and HSV-2) are two members of the Herpesviridae#Human herpesvirus types, human ''Herpesviridae'' family, a set of viruses that produce Viral disease, viral infections in the majority of humans. Both HSV-1 a ...
, the structural gene sequence responsible for virulence was found in two locations in the genome despite only one location actually producing the viral gene product. This was hypothesized to serve as a potential mechanism for strains to regain virulence if lost through mutation.
Understanding the specific changes in structural genes underlying a gain or loss of virulence is a necessary step in the formation of specific treatments, as well the study of possible medicinal uses of toxins.
Phylogenetics
As far back as 1974, DNA sequence similarity was recognized as a valuable tool for determining relationships among taxa. Structural genes in general are more highly conserved due to functional constraint, and so can prove useful in examinations of more disparate taxa. Original analyses enriched samples for structural genes via hybridization to mRNA.
More recent phylogenetic approaches focused on structural genes of known function, conserved to varying degrees. rRNA sequences frequent targets, as they are conserved in all species. Microbiology has specifically targeted the 16S gene to determine species level differences. In higher-order taxa, COI is now considered the “barcode of life,” and is applied for most biological identification.
Debate
Despite the widespread classification of genes as either structural or regulatory, these categories are not an absolute division. Recent genetic discoveries call into question the distinction between regulatory and structural genes,
suggesting greater complexity. Structural gene expression is regulated by numerous factors including
epigenetics
In biology, epigenetics is the study of changes in gene expression that happen without changes to the DNA sequence. The Greek prefix ''epi-'' (ἐπι- "over, outside of, around") in ''epigenetics'' implies features that are "on top of" or "in ...
(e.g. methylation) and
RNA interference
RNA interference (RNAi) is a biological process in which RNA molecules are involved in sequence-specific suppression of gene expression by double-stranded RNA, through translational or transcriptional repression. Historically, RNAi was known by ...
(RNAi). Structural genes and even regulatory genes themselves can be epigenetically regulated identically, so not all regulation is coded for by “regulatory genes”.
There are also examples of proteins that do not decidedly fit either category, such as
chaperone proteins. These proteins aid in the folding of other proteins, a seemingly regulatory role. Yet these same proteins also aid in the movement of their chaperoned proteins across membranes, and have now been implicated in immune responses (see
Hsp60) and in the apoptotic pathway (see
Hsp70
The 70 kilodalton heat shock proteins (Hsp70s or DnaK) are a family of conserved ubiquitously expressed heat shock proteins. Proteins with similar structure exist in virtually all living organisms and play crucial roles in the development of can ...
).
More recently, microRNAs were found to be produced from the internal transcribed spacers of rRNA genes. Thus an internal component of a structural gene is, in fact, regulatory. Binding sites for microRNAs were also detected within coding sequences of genes. Typically interfering RNAs target the 3’UTR, but inclusion of binding sites within the sequence of the protein itself allows the transcripts of these proteins to effectively regulate the microRNAs within the cell. This interaction was demonstrated to have an effect on expression, and thus again a structural gene contains a regulatory component.
References
External links
Model of Lac OperonThe SGC protein browserSILVA database of aligned rRNA sequence dataBarcode of Life database of COI barcoded species
{{DEFAULTSORT:Structural Gene
Genes
Gene expression