Metaproteomics (also Community Proteomics, Environmental Proteomics, or Community Proteogenomics) is an umbrella term for experimental approaches to study all
protein
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residue (biochemistry), residues. Proteins perform a vast array of functions within organisms, including Enzyme catalysis, catalysing metab ...
s in microbial communities and
microbiome
A microbiome () is the community of microorganisms that can usually be found living together in any given habitat. It was defined more precisely in 1988 by Whipps ''et al.'' as "a characteristic microbial community occupying a reasonably wel ...
s from
environmental sources. Metaproteomics is used to classify experiments that deal with all proteins identified and quantified from complex microbial communities. Metaproteomics approaches are comparable to gene-centric environmental
genomics
Genomics is an interdisciplinary field of molecular biology focusing on the structure, function, evolution, mapping, and editing of genomes. A genome is an organism's complete set of DNA, including all of its genes as well as its hierarchical, ...
, or
metagenomics
Metagenomics is the study of all genetics, genetic material from all organisms in a particular environment, providing insights into their composition, diversity, and functional potential. Metagenomics has allowed researchers to profile the mic ...
.
Origin of the term
The term "metaproteomics" was proposed by Francisco RodrÃguez-Valera to describe the genes and/or proteins most abundantly
expressed in environmental samples. The term was derived from "metagenome". Wilmes and Bond proposed the term "metaproteomics" for the large-scale characterization of the entire protein complement of environmental
microbiota
Microbiota are the range of microorganisms that may be commensal, mutualistic, or pathogenic found in and on all multicellular organisms, including plants. Microbiota include bacteria, archaea, protists, fungi, and viruses, and have been found ...
at a given point in time. At the same time, the terms "microbial community proteomics" and "microbial community proteogenomics" are sometimes used interchangeably for different types of experiments and results.
Questions addressed by metaproteomics
Metaproteomics allows for scientists to better understand organisms' gene functions, as genes in DNA are transcribed to mRNA which is then translated to protein. Gene expression changes can therefore be monitored through this method. Furthermore, proteins represent cellular activity and structure, so using metaproteomics in research can lead to functional information at the molecular level. Metaproteomics can also be used as a tool to assess the composition of a microbial community in terms of biomass contributions of individual members species in the community and can thus complement approaches that assess community composition based on gene copy counts such as 16S rRNA gene
amplicon or metagenome sequencing.
Proteomics of microbial communities
The first
proteomics
Proteomics is the large-scale study of proteins. Proteins are vital macromolecules of all living organisms, with many functions such as the formation of structural fibers of muscle tissue, enzymatic digestion of food, or synthesis and replicatio ...
experiment was conducted with the invention of
two-dimensional
A two-dimensional space is a mathematical space with two dimensions, meaning points have two degrees of freedom: their locations can be locally described with two coordinates or they can move in two independent directions. Common two-dimension ...
polyacrylamide gel electrophoresis (2D-PAGE). The 1980s and 1990s saw the development of
mass spectrometry
Mass spectrometry (MS) is an analytical technique that is used to measure the mass-to-charge ratio of ions. The results are presented as a ''mass spectrum'', a plot of intensity as a function of the mass-to-charge ratio. Mass spectrometry is used ...
and mass spectrometry based proteomics. The current proteomics of microbial community makes use of both gel-based (one-dimensional and two-dimensional) and non-gel
liquid chromatography based separation, where both rely on mass spectrometry based peptide identification.
While proteomics is largely a discovery-based approach that is followed by other molecular or analytical techniques to provide a full picture of the subject system, it is not limited to simple cataloging of proteins present in a sample. With the combined capabilities of
"top-down" and
"bottom-up" approaches, proteomics can pursue inquiries ranging from quantitation of
gene expression
Gene expression is the process (including its Regulation of gene expression, regulation) by which information from a gene is used in the synthesis of a functional gene product that enables it to produce end products, proteins or non-coding RNA, ...
between growth conditions (whether nutritional, spatial, temporal, or chemical) to
protein structural information.
A metaproteomics study of the human oral microbiome found 50 bacterial genera using
shotgun proteomics. The results agreed with the Human Microbiome Project, a metagenomic based approach.
Similarly, metaproteomics approaches have been used in larger clinical studies linking the bacterial proteome with human health. A recent paper used shotgun proteomics to characterize the vaginal microbiome, identifying 188 unique bacterial species in 688 women profiled. This study linked vaginal microbiome groups to the efficacy of topical antiretroviral drugs to prevent HIV acquisition in women, which was attributed to bacterial metabolism of the drug in vivo. In addition, metaproteomic approaches have been used to study other aspects of the vaginal microbiome, including the immunological and inflammatory consequences of vaginal microbial dysbiosis, as well as the influence of hormonal contraceptives on the vaginal microbiome.
Metaproteomics and the human intestinal microbiome
Aside from the
oral
The word oral may refer to:
Relating to the mouth
* Relating to the mouth, the first portion of the alimentary canal that primarily receives food and liquid
**Oral administration of medicines
** Oral examination (also known as an oral exam or ora ...
and
vaginal microbiomes, several intestinal microbiome studies have used metaproteomic approaches. A 2020 study done by Long et al. has shown, using metaproteomic approaches, that
colorectal cancer
Colorectal cancer (CRC), also known as bowel cancer, colon cancer, or rectal cancer, is the development of cancer from the Colon (anatomy), colon or rectum (parts of the large intestine). Signs and symptoms may include Lower gastrointestinal ...
pathogenesis
In pathology, pathogenesis is the process by which a disease or disorder develops. It can include factors which contribute not only to the onset of the disease or disorder, but also to its progression and maintenance. The word comes .
Descript ...
may be due to changes in the
intestinal microbiome. Several proteins examined in this study were associated with
iron intake and transport as well as
oxidative stress
Oxidative stress reflects an imbalance between the systemic manifestation of reactive oxygen species and a biological system's ability to readily detoxify the reactive intermediates or to repair the resulting damage. Disturbances in the normal ...
, as high intestinal iron content and oxidative stress are indicative of colorectal cancer.
Another study done in 2017 by Xiong et al. used metaproteomics along with metagenomics in analyzing
gut microbiome
Gut microbiota, gut microbiome, or gut flora are the microorganisms, including bacteria, archaea, fungi, and viruses, that live in the digestive tracts of animals. The gastrointestinal metagenome is the aggregate of all the genomes of the g ...
changes during
human development Human development may refer to:
* Development of the human body
** This includes physical developments such as growth, and also development of the brain
* Developmental psychology
* Development theory
* Human development (economics)
* Human Develo ...
. Xiong et al. found that the infant
gut microbiome
Gut microbiota, gut microbiome, or gut flora are the microorganisms, including bacteria, archaea, fungi, and viruses, that live in the digestive tracts of animals. The gastrointestinal metagenome is the aggregate of all the genomes of the g ...
may be initially populated with
facultative anaerobes such as ''
Enterococcus
''Enterococcus'' is a large genus of lactic acid bacteria of the phylum Bacillota. Enterococci are Gram-positive cocci that often occur in pairs ( diplococci) or short chains, and are difficult to distinguish from streptococci on physical ch ...
'' and ''
Klebsiella
''Klebsiella'' is a genus of Gram-negative, oxidase-negative, rod-shaped bacteria with a prominent polysaccharide-based capsule.
''Klebsiella'' is named after German-Swiss microbiologist Edwin Klebs (1834–1913). Carl Friedlander described ' ...
'', and then later populated by
obligate anaerobe
Obligate anaerobes are microorganisms killed by normal Atmosphere of Earth, atmospheric concentrations of oxygen (20.95% O2). Oxygen tolerance varies between species, with some species capable of surviving in up to 8% oxygen, while others lose vi ...
s like ''
Clostridium
''Clostridium'' is a genus of anaerobic, Gram-positive bacteria. Species of ''Clostridium'' inhabit soils and the intestinal tracts of animals, including humans. This genus includes several significant human pathogens, including the causative ...
'', ''
Bifidobacterium'', and ''
Bacteroides
''Bacteroides'' is a genus of Gram-negative, obligate anaerobic bacteria. ''Bacteroides'' species are non endospore–forming bacilli, and may be either motile or nonmotile, depending on the species. The DNA base composition is 40–48% GC. Un ...
''. While the human gut microbiome shifted over time, microbial metabolic functions remained consistent, including
carbohydrate
A carbohydrate () is a biomolecule composed of carbon (C), hydrogen (H), and oxygen (O) atoms. The typical hydrogen-to-oxygen atomic ratio is 2:1, analogous to that of water, and is represented by the empirical formula (where ''m'' and ''n'' ...
,
amino acid
Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although over 500 amino acids exist in nature, by far the most important are the 22 α-amino acids incorporated into proteins. Only these 22 a ...
and
nucleotide
Nucleotides are Organic compound, organic molecules composed of a nitrogenous base, a pentose sugar and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both o ...
metabolism
Metabolism (, from ''metabolē'', "change") is the set of life-sustaining chemical reactions in organisms. The three main functions of metabolism are: the conversion of the energy in food to energy available to run cellular processes; the co ...
.
A similar study done in 2017 by Maier et al. combined metaproteomics with metagenomics and
metabolomics
Metabolomics is the scientific study of chemical processes involving metabolites, the small molecule substrates, intermediates, and products of cell metabolism. Specifically, metabolomics is the "systematic study of the unique chemical fingerpri ...
to show the effects of resistant
starch
Starch or amylum is a polymeric carbohydrate consisting of numerous glucose units joined by glycosidic bonds. This polysaccharide is produced by most green plants for energy storage. Worldwide, it is the most common carbohydrate in human diet ...
on the human intestinal microbiome. After subjects consumed diets high in resistant starch, it was discovered that several microbial proteins were altered such as
butyrate kinase,
enoyl coenzyme A (
enoyl-CoA) hydratase,
phosphotransacetylase,
adenylosuccinate synthase,
adenine phosphoribosyltransferases, and
guanine
Guanine () (symbol G or Gua) is one of the four main nucleotide bases found in the nucleic acids DNA and RNA, the others being adenine, cytosine, and thymine ( uracil in RNA). In DNA, guanine is paired with cytosine. The guanine nucleoside ...
phosphoribosyltransferases. The human subjects experienced increases in
colipase
Colipase, abbreviated CLPS, is a protein co-enzyme that counteracts the inhibitory effect of intestinal bile acid on the enzymatic activity of pancreatic lipase. It is secreted by the pancreas in an Zymogen, inactive form, procolipase, which is a ...
,
pancreatic triglyceride lipase
The enzyme triacylglycerol lipase (also triglyceride lipase, EC 3.1.1.3;systematic name triacylglycerol acylhydrolase) catalyses the hydrolysis of ester linkages of triglycerides:
: triacylglycerol + H2O diacylglycerol + a carboxylate
These l ...
, bile salt-stimulated lipase abundance while also experiencing a decrease in
α-amylase. Metaproteomics has also been used to understand the human-microbiome interactions that may underlie cardiovascular health. Using machine learning, a 2025 study by Yang et al. showed that human and microbial proteins could identify those at high-risk of cardiovascular disease in healthy and heart failure cohorts. These were proteins were primarily associated with intestinal inflammation and production of
short-chain fatty acids.
Overall, metaproteomics has gained immense popularity in human intestinal microbiome studies as it has led to important discoveries in the health field.
Metaproteomics in environmental microbiome studies
Metaproteomics has been especially useful in the identification of microbes involved in various biodegradation processes. A 2017 study done by Jia et al. has shown the application of metaproteomics in examining protein expression profiles of biofuel-producing microorganisms. According to this study, bacterial and archaeal proteins are involved in producing hydrogen and methane-derived biofuels. Bacterial proteins involved are ferredoxin-NADP reductase, acetate kinase, and NADH-quinone oxidoreductase found in the ''Firmicutes, Proteobacteria, Actinobacteria'' and ''Bacteroidetes'' taxa. These particular proteins are involved in carbohydrate, lipid, and amino acid metabolism. The archaeal proteins involved are acetyl-CoA decarboxylase and methyl-coenzyme M reductase found in ''Methanosarcina''. These proteins participate in biochemical pathways involving acetic acid utilization, reduction, and methyl nutrient usage.
The first quantification method for metaproteomics was reported by Laloo et al. 2018 on an engineered biological reactor enriched for ammonia and nitrite oxidising bacteria. Here the authors used a robust SWATH-MS quantification method ( protein requirement 5μg) for studying the change in expression levels of protein to a perturbed condition. The study noted that the changes in protein expression of the dominant species i.e. ammonia oxidising bacteria were clearly observed but this was not so for the nitrite oxidising bacteria which was found in low abundance.
A 2019 study by Li et al. has demonstrated the use of metaproteomics in observing protein expression of polycyclic aromatic hydrocarbon (PAH) degradation genes. The authors of this study specifically focused on identifying the degradable microbial communities in activated sludge during wastewater treatment, as PAHs are highly prevalent wastewater pollutants. They showed that ''Burkholderiales'' bacteria are heavily involved in PAH degradation, and that the bacterial proteins are involved in DNA replication, fatty acid and glucose metabolism, stress response, protein synthesis, and aromatic hydrocarbon metabolism.
A similar study done in 2020 by Zhang et al. involved metaproteomic profiling of azo dye-degrading microorganisms. As azo dyes are hazardous industrial pollutants, metaproteomics was used to observe the overall biodegradation mechanism. ''Pseudomonas Burkholderia, Enterobacter, Lactococcus'' and ''Clostridium'' strains were identified using metagenomic shotgun sequencing, and many bacterial proteins were found to show degradative activity. These proteins identified using metaproteomics include those involved in the TCA cycle, glycolysis, and aldehyde dehydrogenation. Identification of these proteins therefore led the scientists into proposing potential azo dye degradation pathways in ''Pseudomonas'' and ''Burkholderia''.
All in all, metaproteomics is applicable not only to human health studies, but also to environmental studies involving potentially harmful contaminants.
See also
*
Metatranscriptomics
References
{{Genomics
Biological techniques and tools
Environmental microbiology