PhylomeDB is a public
biological database
Biological databases are libraries of biological sciences, collected from scientific experiments, published literature, high-throughput experiment technology, and computational analysis. They contain information from research areas including genom ...
for complete catalogs of gene phylogenies (
phylomes).
It allows users to interactively explore the evolutionary history of genes through the visualization of
phylogenetic trees and
multiple sequence alignments. Moreover, phylomeDB provides genome-wide orthology and paralogy predictions which are based on the analysis of the phylogenetic trees. The automated pipeline used to reconstruct trees aims at providing a high-quality phylogenetic analysis of different genomes, including Maximum Likelihood tree inference, alignment trimming
and evolutionary model testing.
PhylomeDB includes also a public download section with the complete set of trees, alignments and orthology predictions, as well as a web API that facilitates cross linking trees from external sources. Finally, phylomeDB provides an advanced tree visualization interface based on the ETE toolkit,
[{{cite journal , last=Huerta-Cepas, J, author2= Dopazo, J, author3=Gabaldón, T , date=Jan 2010, title=ETE: a python Environment for Tree Exploration., journal = BMC Bioinformatics , volume=11, issue=Database issue, pages=24, pmid = 20070885, doi =10.1186/1471-2105-11-24, pmc = 2820433 ] which integrates tree topologies, taxonomic information, domain mapping and alignment visualization in a single and interactive tree image.
New steps on phylomeDB
The tree searching engine of PhylomeDB was updated to provide a gene-centric view of all phylomeDB resources. Thus, after a protein or gene search, all the available trees in phylomeDB are listed and organized by phylome and tree type. Users can switch among all available seed and collateral trees without missing the focus on the searched protein or gene.
In phylomeDB v4 all the information available for each tree is now shown using an integrated layout in which tree topology, taxonomy data, alignments and domain annotations, and event-age (phylostratigraphy) information are rendered in the same figure using the newest visualization features provided by the ETE toolkit v2.2:
#
Pfam
Pfam is a database of protein families that includes their annotations and multiple sequence alignments generated using hidden Markov models. The most recent version, Pfam 35.0, was released in November 2021 and contains 19,632 families.
Use ...
domains have been mapped to each alignment in our database and are now displayed in a compact panel at the right side of the tree. For each sequence, domains and their names are shown, they can be clicked to obtain a short description and the external link to Pfam. Protein regions not mapped to domains are shown using the standard amino acid color codes, while gap regions are represented by a flat line.
# Tree images have been also simplified to improve readability. Mappings and/or cross-linking to general and organism-oriented databases has been extended to include the major Arabidopsis thaliana sequence database
TAIR, Drosophila’s
Flybase
FlyBase is an online bioinformatics database and the primary repository of genetic and molecular data for the insect family Drosophilidae. For the most extensively studied species and model organism, ''Drosophila melanogaster'', a wide range of d ...
, as well as the Ascomycete-based genome database
Genolevures.
# Speciation and duplication events are indicated using different node colors and branch support values are now automatically highlighted for lowly supported partitions using a transparent red bubble inversely proportional to the branch bootstrap or aLRT value.
# Internal tree searches can be performed for any of the annotated node attributes while links to other databases are provided through the contextual menu of the tree browser that appear when clicking any node.
Also, users can download relevant data, including the whole database, a specific phylome or, from the tree entry page, the relevant data corresponding to that tree. In this new release we have implemented the possibility to download orthology predictions from a tree in the recently developed
OrthoXML standard format, in addition to a tabulated format.
Quest for Orthologs
The
Quest for Orthologs
A quest is a journey toward a specific mission or a goal. The word serves as a plot device in mythology and fiction: a difficult journey towards a goal, often symbolic or allegorical. Tales of quests figure prominently in the folklore of ever ...
(QfO) consortium involve more than 30 phylogenomic databases. The main of the consortium is improve and standardize orthology predictions through collaboration and discuss about new emerging methods.
#link to
Quest for Orthologs#link to
See also
*
Genomics
Genomics is an interdisciplinary field of biology focusing on the structure, function, evolution, mapping, and editing of genomes. A genome is an organism's complete set of DNA, including all of its genes as well as its hierarchical, three-dim ...
*
Phylogeny
A phylogenetic tree (also phylogeny or evolutionary tree Felsenstein J. (2004). ''Inferring Phylogenies'' Sinauer Associates: Sunderland, MA.) is a branching diagram or a tree showing the evolutionary relationships among various biological spe ...
*
Phylogenomics
Phylogenomics is the intersection of the fields of evolution and genomics. The term has been used in multiple ways to refer to analysis that involves genome data and evolutionary reconstructions. It is a group of techniques within the larger fields ...
References
External links
PhylomeDBToni Gabaldón LabCentre for Genomic Regulation
Biological databases
Computational phylogenetics
Genetics databases