Volume, Area, Dihedral Angle Reporter (VADAR) is a freely available protein structure validation web server that was developed as a collaboration between Dr. Brian Sykes and Dr. David Wishart at the
University of Alberta
The University of Alberta (also known as U of A or UAlberta, ) is a public research university located in Edmonton, Alberta, Canada. It was founded in 1908 by Alexander Cameron Rutherford, the first premier of Alberta, and Henry Marshall Tory, t ...
.
VADAR consists of over 15 different algorithms and programs for assessing and validating
peptide
Peptides are short chains of amino acids linked by peptide bonds. A polypeptide is a longer, continuous, unbranched peptide chain. Polypeptides that have a molecular mass of 10,000 Da or more are called proteins. Chains of fewer than twenty am ...
and
protein structure
Protein structure is the three-dimensional arrangement of atoms in an amino acid-chain molecule. Proteins are polymers specifically polypeptides formed from sequences of amino acids, which are the monomers of the polymer. A single amino acid ...
s from their
PDB coordinate data. VADAR is capable of determining
secondary structure
Protein secondary structure is the local spatial conformation of the polypeptide backbone excluding the side chains. The two most common Protein structure#Secondary structure, secondary structural elements are alpha helix, alpha helices and beta ...
(using three different algorithms), identifying and classifying six different types of
beta turns, determining and calculating the strength of C=O -- N-H hydrogen bonds, calculating residue-specific
accessible surface area
The accessible surface area (ASA) or solvent-accessible surface area (SASA) is the surface area of a biomolecule that is accessible to a solvent. Measurement of ASA is usually described in units of square angstroms (a standard unit of measuremen ...
s (ASA), calculating residue volumes, determining
backbone and side chain torsion angles (phi, psi, omega and chi angles), assessing local structure quality (through numerous quality indices), evaluating global structure quality, and identifying residue "outliers" (residues with unusual structural features). The results have been validated through extensive comparison to published data and careful visual inspection. VADAR produces both text and graphical output with most of the quantitative data presented in easily viewed tables. In particular, VADAR's output is presented in a vertical, tabular format with most of the sequence data, residue numbering and any other calculated property or feature presented from top to bottom, rather than from left to right.
Identification of Secondary Structure
VADAR identifies and assigns protein
secondary structure
Protein secondary structure is the local spatial conformation of the polypeptide backbone excluding the side chains. The two most common Protein structure#Secondary structure, secondary structural elements are alpha helix, alpha helices and beta ...
using 3 different algorithms. These three methods are then combined to create a consensus
secondary structure
Protein secondary structure is the local spatial conformation of the polypeptide backbone excluding the side chains. The two most common Protein structure#Secondary structure, secondary structural elements are alpha helix, alpha helices and beta ...
assignment. Only 3 types of
secondary structure
Protein secondary structure is the local spatial conformation of the polypeptide backbone excluding the side chains. The two most common Protein structure#Secondary structure, secondary structural elements are alpha helix, alpha helices and beta ...
are identified: Helices are indicated with an "H", beta-strands are indicated with a "B" and coil or unstructured regions are identified with a "C".
Secondary structure
Protein secondary structure is the local spatial conformation of the polypeptide backbone excluding the side chains. The two most common Protein structure#Secondary structure, secondary structural elements are alpha helix, alpha helices and beta ...
assignments for each residue are listed under the column labeled SCND STRUC. The first
secondary structure
Protein secondary structure is the local spatial conformation of the polypeptide backbone excluding the side chains. The two most common Protein structure#Secondary structure, secondary structural elements are alpha helix, alpha helices and beta ...
identification method (which appears in column 1) uses a geometric masking approach that was first described by Richards and Kundrot
with slight modifications. The second method (which appears in column 2) uses
backbone dihedral angles to identify
secondary structure
Protein secondary structure is the local spatial conformation of the polypeptide backbone excluding the side chains. The two most common Protein structure#Secondary structure, secondary structural elements are alpha helix, alpha helices and beta ...
elements in a manner initially described by Levitt and Greer
as well as Chou and Fasman.
The third
secondary structure
Protein secondary structure is the local spatial conformation of the polypeptide backbone excluding the side chains. The two most common Protein structure#Secondary structure, secondary structural elements are alpha helix, alpha helices and beta ...
identification method uses hydrogen bonding patterns (in association with measured dihedral angles) to identify helices, beta strands and coil regions. This third method is somewhat similar to the method originally described by Kabsch and Sander.
The net result or consensus
secondary structure
Protein secondary structure is the local spatial conformation of the polypeptide backbone excluding the side chains. The two most common Protein structure#Secondary structure, secondary structural elements are alpha helix, alpha helices and beta ...
is a weighted combination of each of the three methods. VADAR’s method of
secondary structure
Protein secondary structure is the local spatial conformation of the polypeptide backbone excluding the side chains. The two most common Protein structure#Secondary structure, secondary structural elements are alpha helix, alpha helices and beta ...
identification generally identifies a higher fraction of
secondary structure
Protein secondary structure is the local spatial conformation of the polypeptide backbone excluding the side chains. The two most common Protein structure#Secondary structure, secondary structural elements are alpha helix, alpha helices and beta ...
elements than the DSSP algorithm (64% helices and beta strands for VADAR versus 51% helices and beta strands for DSSP). In particular, VADAR’s
secondary structure
Protein secondary structure is the local spatial conformation of the polypeptide backbone excluding the side chains. The two most common Protein structure#Secondary structure, secondary structural elements are alpha helix, alpha helices and beta ...
assignments appear to agree more closely with
secondary structures identified by visual inspection (i.e. author assignments), by STRIDE (another
secondary structure
Protein secondary structure is the local spatial conformation of the polypeptide backbone excluding the side chains. The two most common Protein structure#Secondary structure, secondary structural elements are alpha helix, alpha helices and beta ...
assignment algorithm) or via independent methods (i.e. NMR-based NOE methods).
Calculation of Accessible Surface Area
Accessible surface area
The accessible surface area (ASA) or solvent-accessible surface area (SASA) is the surface area of a biomolecule that is accessible to a solvent. Measurement of ASA is usually described in units of square angstroms (a standard unit of measuremen ...
s is a measure of the solvent exposure of individual atoms or residues (measured in square Angstroms). It corresponds to the surface area of an atom (or residue) that a water molecule can access or touch. In VADAR, the
accessible surface area
The accessible surface area (ASA) or solvent-accessible surface area (SASA) is the surface area of a biomolecule that is accessible to a solvent. Measurement of ASA is usually described in units of square angstroms (a standard unit of measuremen ...
s (ASA) for each residue is presented under two different column headers: RES ASA (residue ASA) and FRAC ASA (fractional ASA). The data listed under the RES ASA column refers to the “residue
accessible surface area
The accessible surface area (ASA) or solvent-accessible surface area (SASA) is the surface area of a biomolecule that is accessible to a solvent. Measurement of ASA is usually described in units of square angstroms (a standard unit of measuremen ...
s” as measured in square Angstroms. The data listed under the FRAC ASA column refers to the fractional residue
accessible surface area
The accessible surface area (ASA) or solvent-accessible surface area (SASA) is the surface area of a biomolecule that is accessible to a solvent. Measurement of ASA is usually described in units of square angstroms (a standard unit of measuremen ...
s (a value ranging from 0 to 1.0). Exposed, exterior, random coil or hydrophilic residues typically have a large fractional
accessible surface area
The accessible surface area (ASA) or solvent-accessible surface area (SASA) is the surface area of a biomolecule that is accessible to a solvent. Measurement of ASA is usually described in units of square angstroms (a standard unit of measuremen ...
s (>0.5), while hydrophobic, beta sheet or interior residues have a small fractional
accessible surface area
The accessible surface area (ASA) or solvent-accessible surface area (SASA) is the surface area of a biomolecule that is accessible to a solvent. Measurement of ASA is usually described in units of square angstroms (a standard unit of measuremen ...
s (<0.2). The fractional
accessible surface area
The accessible surface area (ASA) or solvent-accessible surface area (SASA) is the surface area of a biomolecule that is accessible to a solvent. Measurement of ASA is usually described in units of square angstroms (a standard unit of measuremen ...
s is calculated by dividing a given residue’s observed
accessible surface area
The accessible surface area (ASA) or solvent-accessible surface area (SASA) is the surface area of a biomolecule that is accessible to a solvent. Measurement of ASA is usually described in units of square angstroms (a standard unit of measuremen ...
s by the calculated
accessible surface area
The accessible surface area (ASA) or solvent-accessible surface area (SASA) is the surface area of a biomolecule that is accessible to a solvent. Measurement of ASA is usually described in units of square angstroms (a standard unit of measuremen ...
s for that residue in an extended Gly-Xaa-Gly tripeptide (where Xaa is the residue of interest). VADAR reports accessible surface are values both for the entire amino acid residue and for the amino acid side chains. The
accessible surface area
The accessible surface area (ASA) or solvent-accessible surface area (SASA) is the surface area of a biomolecule that is accessible to a solvent. Measurement of ASA is usually described in units of square angstroms (a standard unit of measuremen ...
s is also calculated for charged atoms (N, O), polar (N, O, S) atoms and for non-polar atoms (C). This information can be used to calculate charged, polar and non-polar surface area.
Accessible surface area
The accessible surface area (ASA) or solvent-accessible surface area (SASA) is the surface area of a biomolecule that is accessible to a solvent. Measurement of ASA is usually described in units of square angstroms (a standard unit of measuremen ...
s measurements/estimates are particularly useful in protein structure assessment, protein structure validation and thermodynamic calculations. The values calculated for
accessible surface area
The accessible surface area (ASA) or solvent-accessible surface area (SASA) is the surface area of a biomolecule that is accessible to a solvent. Measurement of ASA is usually described in units of square angstroms (a standard unit of measuremen ...
s (ASA) depend critically on the selection or choice of atomic or Van der Waals radii. Different methods and different authors have advocated the use of different
atomic radii. As a result, VADAR provides several choices for
atomic or
Van der Waals radii.
Calculation of Backbone and Side Chain Torsion Angles
Protein torsion angles are calculated for phi, psi, omega (which corresponds to the peptide bond) and chi1 (the first side chain torsion angle) using standard
IUPAC
The International Union of Pure and Applied Chemistry (IUPAC ) is an international federation of National Adhering Organizations working for the advancement of the chemical sciences, especially by developing nomenclature and terminology. It is ...
definitions. These values are listed under four different column headers: PHI, PSI, OMEGA and CHI1. All torsion angles are reported in degrees. Torsion angles are a very useful indicator of the
stereochemistry
Stereochemistry, a subdiscipline of chemistry, studies the spatial arrangement of atoms that form the structure of molecules and their manipulation. The study of stereochemistry focuses on the relationships between stereoisomers, which are defined ...
and the stereochemical quality of a
protein structure
Protein structure is the three-dimensional arrangement of atoms in an amino acid-chain molecule. Proteins are polymers specifically polypeptides formed from sequences of amino acids, which are the monomers of the polymer. A single amino acid ...
, with most high-quality proteins exhibiting a relatively tight clustering of phi/psi angles and relatively little deviation in the omega angles.
Determination of Beta Turns
Beta turns are another type of “short” or local
secondary structure
Protein secondary structure is the local spatial conformation of the polypeptide backbone excluding the side chains. The two most common Protein structure#Secondary structure, secondary structural elements are alpha helix, alpha helices and beta ...
that is distinct from the more common helices,
beta sheets
The beta sheet (β-sheet, also β-pleated sheet) is a common motif of the regular protein secondary structure. Beta sheets consist of beta strands (β-strands) connected laterally by at least two or three backbone hydrogen bonds, forming a gene ...
or
random coil
In polymer chemistry, a random coil is a conformation of polymers where the monomer subunits are oriented randomly while still being bonded to adjacent units. It is not one specific shape, but a statistical distribution of shapes for all the cha ...
s.
Beta turns are reasonably abundant (15%) and very important
secondary structures in proteins. In particular,
beta turns play a critical role in defining the
topology of proteins. They also likely play a role in initiating early packing events during the protein folding process. In VADAR
beta turns are identified under the BTURN header using standard Roman numeral notation (I = type I, II = type II, etc.). In VADAR,
beta turns are identified using a combination of different pieces of information including hydrogen bond data, the location of previously identified
secondary structures and the value of their local dihedral angles. In VADAR the classification and nomenclature used for
beta turns follows the definitions proposed by Wilmot and Thornton.
Calculation of Amino Acid Residue Volume
Because of the
van der Waals force
In molecular physics and chemistry, the van der Waals force (sometimes van der Waals' force) is a distance-dependent interaction between atoms or molecules. Unlike ionic or covalent bonds, these attractions do not result from a chemical elec ...
, atoms occupy space, which prevents other atoms from passing through each other. This 3D space or volume is called the excluded volume. Excluded volume is defined as the volume occupied by an atom or residue as determined by its
atomic radii and its nearest neighbors. Excluded volume it is typically given in units of cubic Angstroms. In VADAR the excluded volume for each amino acid residue is listed under two different column headers: RES VOL (residue volume) and FRAC VOL (fractional volume). Residue volume is presented in cubic Angstraoms and calculated using the Voronoi polyhedra algorithm that was first introduced by Dr. Frederic Richards.
In VADAR the number listed under the RES VOL header corresponds to the excluded volume (in cubic Angstroms) while the value under the FRAC VOL header corresponds to the fractional volume (which ranges from 0 to 1.0 or more). If a protein is efficiently packed, all of its residues should have fractional volumes close to 1.0 (+/- 0.1). In certain circumstances, if an amino acid residue is located in an interior cavity (or it has been placed improperly through poor refinement) it could have a fractional volume greater than 1.20. An amino acid residue located in a compressed region or a poorly refined region will have a fractional volume less than 0.80. Structural biologists often use excluded volume to help them find cavities, water-binding pockets, unexpected atomic overlaps or to identify problem areas in a protein structure. High quality
protein structure
Protein structure is the three-dimensional arrangement of atoms in an amino acid-chain molecule. Proteins are polymers specifically polypeptides formed from sequences of amino acids, which are the monomers of the polymer. A single amino acid ...
s should have relatively few residues with fractional volumes greater than 1.20 or less than 0.80.
History
Initially released in 2002, the VADAR web server has gone through a number of revisions and updates (now in version 1.8). The latest version of the VADAR web server supports the submission of either PDB formatted files or PDB accession numbers and generates extensive tables and high quality graphs for quantitatively and qualitatively assessing
protein structure
Protein structure is the three-dimensional arrangement of atoms in an amino acid-chain molecule. Proteins are polymers specifically polypeptides formed from sequences of amino acids, which are the monomers of the polymer. A single amino acid ...
s determined by
X-ray crystallography
X-ray crystallography is the experimental science of determining the atomic and molecular structure of a crystal, in which the crystalline structure causes a beam of incident X-rays to Diffraction, diffract in specific directions. By measuring th ...
,
NMR spectroscopy
Nuclear magnetic resonance spectroscopy, most commonly known as NMR spectroscopy or magnetic resonance spectroscopy (MRS), is a spectroscopic technique based on re-orientation of atomic nuclei with non-zero nuclear spins in an external magnetic f ...
, 3D-threading or
homology modelling
Homology modeling, also known as comparative modeling of protein, refers to constructing an atomic-resolution model of the "''target''" protein from its amino acid sequence and an experimental three-dimensional structure of a related homologous pr ...
. A separate website supports the analysis of multiple protein chains – as might be generated from a standard NMR structure determination effort.
See also
*
Crystallography
Crystallography is the branch of science devoted to the study of molecular and crystalline structure and properties. The word ''crystallography'' is derived from the Ancient Greek word (; "clear ice, rock-crystal"), and (; "to write"). In J ...
*
DSSP (hydrogen bond estimation algorithm)
*
Nuclear magnetic resonance
Nuclear magnetic resonance (NMR) is a physical phenomenon in which nuclei in a strong constant magnetic field are disturbed by a weak oscillating magnetic field (in the near field) and respond by producing an electromagnetic signal with a ...
*
Protein Structure Evaluation Suite & Server (PROSESS)
*
Protein structure prediction
Protein structure prediction is the inference of the three-dimensional structure of a protein from its amino acid sequence—that is, the prediction of its Protein secondary structure, secondary and Protein tertiary structure, tertiary structure ...
*
Resolution by Proxy (ResProx)
*
Root-mean-square deviation
The root mean square deviation (RMSD) or root mean square error (RMSE) is either one of two closely related and frequently used measures of the differences between true or predicted values on the one hand and observed values or an estimator on th ...
*
Structural bioinformatics
*
Structure validation
Macromolecular structure validation is the process of evaluating reliability for 3-dimensional atomic models of large biological molecules such as proteins and nucleic acids. These models, which provide 3D coordinates for each atom in the molecu ...
References
{{reflist
Protein structure databases