Climate And Forecast Metadata Conventions
   HOME

TheInfoList



OR:

The Climate and Forecast (CF)
metadata Metadata (or metainformation) is "data that provides information about other data", but not the content of the data itself, such as the text of a message or the image itself. There are many distinct types of metadata, including: * Descriptive ...
conventions are conventions for the description of Earth sciences data, intended to promote the processing and sharing of data files. The metadata defined by the CF conventions are generally included in the same file as the data, thus making the file "self-describing". The conventions provide a definitive description of what the data values found in each netCDF variable represent, and of the spatial and temporal properties of the data, including information about grids, such as grid cell bounds and cell averaging methods. This enables users of files from different sources to decide which variables are comparable, and is a basis for building software applications with powerful data extraction, grid remapping,
data analysis Data analysis is the process of inspecting, Data cleansing, cleansing, Data transformation, transforming, and Data modeling, modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making. Da ...
, and data visualization capabilities.


History and evolution

The CF conventions were introduced in 2003, after several years of development by a collaboration that included staff from U.S. and European climate and weather laboratories. The conventions contained generalizations and extensions to the earlier Cooperative Ocean/Atmosphere Research Data Service (COARDS) conventions and the Gregory/Drach/Tett (GDT) conventions. As the scope of the CF conventions grew along with its user base, the CF community adopted an open governance model. In December 2008 the trio of standards, netCDF+CF+ OPeNDAP, was adopted by IOOS as a recommended standard (number 08-012) for the representation and transport of gridded data. The CF conventions are being considered by the
NASA The National Aeronautics and Space Administration (NASA ) is an independent agencies of the United States government, independent agency of the federal government of the United States, US federal government responsible for the United States ...
Standards Process Group (SPG) and others as more broadly applicable standards.


Applications and user base

The CF conventions have been adopted by a wide variety of national and international programs and activities in the Earth sciences. For example, they were required for the
climate model Numerical climate models (or climate system models) are mathematical models that can simulate the interactions of important drivers of climate. These drivers are the atmosphere, oceans, land surface and ice. Scientists use climate models to st ...
output data collected for
Coupled model intercomparison project In climatology, the Coupled Model Intercomparison Project (CMIP) is a collaborative framework designed to improve knowledge of climate change. It was organized in 1995 by the Working Group on Coupled Modelling (WGCM) of the World Climate Research ...
s, which are widely used for the
Intergovernmental Panel on Climate Change The Intergovernmental Panel on Climate Change (IPCC) is an intergovernmental body of the United Nations. Its job is to "provide governments at all levels with scientific information that they can use to develop climate policies". The World Met ...
assessment reports. They are promoted as an important element of scientific community coordination by the
World Climate Research Programme The World Climate Research Programme (WCRP) is an international programme that helps to coordinate global climate research. The WCRP was established in 1980, under the joint sponsorship of the World Meteorological Organization (WMO) and the Inte ...
. They are also used as a technical foundation for a number of software packages and data systems, including the Climate Model Output Rewriter (CMOR), which is post processing software for climate model data, and the Earth System Grid, which distributes climate and other data. The CF conventions have also been used to describe the physical fields transferred between individual Earth system model
software components Component-based software engineering (CBSE), also called component-based development (CBD), is a style of software engineering that aims to construct a software system from components that are loosely-coupled and reusable. This emphasizes the sep ...
, such as atmosphere and ocean components, as the model runs .


Supported data types

CF is intended for use with state estimation and
forecasting Forecasting is the process of making predictions based on past and present data. Later these can be compared with what actually happens. For example, a company might Estimation, estimate their revenue in the next year, then compare it against the ...
data, in the atmosphere, ocean, and other physical domains. It was designed primarily to address gridded data types such as
numerical weather prediction Numerical weather prediction (NWP) uses mathematical models of the atmosphere and oceans to weather forecasting, predict the weather based on current weather conditions. Though first attempted in the 1920s, it was not until the advent of comput ...
model outputs and
climatology Climatology (from Greek , ''klima'', "slope"; and , '' -logia'') or climate science is the scientific study of Earth's climate, typically defined as weather conditions averaged over a period of at least 30 years. Climate concerns the atmospher ...
data in which
data binning Data binning, also called data discrete binning or data bucketing, is a data pre-processing technique used to reduce the effects of minor observation errors. The original data values which fall into a given small interval, a '' bin'', are replace ...
is used to impose a regular structure. However, the CF conventions are also applicable to many classes of observational data and have been adopted by a number of groups for such applications.


Supported data formats

CF originated as a standard for data written in netCDF, but its structure is general and it has been adapted for use with other data formats. For example, using the CF conventions with
Hierarchical Data Format Hierarchical Data Format (HDF) is a set of file formats (HDF4, HDF5) designed to store and organize large amounts of data. Originally developed at the U.S. National Center for Supercomputing Applications, it is supported by The HDF Group, a non- ...
data has been explored.


Design principles

Several principles guide the development of CF conventions: * Data should be self-describing, without external tables needed for interpretation. * Conventions should be developed only as needed, rather than anticipating possible needs. * Conventions should not be onerous to use for either data-writers or data-readers. * Metadata should be readable by humans as well as interpretable by programs. * Redundancy should be avoided to prevent inconsistencies when writing data. Specific CF metadata descriptors use values of attributes to represent *
Data provenance Data lineage refers to the process of tracking how data is generated, transformed, transmitted and used across a system over time. It documents data's origins, transformations and movements, providing detailed visibility into its life cycle. This ...
: title, institution, contact, source (e.g. model), history (
audit trail An audit trail (also called audit log) is a security-relevant chronological record, set of records, and/or destination and source of records that provide documentary evidence of the sequence of activities that have affected at any time a specific ...
of operations), references, comment * Description of associated activity: project, experiment * Description of data: units, standard_name, long_name, auxiliary_variables, missing_value, valid_range, flag_values, flag_meanings * Description of coordinates: coordinates, bounds, grid_mapping (with formula_terms); time specified with reference_time ("time since T0") and calendar attributes. * Meaning of grid cells: cell_methods, cell_measures, and climatological statistics. A central element of the CF Conventions is the CF Standard Name Table. The CF Standard Name Table uniquely associates a standard name with each geophysical parameter in a
data set A data set (or dataset) is a collection of data. In the case of tabular data, a data set corresponds to one or more table (database), database tables, where every column (database), column of a table represents a particular Variable (computer sci ...
, where each name provides a precise description of physical quantities being represented. Note that this is the string value of the standard_name attribute, not the name of the parameter. The CF standard name table identifies over 1,000 physical quantities, each with a precise description and associated
canonical units A canonical unit is a unit of measurement agreed upon as default in a certain context. In astrodynamics In astrodynamics, canonical units are defined in terms of some important object’s orbit that serves as a reference. In this system, a refere ...
. Guidelines for construction of CF standard names are documented on the conventions web site. As an example of the information provided by CF standard names, the entry for sea-level atmospheric pressure includes: * standard name: air_pressure_at_sea_level * description: sea_level means mean
sea level Mean sea level (MSL, often shortened to sea level) is an mean, average surface level of one or more among Earth's coastal Body of water, bodies of water from which heights such as elevation may be measured. The global MSL is a type of vertical ...
, which is close to the
geoid The geoid ( ) is the shape that the ocean surface would take under the influence of the gravity of Earth, including gravitational attraction and Earth's rotation, if other influences such as winds and tides were absent. This surface is exte ...
in sea areas.
Air pressure Atmospheric pressure, also known as air pressure or barometric pressure (after the barometer), is the pressure within the atmosphere of Earth. The Standard atmosphere (unit), standard atmosphere (symbol: atm) is a unit of pressure defined as , whi ...
at sea level is the quantity often abbreviated as MSLP or PMSL. * canonical units: Pa


Software


CF-Python
is a data analysis package built on a complete implementation
CFDM
of the CF conventions. The authors of the CFDM and CF-Python currently assert a desire to fully support all aspects of the CF conventions.
NetCDF-Java Library
parses CF Conventions and create

objects from them * OriginPro version 2021b supports netCDF CF Convention. Averaging can be performed during import to allow handling of large datasets in a GUI software. * Th
xarray
Python library parses and decodes data stored according to CF Conventions. * Th

Python library "draws heavily from the NetCDF CF Metadata Conventions as a source for its data model".


References

{{Reflist


External links


CF Metadata Home PageNASA Standards Process GroupStandard for the CF Metadata Conventions (Marine Metadata Interoperability Project page)Ocean Data Standards on Metadata
Metadata Earth sciences metadata conventions Meteorological data and networks Science software