The Climate and Forecast (CF)
metadata conventions are conventions for the description of
Earth sciences
Earth science or geoscience includes all fields of natural science related to the planet Earth. This is a branch of science dealing with the physical, chemical, and biological complex constitutions and synergistic linkages of Earth's four spher ...
data, intended to promote the processing and sharing of
data file
A data file is a computer file which stores data to be used by a computer application or system, including input and output data. A data file usually does not contain instructions or code to be executed (that is, a computer program).
Most of the ...
s. The metadata defined by the CF conventions are generally included in the same file as the data, thus making the file "self-describing". The conventions provide a definitive description of what the data values found in each
netCDF
NetCDF (Network Common Data Form) is a set of software libraries and self-describing, machine-independent data formats that support the creation, access, and sharing of array-oriented scientific data. The project homepage is hosted by the Unidata ...
variable represent, and of the spatial and temporal properties of the data, including information about grids, such as grid cell bounds and cell averaging methods. This enables users of files from different sources to decide which variables are comparable, and is a basis for building software applications with powerful
data extraction
Data extraction is the act or process of retrieving data out of (usually unstructured or poorly structured) data sources for further data processing or data storage ( data migration). The import into the intermediate extracting system is thus usu ...
, grid remapping,
data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making. Data analysis has multiple facets and approaches, en ...
, and
data visualization
Data and information visualization (data viz or info viz) is an interdisciplinary field that deals with the graphic representation of data and information. It is a particularly efficient way of communicating when the data or information is nume ...
capabilities.
History and evolution
The CF conventions were introduced in 2003, after several years of development by a collaboration that included staff from U.S. and European climate and weather laboratories. The conventions contained generalizations and extensions to the earlier Cooperative Ocean/Atmosphere Research Data Service (COARDS) conventions and the Gregory/Drach/Tett (GDT) conventions. As the scope of the CF conventions grew along with its user base, the CF community adopted an
open governance
Open-source governance (also known as open governance and open politics) is a political philosophy which advocates the application of the philosophies of the open-source model, open-source and open-content movements to democracy, democratic prin ...
model. In December 2008 the trio of standards, netCDF+CF+
OPeNDAP OPeNDAP is an acronym for "Open-source Project for a Network Data Access Protocol," an endeavor focused on enhancing the retrieval of remote, structured data through a Web-based architecture and a discipline-neutral Data Access Protocol (DAP). Widel ...
, was adopted by
IOOS as a recommended standard (number 08-012) for the representation and transport of gridded data. The CF conventions are being considered by the
NASA
The National Aeronautics and Space Administration (NASA ) is an independent agency of the US federal government responsible for the civil space program, aeronautics research, and space research.
NASA was established in 1958, succeedi ...
Standards Process Group (SPG) and others as more broadly applicable standards.
Applications and user base
The CF conventions have been adopted by a wide variety of national and international programs and activities in the Earth sciences. For example, they were required for the
climate model
Numerical climate models use quantitative methods to simulate the interactions of the important drivers of climate, including atmosphere, oceans, land surface and ice. They are used for a variety of purposes from study of the dynamics of the ...
output data collected for
Coupled model intercomparison project
In climatology, the Coupled Model Intercomparison Project (CMIP) is a collaborative framework designed to improve knowledge of climate change. It was organized in 1995 by the Working Group on Coupled Modelling (WGCM) of the World Climate Researc ...
s, which are widely used for the
Intergovernmental Panel on Climate Change
The Intergovernmental Panel on Climate Change (IPCC) is an intergovernmental body of the United Nations. Its job is to advance scientific knowledge about climate change caused by human activities. The World Meteorological Organization (WMO) a ...
assessment reports.
They are promoted as an important element of scientific community coordination by the
World Climate Research Programme
The World Climate Research Programme (WCRP) is an international programme that helps to coordinate global climate research. The WCRP was established in 1980, under the joint sponsorship of the World Meteorological Organization (WMO) and the Int ...
. They are also used as a technical foundation for a number of software packages and data systems, including the Climate Model Output Rewriter (CMOR), which is post processing software for climate model data, and the
Earth System Grid, which distributes climate and other data.
[
] The CF conventions have also been used to describe the physical fields transferred between individual Earth system model
software components
Component-based software engineering (CBSE), also called component-based development (CBD), is a branch of software engineering that emphasizes the separation of concerns with respect to the wide-ranging functionality available throughout a giv ...
, such as atmosphere and ocean components, as the model runs
.
Supported data types
CF is intended for use with
state estimation In control theory, a state observer or state estimator is a system that provides an estimate of the internal state of a given real system, from measurements of the input and output of the real system. It is typically computer-implemented, and pr ...
and
forecasting
Forecasting is the process of making predictions based on past and present data. Later these can be compared (resolved) against what happens. For example, a company might estimate their revenue in the next year, then compare it against the actual ...
data, in the atmosphere, ocean, and other physical domains. It was designed primarily to address gridded data types such as
numerical weather prediction
Numerical weather prediction (NWP) uses mathematical models of the atmosphere and oceans to predict the weather based on current weather conditions. Though first attempted in the 1920s, it was not until the advent of computer simulation in th ...
model outputs and
climatology
Climatology (from Ancient Greek, Greek , ''klima'', "place, zone"; and , ''wiktionary:-logia, -logia'') or climate science is the scientific study of Earth's climate, typically defined as weather conditions averaged over a period of at least 30 ...
data in which
data binning Data binning, also called data discrete binning or data bucketing, is a data pre-processing technique used to reduce the effects of minor observation errors. The original data values which fall into a given small interval, a '' bin'', are replaced b ...
is used to impose a regular structure.
However, the CF conventions are also applicable to many classes of
observational data and have been adopted by a number of groups for such applications.
Supported data formats
CF originated as a standard for data written in
netCDF
NetCDF (Network Common Data Form) is a set of software libraries and self-describing, machine-independent data formats that support the creation, access, and sharing of array-oriented scientific data. The project homepage is hosted by the Unidata ...
, but its structure is general and it has been adapted for use with other data formats. For example, using the CF conventions with
Hierarchical Data Format
Hierarchical Data Format (HDF) is a set of file formats (HDF4, HDF5) designed to store and organize large amounts of data. Originally developed at the U.S. National Center for Supercomputing Applications, it is supported by The HDF Group, a non-p ...
data has been explored.
Design principles
Several principles guide the development of CF conventions:
* Data should be self-describing, without external tables needed for interpretation.
* Conventions should be developed only as needed, rather than anticipating possible needs.
* Conventions should not be onerous to use for either data-writers or data-readers.
* Metadata should be readable by humans as well as interpretable by programs.
* Redundancy should be avoided to prevent inconsistencies when writing data.
Specific CF metadata descriptors use values of attributes to represent
*
Data provenance:
title
,
institution
,
contact
,
source
(e.g. model),
history
(audit trail of operations),
references
,
comment
* Description of associated activity:
project
,
experiment
* Description of data:
units
,
standard_name
,
long_name
,
auxiliary_variables
,
missing_value
,
valid_range
,
flag_values
,
flag_meanings
* Description of coordinates:
coordinates
,
bounds
,
grid_mapping
(with
formula_terms
); time specified with
reference_time
("time since T0") and
calendar
attributes.
* Meaning of grid cells:
cell_methods
,
cell_measures
, and climatological statistics.
A central element of the CF Conventions is the CF Standard Name Table. The CF Standard Name Table uniquely associates a standard name with each geophysical parameter in a data set, where each name provides a precise description of physical quantities being represented. Note that this is the string value of the
standard_name
attribute, not the name of the parameter. The CF standard name table identifies over 1,000 physical quantities, each with a precise description and associated
canonical units A canonical unit is a unit of measurement agreed upon as default in a certain context.
In astrodynamics
In astrodynamics, canonical units are defined in terms of some important object’s orbit that serves as a reference. In this system, a refere ...
. Guidelines for construction of CF standard names are documented on the conventions web site.
As an example of the information provided by CF standard names, the entry for sea-level atmospheric pressure includes:
* standard name:
air_pressure_at_sea_level
* description: sea_level means mean
sea level
Mean sea level (MSL, often shortened to sea level) is an average surface level of one or more among Earth's coastal bodies of water from which heights such as elevation may be measured. The global MSL is a type of vertical datuma standardise ...
, which is close to the
geoid
The geoid () is the shape that the ocean surface would take under the influence of the gravity of Earth, including gravitational attraction and Earth's rotation, if other influences such as winds and tides were absent. This surface is exten ...
in sea areas.
Air pressure
Atmospheric pressure, also known as barometric pressure (after the barometer), is the pressure within the atmosphere of Earth. The standard atmosphere (symbol: atm) is a unit of pressure defined as , which is equivalent to 1013.25 millibars, ...
at sea level is the quantity often abbreviated as MSLP or PMSL.
* canonical units: Pa
Software
NetCDF-Java Libraryparses CF Conventions and create
objects from them
*
OriginPro version 2021b supports netCDF CF Convention. Averaging can be performed during import to allow handling of large datasets in a GUI software.
* Th
xarrayPython library parses and decodes data stored according to CF Conventions.
* Th
Python library "draws heavily from the NetCDF CF Metadata Conventions as a source for its data model".
[ ]
References
{{Reflist
External links
CF Metadata Home PageNASA Standards Process GroupStandard for the CF Metadata Conventions (Marine Metadata Interoperability Project page)Ocean Data Standards on Metadata
Metadata
Earth sciences metadata conventions
Meteorological data and networks
Science software