A geocode is a
code
In communications and information processing, code is a system of rules to convert information—such as a letter, word, sound, image, or gesture—into another form, sometimes shortened or secret, for communication through a communication ...
that represents a geographic entity (
location or
object). It is a
unique identifier of the entity, to distinguish it from others in a
finite set of geographic entities. In general the ''geocode'' is a
human-readable and short identifier.
Typical geocodes and entities represented by it:
* ''Country code'' and subdivision code. Polygon of the administrative boundaries of a country or a subdivision.
The main examples are ISO codes:
ISO 3166-1 alpha-2 code (e.g.
AF
for
Afghanistan or
BR
for
Brazil), and its subdivision conventions, such as
subdivision codes (e.g.
AF-GHO
for
Ghor province
Ghōr (Dari: ), also spelled Ghowr or Ghur, is one of the thirty-four provinces of Afghanistan. It is located in the western Hindu Kush in central Afghanistan, towards the northwest. The province contains eleven districts, encompassing hundreds ...
) or
subdivision codes (e.g.
BR-AM
for
Amazonas state).
* ''DGG cell ID''. Identifier of a cell of a
discrete global grid: a
Geohash code (e.g. ~0.023 km
2 cell
6vjyngd
at the
Brazilian's center) or an
OLC code (e.g. ~0.004 km
2 cell
58PJ642P+4
at the same point).
* ''Postal code''. Polygon of a
postal area
The list of postcode areas in the United Kingdom is a tabulation of the postcode areas used by Royal Mail for the purposes of directing mail within the United Kingdom. The postcode area is the largest geographical unit used and forms the initia ...
: a
CEP
''Boletus edulis'' (English: cep, penny bun, porcino or porcini) is a basidiomycete fungus, and the type species of the genus ''Boletus''. Widely distributed in the Northern Hemisphere across Europe, Asia, and North America, it does not occu ...
code (e.g.
70040
represents a Brazilian's central area for postal distribution).
Geocodes are mainly used (in general as an
atomic data type) for
labelling,
data integrity,
geotagging and
spatial indexing.
In
theoretical computer science a ''geocode system'' is a
locality-preserving hashing function.
Classification

There are some common aspects of many geocodes (or
geocode systems) that can be used as classification criteria:
* ''Ownership'': proprietary or
free
Free may refer to:
Concept
* Freedom, having the ability to do something, without having to obey anyone/anything
* Freethought, a position that beliefs should be formed only on the basis of logic, reason, and empiricism
* Emancipate, to procur ...
, differing by its
licence
A license (or licence) is an official permission or permit to do, use, or own something (as well as the document of that permission or permit).
A license is granted by a party (licensor) to another party (licensee) as an element of an agreeme ...
s.
* ''Formation'': the geocode can be originated from a name (ex. abbreviation of official name the country) or from mathematical function (
encoding algorithm to compress
latitude-longitude). See ''geocode system'' types below (of
names
A name is a term used for identification by an external observer. They can identify a class or category of things, or a single thing, either uniquely, or within a given context. The entity identified by a name is called its referent. A persona ...
and of
grids
AIDS is caused by a human immunodeficiency virus (HIV), which originated in non-human primates
in Central and West Africa. While various sub-groups of the virus acquired human infectivity at different times, the present pandemic had its origins i ...
).
* ''Covering'': global or partial. The entities (represented by the geocodes) are in all globe (e. g. geographical points) or is delimited the theme (e.g. only terrestrial areas) or by the ownership's jurisdiction (e.g. only into a country).
* ''Type of the represented entity'':
type of geometry. Point (the geocode can be translated to a
Geo URI), grid cell (the geocode system is related with a
DGG DGG may refer to:
* Deutsche Geophysikalische Gesellschaft, a society for geophysics
* Deutsche Grammophon Gesellschaft, a record label
* Discrete Global Grid
*Dutch Game Garden an organization with the aim of promoting and improving the video g ...
) or polygon (typically administrative boundaries delimitations).
** special hierarchical grids, with global covering and equal-area cells, can be classified as ''DGGS cell''
[The OGS's standard]
Discrete Global Grid Systems
definition.
** some non-standard geographic entities, can be classified also by its
coordinate system and elipsoid of reference (e.g.
UTM
UTM may refer to:
Computing
* Unified threat management, an approach to network security
* Universal Turing machine, a theoretical computer
* Urchin Tracking Module, a Web analytics package that served as the base for Google Analytics
* Usabil ...
). The
''de facto'' standard is the
WGS84.
* ''Scope of use'': general use vs specialized (e.g. airport geocodes).
* ''Hierarchy'': geocode's syntax hierarchy corresponding to the spatial hierarchy of its represented entities. A geocode system can hierarchical (
name
A name is a term used for identification by an external observer. They can identify a class or category of things, or a single thing, either uniquely, or within a given context. The entity identified by a name is called its referent. A personal ...
or
grid) or non-hierarchical.
Geocode system
The set of all geocodes used as unique identifiers of the cells of a
full-coverage of the
geographic surface (or any well-defined area like a country or the oceans), is a geocode system (also named ''geocode scheme''). The
syntax
In linguistics, syntax () is the study of how words and morphemes combine to form larger units such as phrases and sentences. Central concerns of syntax include word order, grammatical relations, hierarchical sentence structure ( constituency) ...
and
semantic
Semantics (from grc, σημαντικός ''sēmantikós'', "significant") is the study of reference, meaning, or truth. The term can be used to refer to subfields of several distinct disciplines, including philosophy, linguistics and comput ...
of the geocodes are also components of the system definition:
* geocode syntax: the characters that can be used, blocks of characters and its size and order. Example: country codes use two letters of the alphabet (chacacter set A-Z). The most commom way to describe formally is by
regular expression (e.g.
/ -Z
).
* geocode semantic: the meaning of the geocode, usually expressed by associating the code with a geographical entity type. Can be described formally is by an
ontology, an
UML class diagram
In software engineering, a class diagram in the Unified Modeling Language (UML) is a type of static structure diagram that describes the structure of a system by showing the system's class (computer science), classes, their attributes, operations ...
or any
Entity-relationship model.
In general the semantic can be deduced by its formation or encoding/decoding process. Example: each Geohash code can be expressed by a rectangular area in the map, and the rectangle coordinates is obtained by its decoding process.
Many syntax and semantic characteristics are also summarized by classification.
Encode and decode
Any geocode can be translated from a formal (and expanded) expression of the geographical entity, or vice versa, the geocode translated to entity. The first is named encode process, the second decode. The actors and process involved, as defined by
OGC,
[Definitions of the OGC's]
Glossary of Terms
. are:
;geocoder: A
software agent that transforms the description of a geographic entity (e.g. location name or latitude/longitude coordinates), into a normalized data and encodes it as a geocode.
;geocoder service: A geocoder implemented as
web service (or similar service interface), that accepts a set of geographic entity descriptors as input. The request is "sent" to the Geocoder Service, which processes the request and returns the resulting geocodes. More general services can also return geographic features (e.g.
GeoJSON object) represented by the geocodes.
;geocoding: Geocoding refers to the assignment of geocodes or coordinates to geographically reference data provided in a textual format. Examples are the two letter country codes and coordinates computed from addresses.
Note: when a
physical addressing schemes (street name and house number) is expressed in a standardized and simplified way, it can be conceived as geocode. So, the term
geocoding (used for addresses) sometimes is generalized for geocodes.
In spatial indexing applications the geocode can also be translated between human-readable (e.g.
hexadecimal
In mathematics and computing, the hexadecimal (also base-16 or simply hex) numeral system is a positional numeral system that represents numbers using a radix (base) of 16. Unlike the decimal system representing numbers using 10 symbols, hexa ...
) and internal (e.g.
binary 64-bit unsigned integer) representations.
Systems of standard names
Geocodes like ''country codes'', city codes, etc. comes from a table of official names, and the corresponding official codes and geometries (typically polygon of administrative areas). "Official" in the context of control and consensus, typically a table controlled by a
standards organization or governmental authority. So, the most general case is a table of ''standard names'' and the corresponding ''standard codes'' (and its official geometries).

Strictly speaking, the "name" related to a geocode is a
toponym, and the table (e.g. toponym to standard code) is the resource for toponym resolution: is the
relationship process, usually effectuated by a software agent, between a toponym and "an unambiguous spatial footprint of the same place".
Any standardized system of toponym resolution, having codes or encoded abbreviations, can be used as ''geocode system''. The "resolver" agent in this context is also a ''geocoder''.
Sometimes names are translated into numeric codes, to be compact or machine-readable. Since numbers, in this case, are name identifiers, we can consider "numeric names" — so this set of codes will be a kind of "system of standard names".
Hierarchical naming
In the geocode context,
space partitioning is the process of dividing a
geographical space
Space is the boundless three-dimensional extent in which objects and events have relative position and direction. In classical physics, physical space is often conceived in three linear dimensions, although modern physicists usually consider ...
into two or more
disjoint subset
In mathematics, Set (mathematics), set ''A'' is a subset of a set ''B'' if all Element (mathematics), elements of ''A'' are also elements of ''B''; ''B'' is then a superset of ''A''. It is possible for ''A'' and ''B'' to be equal; if they are ...
s, resulting in a
mosaic of subdivisions. Each subdivision can be partitioned again,
recursively, resulting in an hierarchical mosaic.
When subdivisions's names are expressed as codes, and code syntax can be decomposed into a parent-child relations, through a well-defined syntactic scheme, the geocode set configures a hierarchical system. A geocode fragment (associated to a subdivision name) can be an abbreviation, numeric or alphanumeric code.
A popular example is the
ISO 3166-2 geocode system, representing country names and the names of respective
administrative subdivisions
Administrative division, administrative unit,Article 3(1). country subdivision, administrative region, subnational entity, constituent state, as well as many similar terms, are generic names for geographical areas into which a particular, ind ...
separated by hyphen. For example
DE
is
Germany, a simple geocode, and its subdivisions (illustrated) are
DE-BW
for
Baden-Württemberg,
DE-BY
for
Bayern, ...,
DE-NW
for
Nordrhein-Westfalen, etc. The scope is only the first level of the hierarchy. For more levels there are other conventions, like HASC code. The HASC codes are alphabetic and its fragments have constant length (2 letters). Examples:
:
DE.NW
-
North Rhine-Westphalia. A two-level hierarchical geocode.
:
DE.NW.CE
- Kreis
Coesfeld
Coesfeld (; Westphalian: ''Koosfeld'') is the capital of the district of Coesfeld in the German state of North Rhine-Westphalia.
History
Coesfeld received its city rights in 1197, but was first recorded earlier than that in the biography of St ...
. A 3-level hierarchical geocode.
Two geocodes of a ''hierarchical geocode system'' with same prefix represents different parts of the same location. For instance
DE.NW.CE
and
DE.NW.BN
represents geographically interior parts of
DE.NW
, the common prefix.
Changing the ''subdivision criteria'' we can obtain other hierarchical systems. For example, for
hydrological criteria there is a geocode system, the US's
hydrologic unit code (HUC), that is a numeric representation of ''basin names'' in a hierarchical syntax schema (first level illustred). For example, the HUC
17
is the identifier of "
Pacific Northwest Columbia basin"; HUC
1706
of "
Lower Snake basin", a spatial
subset
In mathematics, Set (mathematics), set ''A'' is a subset of a set ''B'' if all Element (mathematics), elements of ''A'' are also elements of ''B''; ''B'' is then a superset of ''A''. It is possible for ''A'' and ''B'' to be equal; if they are ...
of HUC
17
and a superset of
17060102
("Imnaha River").
Systems of regular grids
file:Ordnance_Survey_National_Grid.svg, 420px, Each cell of a regular grid represents a geocode. The non-global grids were the most used before the 2000s.
This ''hierarchical system of local grids'', used since the 1930s as Ordnance Survey National Grid, British National Grid, generates hierarchical geocodes. Each cell subdivides recurrently its area into a new 10x10 grid.
Inspired in the classic alphanumeric grids, a
discrete global grid (DGG) is a regular
mosaic which covers the entire
Earth's surface (the globe). The
regularity of the mosaic is defined by the use of cells of same shape in all the grid, or "near the same shape and near same area" in a region of interest, like a country.
All cells of the grid have an identifier (DGG's cell ID), and the center of the cell can be used as reference for cell ID conversion into geographical point. When a compact human-readable expression of the cell ID is standardized, it becomes a geocode.
Geocodes of different ''geocode systems'' can represent the same position in the globe, with same shape and precision, but differ in
string
String or strings may refer to:
*String (structure), a long flexible structure made from threads twisted together, which is used to tie, bind, or hang other objects
Arts, entertainment, and media Films
* ''Strings'' (1991 film), a Canadian anim ...
-length, digit-alphabet, separators, etc. Non-global grids also differ by scope, and in general are geometrically optimized (avoid overlaps, gaps or loss of uniformity) for the local use.
Hierarchical grids
Each cell of a grid can be transformed into a new local grid, in a
recurring process. In the illustrated example, the cell
TQ 2980
is a sub-cell of
TQ 29
, that is a sub-cell of
TQ
. A system of geographic regular
grid references is the base of a ''hierarchical geocode system''.
Two geocodes of a ''hierarchical geocode grid system'' can use the prefix rule: geocodes with same prefix represents different parts of the same broader location. Using again the side illustration:
TQ 28
and
TQ 61
represents geographically
interior parts of
TQ
, the common prefix.
Hierarchical geocode can be split into keys. The
Geohash 6vd23gq
is the key
q
of the cell
6vd23g
, that is a cell of
6vd23
(key
g
), and so on, per-digit keys. The
OLC 58PJ642P
is the key
48
of the cell
58PJ64
, that is a cell of
58Q8
(key
48
), and so on, two-digit keys. In the case of OLC there is a second key schema, after the
+
separator:
58PJ642P+48
is the key
2
of the cell
58PJ642P+4
. It uses two key schemas. Some geocodes systems (e.g. S2 geometry) also use initial prefix with non-hierarchical key schema.
In general, as technical and non-compact optional representation, geocode systems (based on hierarchical grids) also offer the possibility of expressing their cell identifier with a fine-grained schema, by longer path of keys. For example, the Geohash
6vd2
, which is a
base32 code, can be expanded to
base4 0312312002
, which is also a schema with per-digit keys. Geometrically, each Geohash cell is a rectangle that subdivides space recurrently into 32 new rectangles, so, base4 subdividing into 4, is the encoding-expansion limit.
The uniformity of shape and area of cells in a grid can be important for other uses, like
spatial statistics. There are standard ways to build a grid covering the entire globe with cells of equal area, regular shape and other properties: Discrete Global Grid System (DGGS) is a series of discrete global grids satisfying all standardized requirements defined in 2017 by the
OGC.
When human-readable codes obtained from cell identifiers of a DGGS are also standardized, it can be classified as ''DGGS based geocode system''.
Name-and-grid systems
There are also mixed systems, using a syntactical partition, where for example the first part (code prefix) is a name-code and the other part (code suffix) is a grid-code. Example:
:
Mapcode entrance to the elevator of the
Eiffel Tower in Paris is
FR-4J.Q2
, where
FR
is the name-code and
4J.Q2
is the grid-code. Semantically France is the context, to obtain its local grid.
For
mnemonic coherent semantics, in fine-grained geocode applications, the mixed solutions are most suitable.
Shortening grid-based codes by context
Any ''geocode system''
based on regular grid, in general is also a shorter way to express a latitudinal/longitudinal coordinate. But geocode of over 6 characters is difficult for remember. On the other hand, a
geocode based on standard name (or abbreviation or the complete name) is easier to remember.
So, this suggests that a "mixed code" can solve the problem, reducing the number of characters when a name can be used as "context" of the grid-based geocode. For example, in a book where the author says "all geocodes here are contextualized by the chapter's city". In the chapter about Paris, where all places have Geohash with prefix
u09
, it can be removed — for instance Geohash
u09tut
can be reduced to
tut
, or, by an explicit code for context "FR-Paris
tut
". It is only possible when the context resolution (e.g. translation from "FR-Paris" to the prefix
u09
) is well-known.
In fact a methodology exists for
hierarchical grid-based geocodes with non-variable size, where the code prefix describes a broader area, which can be associated with a name. So, it is possible to shorten by replacing the prefix to the associated context. The most usual context is an official name. Examples:
The examples of the ''Mixed reference'' column are significantly easier than remembering ''DGG code'' column. The methods vary, for example OLC can be shortened by elimination of its first four digits and using
Plus Codes naming conventions.
When the mixed reference is also short (9 characters in the second example) and there are a syntax convention to express it (suppose
CP‑PR~bgxed
), this convention is generating a new
name-and-grid geocode system. This is not the case of the first example because, strictly speaking, "Cape Verde, Praia" is not a code.
To be both, a name-and-grid system and also a mixed reference convention, the system must be reversible. Pure name-and-grid systems, like
Mapcode, with no way to transform it into a global code, is not a mixed reference, because there is no algorithm to transform the mixed geocode into a grid-based geocode.
Cataloged examples
In use, general scope
Geocodes in use and with general scope:
In use, alternative address
Geocodes can be used in place of official
street names
A street name is an identifying name given to a street or road. In toponymic terminology, names of streets and roads are referred to as hodonyms (from Greek ‘road’, and ‘name’). The street name usually forms part of the address (th ...
and/or
house numbers, particularly when a given location has not been assigned an address by authorities. They can also be used as an "alternative address" if it can be converted to a
Geo URI. Even if the geocode is not the official designation for a location, it can be used as a "local standard" to allow homes to receive deliveries, access emergency services, register to vote, etc.
In use, postal codes
Geocodes in use, as
postal codes. A geocode recognized by
Universal Postal Union and adopted as "official postal code" by a
country, is also a valid postal code. Not all postal codes are geographic, and for some postal code systems, there are codes that are not geocodes (e.g.
in UK system). Samples, not a complete list:
In use, telephony and radio
Geocodes in use for telephony or radio broadcasting scope:
*
ITU-R country codes
*
ITU-T country calling code
Country calling codes or country dial-in codes are telephone number prefixes for reaching telephone subscribers in the networks of the member countries or regions of the International Telecommunication Union (ITU). The codes are defined by the ...
s
*
ITU-T mobile calling code
The ITU-T Recommendation E.212 defines mobile country codes (MCC) as well as mobile network codes (MNC).
Overview
The mobile country code consists of three decimal digits and the mobile network code consists of two or three decimal digits (for e ...
s
*
Maidenhead Locator System (used by amateur radio operators)
*
Marsden Squares
In use, others
Geocodes in use and with specific scope:
Historical or less widely used
Other examples
Other geocodes:
* ''S2'': a geocoding scheme using
spherical geometry
300px, A sphere with a spherical triangle on it.
Spherical geometry is the geometry of the two-dimensional surface of a sphere. In this context the word "sphere" refers only to the 2-dimensional surface and other terms like "ball" or "solid sp ...
and the space-filling
Hilbert curve, developed at
Google
* ''Munich Orientation Convention'': converts lat/lon to metrical monopolar codes for targets, crossings, stations, stop points, bridges, tunnels, towns, islands, volcanoes, highway exits etc.
* ''SALB'' (Second Administrative Level Boundaries), by U
*
Postal addresses in the Republic of Ireland#OpenPostcode, OpenPostcode, opensource global algorithm (local adaptations as Irish & Hong Kong postcodes).
*
WOEID
*
OpenStreetMap
OpenStreetMap (OSM) is a free, open geographic database updated and maintained by a community of volunteers via open collaboration. Contributors collect data from surveys, trace from aerial imagery and also import from other freely licensed g ...
shortlink, used as a short permanent link to map locations
*
Quarter Degree Grid Cells
*
NAC (patended), area codes (area can be indefinitely small)
* GEOID, the name of
United States Census Bureau geographic identifiers.
* In the United States, the American National Standards Institute (ANSI) Codes are often used. ANSI INCITS 446-2008 is entitled "Identifying Attributes for Named Physical and Cultural Geographic Features (Except Roads and Highways) of the United States, Its Territories, Outlying Areas, and Freely Associated Areas, and the Waters of the Same to the Limit of the Twelve-Mile Statutory Zone".
*
National Topographic System
The National Topographic System or NTS is the system used by Natural Resources Canada for providing general purpose topographic maps of the country. NTS maps are available in a variety of scales, the standard being 1:50,000 and 1:250,000 scales. T ...
in Canada
Other standards
Some standards and name servers include: ISO 3166, FIPS, INSEE, Geonames, IATA and
ICAO
The International Civil Aviation Organization (ICAO, ) is a specialized agency of the United Nations that coordinates the principles and techniques of international air navigation, and fosters the planning and development of international a ...
.
A number of commercial solutions have also been proposed:
*
WOEID (Where on Earth IDentifier) is a unique 32-bit reference identifier that identifies any feature on Earth.
* NAC Locator provides a universal geocoding address for all locations on the planet.
See also
*
Census tract
A census tract, census area, census district or meshblock is a geographic region defined for the purpose of taking a census. Sometimes these coincide with the limits of cities, towns or other administrative areas and several tracts commonly exist ...
*
Geolocation
*
Geotagging
*
Geographic information retrieval
*
Global Navigation Grid Code
The Global Navigation Grid Code (GNGC) is a Chinese-developed point reference system designed for global navigation. It is similar in design to national grid reference systems used throughout the world. GNGC was based upon the work of the GeoSOT ...
(China geocode?)
*
ISO 6709
ISO 6709, ''Standard representation of geographic point location by coordinates'', is the international standard for representation of latitude, longitude and altitude for geographic point locations.
The first edition (ISO 6709:1983) was develo ...
, standard representation of geographic point location by coordinates
*
Place code A P-code, short for place code, is a kind of geocode used mostly by emergency response teams. It provides unique identifiers to thousands of locations and administrative units in a humanitarian operation. The p-codes are represented by combinations ...
*
Unique Property Reference Number
*
Unique Street Reference Number
References
External links
*
{{Geocoding-systems