Unicode
Unicode, formally The Unicode Standard,The formal version reference is is an information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard, ...
supports several
phonetic script
Phonetic transcription (also known as phonetic script or phonetic notation) is the visual representation of speech sounds (or ''phones'') by means of symbols. The most common type of phonetic transcription uses a phonetic alphabet, such as the I ...
s and notations through the existing writing systems and the addition of extra blocks with phonetic characters. These phonetic extras are derived from an existing script, usually Latin, Greek or Cyrillic. Apart from
International Phonetic Alphabet (IPA),
extensions to the IPA and
obsolete and nonstandard IPA symbols, these blocks also contain characters from the
Uralic Phonetic Alphabet and the
Americanist Phonetic Alphabet
Americanist phonetic notation, also known as the North American Phonetic Alphabet (NAPA), the Americanist Phonetic Alphabet or the American Phonetic Alphabet (APA), is a system of phonetic notation originally developed by European and American a ...
.
Phonetic scripts
The
International Phonetic Alphabet (IPA) makes use of letters from other writing systems as most phonetic scripts do. IPA notably uses Latin, Greek and Cyrillic characters. Combining diacritics also adds meaning to the phonetic text. Finally, these phonetic alphabets make use of modifier letters, that are specially constructed for the phonetic meaning. A "modifier letter" is strictly intended not as an independent grapheme but as a modification of the preceding character
resulting in a distinct grapheme, notably in the context of the International Phonetic Alphabet. For example,
ʰ should not occur on its own but modifies the preceding or following symbol. Thus, is a single IPA symbol, distinct from . In practice, however, several of these "modifier letters" are also used as full graphemes, e.g.
ʿ as transliterating Semitic
ayin
''Ayin'' (also ''ayn'' or ''ain''; transliterated ) is the sixteenth letter of the Semitic scripts, including Phoenician , Hebrew , Aramaic , Syriac ܥ, and Arabic (where it is sixteenth in abjadi order only).
The letter represen ...
or Hawaiian
okina Okina may refer to:
* ʻOkina, a letter used in some Polynesian languages, visually resembling a left single quotation mark
* Okina () or , a character from the ''Rurouni Kenshin'' manga series
* Okina, Spain, a village in the Basque Country
* , ...
, or
˚ transliterating Abkhaz
ә.
From to Unicode
Consonants
The following tables indicates the Unicode code point sequences for phonemes as used in the
International Phonetic Alphabet. A bold code point indicates that the Unicode chart provides an application note such as "voiced retroflex lateral" for . An entry in bold italics indicates the character name itself refers to a phoneme such as
Vowels
The following figures depict the phonetic vowels and their Unicode / UCS code points, arranged to represent the phonetic
vowel trapezium
A vowel diagram or vowel chart is a schematic arrangement of the vowels. Depending on the particular language being discussed, it can take the form of a triangle or a quadrilateral. Vertical position on the diagram denotes the vowel closeness, ...
. Vowels appearing in pairs in the figure to the right indicate rounded and unrounded variations respectively. Again, characters with Unicode names referring to phonemes are indicated by bold text. Those with explicit application notes are indicated by bold italic text. Those from borrowed unchanged from another script (Latin, Greek or Cyrillic) are indicated by italics. Before and after a bullet are the
unrounded • rounded vowels.
Diacritics
Diacritics may be encoded as either
modifier (e.g. ˳) or
combining (e.g.
◌̥) characters.
Unicode blocks
*
Basic Latin (0020–007E),
IPA
IPA commonly refers to:
* India pale ale, a style of beer
* International Phonetic Alphabet, a system of phonetic notation
* Isopropyl alcohol, a chemical compound
IPA may also refer to:
Organizations International
* Insolvency Practitioner ...
example:
Open front unrounded vowel
The open front unrounded vowel, or low front unrounded vowel, is a type of vowel sound used in some spoken languages. It is one of the eight primary cardinal vowels, not directly intended to correspond to a vowel sound of a specific language b ...
(0061)
*
Latin-1 Supplement (00A0–00FF),
IPA
IPA commonly refers to:
* India pale ale, a style of beer
* International Phonetic Alphabet, a system of phonetic notation
* Isopropyl alcohol, a chemical compound
IPA may also refer to:
Organizations International
* Insolvency Practitioner ...
example:
Near-open front unrounded vowel (00E6)
*
Latin Extended-A (0100–017F),
IPA
IPA commonly refers to:
* India pale ale, a style of beer
* International Phonetic Alphabet, a system of phonetic notation
* Isopropyl alcohol, a chemical compound
IPA may also refer to:
Organizations International
* Insolvency Practitioner ...
example:
Voiceless pharyngeal fricative (0127)
*
Latin Extended-B
Latin Extended-B is the fourth block (0180-024F) of the Unicode Standard. It has been included since version 1.0, where it was only allocated to the code points 0180-01FF and contained 113 characters. During unification with ISO 10646 for version ...
(0180–024F),
IPA
IPA commonly refers to:
* India pale ale, a style of beer
* International Phonetic Alphabet, a system of phonetic notation
* Isopropyl alcohol, a chemical compound
IPA may also refer to:
Organizations International
* Insolvency Practitioner ...
example:
Tenuis dental click (01C0 0287)
*
IPA Extensions (0250–02AF),
IPA
IPA commonly refers to:
* India pale ale, a style of beer
* International Phonetic Alphabet, a system of phonetic notation
* Isopropyl alcohol, a chemical compound
IPA may also refer to:
Organizations International
* Insolvency Practitioner ...
example:
Near-open central vowel (0250)
*
Spacing Modifier Letters
Spacing Modifier Letters is a Unicode block containing characters for the IPA, UPA, and other phonetic transcriptions. Included are the IPA tone marks, and modifiers for aspiration and palatalization
Palatalization may refer to:
*Palatalizat ...
(02B0–02FF),
IPA
IPA commonly refers to:
* India pale ale, a style of beer
* International Phonetic Alphabet, a system of phonetic notation
* Isopropyl alcohol, a chemical compound
IPA may also refer to:
Organizations International
* Insolvency Practitioner ...
example:
Palatal ejective (0063 02BC)
*
Combining Diacritical Marks
Combining Diacritical Marks is a Unicode block containing the most common combining characters. It also contains the character " Combining Grapheme Joiner", which prevents canonical reordering of combining characters, and despite the name, act ...
(0300–036F),
IPA
IPA commonly refers to:
* India pale ale, a style of beer
* International Phonetic Alphabet, a system of phonetic notation
* Isopropyl alcohol, a chemical compound
IPA may also refer to:
Organizations International
* Insolvency Practitioner ...
example:
Near-close central unrounded vowel
The close central unrounded vowel, or high central unrounded vowel, is a type of vowel sound used in some languages. The symbol in the International Phonetic Alphabet that represents this sound is , namely the lower-case letter ''i'' with a hor ...
(026A 0308)
*
Greek and Coptic (0370–03FF),
IPA
IPA commonly refers to:
* India pale ale, a style of beer
* International Phonetic Alphabet, a system of phonetic notation
* Isopropyl alcohol, a chemical compound
IPA may also refer to:
Organizations International
* Insolvency Practitioner ...
example:
Voiced bilabial fricative
The voiced bilabial fricative is a type of consonantal sound, used in some Speech communication, spoken languages. The symbol in the International Phonetic Alphabet that represents this sound is , and the equivalent X-SAMPA symbol is B. The offi ...
(03B2)
*
Combining Diacritical Marks Supplement (1DC0–1DFF),
IPA
IPA commonly refers to:
* India pale ale, a style of beer
* International Phonetic Alphabet, a system of phonetic notation
* Isopropyl alcohol, a chemical compound
IPA may also refer to:
Organizations International
* Insolvency Practitioner ...
example: Rising-falling contour tone (1DC8)
*
General Punctuation (2000–206F),
IPA
IPA commonly refers to:
* India pale ale, a style of beer
* International Phonetic Alphabet, a system of phonetic notation
* Isopropyl alcohol, a chemical compound
IPA may also refer to:
Organizations International
* Insolvency Practitioner ...
example:
Linking (absence of a break) (203F)
*
Superscripts and Subscripts (2070–209F),
IPA
IPA commonly refers to:
* India pale ale, a style of beer
* International Phonetic Alphabet, a system of phonetic notation
* Isopropyl alcohol, a chemical compound
IPA may also refer to:
Organizations International
* Insolvency Practitioner ...
example:
Nasal release
In phonetics, a nasal release is the release of a stop consonant into a nasal. Such sounds are transcribed in the IPA with superscript nasal letters, for example as in English ''catnip'' . In English words such as ''sudden'' in which historical ...
(207F)
*
Arrows (2190–21FF),
IPA
IPA commonly refers to:
* India pale ale, a style of beer
* International Phonetic Alphabet, a system of phonetic notation
* Isopropyl alcohol, a chemical compound
IPA may also refer to:
Organizations International
* Insolvency Practitioner ...
example:
Global rise (2197)
*
Latin Extended-C
Latin Extended-C is a Unicode block containing Latin
Latin (, or , ) is a classical language belonging to the Italic branch of the Indo-European languages. Latin was originally a dialect spoken in the lower Tiber area (then known as ...
(2C60–2C7F),
IPA
IPA commonly refers to:
* India pale ale, a style of beer
* International Phonetic Alphabet, a system of phonetic notation
* Isopropyl alcohol, a chemical compound
IPA may also refer to:
Organizations International
* Insolvency Practitioner ...
example:
Labiodental flap
In phonetics, the voiced labiodental flap is a speech sound found primarily in languages of Central Africa, such as Kera and Mangbetu. It has also been reported in the Austronesian language Sika. It is one of the few non- rhotic flaps. The s ...
(2C71)
*
Modifier Tone Letters (A700–A71F),
IPA
IPA commonly refers to:
* India pale ale, a style of beer
* International Phonetic Alphabet, a system of phonetic notation
* Isopropyl alcohol, a chemical compound
IPA may also refer to:
Organizations International
* Insolvency Practitioner ...
example:
Upstep (A71B)
*
Phonetic Extensions
Phonetic Extensions is a Unicode block containing phonetic characters used in the Uralic Phonetic Alphabet, Old Irish phonetic notation, the Oxford English dictionary and American dictionaries, and Americanist and Russianist phonetic notation ...
(1D00–1D7F)
*
Phonetic Extensions Supplement
Phonetic Extensions Supplement is a Unicode block containing characters for specialized and deprecated forms of the International Phonetic Alphabet.
Block
History
The following Unicode-related documents record the purpose and process of defi ...
(1D80–1DBF)
*
Latin Extended-D
Latin Extended-D is a Unicode block containing Latin characters for phonetic, Mayanist, and Medieval
In the history of Europe, the Middle Ages or medieval period lasted approximately from the late 5th to the late 15th centuries, simi ...
(A720–A7FF)
*
Latin Extended-E
Latin Extended-E is a Unicode block containing Latin script characters used in German dialectology (Teuthonista
Teuthonista is a phonetic transcription system used predominantly for the transcription of (High) German dialects. It is very s ...
(AB30–AB6F)
*
Latin Extended-F
Latin Extended-F is a Unicode block containing modifier letters, nearly all IPA and extIPA, for phonetic transcription. The Latin Extended-F and -G blocks contain the first Latin characters defined outside of the Basic Multilingual Plane
In ...
(10780–107BF)
*
Latin Extended-G (1DF00–1DFFF)
Unicode blocks with many phonetic symbols
Six
Unicode blocks contain many phonetic symbols:
IPA Extensions (U+0250–02AF)
Spacing Modifier Letters (U+02B0–02FF)
The characters in the "Spacing Modifier Letters" block are intended as forming a unity with the preceding letter (which they "modify"). E.g. the character isn't intended simply as a superscript ''h'' (
h), but as the mark of aspiration placed after the letter being aspirated, as in "
aspirated voiceless bilabial plosive
The voiceless bilabial plosive or stop is a type of consonantal sound used in most spoken languages. The symbol in the International Phonetic Alphabet that represents this sound is , and the equivalent X-SAMPA symbol is p.
Features
Features o ...
". The block contains:
*Latin superscript modifier letters: (U+02B0–U+02B8): ʰ aspiration; ʱ breathy voice, murmured; ʲ palatalization; ʳ, ʴ, ʵ, ʶ r-coloring or r-offglides; ʷ labialization; ʸ palatalization,
Americanist usage for U+02B2
*Miscellaneous phonetic modifiers: (U+02B9–U+02D7): ʹ ʺ ʻ ʼ ʽ ʾ ʿ ˀ ˁ ˂ ˃ ˄ ˅ ˆ ˇ ˈ ˉ ˊ ˋ ˌ ˍ ˎ ˏ ː ˑ ˒ ˓ ˔ ˕ ˖ ˗
*Spacing clones of diacritics: (U+02D8–U+02DD): ˘
breve
A breve (, less often , neuter form of the Latin "short, brief") is the diacritic mark ˘, shaped like the bottom half of a circle. As used in Ancient Greek, it is also called , . It resembles the caron (the wedge or in Czech, in ...
; ˙
dot above; ˚
ring above; ˛
ogonek; ˜
small tilde; ˝
double acute accent
*Additions based on 1989 IPA: (U+02DE–U+02E4): ˞ ˟ ˠ ˡ ˢ ˣ ˤ
*
Tone letters: (U+02E5–U+02E9): ˥ ˦ ˧ ˨ ˩
*Extended
Bopomofo
Bopomofo (), or Mandarin Phonetic Symbols, also named Zhuyin (), is a Chinese transliteration system for Mandarin Chinese and other related languages and dialects. More commonly used in Taiwanese Mandarin, it may also be used to transcribe ...
tone marks: ;
*IPA modifiers: , unaspirated
*Other modifier letters: for
Nenets
*
Uralic Phonetic Alphabet (UPA) modifiers: (U+02EF–U+02FF): ˯ ˰ ˱ ˲ ˳ ˴ ˵ ˶ ˷ ˸ ˹ ˺ ˻ ˼ ˽ ˾ ˿
Phonetic Extensions (U+1D00–1D7F)
This block, together with Phonetic Extensions Supplement below, contains:
* Small capitals "ɢ ɪ ɴ ɶ ʀ ʏ ʙ ʜ ʟ"
* Turned small letters "ɐ ɥ ɯ ɹ ɺ ɻ ʇ ʌ ʍ ʎ ʞ ʮ ʯ"
* Extra small capitals "ʁ ʛ ᴀ ᴁ ᴃ ᴄ ᴅ ᴆ ᴇ ᴊ ᴋ ᴌ ᴍ ᴎ ᴏ ᴐ ᴘ ᴙ ᴚ ᴛ ᴜ ᴠ ᴡ ᴢ ᴣ ᴦ ᴧ ᴨ ᴩ ᴪ"
* Letters with palatal hooks "ƫ ᶀ ᶁ ᶂ ᶃ ᶄ ᶅ ᶆ ᶇ ᶈ ᶉ ᶊ ᶋ ᶌ ᶍ ᶎ ᶪ ᶵ"
* Letters with retroflex hooks "ᶏ ᶐ ᶒ ᶓ ᶔ ᶕ ᶖ ᶗ ᶘ ᶙ ᶚ ᶩ ᶯ ᶼ"
Phonetic Extensions Supplement (U+1D80–1DBF)
Modifier Tone Letters (U+A700–A71F)
Superscripts and Subscripts (U+2070–209F)
Font support for IPA
Input by selection from a screen

Many systems provide a way to select Unicode characters visually.
ISO/IEC 14755 refers to this as a ''screen-selection entry method''.
Microsoft Windows has provided a Unicode version of the Character Map program (find it by hitting then type
charmap
then hit ) since version NT 4.0 – appearing in the consumer edition since XP. This is limited to characters in the
Basic Multilingual Plane
In the Unicode standard, a plane is a continuous group of 65,536 (216) code points. There are 17 planes, identified by the numbers 0 to 16, which corresponds with the possible values 00–1016 of the first two positions in six position hexadecim ...
(BMP). Characters are searchable by Unicode character name, and the table can be limited to a particular code block. More advanced third-party tools of the same type are also available (a notable
freeware
Freeware is software, most often proprietary, that is distributed at no monetary cost to the end user. There is no agreed-upon set of rights, license, or EULA that defines ''freeware'' unambiguously; every publisher defines its own rules for t ...
example is
BabelMap
Andrew Christopher West (; born 31 March 1960) is an English Sinologist. His first works concerned Chinese novels of the Ming and Qing
The Qing dynasty ( ), officially the Great Qing,, was a Manchu-led imperial dynasty of China an ...
).
macOS
macOS (; previously OS X and originally Mac OS X) is a Unix operating system developed and marketed by Apple Inc. since 2001. It is the primary operating system for Apple's Mac (computer), Mac computers. Within the market of ...
provides a "character palette" with much the same functionality, along with searching by related characters, glyph tables in a font, etc. It can b
enabledin the input menu in the menu bar under System Preferences → International → Input Menu (or System Preferences → Language and Text → Input Sources) or can be viewed under Edit → Emoji & Symbols in many programs.
Equivalent tools – such as
gucharmap (
GNOME) or
kcharselect
The KDE Gear (also known as the KDE Applications Bundle or KDE Applications) is a set of applications and supporting libraries that are developed by the KDE, KDE community, primarily used on Linux-based operating systems but mostly multiplatform, ...
(
KDE) – exist on most Linux desktop environments.
See also
*
Unicode symbols
*
Universal Character Set characters
*
Latin script in Unicode
*
IPA
IPA commonly refers to:
* India pale ale, a style of beer
* International Phonetic Alphabet, a system of phonetic notation
* Isopropyl alcohol, a chemical compound
IPA may also refer to:
Organizations International
* Insolvency Practitioner ...
References
External links
Links to PDFs of Unicode codes for several phonetic symbol sets
{{IPA navigation
Unicode
Unicode, formally The Unicode Standard,The formal version reference is is an information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard, ...
*