Zero width joiner
   HOME

TheInfoList



OR:

The zero-width joiner (ZWJ, ) is a non-printing character used in the computerized
typesetting Typesetting is the composition of text by means of arranging physical ''type'' (or ''sort'') in mechanical systems or '' glyphs'' in digital systems representing '' characters'' (letters and other symbols).Dictionary.com Unabridged. Random ...
of
writing system A writing system is a method of visually representing verbal communication, based on a script and a set of rules regulating its use. While both writing and speech are useful in conveying messages, writing differs in also being a reliable fo ...
s in which the shape or positioning of a
grapheme In linguistics, a grapheme is the smallest functional unit of a writing system. The word ''grapheme'' is derived and the suffix ''-eme'' by analogy with ''phoneme'' and other names of emic units. The study of graphemes is called '' graphemi ...
depends on its relation to other graphemes (
complex script Complex text layout (CTL) or complex text rendering is the typesetting of writing systems in which the shape or positioning of a grapheme depends on its relation to other graphemes. The term is used in the field of software internationalizatio ...
s), such as the Arabic script or any
Indic script The Brahmic scripts, also known as Indic scripts, are a family of abugida writing systems. They are used throughout the Indian subcontinent, Southeast Asia and parts of East Asia. They are descended from the Brahmi script of ancient India ...
. Sometimes the
Roman script The Latin script, also known as Roman script, is an alphabetic writing system based on the letters of the classical Latin alphabet, derived from a form of the Greek alphabet which was in use in the ancient Greek city of Cumae, in southern Ital ...
is to be counted as complex, e.g. when using a
Fraktur Fraktur () is a calligraphic hand of the Latin alphabet and any of several blackletter typefaces derived from this hand. The blackletter lines are broken up; that is, their forms contain many angles when compared to the curves of the Antiqu ...
typeface. When placed between two characters that would otherwise not be connected, a ZWJ causes them to be printed in their connected forms. The exact behaviour of the ZWJ varies depending on whether the use of a
conjunct consonant Conjunct consonants are a type of letters, used for example in Brahmi or Brahmi derived modern scripts such as Balinese, Bengali, Devanagari, Gujarati, etc to write consonant clusters such as or . Although most of the time, letters are formed ...
or ligature (where multiple characters are shown with a single glyph) is expected by default; for instance, it suppresses the use of conjuncts in
Devanagari Devanagari ( ; , , Sanskrit pronunciation: ), also called Nagari (),Kathleen Kuiper (2010), The Culture of India, New York: The Rosen Publishing Group, , page 83 is a left-to-right abugida (a type of segmental writing system), based on the ...
(whilst still allowing the use of the individual joining form of a dead consonant, as opposed to a
halant Virama ( ्) is a Sanskrit phonological concept to suppress the inherent vowel that otherwise occurs with every consonant letter, commonly used as a generic term for a codepoint in Unicode, representing either # halanta, hasanta or explicit virā ...
form as would be required by the
zero-width non-joiner The zero-width non-joiner (ZWNJ) is a non-printing character used in the computerization of writing systems that make use of ligatures. When placed between two characters that would otherwise be connected into a ligature, a ZWNJ causes them to b ...
), but induces the use of conjuncts in Sinhala (which does not use them by default). Similarly to Sinhala, when a ZWJ is placed between two emoji characters (or interspersed between multiple), it can result in a single glyph being shown, such as the family emoji, made up of two adult emoji and one or two child emoji. In some cases, such as the second Devanagari example below, the ZWJ can be used to display a joining form in isolation, when included after the character and combining halant code. The character's code point is . In the
InScript InScript (short for Indic Script) is the decreed standard keyboard layout for Indian scripts using a standard 104- or 105-key layout. This keyboard layout was standardised by the Government of India for inputting text in languages of India writ ...
keyboard layout for Indian languages, it is typed by the key combination ''Ctrl+Shift+1''. However, many layouts use the position of QWERTY's ']' key for this character.


Examples


See also

*
Word joiner The word joiner (WJ) is a format character in Unicode used to indicate that word separation should not occur at a position, when using scripts such as Arabic that do not use explicit spacing. It is encoded since Unicode version 3.2 (released i ...
*
Zero-width non-joiner The zero-width non-joiner (ZWNJ) is a non-printing character used in the computerization of writing systems that make use of ligatures. When placed between two characters that would otherwise be connected into a ligature, a ZWNJ causes them to b ...


References


External links


Proposal on Clarification and Consolidation of the Function of ZERO WIDTH JOINER in Indic Scripts
Control characters Typography Unicode formatting code points {{Digital-typography-stub