A typographic approximation is a replacement of an element of the
writing system
A writing system comprises a set of symbols, called a ''script'', as well as the rules by which the script represents a particular language. The earliest writing appeared during the late 4th millennium BC. Throughout history, each independen ...
(usually a
glyph
A glyph ( ) is any kind of purposeful mark. In typography, a glyph is "the specific shape, design, or representation of a character". It is a particular graphical representation, in a particular typeface, of an element of written language. A ...
) with another glyph or glyphs. The replacement may be a nearly
homographic character, a
digraph, or a character string. An approximation is different from a
typographical error
A typographical error (often shortened to typo), also called a misprint, is a mistake (such as a spelling or transposition error) made in the typing of printed or electronic material. Historically, this referred to mistakes in manual typesettin ...
in that an approximation is intentional and aims to preserve the visual appearance of the original. The concept of approximation also applies to the
World Wide Web
The World Wide Web (WWW or simply the Web) is an information system that enables Content (media), content sharing over the Internet through user-friendly ways meant to appeal to users beyond Information technology, IT specialists and hobbyis ...
and other forms of textual information available via digital media, though usually at the level of
characters, not glyphs.
Historically, the main cause of typographic approximation was a low quantity of glyphs (such as
letterform
A letterform, letter-form or letter form is a term used especially in typography, palaeography, calligraphy and epigraphy to mean a letter (alphabet), letter's shape. A letterform is a type of glyph, which is a specific, concrete way of writing a ...
s and
symbol
A symbol is a mark, Sign (semiotics), sign, or word that indicates, signifies, or is understood as representing an idea, physical object, object, or wikt:relationship, relationship. Symbols allow people to go beyond what is known or seen by cr ...
s) available for printing.
In the age of World Wide Web and
digital typesetting, especially after the advent of
Unicode
Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Char ...
and enormous amount of
computer font
A computer font is implemented as a digital data file containing a set of graphically related glyphs. A computer font is designed and created using a font editor. A computer font specifically designed for the computer screen, and not for printi ...
s, typographic approximations are usually caused either by low ability of humans to distinguish and find needed symbols or by inadequate replacement patterns in
word processor A word processor (WP) is a device or computer program that provides for input, editing, formatting, and output of text, often with some additional features.
Early word processors were stand-alone devices dedicated to the function, but current word ...
s,
rather than by lack of available characters.
Typewriter and line printer approximations
Merger of characters
On
typewriter
A typewriter is a Machine, mechanical or electromechanical machine for typing characters. Typically, a typewriter has an array of Button (control), keys, and each one causes a different single character to be produced on paper by striking an i ...
, several characters were merged due to limited size of glyph repertoire. Several modern computing
characters appeared by merger of different symbols, such as the "typewriter"
apostrophe
The apostrophe (, ) is a punctuation mark, and sometimes a diacritical mark, in languages that use the Latin alphabet and some other alphabets. In English, the apostrophe is used for two basic purposes:
* The marking of the omission of one o ...
, ', which can denote an apostrophe proper, ’, a single
quotation mark
Quotation marks are punctuation marks used in pairs in various writing systems to identify direct speech, a quotation, or a phrase. The pair consists of an opening quotation mark and a closing quotation mark, which may or may not be the sam ...
, or the
prime symbol.
Non-spacing modifiers
Some typewriters have
''non-spacing'' keys for use as
diacritic
A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacrit ...
al marks. After the typist pushes, say, acute accent ◌́ the caret does not move. This allows the typist to
overstrike
In typography, overstrike is a method of printing characters that are missing from the printer's character set. The character is created by placing one character on another one – for example, overstriking ⟨L⟩ with ⟨-⟩ results in prin ...
this mark by a
spacing letter, say, e and obtain é, an accented letter. Due to geometrical restrictions of a
monospaced font
A monospaced font, also called a fixed-pitch, fixed-width, or non-proportional font, is a font whose letters and characters each occupy the same amount of horizontal space. This contrasts with Typeface#Proportion, variable-width fonts, where t ...
, the result could not always be perfect. For example, overstriking was unlikely to be a feasible method to produce
uppercase
Letter case is the distinction between the letters that are in larger uppercase or capitals (more formally ''#Majuscule, majuscule'') and smaller lowercase (more formally ''#Minuscule, minuscule'') in the written representation of certain langua ...
accented letters, such as É.
Overstrike was used on
line printers for the same function. This contributed to standardization of such characters as .
Overstrike of the same letter was used to simulate
boldface
In typography, emphasis is the strengthening of words in a text with a font in a different style from the rest of the text, to highlight them. It is the equivalent of prosody stress in speech.
Methods and use
The most common methods in We ...
letters on line printers.
ASCII approximations
The
US-ASCII
ASCII ( ), an acronym for American Standard Code for Information Interchange, is a character encoding standard for representing a particular set of 95 (English language focused) printable character, printable and 33 control character, control c ...
character set and other variants of
ISO/IEC 646
ISO/IEC 646 ''Information technology — ISO 7-bit coded character set for information interchange'', is an International Organization for Standardization, ISO/International Electrotechnical Commission, IEC standard in the ...
contains 95
graphic characters. It is comparable with a (Latin script) typewriter and insufficient for a quality
typography
Typography is the art and technique of Typesetting, arranging type to make written language legibility, legible, readability, readable and beauty, appealing when displayed. The arrangement of type involves selecting typefaces, Point (typogra ...
. But high availability and robustness of ASCII character encoding prompted computer users to invent ASCII substitutes for various glyphs.
The following ASCII characters are used to approximate certain characters. Note that there are many Latin letters that are homographic to letters of other scripts, however those Latin letters are not listed below.
* (space):
alignment and justification.
* : various type of
double quotes, double prime
″.
* : sharp symbol
♯.
* : various type of
single quotes,
apostrophe
The apostrophe (, ) is a punctuation mark, and sometimes a diacritical mark, in languages that use the Latin alphabet and some other alphabets. In English, the apostrophe is used for two basic purposes:
* The marking of the omission of one o ...
’, prime
′.
* Parentheses :
encircled characters, such as for Copyright symbol
©.
* – multiplication sign ×, bullet point •
* – various symbols with strokes extending to left, up, right and down.
* – probably an ASCII character the most used for approximations. A conventional representation of
hyphen
The hyphen is a punctuation mark used to join words and to separate syllables of a single word. The use of hyphens is called hyphenation.
The hyphen is sometimes confused with dashes (en dash , em dash and others), which are wider, or with t ...
, an approximation of
dash
The dash is a punctuation mark consisting of a long horizontal line. It is similar in appearance to the hyphen but is longer and sometimes higher from the baseline. The most common versions are the endash , generally longer than the hyphen ...
(especially as and ), minus sign
− and
line drawing horizontal line ─ (see the image).
* : various
dot-like symbols, see
Full stop
The full stop ( Commonwealth English), period (North American English), or full point is a punctuation mark used for several purposes, most often to mark the end of a declarative sentence (as distinguished from a question or exclamation).
A ...
.
* – see
Slash (punctuation)
The slash is a slanting line punctuation mark . It is also known as a stroke, a solidus, a forward slash and #Alternative names, several other historical or technical names. Once used as the equivalent of the modern full stop, period and comma ...
.
* : Turkish
dotless ı, Cyrillic
palochka
The palochka () is a letter in the Cyrillic script. The letter is usually caseless. It was introduced in the late 1930s as the Hindu-Arabic digit ' 1', and on Cyrillic keyboards, it is usually typeset as the Roman numeral ''. Unicode currentl ...
Ӏ.
* : IPA reversed epsilon
ɜ, Cyrillic letter
З.
* : Cyrillic letter
Ч.
* : various non-Latin letters and symbols with similar grapheme.
* – see
Colon (punctuation)
The colon, , is a punctuation mark consisting of two equally sized dots aligned vertically. A colon often precedes an explanation, a list, or a quoted sentence. It is also used between hours and minutes in time, between certain elements in medi ...
.
* and :
chevrons ⟨ ⟩, angle quotes ‹ ›, horizontal
arrows (especially as digraphs and ).
* :
line drawing horizontal double line ═ (see the image),
double hyphen.
* – although not an approximation, the question sign sometimes replaces unrepresented and unrecognized characters.
* – see
At sign
The at sign () is an accounting and invoice abbreviation meaning "at a rate of" (e.g. 7 Widget (economics), widgets @ £2 per widget = £14), now seen more widely in email addresses and social media platform User (computing), handles. It is norm ...
.
* :
Numero sign
The numero sign or numero symbol, (also represented as Nº, No̱, No., or no.), is a Typography, typographic abbreviation of the word ''number''(''s'') indicating Ordinal numeral, ordinal numeration, especially in names and titles. For example ...
№.
* : various symbols with strokes extending to left, right and down, but not up.
* :
set union
In set theory, the union (denoted by ∪) of a collection of sets is the set of all elements in the collection. It is one of the fundamental operations through which sets can be combined and related to each other.
A refers to a union of ze ...
∪.
* :
logical OR ∨.
* :
X mark
An X mark (also known as an ex mark or a cross mark or simply an X or ex or a cross) is used to indicate the concept of negation (for example "no, this has not been verified", "no, that is not the correct answer" or "no, I do not agree") as well ...
✗.
* and :
checkbox
A checkbox (check box, tickbox, tick box) is a graphical widget that allows the user to make a binary choice, i.e. a choice between one of two possible mutually exclusive options. For example, the user may have to answer 'yes' (checked) or 'n ...
and similar
rectangular pictograms.
* :
logical AND ∧, upwards arrow ↑, and similar symbols with the wedge at the top.
* – see
Underscore
An underscore or underline is a line drawn under a segment of text. In proofreading, underscoring is a convention that says "set this text in italic type", traditionally used on manuscript or typescript as an instruction to the printer. Its ...
.
* –
opening single quote ‘.
* - flat symbol
♭
* :
bullets
A bullet is a Kinetic energy weapon, kinetic projectile, a component of firearm ammunition that is Shooting, shot from a gun barrel. They are made of a variety of materials, such as copper, lead, steel, polymer, rubber and even wax; and are made ...
and various
circle
A circle is a shape consisting of all point (geometry), points in a plane (mathematics), plane that are at a given distance from a given point, the Centre (geometry), centre. The distance between any point of the circle and the centre is cal ...
-like symbols such as
∘ and ∞ (using two consecutive characters).
* : μ — SI prefix ''
micro-
''Micro'' (Greek letter μ, Mu (letter), mu, non-Italic type, italic) is a metric prefix, unit prefix in the metric system denoting a factor of one millionth (10−6). It comes from the Ancient Greek, Greek word (), meaning "small".
It is the ...
'' or lowercase Greek letter
mu
* : downwards arrow ↓, and similar symbols with the wedge at the bottom.
* :
multiplication sign
The multiplication sign (), also known as the times sign or the dimension sign, is a mathematical symbol used to denote the operation of multiplication, which results in a product.
The symbol is also used in botany, in botanical hybrid nam ...
×.
* (on the image, this ASCII character is rendered as a ''broken bar'' ¦):
line drawing vertical symbols.
* – see
Tilde.
Approximation of non-glyphs
There exist various approximation for
typographic alignment
In typesetting and page layout, alignment or range is the setting of typography, text flow or image placement relative to a page (paper), page, column (typography), column (measure), table cell, or tabulation, tab (and often to an image above it o ...
. For example,
justification may be emulated with inserting of
spaces, and flush-right alignment may be done by padding with spaces.
There are various techniques for approximation of
tables (historically used for
text mode
Text mode is a computer display mode in which content is internally represented on a computer screen in terms of characters rather than individual pixels. Typically, the screen consists of a uniform rectangular grid of ''character cells'', ea ...
displays), such as
box-drawing characters.
References
{{reflist
Typography
Typesetting