Underline(I)
   HOME

TheInfoList



OR:

An underscore or underline is a line drawn under a segment of text. In
proofreading Proofreading is a phase in the process of publishing where galley proofs are compared against the original manuscripts or graphic artworks, to identify transcription errors in the typesetting process. In the past, proofreaders would place corr ...
, underscoring is a convention that says "set this text in
italic type In typography, italic type is a cursive font based on a stylised form of calligraphic handwriting. Along with blackletter and roman type, it served as one of the major typefaces in the history of Western typography. Owing to the influence f ...
", traditionally used on
manuscript A manuscript (abbreviated MS for singular and MSS for plural) was, traditionally, any document written by hand or typewritten, as opposed to mechanically printed or reproduced in some indirect or automated way. More recently, the term has ...
or
typescript TypeScript (abbreviated as TS) is a high-level programming language that adds static typing with optional type annotations to JavaScript. It is designed for developing large applications and transpiles to JavaScript. It is developed by Micr ...
as an instruction to the printer. Its use to add emphasis in modern finished documents is generally avoided. The (freestanding) underscore character, , also called a low line, or low dash, originally appeared on the
typewriter A typewriter is a Machine, mechanical or electromechanical machine for typing characters. Typically, a typewriter has an array of Button (control), keys, and each one causes a different single character to be produced on paper by striking an i ...
so that underscores could be typed. To produce an underscored word, the word was typed, the
typewriter carriage A typewriter is a mechanical or electromechanical machine for typing characters. Typically, a typewriter has an array of keys, and each one causes a different single character to be produced on paper by striking an inked ribbon selectively a ...
was moved back to the beginning of the word, and the word was overtyped with the underscore character. In modern usage, underscoring is achieved with a
markup language A markup language is a Encoding, text-encoding system which specifies the structure and formatting of a document and potentially the relationships among its parts. Markup can control the display of a document or enrich its content to facilitate au ...
, with the Unicode combining low line or as a standard facility of
word processing A word processor (WP) is a device or computer program that provides for input, editing, formatting, and output of text, often with some additional features. Word processor (electronic device), Early word processors were stand-alone devices dedicate ...
software. The free-standing underscore character is used to indicate word boundaries in situations where spaces are not allowed, such as in computer
filename A filename or file name is a name used to uniquely identify a computer file in a file system. Different file systems impose different restrictions on filename lengths. A filename may (depending on the file system) include: * name – base ...
s,
email address An email address identifies an email box to which messages are delivered. While early messaging systems used a variety of formats for addressing, today, email addresses follow a set of specific rules originally standardized by the Internet Enginee ...
es, and in Internet
URL A uniform resource locator (URL), colloquially known as an address on the Web, is a reference to a resource that specifies its location on a computer network and a mechanism for retrieving it. A URL is a specific type of Uniform Resource Identi ...
s, for example . It is also used as a proofreader's mark, to indicate that text should be italicised when
typeset Typesetting is the composition of text for publication, display, or distribution by means of arranging physical ''type'' (or ''sort'') in mechanical systems or ''glyphs'' in digital systems representing '' characters'' (letters and other ...
, for instance is to be rendered as ''thus''.


Diacritic

The combining diacritic, , is similar to the combining low line but is shorter. The difference between "macron below" and "low line" is that the latter results in an unbroken underline when it is run together: compare and (only the latter should look like abc).


Modern use

In a
manuscript A manuscript (abbreviated MS for singular and MSS for plural) was, traditionally, any document written by hand or typewritten, as opposed to mechanically printed or reproduced in some indirect or automated way. More recently, the term has ...
(or typescript) to be
typeset Typesetting is the composition of text for publication, display, or distribution by means of arranging physical ''type'' (or ''sort'') in mechanical systems or ''glyphs'' in digital systems representing '' characters'' (letters and other ...
, various forms of underlining (see
below Below may refer to: *Earth *Ground (disambiguation) *Soil *Floor * Bottom (disambiguation) *Less than *Temperatures below freezing *Hell or underworld People with the surname * Ernst von Below (1863–1955), German World War I general * Fred Belo ...
) were therefore conventionally used to indicate that text should be set in special type such as
italics In typography, italic type is a cursive font based on a stylised form of calligraphic handwriting. Along with blackletter and roman type, it served as one of the major typefaces in the history of Western typography. Owing to the influence f ...
, part of a procedure known as markup. In printed documents underlining is generally avoided, with
italics In typography, italic type is a cursive font based on a stylised form of calligraphic handwriting. Along with blackletter and roman type, it served as one of the major typefaces in the history of Western typography. Owing to the influence f ...
or
small caps In typography, small caps (short for small capitals) are grapheme, characters typeset with glyphs that resemble uppercase letters but reduced in height and weight close to the surrounding lowercase letters or text figures. Small caps are used i ...
often used instead, or (especially in headings) using
capitalization Capitalization ( North American spelling; also British spelling in Oxford) or capitalisation (Commonwealth English; all other meanings) is writing a word with its first letter as a capital letter (uppercase letter) and the remaining letters in ...
,
bold type In typography, emphasis is the strengthening of words in a text with a font in a different style from the rest of the text, to highlight them. It is the equivalent of prosody stress in speech. Methods and use The most common methods in We ...
or greater body height (font size). Underlining may still be seen in display work. A series of underscores (like __________ ) may be used to reserve a blank space in text that is later to be filled in by hand, such as on a paper form. It is also sometimes used to create a horizontal line; other symbols with similar
glyph A glyph ( ) is any kind of purposeful mark. In typography, a glyph is "the specific shape, design, or representation of a character". It is a particular graphical representation, in a particular typeface, of an element of written language. A ...
s, such as hyphens and dashes, are also used for this purpose. In
German German(s) may refer to: * Germany, the country of the Germans and German things **Germania (Roman era) * Germans, citizens of Germany, people of German ancestry, or native speakers of the German language ** For citizenship in Germany, see also Ge ...
, Slovene and some other
Slavic languages The Slavic languages, also known as the Slavonic languages, are Indo-European languages spoken primarily by the Slavs, Slavic peoples and their descendants. They are thought to descend from a proto-language called Proto-Slavic language, Proto- ...
, the underscore has recently gained prominence as the punctuation to form gender-neutral suffixes in gendered nouns and other parts of the speech. The underscore is also used in modern editions of Spanish vocal sheet music to indicate
elision In linguistics, an elision or deletion is the omission of one or more sounds (such as a vowel, a consonant, or a whole syllable) in a word or phrase. However, these terms are also used to refer more narrowly to cases where two words are run to ...
, instead of the
breve A breve ( , less often , grammatical gender, neuter form of the Latin "short, brief") is the diacritic mark , shaped like the bottom half of a circle. As used in Ancient Greek, it is also called , . It resembles the caron (, the wedge or in ...
below (), which is less convenient to input on a computer.


Mathematics

In mathematical notations, underscores are sometimes used in the following contexts: * vectors. Underscores are sometimes used instead of bold face or a variable with an overarrow. E.g.: \underline = \mathbf = \left _1\;v_2\right\mathbf. *
complex number In mathematics, a complex number is an element of a number system that extends the real numbers with a specific element denoted , called the imaginary unit and satisfying the equation i^= -1; every complex number can be expressed in the for ...
s. Similar to vectors, underscored variable names can indicate a complex-valued variable, especially if the real-valued variable name is reused to indicate the complex variable. For instance, in optics, n is often used for a medium's
index of refraction In optics, the refractive index (or refraction index) of an optical medium is the ratio of the apparent speed of light in the air or vacuum to the speed in the medium. The refractive index determines how much the path of light is bent, or refrac ...
. When the refractive index has a wavelength-dependent extinction coefficient \kappa, the combined complex index of refraction is sometimes notated \underline = n + \mathrm\kappa. * minimum bound. Example: \underline < x < \overline, where \underline is the lower bound of x, and \overline is the upper bound.


Usage in computing

In
web browser A web browser, often shortened to browser, is an application for accessing websites. When a user requests a web page from a particular website, the browser retrieves its files from a web server and then displays the page on the user's scr ...
s, default settings typically distinguish
hyperlink In computing, a hyperlink, or simply a link, is a digital reference providing direct access to Data (computing), data by a user (computing), user's point and click, clicking or touchscreen, tapping. A hyperlink points to a whole document or to ...
s by underlining them (and usually changing their color), but both users and websites can change the settings to make some or all hyperlinks appear differently (or even without distinction from normal text).


History

As early output devices (
Teleprinter A teleprinter (teletypewriter, teletype or TTY) is an electromechanical device that can be used to send and receive typed messages through various communications channels, in both point-to-point (telecommunications), point-to-point and point- ...
s,
CRT CRT or Crt most commonly refers to: * Cathode-ray tube, a display * Critical race theory, an academic framework of analysis CRT may also refer to: Law * Charitable remainder trust, United States * Civil Resolution Tribunal, Canada * Columbia ...
s and
line printer A line printer Printer (computing), prints one entire line of text before advancing to another line. Most early line printers were printer (computing)#Impact printers, impact printers. Line printers are mostly associated with unit record eq ...
s) could not produce more than one character at a location, it was not possible to underscore text, so early encodings such as
ITA2 The Baudot code () is an early character encoding for telegraphy invented by Émile Baudot in the 1870s. It was the predecessor to the International Telegraph Alphabet No. 2 (ITA2), the most common teleprinter code in use before ASCII. Each Chara ...
and the first versions of
ASCII ASCII ( ), an acronym for American Standard Code for Information Interchange, is a character encoding standard for representing a particular set of 95 (English language focused) printable character, printable and 33 control character, control c ...
had no underscore. IBM's
EBCDIC Extended Binary Coded Decimal Interchange Code (EBCDIC; ) is an eight- bit character encoding used mainly on IBM mainframe and IBM midrange computer operating systems. It descended from the code used with punched cards and the corresponding si ...
character-coding system, introduced in 1964, added the underscore, which IBM referred to as the "break character". IBM's report on NPL (the early name of what is now called
PL/I PL/I (Programming Language One, pronounced and sometimes written PL/1) is a procedural, imperative computer programming language initially developed by IBM. It is designed for scientific, engineering, business and system programming. It has b ...
) leaves the character set undefined, but specifically mentions the break character, and gives as an example identifier. By 1967 the underscore had spread to
ASCII ASCII ( ), an acronym for American Standard Code for Information Interchange, is a character encoding standard for representing a particular set of 95 (English language focused) printable character, printable and 33 control character, control c ...
, replacing the similarly shaped left-arrow character, (see also: PIP). C, developed at Bell Labs in the early 1970s, allowed the underscore in identifiers. Underscore predates the existence of lower-case letters in many systems, so often it had to be used to make multi-word identifiers, since camelCase (see below) was not available.


Programming conventions

Underscores inserted between letters are very common to make a "multi-word" identifier in languages that cannot handle spaces in identifiers. This convention is known as "
snake_case Snake case (sometimes stylized autological, autologically as snake_case) is the Naming convention (programming), naming convention in which each space is replaced with an underscore (_) character, and words are written in lowercase. It is a comm ...
" (the other popular method is called camelCase, where capital letters are used to show where the words start). An underscore as the first character in an ID is often used to indicate an internal implementation that is not considered part of the
API An application programming interface (API) is a connection between computers or between computer programs. It is a type of software interface, offering a service to other pieces of software. A document or standard that describes how to build ...
and should not be called by code outside that implementation. In Dart, all private properties of classes must start with an underscore; this usage is also common in other languages such as C++ even though those provide keywords to indicate that members are private. It is extensively used to hide variables and functions used for implementations in
header file An include directive instructs a text file processor to replace the directive text with the content of a specified file. The act of including may be logical in nature. The processor may simply process the include file content at the location of ...
s. In fact, the use of a single underscore for this became so common that C compilers had to standardize on a ''double'' leading underscore (for instance __DATE__) for actual built-in variables to avoid conflicts with the ones in header files.
PHP PHP is a general-purpose scripting language geared towards web development. It was originally created by Danish-Canadian programmer Rasmus Lerdorf in 1993 and released in 1995. The PHP reference implementation is now produced by the PHP Group. ...
"reserves all function names starting with __ ouble underscoreas magical." Python uses names that both start and end with double underscores (so called "dunder methods", as in double underscore) for magic members used for purposes such as operator overloading and reflection, and names starting but not ending with a double underscore to denote private
member variable In object-oriented programming Object-oriented programming (OOP) is a programming paradigm based on the concept of '' objects''. Objects can contain data (called fields, attributes or properties) and have actions they can perform (called ...
s of classes which should be mangled in a manner which prevents them from colliding with members of derived classes unless the classes have the same name ( in class will be mangled to ). By convention, members starting with a single underscore are considered private or protected, although this behavior only has inherent effect for modules, where statements by default import all names that do not start with an underscore, unless an export list is explicitly defined by the module. A variable named with just an underscore often has special meaning. $_ or _ is the previous command or result in many interactive shells, such as those of Python,
Ruby Ruby is a pinkish-red-to-blood-red-colored gemstone, a variety of the mineral corundum ( aluminium oxide). Ruby is one of the most popular traditional jewelry gems and is very durable. Other varieties of gem-quality corundum are called sapph ...
, and
Perl Perl is a high-level, general-purpose, interpreted, dynamic programming language. Though Perl is not officially an acronym, there are various backronyms in use, including "Practical Extraction and Reporting Language". Perl was developed ...
. In
Perl Perl is a high-level, general-purpose, interpreted, dynamic programming language. Though Perl is not officially an acronym, there are various backronyms in use, including "Practical Extraction and Reporting Language". Perl was developed ...
, @_ is a special
array An array is a systematic arrangement of similar objects, usually in rows and columns. Things called an array include: {{TOC right Music * In twelve-tone and serial composition, the presentation of simultaneous twelve-tone sets such that the ...
variable that holds the
argument An argument is a series of sentences, statements, or propositions some of which are called premises and one is the conclusion. The purpose of an argument is to give reasons for one's conclusion via justification, explanation, and/or persu ...
s to a
function Function or functionality may refer to: Computing * Function key, a type of key on computer keyboards * Function model, a structured representation of processes in a system * Function object or functor or functionoid, a concept of object-orie ...
. In
Clojure Clojure (, like ''closure'') is a dynamic programming language, dynamic and functional programming, functional dialect (computing), dialect of the programming language Lisp (programming language), Lisp on the Java (software platform), Java platfo ...
, it indicates an argument whose value will be ignored. In some languages with
pattern matching In computer science, pattern matching is the act of checking a given sequence of tokens for the presence of the constituents of some pattern. In contrast to pattern recognition, the match usually must be exact: "either it will or will not be a ...
, such as
Haskell Haskell () is a general-purpose, statically typed, purely functional programming language with type inference and lazy evaluation. Designed for teaching, research, and industrial applications, Haskell pioneered several programming language ...
and other ML-family languages,
Rust Rust is an iron oxide, a usually reddish-brown oxide formed by the reaction of iron and oxygen in the catalytic presence of water or air moisture. Rust consists of hydrous iron(III) oxides (Fe2O3·nH2O) and iron(III) oxide-hydroxide (FeO(OH) ...
, C#, Python,
Prolog Prolog is a logic programming language that has its origins in artificial intelligence, automated theorem proving, and computational linguistics. Prolog has its roots in first-order logic, a formal logic. Unlike many other programming language ...
, and the
Wolfram Language The Wolfram Language ( ) is a proprietary, very high-level multi-paradigm programming language developed by Wolfram Research. It emphasizes symbolic computation, functional programming, and rule-based programming and can employ arbitrary stru ...
, the pattern _ matches any value, but does not perform binding, hence it can be reused multiple times in a single pattern and does not require declaration.


HTML <u> and CSS

The ASCII underscore character can be inserted with the entities or (or or ). HTML has a presentational element that was originally used to underline text; this usage was
deprecated Deprecation is the discouragement of use of something human-made, such as a term, feature, design, or practice. Typically something is deprecated because it is claimed to be inferior compared to other options available. Something may be deprec ...
in HTML4 in favor of the CSS style . In HTML5, the tag reappeared but its meaning was changed significantly: it now "represents a span of inline text which should be rendered in a way that indicates that it has a non-textual annotation". This facility is intended for example to provide a red wavy line (or wiggly line) underline to flag spelling errors at input time but which are not to be embedded in any stored file (unlike an emphasis mark, which would be). Other styles are also available: doubled, dotted, and dashed. The elements may also exist in other
markup language A markup language is a Encoding, text-encoding system which specifies the structure and formatting of a document and potentially the relationships among its parts. Markup can control the display of a document or enrich its content to facilitate au ...
s, such as
MediaWiki MediaWiki is free and open-source wiki software originally developed by Magnus Manske for use on Wikipedia on January 25, 2002, and further improved by Lee Daniel Crocker,mailarchive:wikipedia-l/2001-August/000382.html, Magnus Manske's announc ...
. The
Text Encoding Initiative The Text Encoding Initiative (TEI) is a text-centric community of practice in the academic field of digital humanities, operating continuously since the 1980s. The community currently runs a mailing list, meetings and conference series, and ma ...
(TEI) provides an extensive selection of related elements for marking editorial activity (insertion, deletion, correction, addition, etc.).


Unicode

Unicode Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Char ...
has a free-standing underscore at U+005F, inherited from ASCII, which is a legacy of the typewriter practice of underlining using backspace and overtype. Modern practice uses the
combining diacritic In digital typography, combining characters are characters that are intended to modify other characters. The most common combining characters in the Latin script are the combining diacritical marks (including combining accents). Unicode al ...
that results in an underline when run together: u̲n̲d̲e̲r̲l̲i̲n̲e̲. Unicode also has . In addition, there are single line and double line versions of the combining macron below, a diacritic that applies to single letters only.


"Simulated" underlines in plain-text

In
plain-text In computing, plain text is a loose term for data (e.g. file contents) that represent only characters of readable material but not its graphical representation nor other objects (floating-point numbers, images, etc.). It may also include a limi ...
applications, including plain-text
e-mail Electronic mail (usually shortened to email; alternatively hyphenated e-mail) is a method of transmitting and receiving Digital media, digital messages using electronics, electronic devices over a computer network. It was conceived in the ...
s where emphasis markup is not possible, the desired emphasis is often indicated by surrounding words with underscore characters. For example, "You must use _emulsion_ paint on the ceiling". Some applications will automatically add emphasis to text manually bracketed by underscores, either by underlining or by italicizing it (e.g. may render as either or ''string'').


As a marker for incorrectness

Underline (typically red or wavy or both) is often used by
spell checker In software, a spell checker (or spelling checker or spell check) is a software feature that checks for misspellings in a text. Spell-checking features are often embedded in software or services, such as a word processor, email client, electronic ...
s (and
grammar checker A grammar checker, in computing terms, is a Computer program, program, or part of a program, that attempts to verify written text for grammatical correctness. Grammar checkers are most often implemented as a feature of a larger program, such as a ...
s) to denote misspelled or otherwise incorrect text.


Manuscripts

Depending on local conventions, the following kinds of underlines may be used inline on manuscripts to indicate the special
typeface A typeface (or font family) is a design of Letter (alphabet), letters, Numerical digit, numbers and other symbols, to be used in printing or for electronic display. Most typefaces include variations in size (e.g., 24 point), weight (e.g., light, ...
s to be used: *single dashed underline for , 'let it stand', proof-reading mark cancelled. *single straight underline for ''italic type'' *single wavy underline for bold type *double straight underline for *double underline of one straight line and one wavy line for ''bold italic'' *triple underline for FULL CAPITAL LETTERS (used among small caps or to change text already typed as lower case).


Underlines in Chinese

In Chinese, the underline is a little-used punctuation mark for proper names (;
pinyin Hanyu Pinyin, or simply pinyin, officially the Chinese Phonetic Alphabet, is the most common romanization system for Standard Chinese. ''Hanyu'' () literally means 'Han Chinese, Han language'—that is, the Chinese language—while ''pinyin' ...
: zhuānmínghào; literally " proper name mark", used for personal and geographic names). Its meaning is somewhat akin to capitalization in English and should never be used for emphasis even if the influence of English computing makes the latter sometimes occur. A wavy underline (;
pinyin Hanyu Pinyin, or simply pinyin, officially the Chinese Phonetic Alphabet, is the most common romanization system for Standard Chinese. ''Hanyu'' () literally means 'Han Chinese, Han language'—that is, the Chinese language—while ''pinyin' ...
: shūmínghào; literally, "book title mark") serves a similar function, but marks names of literary works instead of proper names. In the case of two or more adjacent proper names, each individual proper name is separately underlined so there should be a slight gap between the underlining of each proper name.


See also

*
Overline An overline, overscore, or overbar, is a typographical feature of a horizontal and vertical, horizontal line drawn immediately above the text. In old mathematical notation, an overline was called a ''vinculum (symbol), vinculum'', a notation fo ...
*
Space Character A whitespace character is a character data element that represents white space when text is rendered for display by a computer. For example, a ''space'' character (, ASCII 32) represents blank space such as a word divider in a Western scri ...
*
Strikethrough Strikethrough, or strikeout, is a typographical presentation of words with a horizontal line through their center, resulting in , sometimes an X or a forward slash is typed over the top instead of using a horizontal line. Strike-through was u ...
*
Undertie The tie is a symbol in the shape of an arc similar to a large breve, used in Greek, phonetic alphabets, and Z notation. It can be used between two characters with spacing as punctuation, non-spacing as a diacritic, or (underneath) as a proofread ...
* Visible space


Notes


References


External links

* {{Typography terms Punctuation Typography Typographical symbols