The hyphen is a
punctuation
Punctuation marks are marks indicating how a piece of writing, written text should be read (silently or aloud) and, consequently, understood. The oldest known examples of punctuation marks were found in the Mesha Stele from the 9th century BC, c ...
mark used to join
word
A word is a basic element of language that carries semantics, meaning, can be used on its own, and is uninterruptible. Despite the fact that language speakers often have an intuitive grasp of what a word is, there is no consensus among linguist ...
s and to separate
syllable
A syllable is a basic unit of organization within a sequence of speech sounds, such as within a word, typically defined by linguists as a ''nucleus'' (most often a vowel) with optional sounds before or after that nucleus (''margins'', which are ...
s of a single word. The use of hyphens is called hyphenation.
The hyphen is sometimes confused with
dash
The dash is a punctuation mark consisting of a long horizontal line. It is similar in appearance to the hyphen but is longer and sometimes higher from the baseline. The most common versions are the endash , generally longer than the hyphen ...
es (
en dash ,
em dash
The dash is a punctuation mark consisting of a long horizontal line. It is similar in appearance to the hyphen but is longer and sometimes higher from the baseline. The most common versions are the endash , generally longer than the hyphen ...
and others), which are wider, or with the
minus sign
The plus sign () and the minus sign () are mathematical symbols used to denote positive and negative functions, respectively. In addition, the symbol represents the operation of addition, which results in a sum, while the symbol represent ...
, which is also wider and usually drawn a little higher to match the crossbar in the
plus sign
The plus sign () and the minus sign () are mathematical symbols used to denote positive and negative functions, respectively. In addition, the symbol represents the operation of addition, which results in a sum, while the symbol represents ...
.
As an
orthographic concept, the hyphen is a single entity. In
character encoding
Character encoding is the process of assigning numbers to graphical character (computing), characters, especially the written characters of human language, allowing them to be stored, transmitted, and transformed using computers. The numerical v ...
for use with computers, it is represented in
Unicode
Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Char ...
by any of several
characters. These include the dual-use
hyphen-minus
The symbol , known in Unicode as hyphen-minus, is the form of hyphen most commonly used in digital documents. On most keyboards, it is the only character that resembles a minus sign or a dash, so it is also used for these. The name ''hyphen-mi ...
, the
soft hyphen, the
nonbreaking hyphen, and an unambiguous form known familiarly as the "Unicode hyphen", shown at the top of the infobox on this page. The character most often used to represent a hyphen (and the one produced by the key on a keyboard) is called the "hyphen-minus" by Unicode, deriving from the original
ASCII
ASCII ( ), an acronym for American Standard Code for Information Interchange, is a character encoding standard for representing a particular set of 95 (English language focused) printable character, printable and 33 control character, control c ...
standard, where it was called "hyphen(minus)".
Etymology
The word is derived from Ancient Greek (), contracted from (), "in one" (literally "under one"). An () was an
undertie-like sign written below two adjacent letters to indicate that they belong to the same word when it was necessary to avoid ambiguity, before
word spacing was practiced.
History

The first known documentation of the hyphen is in the grammatical works of
Dionysius Thrax
Dionysius Thrax ( ''Dionýsios ho Thrâix'', 170–90 BC) was a Greek grammarian and a pupil of Aristarchus of Samothrace. He was long considered to be the author of the earliest grammatical text on the Greek language, one that was used as a st ...
. At the time hyphenation was joining two words that would otherwise be read separately by a low
tie mark between the two words.
[Nicolas, Nick.]
Greek Unicode Issues: Punctuation
". 2005. Accessed 7 October 2014. In Greek these marks were known as ''
enotikon'', officially
romanized
In linguistics, romanization is the conversion of text from a different writing system to the Roman (Latin) script, or a system for doing so. Methods of romanization include transliteration, for representing written text, and transcription, ...
as a hyphen.
[ 'Ellīnikós Organismós Typopoíīsīs'', "Hellenic Organization for Standardization"]. [''ELOT 743, 2ī Ekdosī'', "ELOT 743, "]. ELOT (Athens), 2001. ]
With the introduction of letter spacing in the Middle Ages, the hyphen, still written beneath the text, reversed its meaning. Scribes used the mark to connect two words that had been incorrectly separated by a space. This era also saw the introduction of the marginal hyphen, for words broken across lines.
The modern format of the hyphen originated with
Johannes Gutenberg
Johannes Gensfleisch zur Laden zum Gutenberg ( – 3 February 1468) was a German inventor and Artisan, craftsman who invented the movable type, movable-type printing press. Though movable type was already in use in East Asia, Gutenberg's inven ...
of Mainz, Germany, with the publication of his 42-line
Bible
The Bible is a collection of religious texts that are central to Christianity and Judaism, and esteemed in other Abrahamic religions such as Islam. The Bible is an anthology (a compilation of texts of a variety of forms) originally writt ...
. His tools did not allow for a
sublinear hyphen, and he thus moved it to the middle of the line.
Examination of an original copy on
vellum
Vellum is prepared animal skin or membrane, typically used as writing material. It is often distinguished from parchment, either by being made from calfskin (rather than the skin of other animals), or simply by being of a higher quality. Vellu ...
(Hubay index #35) in the
U. S. Library of Congress shows that Gutenberg's movable type was set justified in a uniform style, 42 equal lines per page. The Gutenberg printing press required words made up of individual letters of type to be held in place by a surrounding nonprinting rigid frame. Gutenberg solved the problem of making each line the same length to fit the frame by inserting a hyphen as the last element at the right-side margin. This interrupted the letters in the last word, requiring the remaining letters be carried over to the start of the line below. His
double hyphen,
, appears throughout the Bible as a short, double line inclined to the right at a 60-degree angle.
Use in English
The
English language
English is a West Germanic language that developed in early medieval England and has since become a English as a lingua franca, global lingua franca. The namesake of the language is the Angles (tribe), Angles, one of the Germanic peoples th ...
does not have definitive hyphenation rules,
though various
style guide
A style guide is a set of standards for the writing, formatting, and design of documents. A book-length style guide is often called a style manual or a manual of style. A short style guide, typically ranging from several to several dozen page ...
s provide detailed usage recommendations and have a significant amount of overlap in what they advise. Hyphens are mostly used to break single words into parts or to join ordinarily separate words into single words. Spaces are not placed between a hyphen and either of the elements it connects except when using a suspended or "hanging" hyphen that stands in for a repeated word (e.g., ''nineteenth- and writers''). Style conventions that apply to hyphens (and dashes) have evolved to support ease of reading in complex constructions; editors often accept deviations if they aid rather than hinder easy comprehension.
The use of the hyphen in
English compound
A compound is a word composed of more than one free morpheme. The English language, like many others, uses compounds frequently. English compounds may be classified in several ways, such as the word classes or the semantic relationship of thei ...
nouns and verbs has, in general, been steadily declining. Compounds that might once have been hyphenated are increasingly left with spaces or are combined into one word. Reflecting this changing usage, in 2007, the sixth edition of the ''
Shorter Oxford English Dictionary
The ''Shorter Oxford English Dictionary'' (''SOED'') is an English language dictionary published by the Oxford University Press. The SOED is a two-volume abridgement of the twenty-volume ''Oxford English Dictionary'' (''OED'').
Print editions ...
'' removed the hyphens from 16,000 entries, such as ''fig-leaf'' (now ''fig leaf''), ''pot-belly'' (now ''pot belly''), and ''pigeon-hole'' (now ''pigeonhole''). The increasing prevalence of computer technology and the advent of the Internet have given rise to a subset of common nouns that might have been hyphenated in the past (e.g., ''
toolbar
The toolbar, also called a bar or standard toolbar (originally known as ribbon), is a graphical control element on which on-screen icons can be used. A toolbar often allows for quick access to functions that are commonly used in the program. Some ...
'', ''
hyperlink
In computing, a hyperlink, or simply a link, is a digital reference providing direct access to Data (computing), data by a user (computing), user's point and click, clicking or touchscreen, tapping. A hyperlink points to a whole document or to ...
'', and ''
pastebin'').
Despite decreased use, hyphenation remains the norm in certain compound-modifier constructions and, among some authors, with certain prefixes (see
below). Hyphenation is also routinely used as part of
syllabification
Syllabification () or syllabication (), also known as hyphenation, is the separation of a word into syllables, whether spoken, written or signed.
Overview
The written separation into syllables is usually marked by a hyphen when using English o ...
in
justified texts to avoid unsightly spacing (especially in
columns
A column or pillar in architecture and structural engineering is a structural element that transmits, through compression, the weight of the structure above to other structural elements below. In other words, a column is a compression member ...
with narrow
line length
In typography, line length is the width of a block of typeset text, usually measured in units of length like inches or points or in characters per line (in which case it is a measure). A block of text or paragraph has a maximum line length tha ...
s, as when used with
newspaper
A newspaper is a Periodical literature, periodical publication containing written News, information about current events and is often typed in black ink with a white or gray background. Newspapers can cover a wide variety of fields such as poli ...
s).
Separating
Justification and line-wrapping
When flowing text, it is sometimes preferable to break a word into two so that it continues on another line rather than moving the entire word to the next line. The word may be divided at the nearest break point between syllables (
syllabification
Syllabification () or syllabication (), also known as hyphenation, is the separation of a word into syllables, whether spoken, written or signed.
Overview
The written separation into syllables is usually marked by a hyphen when using English o ...
) and a hyphen inserted to indicate that the letters form a word fragment, rather than a full word. This allows more efficient use of paper, allows flush appearance of right-side margins (
justification) without oddly large word spaces, and decreases the problem of
rivers
A river is a natural stream of fresh water that flows on land or inside caves towards another body of water at a lower elevation, such as an ocean, lake, or another river. A river may run dry before reaching the end of its course if it ru ...
. This kind of hyphenation is most useful when the width of the column (called the "line length" in typography) is very narrow. For example:
Rules (or guidelines) for correct hyphenation vary between languages, and may be complex, and they can interact with other
orthographic and
typesetting
Typesetting is the composition of text for publication, display, or distribution by means of arranging physical ''type'' (or ''sort'') in mechanical systems or '' glyphs'' in digital systems representing '' characters'' (letters and other ...
practices.
Hyphenation algorithm
Syllabification () or syllabication (), also known as hyphenation, is the separation of a word into syllables, whether spoken, written or signed.
Overview
The written separation into syllables is usually marked by a hyphen when using English o ...
s, when employed in concert with dictionaries, are sufficient for all but the most formal texts.
It may be necessary to distinguish an incidental line-break hyphen from one integral to a word
being mentioned (as when used in a
dictionary
A dictionary is a listing of lexemes from the lexicon of one or more specific languages, often arranged Alphabetical order, alphabetically (or by Semitic root, consonantal root for Semitic languages or radical-and-stroke sorting, radical an ...
) or present in an original text being quoted (when in a
critical edition
Textual criticism is a branch of textual scholarship, philology, and literary criticism that is concerned with the identification of textual variants, or different versions, of either manuscripts (mss) or of printed books. Such texts may range i ...
), not only to control its
word wrap behavior (which
encoding
In communications and Data processing, information processing, code is a system of rules to convert information—such as a letter (alphabet), letter, word, sound, image, or gesture—into another form, sometimes data compression, shortened or ...
handles with
hard and soft hyphens having the same
glyph
A glyph ( ) is any kind of purposeful mark. In typography, a glyph is "the specific shape, design, or representation of a character". It is a particular graphical representation, in a particular typeface, of an element of written language. A ...
) but also to differentiate appearance (with a different glyph). ''
Webster's Third New International Dictionary
''Webster's Third New International Dictionary of the English Language, Unabridged'' (commonly known as ''Webster's Third'', or ''W3'') is an American English-language dictionary published in September 1961. It was edited by Philip Babcock Gove a ...
''
and the ''
Chambers Dictionary
''The Chambers Dictionary'' was first published by William and Robert Chambers as ''Chambers's English Dictionary'' in 1872. It was an expanded version of ''Chambers's Etymological Dictionary'' of 1867, compiled by James Donald. A second editio ...
''
use a
double hyphen for integral hyphens and a single hyphen for line-breaks, whereas Kromhout's Afrikaans–English dictionary uses the opposite convention.
The ''
Concise Oxford Dictionary (fifth edition)'' suggested repeating an integral hyphen at the start of the following line.
Prefixes and suffixes
Prefix
A prefix is an affix which is placed before the stem of a word. Particularly in the study of languages, a prefix is also called a preformative, because it alters the form of the word to which it is affixed.
Prefixes, like other affixes, can b ...
es (such as ''de-'', ''pre-'', ''re-'', and ''non-''
[A fairly comprehensive list, although not exhaustive, is given at Prefix > List of English derivational prefixes.]) and
suffix
In linguistics, a suffix is an affix which is placed after the stem of a word. Common examples are case endings, which indicate the grammatical case of nouns and adjectives, and verb endings, which form the conjugation of verbs. Suffixes can ca ...
es (such as ', ', ', and ') are sometimes hyphenated, especially when the unhyphenated spelling resembles another word or when the
affix
In linguistics, an affix is a morpheme that is attached to a word stem to form a new word or word form. The main two categories are Morphological derivation, derivational and inflectional affixes. Derivational affixes, such as ''un-'', ''-ation' ...
ation is deemed misinterpretable, ambiguous, or somehow "odd-looking" (for example, having two consecutive
monograph
A monograph is generally a long-form work on one (usually scholarly) subject, or one aspect of a subject, typically created by a single author or artist (or, sometimes, by two or more authors). Traditionally it is in written form and published a ...
s that look like the
digraphs of English, like e+a, e+e, or e+i). However, the unhyphenated style, which is also called ''closed up'' or ''solid'', is usually preferred, particularly when the
derivative
In mathematics, the derivative is a fundamental tool that quantifies the sensitivity to change of a function's output with respect to its input. The derivative of a function of a single variable at a chosen input value, when it exists, is t ...
has been relatively familiarized or popularized through extensive use in various contexts. As a
rule of thumb
In English language, English, the phrase ''rule of thumb'' refers to an approximate method for doing something, based on practical experience rather than theory. This usage of the phrase can be traced back to the 17th century and has been associat ...
, affixes are not hyphenated unless the lack of a hyphen would hurt clarity.
The hyphen may be used between vowel letters (e.g., ''ee'', ''ea'', ''ei'') to indicate that they do not form a
digraph. Some words have both hyphenated and unhyphenated variants: ''/deescalate'', ''/cooperation'', ''/reexamine'', ''/deemphasize'', and so on. Words often lose their hyphen as they become more common, such as ''
email
Electronic mail (usually shortened to email; alternatively hyphenated e-mail) is a method of transmitting and receiving Digital media, digital messages using electronics, electronic devices over a computer network. It was conceived in the ...
'' instead of '. When there are tripled letters, the hyphenated variant of these words is often more common (as in ' instead of ').
Closed-up style is avoided in some cases: possible
homograph
A homograph (from the , and , ) is a word that shares the same written form as another word but has a different meaning. However, some dictionaries insist that the words must also be pronounced differently, while the Oxford English Dictionar ...
s, such as ''
recreation
Recreation is an activity of leisure, leisure being discretionary time. The "need to do something for recreation" is an essential element of human biology and psychology. Recreational activities are often done for happiness, enjoyment, amusement, ...
'' (fun or sport) versus ' (the act of creating again), ''retreat'' (turn back) versus ' (give
therapy
A therapy or medical treatment is the attempted remediation of a health problem, usually following a medical diagnosis. Both words, ''treatment'' and ''therapy'', are often abbreviated tx, Tx, or Tx.
As a rule, each therapy has indications a ...
again), and ' (not in
ion form) versus ''unionized'' (organized into
trade union
A trade union (British English) or labor union (American English), often simply referred to as a union, is an organization of workers whose purpose is to maintain or improve the conditions of their employment, such as attaining better wages ...
s); combinations with
proper nouns or adjectives (', ');
[.] acronym
An acronym is a type of abbreviation consisting of a phrase whose only pronounced elements are the initial letters or initial sounds of words inside that phrase. Acronyms are often spelled with the initial Letter (alphabet), letter of each wor ...
s (', '); or numbers (', '). Although ' is still hyphenated by both ''Dorland's'' and ''Merriam-Webster's Medical'', the solid (that is, unhyphenated) styling (''protooncogene'') is a common variant, particularly among oncologists and geneticists.
A
diaeresis may also be used in a like fashion, either to separate and mark off monographs (as in ''coöperation'') or to signalize a
vocalic terminal e (for example, ''
Brontë''). This use of the diaeresis peaked in the late 19th and early 20th centuries, but it was never applied extensively across the language: only a handful of diaereses, including ''coöperation'' and ''Brontë'', are encountered with any appreciable frequency in English; thus ''reëxamine'', ''reïterate'', ''deëmphasize'', etc. are seldom encountered. In borrowings from Modern French, whose
orthography
An orthography is a set of convention (norm), conventions for writing a language, including norms of spelling, punctuation, Word#Word boundaries, word boundaries, capitalization, hyphenation, and Emphasis (typography), emphasis.
Most national ...
utilizes the diaeresis as a means to differentiate
grapheme
In linguistics, a grapheme is the smallest functional unit of a writing system.
The word ''grapheme'' is derived from Ancient Greek ('write'), and the suffix ''-eme'' by analogy with ''phoneme'' and other emic units. The study of graphemes ...
s, various English dictionaries list the dieresis as optional (as in ''naive'' and ''naïve'') despite the juxtaposition of a and i.
Syllabification and spelling
Hyphens are occasionally used to denote
syllabification
Syllabification () or syllabication (), also known as hyphenation, is the separation of a word into syllables, whether spoken, written or signed.
Overview
The written separation into syllables is usually marked by a hyphen when using English o ...
, as in ''syl-la-bi-fi-ca-tion''. Various British and North American dictionaries use an
interpunct
An interpunct , also known as an interpoint, middle dot, middot, centered dot or centred dot, is a punctuation mark consisting of a vertically centered dot used for interword separation in Classical Latin. ( Word-separating spaces did not appe ...
, sometimes called a "middle dot" or "hyphenation point", for this purpose, as in ''syl·la·bi·fi·ca·tion''. This allows the hyphen to be reserved only for places where a hard hyphen is intended (for example, ', ', '). Similarly, hyphens may be used to indicate how a word is being or should be spelled. For example, ''W-O-R-D spells "word"''.
In nineteenth-century American literature, hyphens were also used irregularly to divide syllables in words from indigenous North American languages, without regard for etymology or pronunciation, such as "Shuh-shuh-gah" (from
Ojibwe
The Ojibwe (; Ojibwe writing systems#Ojibwe syllabics, syll.: ᐅᒋᐺ; plural: ''Ojibweg'' ᐅᒋᐺᒃ) are an Anishinaabe people whose homeland (''Ojibwewaki'' ᐅᒋᐺᐘᑭ) covers much of the Great Lakes region and the Great Plains, n ...
''zhashagi'', "blue heron") in ''
The Song of Hiawatha''. This usage is now rare and proscribed, except in some place names such as
Ah-gwah-ching.
Joining
Compound modifiers
Compound modifier
A compound modifier (also called a compound adjective, phrasal adjective, or adjectival phrase) is a compound of two or more attributive words: that is, two or more words that collectively modify a noun. Compound modifiers are grammatically equiva ...
s are groups of two or more words that jointly modify the meaning of another word. When a compound modifier other than an
adverb An adverb is a word or an expression that generally modifies a verb, an adjective, another adverb, a determiner, a clause, a preposition, or a sentence. Adverbs typically express manner, place, time, frequency, degree, or level of certainty by ...
–
adjective
An adjective (abbreviations, abbreviated ) is a word that describes or defines a noun or noun phrase. Its semantic role is to change information given by the noun.
Traditionally, adjectives are considered one of the main part of speech, parts of ...
combination appears ''before'' a term, the compound modifier is often hyphenated to prevent misunderstanding, such as in '' player'' or '' paintings''. Without the hyphen, there is potential confusion about whether the writer means a "player of American football" or an "American player of football" and whether the writer means paintings that are "little celebrated" or "celebrated paintings" that are little. Compound modifiers can extend to three or more words, as in '' candy'', and can be adverbial as well as adjectival ('' frightening''). However, if the compound is a familiar one, it is usually unhyphenated. For example, some style guides prefer the construction ''high school students'', to '' students''.
Although the expression is technically ambiguous ("students of a high school"/"school students who are high"), it would normally be formulated differently if other than the first meaning were intended. Noun–noun compound modifiers may also be written without a hyphen when no confusion is likely: ''grade point average'' and ''department store manager''.
When a compound modifier ''follows'' the term to which it applies, a hyphen is typically not used if the compound is a temporary compound. For example, "that gentleman is well respected", not "that gentleman is well-respected"; or "a patient-centered approach was used" but "the approach was patient centered." But permanent compounds, found as headwords in dictionaries, are treated as invariable, so if they are hyphenated in the cited dictionary, the hyphenation will be used in both attributive and predicative positions. For example, "A cost-effective method was used" and "The method was cost-effective" (''cost-effective'' is a permanent compound that is hyphenated as a headword in various dictionaries). When one of the parts of the modifier is a
proper noun
A proper noun is a noun that identifies a single entity and is used to refer to that entity ('' Africa''; ''Jupiter''; '' Sarah''; ''Walmart'') as distinguished from a common noun, which is a noun that refers to a class of entities (''continent, ...
or a
proper adjective
In English orthography, the term proper adjective is used to mean adjectives that take initial capital letters, and common adjective to mean those that do not. For example, a person from India is Indian—''Indian'' is a proper adjective.
Etymolo ...
, there is no hyphen (e.g., "a South American actor").
When the first modifier in a compound is an adverb ending in ''-ly'' (e.g., "a poorly written novel"), various style guides advise no hyphen.
However, some do allow for this use. For example, ''
The Economist
''The Economist'' is a British newspaper published weekly in printed magazine format and daily on Electronic publishing, digital platforms. It publishes stories on topics that include economics, business, geopolitics, technology and culture. M ...
Style Guide'' advises: "Adverbs do not need to be linked to participles or adjectives by hyphens in simple constructions... Less common adverbs, including all those that end ''-ly'', are less likely to need hyphens."
In the 19th century, it was common to hyphenate adverb–adjective modifiers with the adverb ending in ''-ly'' (e.g., "a craftily-constructed chair"). However, this has become rare. For example, ''wholly owned subsidiary'' and ''quickly moving vehicle'' are unambiguous, because the adverbs clearly modify the adjectives: "quickly" cannot modify "vehicle".
However, if an adverb can also function as an adjective, then a hyphen may be or should be used for clarity, depending on the style guide.
For example, the phrase '' reasons'' ("reasons that are more important") is distinguished from ''more important reasons'' ("additional important reasons"), where ''more'' is an adjective. Similarly, '' scenery'' (with a
mass-noun) is distinct from ''more beautiful scenery''. (In contrast, the hyphen in "a reason" is not necessary, because the syntax cannot be misinterpreted.) A few short and common words—such as ''well'', ''ill'', ''little'', and ''much''—attract special attention in this category.
The hyphen in "well-
ast_participlednoun", such as in "
well-differentiated cells", might reasonably be judged superfluous (the syntax is unlikely to be misinterpreted), yet plenty of style guides call for it. Because ''early'' has both adverbial and adjectival senses, its hyphenation can attract attention; some editors, due to comparison with ''advanced-stage disease'' and ''adult-onset disease'', like the parallelism of ''early-stage disease'' and ''early-onset disease''. Similarly, the hyphen in ''little-celebrated paintings'' clarifies that one is not speaking of little paintings.
Hyphens are usually used to connect numbers and words in modifying phrases. Such is the case when used to describe dimensional measurements of weight, size, and time, under the rationale that, like other compound modifiers, they take hyphens in attributive position (before the modified noun),
although not in predicative position (after the modified noun). This is applied whether numerals or words are used for the numbers. Thus '' woman'' and '' woman'' or '' wingspan'' and '' wingspan'', but ''the woman is 28 years old'' and ''a wingspan of 32 feet''. However, with symbols for
SI units (such as ''
m'' or ''
kg'')—in contrast to the ''names'' of these units (such as ''
metre
The metre (or meter in US spelling; symbol: m) is the base unit of length in the International System of Units (SI). Since 2019, the metre has been defined as the length of the path travelled by light in vacuum during a time interval of of ...
'' or ''
kilogram
The kilogram (also spelled kilogramme) is the base unit of mass in the International System of Units (SI), equal to one thousand grams. It has the unit symbol kg. The word "kilogram" is formed from the combination of the metric prefix kilo- (m ...
'')—the numerical value is always separated from it with a space: ''a 25 kg sphere''. When the unit names are spelled out, this recommendation does not apply: ''a sphere'', ''a roll of film''.
In spelled-out
fractions
A fraction (from , "broken") represents a part of a whole or, more generally, any number of equal parts. When spoken in everyday English, a fraction describes how many parts of a certain size there are, for example, one-half, eight-fifths, thre ...
, hyphens are usually used when the fraction is used as an adjective but not when it is used as a noun: thus '' majority'' and '' portion'' but ''I drank two thirds of the bottle'' or ''I kept three quarters of it for myself''.
However, at least one major style guide
hyphenates spelled-out fractions invariably (whether adjective or noun).
In English, an
en dash, , sometimes replaces the hyphen in hyphenated compounds if either of its constituent parts is already hyphenated or contains a space (for example, ''San Francisco–area residents'', ''hormone receptor–positive cells'', ''cell cycle–related factors'', and ''public-school–private-school rivalries'').
A commonly used alternative style is the hyphenated string (''hormone-receptor-positive cells'', ''cell-cycle-related factors''). (For other aspects of en dash–versus–hyphen use, see ''
Dash § En dash''.)
Object–verbal-noun compounds
When an object is compounded with a verbal noun, such as ''
egg-beater'' (a tool that beats eggs), the result is sometimes hyphenated. Some authors do this consistently, others only for disambiguation; in this case, ''egg-beater, egg beater,'' and ''eggbeater'' are all common.
An example of an ambiguous phrase appears in ''they stood near a group of alien lovers'', which without a hyphen implies that they stood near a group of lovers who were aliens; ''they stood near a group of alien-lovers'' clarifies that they stood near a group of people who loved aliens, as "alien" can be either an adjective or a noun. On the other hand, in the phrase ''a hungry pizza-lover'', the hyphen will often be omitted (a hungry pizza lover), as "pizza" cannot be an adjective and the phrase is therefore unambiguous.
Similarly, ''a man-eating shark'' is nearly the opposite of ''a man eating shark''; the first refers to a shark that eats people, and the second to a man who eats
shark meat. ''A government-monitoring program'' is a program that monitors the government, whereas ''a government monitoring program'' is a government program that monitors something else.
Personal names
Some married couples compose a new
surname
In many societies, a surname, family name, or last name is the mostly hereditary portion of one's personal name that indicates one's family. It is typically combined with a given name to form the full name of a person, although several give ...
(sometimes referred to as a
double-barrelled name
A double-barrelled name is a type of compound surname, typically featuring two words (occasionally more), often joined by a hyphen. Notable people with double-barrelled names include Winnie Madikizela-Mandela, Julia Louis-Dreyfus, and Beyonc ...
) for their new family by combining their two surnames with a hyphen. Jane Doe and John Smith might become Jane and John Smith-Doe, or Doe-Smith, for instance. In some countries only the woman hyphenates her birth surname, appending her husband's surname.
With already-hyphenated names, some parts are typically dropped. For example, Aaron Johnson and Samantha Taylor-Wood became
Aaron Taylor-Johnson and
Sam Taylor-Johnson
Samantha Louise Taylor-Johnson ( Taylor-Wood; born 1967) is a British filmmaker. Her directorial feature film debut was 2009's ''Nowhere Boy'', a film based on the childhood experiences of the Beatles' singer and songwriter John Lennon. She is ...
. Not all hyphenated surnames are the result of marriage. For example
Julia Louis-Dreyfus
Julia Scarlett Elizabeth Louis-Dreyfus ( ; born January 13, 1961) is an American actress and comedian. She has gained acclaim for starring in a string of successful comedy series as well as several comedy films. She has received List of awards ...
is a descendant of Louis Lemlé Dreyfus whose son was Léopold Louis-Dreyfus.
Other compounds
Connecting hyphens are used in a large number of miscellaneous compounds, other than modifiers, such as in ''lily-of-the-valley'', ''cock-a-hoop'', ''clever-clever'', ''tittle-tattle'' and ''orang-utan''. Use is often dictated by convention rather than fixed rules, and hyphenation styles may vary between authors; for example, ''orang-utan'' is also written as ''orangutan'' or ''orang utan'', and ''lily-of-the-valley'' may be hyphenated or not.
Suspended hyphens
A suspended hyphen (also called a suspensive hyphen or hanging hyphen, or less commonly a dangling or floating hyphen) may be used when a single base word is used with separate, consecutive, hyphenated words that are connected by "and", "or", or "to". For example, ''short-term and long-term plans'' may be written as ''short- and long-term plans.'' This usage is now common and specifically recommended in some style guides.
[E.g. ] Suspended hyphens are also used, though less commonly, when the base word comes first, such as in "investor-owned and ". Uses such as "applied and sociolinguistics" (instead of "applied linguistics and sociolinguistics") are frowned upon; the Indiana University style guide uses this example and says "Do not 'take a shortcut' when the first expression is ordinarily open" (i.e., ordinarily two separate words).
This is different, however, from instances where prefixes that are normally closed up (styled solidly) are used suspensively. For example, ' becomes ' (not ') when suspended. Some editors prefer to avoid suspending such pairs, choosing instead to write out both words in full.
Other uses
A hyphen may be used to connect groups of numbers, such as in
dates (see
§ Usage in date notation),
telephone numbers or sports
scores.
It can also be used to indicate a range of values, although many styles prefer an en dash (see ).
It is sometimes used to hide letters in words (
filleting for redaction or censoring), as in "
G-d", although an en dash can be used as well ("G–d").
It is often used in
reduplication
In linguistics, reduplication is a Morphology (linguistics), morphological process in which the Root (linguistics), root or Stem (linguistics), stem of a word, part of that, or the whole word is repeated exactly or with a slight change.
The cla ...
.
Due to their similar appearances, hyphens are sometimes mistakenly used where an en dash or em dash would be more appropriate.
Varied meanings
Some stark examples of semantic changes caused by the placement of hyphens to mark attributive phrases:
* ''Disease-causing poor nutrition'' is poor nutrition that causes disease.
** ''Disease causing poor nutrition'' is a disease that causes poor nutrition.
* A ''hard-working man'' is a man who works hard.
** A ''hard working man'' is a working man who is tough.
* A ''man-eating shark'' is a shark that eats humans.
** A ''man eating shark'' is a man who is eating shark meat.
* ''Three-hundred-year-old trees'' are an indeterminate number of trees that are each 300 years old.
** ''Three hundred-year-old trees'' are three trees that are each 100 years old.
** ''Three hundred year-old trees'' are 300 trees that are each a year old.
Use in computing
Hyphen-minuses
In the
ASCII
ASCII ( ), an acronym for American Standard Code for Information Interchange, is a character encoding standard for representing a particular set of 95 (English language focused) printable character, printable and 33 control character, control c ...
character encoding, the hyphen (or minus) is character 45
10. As
Unicode
Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Char ...
is identical to ASCII (the 1967 version) for all encodings up to 127
10, the number 45
10 (2D
16) is also assigned to this character in Unicode, where it is denoted as . Unicode has, in addition, other encodings for minus and hyphen characters: and , respectively. The unambiguous at U+2010 is generally inconvenient to enter on most keyboards and the glyphs for this hyphen and the hyphen-minus are identical in most fonts (
Lucida Sans Unicode
Lucida Sans Unicode is an OpenType typeface from the design studio of Bigelow & Holmes,All Bigelow & Holmes Lucida typefaces are distributed by the designers througThe Lucida Fonts Storeand a subset of Lucida fonts is distributed bAscender Corpo ...
is one of the few exceptions). Consequently, use of the hyphen-minus as the hyphen character is very common. Even the
Unicode Standard
Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 cha ...
regularly uses the hyphen-minus rather than the U+2010 hyphen.
The hyphen-minus has limited use in indicating subtraction; for example, compare (minus) and (hyphen-minus) — in most typefaces, the
glyph
A glyph ( ) is any kind of purposeful mark. In typography, a glyph is "the specific shape, design, or representation of a character". It is a particular graphical representation, in a particular typeface, of an element of written language. A ...
for hyphen-minus will not have the optimal width, thickness, or vertical position, whereas the minus character is typically designed so that it does. Nevertheless, in many spreadsheet and programming applications the hyphen-minus must be typed to indicate subtraction, as use of the Unicode minus sign will not be recognised.
The hyphen-minus is often used instead of dashes or minus signs in situations where the latter characters are unavailable (such as
type-written or ASCII-only text), where they take effort to enter (via
dialog box
In computing, a dialog box (also simply dialog) is a graphical control element in the form of a small window that communicates information to the user and prompts them for a response.
Dialog boxes are classified as " modal" or "modeless", dep ...
es or multi-key
keyboard shortcut
In computing, a keyboard shortcut (also hotkey/hot key or key binding) is a software-based assignment of an action to one or more keys on a computer keyboard. Most Operating system, operating systems and Application software, applications come ...
s), or when the writer is unaware of the distinction. Consequently, some writers use two or three hyphen-minuses ( or ) to represent an em dash. In the TeX typesetting languages, a single hyphen-minus (
-
) renders a hyphen, a single hyphen-minus in math mode (
$-$
) renders a minus sign, two hyphen-minuses (
--
) renders an en dash, and three hyphen-minuses (
---
) renders an em dash.
The hyphen-minus character is also often used when specifying
command-line options. The character is usually followed by one or more letters that indicate specific actions. Typically it is called a dash or switch in this context. Various implementations of the
getopt
Getopt is a C library function used to parse command-line options of the Unix/POSIX style. It is a part of the POSIX specification, and is universal to Unix-like systems.
It is also the name of a Unix program for parsing command line argument ...
function to parse command-line options additionally allow the use of two hyphen-minus characters, , to specify long option names that are more descriptive than their single-letter equivalents. Another use of hyphens is that employed by programs written with
pipelining in mind: a single hyphen may be recognized
in lieu of a filename, with the hyphen then serving as an indicator that a
standard stream, instead of a file, is to be worked with.
Soft and hard hyphens
Although software (
hyphenation algorithm
Syllabification () or syllabication (), also known as hyphenation, is the separation of a word into syllables, whether spoken, written or signed.
Overview
The written separation into syllables is usually marked by a hyphen when using English o ...
s) can often automatically make decisions on when to hyphenate a word at a line break, it is also sometimes useful for the user to be able to insert cues for those decisions (which are dynamic in the online medium, given that text can be
reflowed). For this purpose, the concept of a
soft hyphen (discretionary hyphen, optional hyphen) was introduced, allowing such manual specification of a place where a hyphenated break is ''allowed'' but not ''forced''. That is, it does not force a line break in an inconvenient place when the text is later reflowed.
Soft hyphens are inserted into the text at the positions where hyphenation ''may'' occur. It can be a tedious task to insert the soft hyphens by hand, and tools using hyphenation algorithms are available that do this automatically. Current modules of the
Cascading Style Sheets (CSS) standard provide language-specific hyphenation dictionaries.
In contrast, a hyphen that is always displayed and printed is called a "hard hyphen". This can be a Unicode hyphen, a hyphen-minus, or a nonbreaking hyphen (see
below). Confusingly, the term is sometimes limited to nonbreaking hyphens.
Nonbreaking hyphens
The
word segmentation
A word is a basic element of language that carries meaning, can be used on its own, and is uninterruptible. Despite the fact that language speakers often have an intuitive grasp of what a word is, there is no consensus among linguists on its ...
rules of most text systems consider a hyphen to be a
word boundary and a valid point at which to break a line when flowing text. This is not always desirable, it could lead to ambiguity (e.g. ''retreat'' and ''re‑treat'' would be indistinguishable with a line break after ''re''), it can split off an ending as in "''n''th" (though ''n''
th or "''n''th" could be used), and it is inappropriate in some languages other than English (e.g., a line break at the hyphen in
Irish or
Romanian would be undesirable). The non-breaking hyphen, nonbreaking hyphen, or no-break hyphen looks identical to the regular hyphen, but word processors do not break words at it. The
nonbreaking space exists for similar reasons.
"Unicode hyphen"
Because the conventional ''hyphen-minus'' mark on keyboards is ambiguous (it can be interpreted sometimes unexpectedly as a hyphen or a minus, depending on context), in addition the
Unicode consortium
The Unicode Consortium (legally Unicode, Inc.) is a 501(c)(3) non-profit organization incorporated and based in Mountain View, California, U.S. Its primary purpose is to maintain and publish the Unicode Standard which was developed with the in ...
allocated
codepoint
A code point, codepoint or code position is a particular position in a table, where the position has been assigned a meaning. The table may be one dimensional (a column), two dimensional (like cells in a spreadsheet), three dimensional (sheets in ...
s for an unambiguous minus and an unambiguous hyphen. The Unicode hyphen () is seldom used. Even the
Unicode Standard
Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 cha ...
uses U+002D instead of U+2010 in its text.
Use in date notation
Use of hyphens to delineate the parts of a written date (rather than the
slashes used conventionally in
Anglophone
The English-speaking world comprises the 88 countries and territories in which English is an official, administrative, or cultural language. In the early 2000s, between one and two billion people spoke English, making it the largest language ...
countries) is specified in the
international standard
An international standard is a technical standard developed by one or more international standards organizations. International standards are available for consideration and use worldwide. The most prominent such organization is the International O ...
ISO 8601
ISO 8601 is an international standard covering the worldwide exchange and communication of date and time-related data. It is maintained by the International Organization for Standardization (ISO) and was first published in 1988, with updates in ...
. Thus, for example, 1789-07-14 is the standard way of writing the date of
Bastille Day
Bastille Day is the common name given in English-speaking countries to the national day of France, which is celebrated on 14 July each year. It is referred to, both legally and commonly, as () in French, though ''la fête nationale'' is also u ...
. This standard has been transposed as European Standard EN 28601 and has been incorporated into various national typographic style guides (e.g.,
DIN 5008 in Germany). Now all official European Union (and many member state) documents use this style. This is also the typical date format used in large parts of Europe and Asia, although sometimes with other separators than the hyphen.
This method has gained influence within North America, as most common computer
file systems make the use of slashes in
file names difficult or impossible.
DOS,
OS/2
OS/2 is a Proprietary software, proprietary computer operating system for x86 and PowerPC based personal computers. It was created and initially developed jointly by IBM and Microsoft, under the leadership of IBM software designer Ed Iacobucci, ...
and
Windows
Windows is a Product lining, product line of Proprietary software, proprietary graphical user interface, graphical operating systems developed and marketed by Microsoft. It is grouped into families and subfamilies that cater to particular sec ...
use to introduce and separate switches to
shell
Shell may refer to:
Architecture and design
* Shell (structure), a thin structure
** Concrete shell, a thin shell of concrete, usually with no interior columns or exterior buttresses
Science Biology
* Seashell, a hard outer layer of a marine ani ...
commands, and on both Windows and
Unix-like
A Unix-like (sometimes referred to as UN*X, *nix or *NIX) operating system is one that behaves in a manner similar to a Unix system, although not necessarily conforming to or being certified to any version of the Single UNIX Specification. A Uni ...
systems slashes in a filename introduce subdirectories which may not be desirable. Besides encouraging use of dashes, the Y-M-D order and zero-padding of numbers less than 10 are also copied from ISO 8601 to make the filenames sort by date order.
Unicode
Unicode has multiple hyphen characters:
* , a character of multiple uses
*
*
*
* for medieval texts
And in non-Latin scripts:
*
*
*
* (used only as a line-breaking hyphen)
* (used in ancient Near-Eastern linguistics and in
blackletter
Blackletter (sometimes black letter or black-letter), also known as Gothic script, Gothic minuscule or Gothic type, was a script used throughout Western Europe from approximately 1150 until the 17th century. It continued to be commonly used for ...
typefaces)
* (has the Unicode property of "Hyphen" despite its name)
* (compatibility character for a small hyphen-minus, used in East Asian typography)
* (compatibility character for a wide hyphen-minus, used in East Asian typography)
* (compatibility character for a wide katakana middle dot, has the Unicode property of "Hyphen" despite its name)
Unicode distinguishes the hyphen from the general
interpunct
An interpunct , also known as an interpoint, middle dot, middot, centered dot or centred dot, is a punctuation mark consisting of a vertically centered dot used for interword separation in Classical Latin. ( Word-separating spaces did not appe ...
. The characters below do not have the Unicode property of "Hyphen" despite their names:
*
*
*
*
*
*
*
*
(See
interpunct
An interpunct , also known as an interpoint, middle dot, middot, centered dot or centred dot, is a punctuation mark consisting of a vertically centered dot used for interword separation in Classical Latin. ( Word-separating spaces did not appe ...
and
bullet (typography)
In typography, a bullet or bullet point, , is a typographical symbol or glyph used to introduce items in a list. For example:
* Monica
* Erica
* Rita
* Tina
* Sandra
* Mary
* Jessica
The bullet symbol may take any of a variety of shapes, s ...
for more round characters.)
See also
*
*
*
*
* ("Papyrological hyphen")
* , whose name refers to the hyphen
Notes
References
External links
*
Wiktionary list of English phrases spelled with a hyphen
Economist Style Guide—Hyphens* Jukka Korpela
(See also his article o
.
* Markus Kuhn,
ttp://www.cl.cam.ac.uk/~mgk25/ucs/L2/03155r-kuhn-soft-hyphen.pdf Unicode interpretation of SOFT HYPHEN breaks ISO 8859-1 compatibility Unicode Technical Committee document L2/03-155R, June 2003.
United States Government Printing Office Style Manual 20006. Compounding Rules]
{{navbox punctuation
Punctuation
Punctuation of English