Hungarian orthography
   HOME

TheInfoList



OR:

Hungarian orthography ( hu, helyesírás, lit=correct writing, link=no) consists of rules defining the standard written form of the
Hungarian language Hungarian () is an Uralic language spoken in Hungary and parts of several neighbouring countries. It is the official language of Hungary and one of the 24 official languages of the European Union. Outside Hungary, it is also spoken by Hungar ...
. It includes the spelling of lexical words,
proper noun A proper noun is a noun that identifies a single entity and is used to refer to that entity (''Africa'', ''Jupiter'', '' Sarah'', ''Microsoft)'' as distinguished from a common noun, which is a noun that refers to a class of entities (''continent, ...
s and foreign words (
loanword A loanword (also loan word or loan-word) is a word at least partly assimilated from one language (the donor language) into another language. This is in contrast to cognates, which are words in two or more languages that are similar because ...
s) in themselves, with
suffix In linguistics, a suffix is an affix which is placed after the stem of a word. Common examples are case endings, which indicate the grammatical case of nouns, adjectives, and verb endings, which form the conjugation of verbs. Suffixes can carr ...
es, and in compounds, as well as the
hyphen The hyphen is a punctuation mark used to join words and to separate syllables of a single word. The use of hyphens is called hyphenation. ''Son-in-law'' is an example of a hyphenated word. The hyphen is sometimes confused with dashes ( figure ...
ation of words,
punctuation Punctuation (or sometimes interpunction) is the use of spacing, conventional signs (called punctuation marks), and certain typographical devices as aids to the understanding and correct reading of written text, whether read silently or aloud. A ...
,
abbreviation An abbreviation (from Latin ''brevis'', meaning ''short'') is a shortened form of a word or phrase, by any method. It may consist of a group of letters or words taken from the full version of the word or phrase; for example, the word ''abbrevia ...
s,
collation Collation is the assembly of written information into a standard order. Many systems of collation are based on numerical order or alphabetical order, or extensions and combinations thereof. Collation is a fundamental element of most office filin ...
(alphabetical ordering), and other information (such as how to write dates).


Alphabet

Hungarian is written with the
Hungarian alphabet The Hungarian alphabet () is an extension of the Latin alphabet used for writing the Hungarian language. The alphabet is based on the Latin alphabet, with several added variations of letters. The alphabet consists of the 26 letters of the ISO b ...
, an extended version of the
Latin alphabet The Latin alphabet or Roman alphabet is the collection of letters originally used by the ancient Romans to write the Latin language. Largely unaltered with the exception of extensions (such as diacritics), it used to write English and the ...
. Its letters usually indicate sounds, except when morphemes are to be marked (see below). The extensions include consonants written with digraphs or a trigraph and vowel letters marked with
diacritics A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacriti ...
. Long consonants are marked by a double letter (e.g. > and > ) while long vowels get an
acute accent The acute accent (), , is a diacritic used in many modern written languages with alphabets based on the Latin, Cyrillic, and Greek scripts. For the most commonly encountered uses of the accent in the Latin and Greek alphabets, precomposed ...
(e.g. > ) or their umlaut is replaced with a double acute accent ( > ). Only the first letter of digraphs and of the trigraph is written in upper case when capitalizing in normal text, but all letters are capitalized in acronyms and all-uppercase inscriptions. The letters ''q, x, y, w'' are only part of the extended Hungarian alphabet and they are rarely used in Hungarian words – they are normally replaced with their usual phonetic equivalents ''kv, ksz, i, v'' (only the ''x'' is relatively common, e.g. ). ''Ch'' is not a part of the alphabet but it still exists in some words (like , 'technology' or 'technique'). In traditional surnames, other digraphs may occur as well, both for vowels and consonants.


Four principles of spelling

Hungarian orthography is based on four main principles: following pronunciation, word analysis (reflecting the constituting
morpheme A morpheme is the smallest meaningful Constituent (linguistics), constituent of a linguistic expression. The field of linguistics, linguistic study dedicated to morphemes is called morphology (linguistics), morphology. In English, morphemes are ...
s),AkH.12 49. keeping traditions and simplification. The first principle is that the Hungarian writing system is
phonemic In phonology and linguistics, a phoneme () is a unit of sound that can distinguish one word from another in a particular language. For example, in most dialects of English, with the notable exception of the West Midlands and the north-west ...
by default, i.e. letters correspond to
phoneme In phonology and linguistics, a phoneme () is a unit of sound that can distinguish one word from another in a particular language. For example, in most dialects of English, with the notable exception of the West Midlands and the north-wes ...
s (roughly, sounds) and vice versa. In some cases, however, vowel length or consonant length is not the same in writing as in pronunciation (e.g. uɲog'mosquito', yːzd'fight', ːlɒt'animal', ɟː'one'). Suffixed or compound words usually obey the second main principle, word analysis. This means that the original constituents (
morphemes A morpheme is the smallest meaningful constituent of a linguistic expression. The field of linguistic study dedicated to morphemes is called morphology. In English, morphemes are often but not necessarily words. Morphemes that stand alone a ...
) of a word should be written the same way, regardless of pronunciation assimilations. This, however, applies only when the resulting pronunciation conforms to some regular pattern; irregular assimilations are reflected in writing too. For example, + ("you should leave
ome Ome may refer to: Places * Ome (Bora Bora), a public island in the lagoon of Bora Bora * Ome, Lombardy, Italy, a town and ''comune'' in the Province of Brescia * Ōme, Tokyo, a city in the Prefecture of Tokyo * Ome (crater), a crater on Mars Tran ...
) is pronounced like "haggy", but written as according to the principle of word analysis. This is because the composition of and gives a long in
Hungarian phonology The phonology of the Hungarian language is notable for its process of vowel harmony, the frequent occurrence of geminate consonants and the presence of otherwise uncommon palatal stops. Consonants This is the standard Hungarian consonanta ...
anyway, so spelling out the original morphemes is considered clearer. By contrast + ('you should believe') is pronounced "higgy" and also written as , since this pronunciation cannot be regularly deduced from the morphemes and basic phonological rules. Compound words are generally written so that all constituents retain their spelling, but some compounds have become vague enough not to be considered true compounds any more, especially if one of the elements is obsolete. An example is 'glove', which originally comes from 'hand' and an obsolete and in this case the spelling no longer reflects the derivation. The third principle, tradition, affects, for example, surnames, where many spellings predate the modern spelling rules of Hungarian. For example, "smith" may be spelt , or as a surname. Another example of tradition is that the digraph is still used despite the fact that it stands for the same sound as in today's standard Hungarian. The fourth principle, simplification, affects only a handful of cases. If a common noun ending in a double consonant has a suffix beginning with the same consonant, the third instance is dropped, e.g. + > . This rule doesn't apply to given names or surnames, e.g. + > "from Bernadette", "Mrs. Wittmann": all three consecutive identical consonants are preserved, but a hyphen is inserted, just as in compounds and in suffixed proper names ( "chess group", "with Bonn"). The simplification principle is also applied to double digraphs at the border of suffixes, thus + becomes (e.g. + > "with Kovács"). However, there is no simplification in compounds: e.g. "bunch of keys". Also, in cases of suffix-like derivational elements such as and "-like", simplification is not applied to words ending in single digraphs, such as + > "wax-like", similarly to doubled forms: + > "jazz-like". :The rules concerning cases like (Hungarian given names) and cases like changed in 2015, with the 12th edition of the regulations being released. Prior to that, spellings like and were preferred.


Word breaks: writing as one word or as separate words

Compound words are typically spelt as one word (without spaces) and phrases are normally spelt as more than one word (with one or more spaces), but this is not always the case. Hyphenated spelling is considered an alternative to writing as one word and is used, e.g., if a compound contains a proper name.


Coordination

As far as repeated words are concerned, they are normally written separately (with a comma), but a hyphen is used if their connection is more than occasional (e.g. 'who' but 'everyone'). If a word is repeated with a different suffix or postposition, the words are written separately ( 'day by day', lit. 'from day to day'), except if an element only exists in this phrase, in which case the words are written with a hyphen ( 'old' 'ancient old'). Coordinated words are normally written separately (with a comma). If the meaning of the result is different from that of the two words together, but both elements take suffixes, they are written with a hyphen (e.g. 'cook', consisting of words referring to cooking in the oven and cooking in water, 'they cook'). A hyphen is needed in cases when a phrase is only used with certain suffixes. Connections of words which are completely fused and thus take suffixes only at the end of the second element are written as one word (e.g. 'sorrow and grief', 'stricken with sorrow and grief'). However, there are phrases that only take suffixes at the end but their elements are still connected with a hyphen, as when words are contrasted (e.g. 'bittersweet'). Certain phrases can be suffixed either at the end of both elements or only at the end of the second element (e.g. 'fame': or 'his/her/its fame').


Subordination

As shown by printed material and street inscriptions, this field is probably the most problematic for the majority of native speakers even at a reasonably educated level. The main principle is that these compounds have to be written without spaces if any of these three criteria are met: # there is a change of meaning, which cannot be deduced from the elements alone, # an inflectional suffix is omitted, # tradition (the examples in this group are limited, though). This applies to phrases and compounds of many types, like those where the first element is the subject of the second (which is a participle), or it is the adjective of the second (e.g. means 'fast train', while means 'express train' as a type of train: the change in meaning makes it necessary to write the latter as one word). :Problematic point(s): it is virtually unpredictable whether the change in meaning (as compared to the mere sum of its elements) is attributed to one element or the whole compound. For example, 'live programme' is written as two words, even though the word is used differently from its basic meaning – probably because it was decided that this component can carry the change of meaning, so writing the compound as one word is not necessary. On the other hand, 'fast food restaurant' (lit. 'fast restaurant') is written as one word – probably because the change of meaning was attributed to the whole compound, leaving the lexical meaning of the word intact. Some phrases without any change of meaning are written as one word, e.g. 'travel companion', while most other phrases are written regularly: 'travel destination.' :Sometimes the original meaning of the adjective is retained, but the whole compound still means something more specific than the sum of the elements. For example, , lit. 'sour cabbage', actually 'sauerkraut', is more than a cabbage that tastes sour: it means a type of pickled food, yet it is written with a space. On the other hand, 'mobile phone' is written in one word, although it is actually a telephone that is mobile – writing in one word may be justified by the different technology, as distinguished from a
cordless telephone A cordless telephone or portable telephone has a portable telephone handset that connects by radio to a base station connected to the public telephone network. The operational range is limited, usually to the same building or within some short ...
, which is also portable.
As far as the suffix omission is concerned, often there is a grammatical relationship between two nouns of a compound which could also be expressed in a marked, more explicit way: for example 'window pane' could be expressed as 'the pane of the window,' and based on this derivation, it needs to be written as one word. The word 'confusion, turmoil' also needs to be written as one word, despite the marked possessive, so as to avoid the literal meaning 'house of fools' (1st case). Other compounds, where the first element gives the object, the adverb, or the possessor, are also written in one word where the suffix is omitted, or if the actual meaning is different from the sum of its elements. Thus, 'wind-protected' can be deduced from 'protected from hewind', and it is written together as the suffix is omitted. Verbal phrases where the suffix is marked are usually written in two words, even if the meaning has become figurative, (e.g. 'take part'), while other phrases with a marked suffix are written in one word (e.g. 'notice, spot', literally "take on the mind"). :Problematic point(s): there are more than a hundred verbal phrases that are used exactly like verbs with a verbal prefix (cf. "eat up" in English), like above, but they must be written as two words. Verbal
prefix A prefix is an affix which is placed before the stem of a word. Adding it to the beginning of one word changes it into another word. For example, when the prefix ''un-'' is added to the word ''happy'', it creates the word ''unhappy''. Particul ...
es (cf. in German) are only written together with the verb they belong to if they immediately precede that verb. If the same verbal prefix is repeated to express repeated action, the first is divided by a hyphen, the second is written in one word ( 'keep stopping once in a while'). If two verbal prefixes with an opposite meaning follow each other, both are written separately ( 'walk up and down'). Verbal prefixes may be written separately if the meaning of the prefix is stressed and the prefix is meant in a literal sense, but they must be written as one word if the meaning is changed (e.g. 'stay upstairs' but 'survive, remain'). Some verbal prefixes coincide with adverbs that can have personal endings. In this case, they can only be written as one word if they are in the third person plural and the prefix/adverb is not stressed on its own (especially if the meaning is changed). Otherwise (if another person is used and/or the prefix/adverb is stressed) they should be written in two words. :Problematic point(s): phonologically speaking, verbal prefixes are always attached to the following verb, even if it is an auxiliary verb wedged in between, which loses its own stress. For example: means 'catch' and means 'see, have a look.' Thus, 'I'll catch the butterfly' but 'I'm going to see it.' In the first example, belongs to ; in the second, belongs to . The pronunciation is megfogomin both cases. These cases can be distinguished, though, by considering the word elements. On the other hand, verbal prefixes with personal suffixes can never be written together with the main verb, even though they are stressed the same way as unsuffixed prefixes, e.g. but 's/he looks at me, you, him/her.' A separate group of compounds with subordinated elements is the one named literally "meaning-condensing" or "meaning-compressing" compounds, which have a more complex internal structure, containing implicit elements outside the constituting words, or sometimes where the present meaning cannot be derived at all from the elements. They are always written in one word, e.g. 'spiral staircase,' lit. "snail-staircase", i.e. a staircase similar to the shell of snails. If a phrase (e.g. an adjective and a noun or a noun and a postposition) written in two words receives a derivational suffix, it will also be written in two words – except if the meaning is changed. However, if they receive a second derivational suffix, the phrase will be written in one word. (For example: 'one after the other', 'successive', but 'successiveness,' i.e. 'succession.' In addition: 'under the ground', 'being under the ground' but 'underground ' or 'subway, tube.') :Problematic point(s): there are more than fifty phrases written in one word only after one single derivational suffix (e.g. 'disembark' but 'disembarkation').
Apposition Apposition is a grammatical construction in which two elements, normally noun phrases, are placed side by side so one element identifies the other in a different way. The two elements are said to be ''in apposition'', and one of the elements is ...
al compounds are normally written in two words, e.g. 'a footballer wife' (a wife who plays football) is expressed as . However, if there is a possessive relationship between the words, i.e. if the wife of a footballer is meant, it is considered a (regular) compound, thus it should be written as one word: . There are several appositional compounds though, which are written as one word, especially where the first element specifies the type of the second (e.g. 'student girl'). :Problematic point(s): people find long words difficult to read, so many still prefer to write them separately, relying on the context to clarify the meaning. In addition, the justification for the above subtype that provides specification is considered vague. Phrases whose first element is a participle are written separately if the participle expresses an occasional action: 'a working woman, a woman at work.' However, if the participle expresses function, purpose, ability, task, or duty, the phrase is considered a compound and is written as one word, e.g. 'washerwoman', someone whose duty is to wash. 'walking edestrianstreet' means a street for walking: writing as one word expresses that it is not the street that walks. :Problematic point(s): there are several received expressions referring to function that are written separately despite the above rules (e.g. 'indicative mood' lit. 'declaring mood', even though the mood is used for declarations, it doesn't declare anything), so it is sometimes not obvious how a newly coined construction should be written. In addition, present participles sometimes become nouns, and their compounds cannot be written in two words, as they cannot be considered adjectives anymore. For example, 'footballer' was created from a participle (lit. 'ball-kicking' erson but it is a noun today, and since 'footballer's strip' implies a possessive relationship, it must be written in one word.


Alternation depending on the elements being compound

The above rule doesn't apply to compounds where an element is already a compound itself, even if the whole compound expresses function or purpose. For example, 'rocket launching platform' is written as two words because of its compound first element, despite the fact that it is not the platform which launches the rocket, but it is only used for it, so a function is expressed. Words containing a suffixed numeral are written as one word (e.g. 'a twenty metres long way,' cf. 'twenty metres'), except if an element is already a compound (e.g. 'a twenty-one metres long way' or 'a twenty kilometres long way'). This rule doesn't apply to compounds with numbers written in digits, e.g. , as they are written with spaces. A similar principle is applied to compounds whose first element expresses the material of the second (e.g. 'wooden table' but 'pine-wood table' and 'wooden kitchen table'). :Problematic point(s):, linguistically speaking, these do not constitute actual compounds (because the meaning is not institutionalized), so there is no sound reason for writing them in one word. In addition, the normal alternative for writing in one word is writing with hyphens, rather than writing separately, so this opposition is unusual in Hungarian orthography.


Hyphenating long compounds


The syllable-counting rule

To avoid too long words, a "syllable-counting rule" is applied. Compounds with more than 6 syllables (excluding all its inflectional suffixes as well as the derivational suffix ) and more than 2 elements take a hyphen at the border of the two main elements. For example, 'with your Lfootball teams' has 10 syllables, but its stem, is only 6 syllables long, so all its forms are written as one word. On the other hand, 'football championship' has 7 syllables even in its base form, so all its forms should take a hyphen. Compounds of whatever length are permitted, supposing they consist of only two elements, e.g. 'nitrogen assimilation' is written as one word despite its 9 syllables. Derivational suffixes (with the exception of the adjective-forming ) are considered part of the base form in terms of counting syllables, so e.g. 'fashion magazine editor' is written as one word, but 'fashion magazine editorial office' will take a hyphen in accordance with the above rules. Due to the exception given above, 'catering industry related' is written as one word, as is not reckoned with when determining its stem 'catering industry'. :The latter point is thanks to a change with the release of the 12th edition in 2015. The preceding edition made no exception with , resulting in homographs differing only in a hyphen, like "his/her/its motorists" and "motorist-related" (the derivational suffix coinciding with the inflectional suffix of the plural possessive). The previous rule also changed the appearance of closely related word pairs, enforcing a hyphen when it was not otherwise justified by readability. :Problematic point(s): not only keeping the numbers and their meanings in mind and the differentiation between inflectional and derivational suffixes, but also that compounds are sometimes far from transparent to today's speakers (e.g. 'system' from 'order' and an obsolete ). In addition, it is not commonly known what is considered an element: it includes e.g. foreign prefixes that are used on their own with Hungarian second elements (there is a list of them) as well as verbal prefixes consisting of no less than two syllables.


Three "mobility rules"

Sometimes word boundaries are flexibly rearranged to reflect the meaning of the whole compound: the three rules dealing with it are referred to as "mobility rules". #If a compound with a hyphen takes another element, its original hyphen is removed, and only the new element takes a hyphen: 'peace treaty draft' but 'peace treaty draft development'. #If a phrase of two words takes another element that belongs to both, the two original elements will be exceptionally written together, and the new element will be attached to them with a hyphen: 'cold water' but 'cold water tap'. #If two compounds with an identical element are contracted, the identical element is written separately and the two other elements are connected with a hyphen: e.g. 'copper alloy' and 'gold alloy' but 'copper-gold alloy'. :Problematic point(s): the resulting very long words are difficult to comprehend, so instead of rephrasing them, people tend to write them separately. In addition, it is debated whether these forms occasionally written in one word should be allowed, because this form only shows the highest relationship at the expense of an easy-to-read overview of the other parts. (One of the OH. authors, Attila Mártonfi noted: the inscription 'change in traffic regulations' is easier to read if written in three words rather than the regular form created with the mobility rules, .) Sometimes this rule is ignored even in the title of linguistic books, such as ('Hungarian historical linguistics,' lit. "Hungarian language history"), which should be written , to reflect that it is not historical linguistics in Hungarian language, but the historical linguistics of the Hungarian language. It may also cause problems when the involved elements are proper names, such as 'Sun–Earth distance,' because the dashes and hyphens follow a different rule in this case (see the part on punctuation).


Capitalization


Proper names

The following type of proper names are distinguished: personal names, animals' names, geographical names, astronomical names, names of institutions, brand names, names of awards and prizes, and titles (of works). Proper names may become common names, and in this case they are written in lowercase (e.g. 'x-ray') and even their derived compounds may become lowercase, losing the hyphen (e.g. rather than 'Adam's apple').


Personal names and animals' names

Surnames and given names are capitalized. Surnames may have an old-fashioned spelling, which is usually retained – except if their form already has variations, and some of them may interfere with reading. They may consist of two or more elements, and they may be given as one word or in several words, but today hyphenation is the most common method. Given names are written phonetically (even modern names like , cf. English ''Jennifer''), except that ''x'' and ''ch'' are retained (even though they are pronounced ''ksz'' and ''h''), e.g. , . Names of gods and religious figures are capitalized, except when they are referred to as common names (like Greek gods) or if they are mentioned as part of common phrases (e.g. 'thank God'). Occasional
epithet An epithet (, ), also byname, is a descriptive term (word or phrase) known for accompanying or occurring in place of a name and having entered common usage. It has various shades of meaning when applied to seemingly real or fictitious people, di ...
s are not capitalized: only their fixed equivalents are. Common nouns expressing rank or relation are written separately ( 'King Stephen', 'Mr Németh, engineer'). Groups of people named after people (or even a fancy name) are written separately, except for groups founded or led by that person (in which case it is a compound, written with a hyphen). :Problematic point(s): sometimes it is not commonly known which is the case, for example, 'Kodály string quartet' is written with a space as it was only named after Kodály, while is written with a hyphen as it was founded by
Vilmos Tátrai Vilmos Tátrai (7 October 1912 – 2 February 1999) was a Hungarian classical violinist and the founder of the Tátrai Quartet. Life Tátrai was born in Kispest, now 19th district of Budapest. A professor at the Franz Liszt Academy of Music, he ...
. Another problematic point is that this rule applies to families (e.g. ''Kovács család'' 'Kovács family') but it does not apply to dynasties ( 'Bourbon family')
Suffixes are added to personal names without hyphens. If a suffix is attached, it follows the pronunciation of the word, including obsolete consonant clusters (e.g. '' Móricz,'' pronounced , suffixed: ). However, if a surname or a foreign name ends in a double consonant, suffixes are added with a hyphen, so that the original form can be restored (e.g. is suffixed as , because would refer to another name, ''Pap'').AkH.12 160. c) Given names are suffixed the same way, even if they are from a fairly limited set, so their original forms could be retraced (e.g. + > ). If an adjective is formed from a proper name, it is not capitalized. (In case of a hyphenated compound, no element is capitalized, e.g. '' Rippl-Rónai'' but 'typical of Rippl-Rónai'.) Suffixes are added directly, except if the name consists of several elements written separately: '' Széchenyi István'' and ''Széchenyi István-i.'' Compounds formed with personal names are always hyphenated, e.g. 'a poem by Ady'. :Problematic point(s): e.g.
Kossuth Prize The Kossuth Prize ( hu, Kossuth-díj) is a state-sponsored award in Hungary, named after the Hungarian politician and revolutionist Lajos Kossuth. The Prize was established in 1948 (on occasion of the centenary of the March 15th revolution, the ...
is also hyphenated, even if it is not a prize by
Lajos Kossuth Lajos Kossuth de Udvard et Kossuthfalva (, hu, udvardi és kossuthfalvi Kossuth Lajos, sk, Ľudovít Košút, anglicised as Louis Kossuth; 19 September 1802 – 20 March 1894) was a Hungarian nobleman, lawyer, journalist, polit ...
, and no reason can be found for an actual compound: he had nothing to do with the prize, it was only named in his honour. This rule is also often ignored when it is considered to be overruled by another rule concerning names of institutions, e.g. in the name of ' Mindszenty Memorial', advertised as . In this case there are actually two reasons to capitalize (as a name of a person and the beginning letter of the institution name) but the second element of the compound should not be affected.
An exception to the hyphenation of compounds with a proper name is when the proper name contains an uncapitalized common noun. For example, if there is a monastery () named after 'the Prophet Jeremiah', the compound cannot have the usual hyphen, as it would falsely suggest a closer relationship between and . (If ''all'' the elements were common nouns, the case would be simpler, as the above mobility rules could be applied.) Animals' names are capitalized, and if the species is added, it is written in lowercase, without a hyphen.


Geographical names

The two most important questions about geographical names are whether a name should be written in one word, with a hyphen, or in separate words, and which elements should be written uppercase and lowercase. Different written forms may refer to different entities, e.g. lit. 'muddy river' refers to a river, but refers to a city (because rivers' names are written with a hyphen, but city names are written as one word). This field is considered one of the most complex parts of Hungarian orthography, so a separate volume has been published about it, and a separate board () working in the Ministry of Agriculture is entitled to give statements. It consists of experts in linguistics, education, transportation, hydrology, natural protection, public administration, ethnic minorities, foreign relations, and other fields. Apart from single-element names, country names with , , or ('country', 'land', 'plain', 'coast') and most regions are written in one word, as well as Hungarian settlements and their districts ("towns") and quarters, and even Hungarian names outside Hungary. The adjective-forming suffix (sometimes ) is attached directly to the name. If it already ends in , this ending is not repeated. :Problematic point(s): certain region names have become one word, dropping the hyphen, such as ; there are about 60 such forms. Quarters also need to be written as one word, even if they contain a proper name (e.g. , lit. "Wekerle's settlement"), and even if they exceed the 6 syllables (e.g. , 7 syllables and 4 elements, despite the above-mentioned syllable-counting rule). If a geographical name contains a common geographical expression (river, lake, mountain, island etc.) or another common noun or an adjective, the compound is written with a hyphen (e.g. 'Lake Huron' or 'New Zealand'). When these forms are converted into an adjective, only those elements are left capitalized which are actual proper names themselves ( and 'Caspian Sea', however and – is not considered a proper name because it carries the adjectival suffix). The same rule is applied to compounds with three or more elements, although compounds with more than four elements are simplified (lower-ranked hyphens are removed). An
en dash The dash is a punctuation mark consisting of a long horizontal line. It is similar in appearance to the hyphen but is longer and sometimes higher from the baseline. The most common versions are the endash , generally longer than the hyphen b ...
is used to express a relation between two places, and its adjectival form becomes completely lower-case (e.g. 'Moscow-Paris oute and 'of the Moscow-Paris route'). However, if a higher-ranked connected element becomes an adjective, the geographical proper names will retain the upper case (e.g. 'Volga-Don canal' vs. ), except when the elements of the name contain adjectives or common nouns, which will become lower-case (e.g. 'Bohemian-Moravian Highlands' vs. ). All elements are written separately (excluding the above-mentioned names that are written as one word or with a hyphen) in current and historical country names and geographical-historical region names. Their adjectival forms are all written with lower case. (For example, 'United Kingdom' vs. 'from/of the U.K.', 'South African Republic' vs. but 'Republic of San Marino' vs. ). Only the first element is capitalized in subnational entities like counties, areas, districts, neighbourhoods. When forming an adjective, this uppercase letter is only kept if this element is a proper name, e.g. 'State of New York' vs. . However, if the first element of such an entity is a common noun or an adjectival form, all elements are written lower case (e.g. in names of
local administrative unit Nomenclature of Territorial Units for Statistics or NUTS (french: Nomenclature des unités territoriales statistiques) is a geocode standard for referencing the subdivisions of countries for statistical purposes. The standard, adopted in 2003, ...
s like vs. ). Names of public spaces (roads, streets, squares, bridges etc.) are written separately (except for elements that are already compounds or hyphenated). Their first element is capitalized, and this capitalization is kept even in the adjectival forms, e.g. ' Váci Street' and . :Problematic point(s): people need to know if a phrase is officially the name of that place or just a designation, e.g. is a name ('
Elisabeth Bridge Elisabeth Bridge ( hu, Erzsébet híd) is the third newest bridge of Budapest, Hungary, connecting Buda and Pest across the River Danube. The bridge is situated at the narrowest part of the Danube in the Budapest area, spanning only 290 m. I ...
') but merely refers to a bridge on the Danube, so a hyphen should be used.
If a common name is added to a geographical name to clarify its nature, it is written separately. :Problematic point(s): it is often unclear whether a common noun is actually part of an official geographical name, e.g. many people believe is the actual name of Fertő Lake so they write it with a hyphen; but the name is only , thus a space must be used before . In addition, names like ('Sahara Desert') or 'Ural Mountains' do not contain the common name, so no hyphen must be used, as opposed to the type. If a geographical name consists of several elements whose relationship is marked by suffixes or postpositions, these elements are also written separately. The uppercase letter of the beginning element is kept even in an adjectival form.AkH. 184. :Problematic point(s): the suffix that marks the possessive relationship is lost in the adjectival form, so the relationship is eventually unmarked, but the hyphen is still not used. For example, when 'Vác environs' becomes 'of/from Vác environs', the possessive-marker is lost, so it seemingly becomes analogous with the above type. In addition, while names like ('Outer Pest Road') makes it obvious that is part of the name (rather than an occasional designation), the adjectival form can only be given correctly with this knowledge. The above case of emerges again with the type of 'Lady Mary Island', where 'island' would normally be connected with a hyphen, were it not for the common noun 'lady' in the original name, which makes it impossible, so all elements have to be written separately.


Stars and other astronomical objects

Stars, constellations, planets, moons are written with an uppercase capital, e.g. 'Earth', 'Milky Way', especially as astronomical terms. In everyday usage, however, names of the Earth, the Moon, and the Sun are normally written in lowercase ( 'a journey around the Earth').


Institutions

Names of offices, social organizations, educational institutions, academic institutes, cooperatives, companies etc. are written capitalizing all elements except conjunctions and articles. In adjectival forms, only actual proper names and
trade name A trade name, trading name, or business name, is a pseudonym used by companies that do not operate under their registered company name. The term for this type of alternative name is a "fictitious" business name. Registering the fictitious name w ...
s ("fantasy names") are left uppercase. For example, '
National Széchényi Library The National Széchényi Library ( hu, Országos Széchényi Könyvtár) (OSZK) is a library in Budapest, Hungary, located in Buda Castle. It is one of two Hungarian national libraries, the other being University of Debrecen Library. History The ...
' vs. or " heJudge’s House Restaurant and Guesthouse" vs. , retaining the capitalization of " heJudge’s House" in the adjectival form. :Problematic point(s): it is not always known whether a specific form is actually the official name of an institution, or what is its official name (e.g. whether the city where it is located is part of the name). In addition, it is not always clear if a group is actually an institution in the sense of having been registered at the court, with a statute, a stamp, letter header etc. on its own. A third problem is the question of whether the spelling of an organization can be corrected if it is not written according to the above rule. In addition, adjectival forms derived from institution names are also often mistaken because people feel the need to distinguish them from genuine common nouns (especially if the name contains a
trade name A trade name, trading name, or business name, is a pseudonym used by companies that do not operate under their registered company name. The term for this type of alternative name is a "fictitious" business name. Registering the fictitious name w ...
that becomes identical with a common noun if written in lowercase). Furthermore, it is unclear why cinemas are treated differently (see above) from theatres, cf. , but .
If a part of the institution name stands for the whole name, its upper case form is preserved if it is a specific keyword of the name. However, if a common noun part is used for the whole name, it is written in lower case (except for for the Hungarian Academy of Sciences and for the Hungarian State Opera House). :This rule is commonly violated in legal documents where the authors want to make it as clear as possible that the names refer to the contracting parties in particular, so they write it in upper case (not only the common noun parts of the company names but also generic words referring to the parties involved). Subordinated units of institutions are written in uppercase if they are major divisions (e.g. 'Geographical Society', under the Hungarian Academy of Sciences), not including the personnel department or the warden's office. Railway stations, airports, cinemas, restaurants, cafés, shops, baths and spas, cemeteries etc. are considered less typical institutions, so normally only their actual proper name elements (including possible trade names, "fantasy names") are written in upper case, apart from the first word. Their adjectival forms retain the original case. For example, 'Eastern Railway Station' vs. ; 'Alumnus Café' (or 'Old Student Café') vs. . However, most of these names may also be full institution names, thus their spelling reflecting the rules given above (see the beginning of this section).


Brand names

Names of products, articles, makes, and brands are written capitalized, e.g. . This does not include names which include the material or origin of the product, e.g. 'orange juice'. If the word showing the type is added to the name for clarification, it is done with a space, and in lowercase, e.g. 'Panangin pill'.


Awards and prizes

Words denoting a prize, an award, a medal etc. are attached with a hyphen to proper names, e.g. 'Kossuth Prize.' If the name consists of several elements, whose relation is marked, all the elements are capitalized, e.g. 'Golden Medal of the academy.' Degrees and types of awards are written in lowercase.


Titles of works

Titles are classified as constant and individual titles: the first being the title of newspapers, periodicals, magazines, and the second used with literary, artistic, musical, and other works, articles etc. All elements of constant titles are written in uppercase (e.g. 'Life and Science' eekly, while only the first word is capitalized in individual titles (e.g. 'Defining Desk Dictionary of the Hungarian Language' or '
A Little Night Music ''A Little Night Music'' is a musical with music and lyrics by Stephen Sondheim and book by Hugh Wheeler. Inspired by the 1955 Ingmar Bergman film ''Smiles of a Summer Night'', it involves the romantic lives of several couples. Its title is a ...
'). Suffixes are attached to titles without a hyphen, except if a title already ends in a suffix or a punctuation mark, or if the suffix creates an adjective: in these cases, a hyphen must be used. (For example: 'in Magyar Hírlap' but 'Magyar Hírlap-like.')


Non-proper names

Names of national and religious holidays, celebrations, notable days, periods, historical events are not capitalized (nor day or month names), neither are names of nationalities and ethnicities, languages and language groups as well as religions. Events, programmes, and arrangements are not capitalized either, except if they have an institutional background. :Problematic point(s): an average person cannot always know if an event has an institutional background. Therefore, events are still usually written with capital letters. Apart from personal names, commons nouns expressing rank or relation may also be capitalized in addresses for reasons of politeness.
suffixes In linguistics, a suffix is an affix which is placed after the stem of a word. Common examples are case endings, which indicate the grammatical case of nouns, adjectives, and verb endings, which form the conjugation of verbs. Suffixes can carry g ...
and titles like Doctor, Junior, Senior, and their abbreviations are only capitalized if they are in a prominent position (e.g. in postal addresses or lists).


Foreign words and loanwords

Foreign words either retain their foreign spelling or they are phonetically respelled according to the Hungarian writing system.


Common nouns

If a word comes from a language using the Latin script, it is only respelled if it has become an integral, widely known part of Hungarian language (e.g. laser > ; manager > ). If it is less widely used, it retains its original spelling, e.g. , , . But there is no hard and consistent rule, and many widely used terms are written in the original spelling, e.g. or . Certain phrases from foreign languages are always written in their original form, even if the individual words would be respelled in isolation, e.g. cf. . :Problematic point(s): Inconsistency in some cases, cf. for 'fitness' and ''wellness'', or ''Milánó'' and ''Torino''. Certain words, long present in Hungarian, are written in the foreign way (such as ), despite their being commonly known, because of uncommon sound clusters in Hungarian (such as juin *mjuzikel), or because of possible confusion with an existing Hungarian word (e.g. for 'salt'). Moreover, the traditional Hungarian transliteration may be rejected for languages like Chinese that already have a Latin version of their writing system. In addition, it may not be obvious whether a Latin or a non-Latin official language of a country should be considered as a basis (e.g. Indian names). Some features of the original spelling are sometimes retained, e.g. football > (pronounced "fudbal"), million > (pronounced "milió"). The digraph ''ch'' is preserved if it is pronounced The letter ''x'', if pronounced "ksz", is usually written ''x'' in Hungarian too. However, if it is pronounced "gz", it is normally written ''gz'', again with a few exceptions. The letters ''qu'' are always respelled as ''kv''. If the source language uses a non-Latin script (Greek, Russian, Chinese etc.) words are respelled phonetically. This does not always mean exact
transliteration Transliteration is a type of conversion of a text from one script to another that involves swapping letters (thus ''trans-'' + '' liter-'') in predictable ways, such as Greek → , Cyrillic → , Greek → the digraph , Armenian → or L ...
: sometimes the foreign pronunciation is bent to conform to Hungarian phonology better (e.g. , 'samovar', 'typhoon'). In practice however, English transliterations are often used instead, such as using instead of ).


Proper names

Proper names from languages with a Latin alphabet are normally written in the original way, e.g. , including all the diacritics (e.g. ). Certain foreign proper names have a Hungarian version, e.g. for Christopher Columbus (in the
Eastern name order A personal name, or full name, in onomastic terminology also known as prosoponym (from Ancient Greek πρόσωπον / ''prósōpon'' - person, and ὄνομα / ''onoma'' - name), is the set of names by which an individual person is know ...
, typical of Hungarian). Other names adapted the given name and the word order to Hungarian customs, but left the surname intact, e.g. for Jules Verne. Recently borrowed names are no longer modified in Hungarian. The only exceptions are some given names which can only be written in Hungarian spelling, e.g. for Christian and for Carmen. As with common nouns, ''ch'' and ''x'' are retained in both personal names and geographical names of foreign origin (e.g. , ). Similarly to common names again, widely-known and fixed forms of proper names from languages with a non-Latin script are preserved (e.g. (Aesop), , ), rather than introducing a more up-to-date or more accurate transliteration (e.g. ). Some well-established foreign names have a popular form used in phrases and another referring to the person (e.g. 'Pythagorean theorem' but for the philosopher himself).


Suffixes

Suffixes are added directly in most cases. The suffix is omitted in writing if the word already ends in the letter ''i'' (e.g. ''Stockholm'' > ; ''Helsinki'' > ). In the case of suffixes of variable forms depending on Hungarian
vowel harmony In phonology, vowel harmony is an assimilatory process in which the vowels of a given domain – typically a phonological word – have to be members of the same natural class (thus "in harmony"). Vowel harmony is typically long distance, me ...
rules, the version in accordance with the actual pronunciation should be used. If a certain suffix requires lengthening of the word-final vowels ''a'', ''e'', ''o'', ''ö'', they are lengthened as usual, e.g. but , . In addition, suffixes will follow the pronunciation of the word in terms of the ending consonant and the front or back vowels (e.g. 'with Bach', 'with Greenwich'). If the last letter of a foreign word is silent (not pronounced) or part of a complex cluster of letters, a hyphen is used when attaching suffixes (e.g. 'with a guillotine', 'with M.').AkH. 217. a) If an adjective is formed from a proper name with only one element, it will be lowercase (e.g. 'Voltaire-esque'). A hyphen is also used if an adjective is formed from a multiword name (e.g. 'typical of V. H.', 'S. F.-based'). The last vowel is lengthened even in writing if it is pronounced and it is required by phonological rules. If the suffix begins with the same letter as a word-final double letter (e.g. 'with Grimm')., a hyphen is used again.


Hyphenation

Hyphenation at the end of a line depends on whether there is an easily recognizable word boundary there. If the word is not a compound (or it is, but the boundary is not nearby) the word is hyphenated by syllables, otherwise by word elements (e.g. 'railway', lit. 'iron-road,' instead of ).AkH. 233. The number of syllables is defined by the number of vowels (i.e., every syllable must contain one and only one vowel) and the main rule can be summarized as follows: a syllable can begin with at most one consonant (except for the first syllable of a word, which may contain up to three initial consonants). It means that a syllable can only begin without a consonant if there is no consonant after the preceding vowel (e.g. 'nut-sized'), and if there are multiple consonants between vowels, only one can go to the next syllable (e.g. 'list'). Hyphenation normally follows pronunciation, rather than the written form. If a word contains several vowel letters but they are pronounced as a single sound, it cannot be hyphenated (e.g. 'a surname', 'blues'). Pronunciation is respected in the case of ''ch,'' which is pronounced as a single sound so both its letters are kept together (e.g. 'from Zürich').AkH. 228. Hungarian surnames are also hyphenated by pronunciation, e.g. > r. bő-ti 'the Balogh family' > r. balog 'typical of Móricz' . The same principle applies to foreign common names and proper names, e.g. for consonants (because ''lj, gi,'' and ''sch'' denote single sounds) and for vowels. Even acronyms can be hyphenated if they contain at least two vowels (e.g. ) or at the boundary of the acronym and the suffix, where a hyphen already exists (e.g. 'for the NATO'). On the other hand, ''x'' denotes two sounds, but it is not separated at the boundary of two syllables (e.g. rather than , based on phonetics). Long double consonants are separated and their original forms are restored if they are at the boundary of two syllables (e.g. 'cherry-flavoured' > ). Although not incorrect, it is not recommended to leave a single vowel at the end or the beginning of a line (e.g. ). Double vowels can be separated (e.g. 'vacuum'), and long consonants can also be separated (e.g. 'ton'). Inflectional suffixes are not considered elements on their own (e.g. although the stem of 'for money' is , its hyphenation is rather than ). Apart from the hyphenation based on pronunciation, foreign compounds may be hyphenated at their boundary, if the prefix or suffix is widely recognized, e.g. (by syllables) or (by elements). The elements are also taken into consideration in compound names (e.g. r. pálfi hyphenated as , rather than ). Sometimes different ways of hyphenation reflect different words (e.g. 'again,' a single word hyphenated by syllables, cf. 'admonish,' a compound with a verbal prefix, hyphenated by elements). Hyphens are not to be repeated at the beginning of the next line, except in specialized textbooks, as a way of warning for the correct form. :Problematic point(s): certain vowel couples are sometimes pronounced as diphthongs, e.g. 'August' is often pronounced in three syllables like , but the syllable-counting rule (above) should consider it as a word of four syllables. Another problematic point is that some words might seem to be compounds although they are not (e.g. 'serf' > , although 'better' and 'bed' are existing words). A third possible problem is that although hyphenation strictly follows pronunciation, long consonants pronounced as short are treated as if they were pronounced long (e.g. is pronounced iliméterso hyphenation could be but it follows the written form and will be , or is hyphenated as , although the ''ss'' is pronounced short.) A further problem is that ''dz'' is not considered as a genuine diphthong by current Hungarian phonology, but it is treated as a digraph, and both of its letters should be moved together (e.g. 'string') and the trigraph ''dzs'' is also treated as a single letter, even when it is pronounced long (e.g. 'impresario'). Finally, the pronunciation and spelling rules of foreign words are not always known, so people may not be able to hyphenate them correctly (although they can separate them at a different point, or take them to the next line altogether).


Punctuation


At the end of a sentence

Punctuation marks are added to the end of the sentence depending on its intended meaning. The exclamation mark is not only used for exclamations, but also for wishes and commands. If the sentence formally reflects one mood, but it actually refers to a different idea, the punctuation mark is selected based on the actual meaning. Punctuation marks may be repeated or combined to express an intense or mixed emotion. (For example, 'How dare you?!') In case of coordinating clauses, the punctuation mark is adapted to the ending clause. Subordinated clauses take a punctuation mark reflecting the main clause – except if the main clause is only symbolic, emphasizing the subordinate clause.


Between clauses

A comma (or a colon, semicolon etc.) should be placed at the border of clauses whether or not there is a conjunction. It also applies to cases when the clause begins with one of the conjunctions 'and' and 'or'. However, it is sometimes difficult to assess whether the part joined with these conjunctions is a separate clause (because if it is not, no comma is needed). For example: 'He banged the door and rushed away in fury.' but . 'He suddenly jumped up and rushed away.'
Simile A simile () is a figure of speech that directly ''compares'' two things. Similes differ from other metaphors by highlighting the similarities between two things using comparison words such as "like", "as", "so", or "than", while other metaphors c ...
s introduced with the word 'as, like' are to be preceded by a comma. The exception is a kind of a 'more than' construction that has a mere intensifying function (as opposed to 'practically' or 'almost'). In case of a double conjunction expressing 'instead of (doing)', 'without (doing)' etc., only the first element should be preceded by a comma – except if the first element closely belongs to the first clause, in which case the comma is placed between the two conjunctions. Semicolons are generally used to separate sets of closely connected clauses, if these larger sets of clauses are loosely connected to each other. A semicolon may also be used to mark that two single clause have but a loose relation to each other. Colons attract the attention to a forthcoming idea, or they may be used to mark that an important explanation or conclusion follows. If a clause introduces several separate sentences, all of them (including the first) are written with an uppercase initial. To express that a fairly distinct set of ideas follows, a dash may be used after the full stop, the question mark, or the exclamation mark.


Between clause elements

Coordinated clause elements are separated by commas if no conjunction is used. (A semicolon may be used to separate series of words whose elements are separated by commas.) If a conjunction is used between coordinated clause elements, a comma is used before it, except if the conjunction is one of the words 'and' or 'or,' where the comma is omitted. Since the abbreviation ''stb.'' 'etc.' includes the conjunction ''s'' 'and,' it doesn't need a comma either. For example: 'an appealing but incorrect theory,' ''a rózsának, a szegfűnek vagy a levendulának az illata'' 'the scent of a rose, a carnation, or a lavender.' If a coordinated sentence element is mentioned at the end of the whole clause, separated from the related elements, in a postponed manner, it is separated from the rest of the clause with a comma. For example: 'Bring an umbrella to the excursion, or a raincoat.' Coordinated structures formed with coupled conjunctions (e.g. "either – or") are written with a comma placed before the second conjunction.
Apposition Apposition is a grammatical construction in which two elements, normally noun phrases, are placed side by side so one element identifies the other in a different way. The two elements are said to be ''in apposition'', and one of the elements is ...
s are separated from the referred element with a comma (or a colon), if they are in the same grammatical position as the referred element. If the apposition gets further back in the sentence, the comma will precede it directly. If the apposition is followed by a pause in speech, a comma may be placed after it, too. If a descriptive phrase is added to a personal name but only the last part takes the suffixes (in which case it is not called an apposition), no comma is used after the personal name. For example: 'to Elemér Nagy, honorary citizen of our town' – because of the possessive structure, both element take the suffixes, and the second part can only be an apposition, so a comma is needed. On the other hand: 'to Elemér Nagy honorary citizen' – the whole structure takes one suffix at the very end, thus it cannot be appositive, and no comma is used. If the apposition or the referred element is a derivative of the word ("himself" etc.), the comma is not used. However, adverbs used like appositions take the comma. Subordinated clause elements take no comma (e.g. 'a man with black glasses' – the word 'black' doesn't belong to 'man' but to 'glasses'). If the word 'as' precedes a phrase expressing status or quality, no comma is used before it (e.g. . 'My brother was heard as a witness.') Structures formed with an
adverbial participle In linguistics, a participle () (from Latin ' a "sharing, partaking") is a nonfinite verb form that has some of the characteristics and functions of both verbs and adjectives. More narrowly, ''participle'' has been defined as "a word derived from ...
are not usually separated from the clause with a comma, especially if the participle is directly connected to it. However, if this part is loosely attached to the clause (especially if the participle has its own
complement A complement is something that completes something else. Complement may refer specifically to: The arts * Complement (music), an interval that, when added to another, spans an octave ** Aggregate complementation, the separation of pitch-clas ...
), it is recommended to use a comma.


Elements wedged into a clause


Interposition

Words or phrases (especially external elements) interposed into a sentence are marked with commas, dashes (with spaces), or parentheses. For example: or or 'My brother, as the witness of the accident, was heard several times.' The comma may be omitted around interposed elements depending on the articulation, reflecting the intention of the author, e.g. 'The train was, of course, late again.' can be written without commas as well. If the conjunction 'as' precedes an interpolation separated by pauses in speech, commas may be used before and after the interjected part. Subordinated clauses are also separated by commas, dashes, or parentheses if they are interposed into another clause. 'Eve, though she could have run through the road, let the truck leave.' If a word, phrase, or clause is interposed into a sentence right next to a punctuation mark, this mark needs to be inserted after the pair of dashes or parentheses. For example: 'He graduated at a technical university – as a chemical engineer – but he became a writer.' However, if an independent sentence is interposed, its punctuation mark is inserted inside the parentheses.


Forms of address

Forms of address are usually followed by an exclamation mark, e.g. 'My dear friends,' or a comma can be used in private letters. If this form stands within a sentence, it is separated from the rest with commas. :Problematic point(s): intonation is sometimes unbroken before appositions wedged into sentences, so even a series of books was published by a notable publishing house with the title 'Do it yourself, Sir, if you are in trouble with the PC,' without a comma preceding .


Quotation

Quotation marks are placed below at the beginning and above at the ending of a quotation, both signs turning left, being curly and double. If another quotation is included in a quotation, angle quotation marks (guillemets) are used, directed towards each other with their tips: ''(„quote1 »quote2« quote1").'' If a quoting sentence introduces the quotation, it is preceded by a colon; the ending punctuation mark should be inserted as in the original. Lowercase initials should only be used if they are lowercase in the original. If a quoting sentence follows the quotation, they are separated by a dash (and spaces). Punctuation marks of the original text are preserved, except for the full stop, which is omitted. If the quoting sentence is interposed in the quotation itself, it is written in lowercase and separated with dashes (and spaces). The second quotation mark stands at the end of the quotation. For example: or or '"Yes, I know," he replied.' If the quotation is organically interwoven into one's own text, the quoted part is marked with quotation marks, and common words beginning the quotation are written in lowercase (even despite the original). For example: 'According to the syllabus, one of the goals of a school is "to bring up a generation healthy in body and mind."' When quoting others' words in terms of their content, the quotation marks are not used: 'Our constitution states that our social system is based on work.' Indirect (reported) speech is treated in the same way. In fiction and prose, quotations are marked by dashes instead of quotation marks, placed at the beginning of a line. If the quotation is written in a separate line, the only dash is the one that precedes it. If quotation is followed by the quoting sentence, they are separated by another dash (the full stop omitted from the end, other punctuation marks retained, as described above). If a quotation is continued after the author's words, another dash follows. For example: : :"I have been waiting for you," he received his friend. "We have a lot to do."


Between words and their elements

Interjection An interjection is a word or expression that occurs as an utterance on its own and expresses a spontaneous feeling or reaction. It is a diverse category, encompassing many different parts of speech, such as exclamations ''(ouch!'', ''wow!''), curse ...
s are preceded and followed by commas. If an interjection is followed by the emphatic words or 'how much,' the commas can be omitted depending on the stress and pause conditions of the sentence. If two conjunctions follow each other (e.g. because of an interposed clause), only the first is preceded by a comma, e.g. 'They invited him, but as it was cold, he didn't set out.' A hyphen is used between words and their elements in the following cases (a taxative list, partly reiterating points mentioned elsewhere): * in case of three successive identical consonant letters at the border of compound elements and between a proper name and its suffix (see above) * in certain kinds of word repetitions and coordinated compounds, in several types of subordinated compounds (see above), as well as in unusual, occasional compounds in the poetic language (e.g. ''bogáncs-szívem'' 'my heart of thistle') * if the ending or beginning word of two or more compounds are the same, and only the last instance is written out in full: the preceding, omitted instances are marked with a hyphen, e.g. ''tej-, zöldség- és gyümölcsfelhozatal'' 'milk, vegetable, and fruits arrival,' 'cooperative for wine production and marketing' * in numbers written in letters: beyond two thousand, if more numerals follow (see below) * nouns and their derived adjectives are connected to proper names in several cases with a hyphen (see above) * with double surnames (see above) * with several types of geographical compounds (see above) * with the
enclitic In morphology and syntax, a clitic (, backformed from Greek "leaning" or "enclitic"Crystal, David. ''A First Dictionary of Linguistics and Phonetics''. Boulder, CO: Westview, 1980. Print.) is a morpheme that has syntactic characteristics of a ...
question word (e.g. 'Do you know which way to go?') * in case of pairs of numbers (whether in digits or in letters) given in an approximative sense (e.g. 'some eight or ten days') The dash is referred to in Hungarian orthography under two names: (lit. "thought mark") and (lit. "big hyphen"). The first form applies to cases where it separates an interposed remark, usually a clause or a phrase (see above): this one is always used with spaces on either side (or a comma and a space after it). The second one is used to connect single words with each other to create a phrase: this one is normally used without spaces. This latter dash is used between words in the following cases (a taxative list): * to connect names of peoples or languages (e.g. 'French-Spanish border') * to link proper names in a loose, occasional (i.e. not institutionalized) relationship (like when authors of a book are mentioned after each other, or for matches of two sports teams) * to express a relationship extending between two points (in time or space, e.g. 'Budapest-Vienna oute). Note: the dash can exceptionally be surrounded by spaces in more complex cases, e.g. ''i. e. 753 – i. sz. 456'' '753 BC – 456 AD' * giving types of machines, between letters or words and a number (e.g. ) The ellipsis sign ''(...)'' is used to mark that an idea is unfinished (and more thoughts can be inferred from what is written), or if a part of a text has been omitted from a quotation.


Suffixation

Suffixes are normally attached to words directly. However, a hyphen is used in a couple of cases (a taxative list, referring to other passages of the regulation): * in case of three successive identical consonant letters in certain cases that cannot be simplified, such as with proper names ending in double letters and having a suffix (see above) * personal and geographical names as well as titles of periodicals consisting of several separate elements take adjective-forming derivational suffixes with a hyphen (see above), e.g. 'typical of L. da Vinci' (but 'with L. da Vinci'), 'N. Y. C.-based' (but 'in N. Y. C.') * proper names (including personal names, geographical names, institution names, titles of periodicals) with one single element take a hyphen before suffix-like derivational elements such as and (e.g. ' Petőfi-like,' cf. 'with Petőfi' and 'typical of Petőfi') * if the word-final letter is not pronounced (silent), or this letter is part of a more complex cluster of letters, suffixes are connected with a hyphen (see above) * digits, punctuation marks, typographical signs, abbreviations, and acronyms take a hyphen before suffixes (see below)


Other information on punctuation

No full stop is needed after titles of periodicals, books, poems, articles, studies, and treatises as well as after institution names and direction signs if they are given highlighted or on their own. However, lower section titles can be inserted in a text and they can be followed by other sentences: in this case, a full stop is used after them. Question and exclamation marks can be used even in highlighted titles. A full stop is used in the following cases: * after Roman and Arabic numerals denoting ordinal numbers (see below) * after certain types of abbreviations (see below) * after numbers marking the year, the month, and the day of a date (see below). A colon is used to highlight a phrase or sentence mentioned as an example. This sign is also used between an author's name and the title of the work, if they are given without a syntactic reference to each other. A possessive case, however, eliminates the colon. (For example: but ' Toldi by János Arany.') A hyphen is used at the end of a line, when a part of a word is taken to the next line. If a word already contains a hyphen for whatever reason, it can be used at the end of the line, just like if it contains a dash. If a part given in parentheses has a fairly close connection to the sentence, the closing punctuation mark is used after it. If the part in parentheses ends in a full stop, the punctuation mark still needs to be used after the parenthetical part. Quotation marks may be used (though should not be overused) to express ironic or other emotional overtones. Quotation marks can be used around the titles of books, works, articles etc. – in this case, suffixes can be connected with a hyphen. The beginning of decimal fractions is marked with a comma. Numerals of more than four digits are divided by spaces, in groups of three, counted from the back. (See more below.) The following signs and symbols are also used relatively frequently (with minor differences from the Anglo-Saxon usage): plus (+) for addition, minus (–) for extraction, the interpunct ( · ) for multiplication, the colon ( : ) for division, the equals sign (=) to mean equality, the percent sign (%) to express percent, the slash (/) to express alternativeness or fractions, the section sign (§) to refer to sections, a combination of an upper dot, a slash, and a lower dot (⁒) to mean "please turn over," the asterisk or superscript numbers (* or 1) to mark notes, a right double curly quote (") to express repetition (as opposed to
ditto mark The ditto mark is a shorthand sign, used mostly in hand-written text, indicating that the words or figures above it are to be repeated. The mark is made using 'a pair of apostrophes'; 'a pair of marks used underneath a word'; the symbol (quota ...
), a right single curly quote (') to express lack, the degree symbol to mark the (Celsius) degree, and the tilde (~) to express repetition or equivalence. Suffixes are connected to the percent sign, the section sign, and the degree symbol with a hyphen, and the suffix will reflect the pronounced form, with respect to assimilations and linking vowels, e.g. r. "három százalékkal"'by 3%.'


Abbreviations and acronyms

These two groups are distinguished by whether the shortened form is only used in writing (abbreviations) or in speech as well (acronyms). Acronyms may be pronounced with the name of their letters (e.g. 'National Savings Bank' r. ótépé, or if possible, in full ( 'Hungarian State Railways' r. máv. The article preceding these forms is always adapted to the spoken form.


Abbreviations

Abbreviations are written in one word whether they are created from single nouns, nouns with derivational suffixes, or compounds, and they are written with a full stop. If an abbreviation retains the ending of the original word, the full stop is still preserved (e.g. < 'headquarters'). Abbreviation of phrases normally contains as many elements as the original phrase contains (e.g. < 'by his/her own hand') but there are exceptions (e.g. < 'compare'). Case is usually kept in abbreviations (e.g. < 'Hungary') but some abbreviations created from lowercase words use the uppercase (e.g. < 'west'). Units of measurement are used in accordance with the international standard, depending on whether the sign comes from a common name ( < ) or a proper name ( < after Isaac Newton). Standard forms of abbreviations are not to be altered even in full-capitalized inscriptions ( 'PRICE: 100 HUF'). Some abbreviations are written without a full stop, such as names of currencies, cardinal and ordinal directions, country codes of cars, codes of country names, chemical, physical, mathematicals symbols, symbols of units, etc. The full stop can be omitted from abbreviations in encyclopedias, but they are to be explained in a legend. A full stop is not used after abbreviations whose last element is a full word (e.g. < 'the same'). Suffixes are attached to abbreviations based on their pronunciation (even if the pronunciation is considerably different from the symbol, e.g. 'iron'> 'with iron' and the article, too, should reflect the pronounced form). If an abbreviation forms a compound with a full word, they are connected with a hyphen (e.g. < 'statement of current account').


Acronyms

Acronyms are classified into two groups: those consisting only of initials ( lit. 'letter-words'), and those consisting parts of the original word ( 'word contractions'). The first group is divided again by whether they denote proper names (written in uppercase, e.g. < 'United Nations Organization', note that both letters of the digraphs ''SZ'' are capitalized) or common names (written in lowercase, e.g. < 'executive committee', note that it is written as one word despite the two elements). Some acronyms created from common names are still written in uppercase, though, especially in sciences ( < 'ultra-high-frequency') but other capitalized acronyms may be accepted too (''TDK < tudományos diákkör'' 'students' scholarly circle'). In some cases, full-fledged words are created from the pronounced form of acronyms standing for common names (e.g. ''tévé < tv < televízió''). Since 2015, a hyphen is required even if the additional word is already contained in the acronym (e.g. literally 'CD disk', though ''CD'' already comprises "disk", versus 'CD burning', in accordance with earlier regulations). Acronyms of the second group are created from longer parts of the original words (in fact, at least one word of the original should keep at least two letters, not including digraphs). Their letters are not all capitalized, only the initial of acronyms that derive from proper names (e.g. < , 'Commercial Quality Control Institute' cf. < 'maternity benefit'). Neither type of acronyms need a full stop between their elements or at their end. Acronyms take suffixes in accordance with their pronounced forms, whether their letters are pronounced one by one or as a full word (e.g. ébécés'one with tuberculosis'). Those from the first group, consisting only of word initials, are suffixed with a hyphen. Their capitalized types will retain their uppercase even in their adjectival forms ( 'one from the UN'), and their ending vowel letter will not be lengthened even if it would be phonologically justified (e.g. ltén'at ELTE'). Those from the second group, however, consisting of shorter pieces of the constituting words, take suffixes without a hyphen (e.g. 'she is on maternity leave'). The same happens to those words that were created from pronounced letters (e.g. 'watch TV'). Proper name types of these acronyms are written in lowercase if an adjective is formed out of them (e.g. 'Kermi-related'). In addition, their ending vowel letter may be lengthened in accordance with general phonological rules (e.g. > 'at Hungexpo'). Compounds are created with acronyms by the following rules: those from the first group take other elements with a hyphen (e.g. 'UHF broadcast'), and proper name types of the second group behave the same way (e.g. 'control by Kermi'). The common name types of the second group, however, can be written as one word with other elements, except if they require a hyphen because of length (e.g. 'TV transmission').


Other information


Numbers

Numerals that can be pronounced with a short word are usually written in letters, just like those having a suffix, a postposition, or another compound element. On the other hand, digits should be used in case of longer or bigger numerals, as well as to note down exact quantities, dates, amounts of money, measurement, statistical data etc. If cardinal numbers are written in letters, they should be written as one word up to 2000 (e.g. '1,999') and they should be divided by hyphens by the usual three-digit division over 2000 (e.g. '2,001'). Numbers written in digits can be written without a space up to four digits; above that, they are divided by spaces from the end by the usual three-digit division (e.g. but ). If numbers are written under each other in a column, all can be divided by spaces. Ordinal numbers written in digits take a full stop (e.g. '3rd line'). The full stop is retained even before the hyphen that connects suffixes (e.g. 'with the 10th'). Dates are an exception to this rule, see below. If a fraction functions as a noun, the quantifier is written separately (e.g. 'one quarter'). However, if a fraction takes an adjectival role in a phrase, the two parts are written in one word (e.g. 'a one-quarter part'). Giving the hour is also done by this rule. The integer part of a decimal is divided from the rest by a comma (e.g. '3.14'). Numbers are usually written in Arabic numerals. Roman numerals are only used in some special traditional cases, only to express ordinal numbers (e.g. to express the numbering of monarchs, popes, districts of a city, congresses, etc.). Their use is advisable if they have a distinctive role as opposed to Arabic numbers, e.g. to denote the month between the year and the day, or to mark the floor number in front of the door number.


Dates

The year is always given in Arabic numerals and it is followed by a full stop. The name of the month can be written in full or abbreviated, or it can be marked with a Roman or Arabic numeral. The day is always written in Arabic numerals. Dates are sometimes written without full stops and spaces, divided only by hyphens. Normally, a full stop is needed after the year. However, it is omitted in three cases: (1) if it is in a possessive relationship with the forthcoming word, (2) if it is followed by a postposition or an adjective coined from it, or (3) if it is the subject of a sentence or it stands solely in parentheses. For example, 'the 1994 spring' but 'the spring of 1994' and 'after 1994.' When digits expressing year and day take suffixes, the full stop is dropped before the hyphen (e.g. 'in 1838' and 'on March 15'). The word 'on the 1st of' and its suffixed forms are abbreviated as etc. If a day is followed by a postposition, the full stop is retained (e.g. 'between the 20th and the 30th').


Other cases

Letters and other postal consignments are to be addressed by the official addressing patterns of the Hungarian Postal Service. (It currently means that the name comes first, then the settlement, the street or the P.O.B., and finally the postcode, written under each other. Street directions contain the street number first, and optionally the floor number and the door number.) The words for "hour" and "minute" ( and ) are not usually abbreviated in fluent texts. If the time is given in digits, a full stop is placed between the hour and the minute without a space (e.g. ). This latter form takes a hyphen before suffixes (e.g. 'at 10:35').


Collation

Digraphs are distinguished in
collation Collation is the assembly of written information into a standard order. Many systems of collation are based on numerical order or alphabetical order, or extensions and combinations thereof. Collation is a fundamental element of most office filin ...
(i.e. to determine the order of entries in a dictionary or directory) from the letters they consist of. For example, is followed by , even though ''s'' precedes , as is considered a single entity, and follows all the words starting with . In general dictionaries, contracted forms of digraphs are collated as if they were written in full, e.g. precedes , even though precedes , because consists of  + , and precedes . Short and long versions of vowels are considered equal for the purposes of collation (e.g. precedes ) unless the words are otherwise identically spelt, in which case the short vowel precedes the long one (e.g. precedes ). Phrases and hyphenated compounds are collated ignoring the space or the hyphen between their elements; lower and upper case don't count either. Obsolete digraphs in traditional Hungarian names and foreign words are treated as a series of individual letters. Diacritics are only taken into consideration if there is no other difference between words. However, in encyclopedias, map indices, and other specialized works, where Hungarian and foreign names are mixed, the universal Latin alphabet is followed.


History

The rules of Hungarian orthography were first published by the
Hungarian Academy of Sciences The Hungarian Academy of Sciences ( hu, Magyar Tudományos Akadémia, MTA) is the most important and prestigious learned society of Hungary. Its seat is at the bank of the Danube in Budapest, between Széchenyi rakpart and Akadémia utca. Its ma ...
(HAS) in 1832, edited by
Mihály Vörösmarty Mihály Vörösmarty (archaically English: Michael Vorosmarthy 1 December 180019 November 1855) was an important Hungarian poet and dramatist. Biography He was born at Puszta-Nyék (now Kápolnásnyék), of a noble Roman Catholic family. ...
. Revisions followed in 1877, 1922, 1954, 1984, and 2015 (the last two being minor). The currently effective version is the 12th edition from 2015. Rules of Hungarian orthography are laid down by the Hungarian Language Committee of the Research Institute for Linguistics (formerly named ''…of the
Hungarian Academy of Sciences The Hungarian Academy of Sciences ( hu, Magyar Tudományos Akadémia, MTA) is the most important and prestigious learned society of Hungary. Its seat is at the bank of the Danube in Budapest, between Széchenyi rakpart and Akadémia utca. Its ma ...
;'' but since 2019 the institute is affiliated with Eötvös Loránd Research Network). They are published in a book titled ''Rules of Hungarian Orthography''
A magyar helyesírás szabályai
. The two most significant orthographic dictionaries are the one published by ''Akadémiai Kiadó'' ("Academy Press") (originally founded by HAS), searchable online, and the one published by Osiris Kiadó. The former is considered as the standard, and comprises 140,000+ words and phrases; the latter is more comprehensive, including more than 210,000 words and phrases as well as a more detailed elaboration of the regulations.


Orthography and society

Although orthography gives only instructions how to note down an existing text, usage-related suggestions are also given in most Hungarian linguistic publications (such as if a construction should be rephrased or a word should be avoided). These periodicals include , , , , and , and several other periodicals have linguistic columns (such as ). Ádám Nádasdy sometimes touched on orthographic issues in his column popularizing linguistics in
Magyar Narancs ''Magyar Narancs'' (''Hungarian Orange'' in English) is a weekly liberal magazine with a strong satirical tone appearing on Thursdays in Hungary. It is informally referred to as Mancs (''Paw'' in English) which is a joking abbreviation of the nam ...
, and in his books based on this column and its forerunners. New entries of (Proofreader's Blog – "The mild Grammar Nazi") are published on the main page of the popular news portal Index.hu. Linguistic educational programmes were broadcast on television, the most famous being "Let's stop for a word", screened more than 500 occasions between 1987 and 1997, and some of its issues were published in a book. Apart from the Geographical Names Committee and the manual on geographical names mentioned above, other fields have their specialized orthographical dictionaries, such as economy, medicine, technology, chemistry, and military affairs, as well as collections of examples in periodicals, such as for zoological and botanical names. Orthographical competitions are organized at primary, secondary, and tertiary level in every year ( competition for upper primary schools – for students aged 10 to 14, competition for secondary schools, and competition for universities). Word processors, some Internet browsers and mailing applications are supplied with a Hungarian spellchecker: Hunspell for
OpenOffice.org OpenOffice.org (OOo), commonly known as OpenOffice, is a discontinued open-source office suite. Active successor projects include LibreOffice (the most actively developed), Apache OpenOffice, Collabora Online (enterprise ready LibreOffice) a ...
,
Firefox Mozilla Firefox, or simply Firefox, is a free and open-source web browser developed by the Mozilla Foundation and its subsidiary, the Mozilla Corporation. It uses the Gecko rendering engine to display web pages, which implements current ...
and
Thunderbird Thunderbird, thunder bird or thunderbirds may refer to: * Thunderbird (mythology), a legendary creature in certain North American indigenous peoples' history and culture * Ford Thunderbird, a car Birds * Dromornithidae, extinct flightless birds ...
. A Hungarian company, MorphoLogic has developed its own proofing tools, which is used in
Microsoft Office Microsoft Office, or simply Office, is the former name of a family of client software, server software, and services developed by Microsoft. It was first announced by Bill Gates on August 1, 1988, at COMDEX in Las Vegas. Initially a marketin ...
. People can seek advice for free in orthography-related and other linguistic topics from the Department of Normative Linguistics at the
Research Institute for Linguistics of the Hungarian Academy of Sciences The Hungarian Research Centre for Linguistics ( hu, Nyelvtudományi Kutatóközpont) was created in 1949. It was under the supervision of the Hungarian Academy of Sciences from 1951 until 2019, when it was moved by a governmental decree to the sup ...
or from the Hungarian Linguistic Service Office.


See also

*
Hungarian alphabet The Hungarian alphabet () is an extension of the Latin alphabet used for writing the Hungarian language. The alphabet is based on the Latin alphabet, with several added variations of letters. The alphabet consists of the 26 letters of the ISO b ...
*
Hungarian braille The braille alphabet used to write Hungarian language, Hungarian is based on the international norm for the 26 basic letters of the Latin script. However, the letters for ''q'' and ''z'' have been replaced, to increase the symmetry of the accente ...


References

*AkH.: ''A magyar helyesírás szabályai.'' akadémiai helyesírás"Akadémiai Kiadó, Budapest (several prints after 1984). . (The numbers refer to passages.) *OH.: Laczkó, Krisztina and Attila Mártonfi. ''Helyesírás.'' Osiris Kiadó, Budapest, 2004. . (The numbers refer to page numbers.)


External links


A test of Hungarian orthography with 100 questions
{{DEFAULTSORT:Hungarian Orthography Hungarian language, Orthography Latin-script orthographies