Gaj's Latin Alphabet
   HOME

TheInfoList



OR:

Gaj's Latin alphabet ( sh-Latn-Cyrl, Gajeva latinica, separator=" / ", Гајева латиница}, ), also known as ( sh-Cyrl, абецеда, ) or ( sh-Cyrl, гајица, link=no, ), is the form of the
Latin script The Latin script, also known as Roman script, is an alphabetic writing system based on the letters of the classical Latin alphabet, derived from a form of the Greek alphabet which was in use in the ancient Greek city of Cumae, in southern ...
used for writing
Serbo-Croatian Serbo-Croatian () – also called Serbo-Croat (), Serbo-Croat-Bosnian (SCB), Bosnian-Croatian-Serbian (BCS), and Bosnian-Croatian-Montenegrin-Serbian (BCMS) – is a South Slavic language and the primary language of Serbia, Croatia, Bosnia an ...
and all of its
standard varieties A standard language (also standard variety, standard dialect, and standard) is a language variety that has undergone substantial codification of grammar and usage, although occasionally the term refers to the entirety of a language that includes ...
: Bosnian, Croatian, Montenegrin, and
Serbian Serbian may refer to: * someone or something related to Serbia, a country in Southeastern Europe * someone or something related to the Serbs, a South Slavic people * Serbian language * Serbian names See also * * * Old Serbian (disambiguation ...
. The alphabet was initially devised by Croatian linguist
Ljudevit Gaj Ljudevit Gaj (; born Ludwig Gay; hu, Gáj Lajos; 8 August 1809 – 20 April 1872) was a Croatian linguist, politician, journalist and writer. He was one of the central figures of the pan-Slavist Illyrian movement. Biography Origin He was bor ...
in 1835 during the Illyrian movement in ethnically Croatian parts of
Austrian Empire The Austrian Empire (german: link=no, Kaiserthum Oesterreich, modern spelling , ) was a Central- Eastern European multinational great power from 1804 to 1867, created by proclamation out of the realms of the Habsburgs. During its existence, ...
. It was largely based on
Jan Hus Jan Hus (; ; 1370 – 6 July 1415), sometimes anglicized as John Hus or John Huss, and referred to in historical texts as ''Iohannes Hus'' or ''Johannes Huss'', was a Czech theologian and philosopher who became a Church reformer and the insp ...
's
Czech alphabet Czech orthography is a system of rules for proper formal writing (orthography) in Czech. The earliest form of separate Latin script specifically designed to suit Czech was devised by Czech theologian and church reformist Jan Hus, the namesake o ...
and was meant to serve as a unified orthography for three Croat-populated kingdoms within the Austrian Empire at the time, namely
Croatia , image_flag = Flag of Croatia.svg , image_coat = Coat of arms of Croatia.svg , anthem = " Lijepa naša domovino"("Our Beautiful Homeland") , image_map = , map_caption = , capi ...
,
Dalmatia Dalmatia (; hr, Dalmacija ; it, Dalmazia; see names in other languages) is one of the four historical regions of Croatia, alongside Croatia proper, Slavonia, and Istria. Dalmatia is a narrow belt of the east shore of the Adriatic Sea, str ...
and
Slavonia Slavonia (; hr, Slavonija) is, with Dalmatia, Croatia proper, and Istria, one of the four historical regions of Croatia. Taking up the east of the country, it roughly corresponds with five Croatian counties: Brod-Posavina, Osijek-Bar ...
, and their three dialect groups,
Kajkavian Kajkavian (Kajkavian noun: ''kajkavščina''; Shtokavian adjective: ''kajkavski'' , noun: ''kajkavica'' or ''kajkavština'' ) is a South Slavic regiolect or language spoken primarily by Croats in much of Central Croatia, Gorski Kotar and no ...
,
Chakavian Chakavian or Čakavian (, , , sh-Latn, čakavski proper name: or own name: ''čokovski, čakavski, čekavski'') is a South Slavic regiolect or language spoken primarily by Croats along the Adriatic coast, in the historical regions of Dalmat ...
and Shtokavian, which historically utilized different spelling rules. A slightly modified version of it was later adopted as the formal Latin writing system for the unified Serbo-Croatian standard language per the Vienna Literary Agreement. It served as one of the official scripts in the unified South Slavic state of
Yugoslavia Yugoslavia (; sh-Latn-Cyrl, separator=" / ", Jugoslavija, Југославија ; sl, Jugoslavija ; mk, Југославија ;; rup, Iugoslavia; hu, Jugoszlávia; rue, label= Pannonian Rusyn, Югославия, translit=Juhoslavij ...
alongside Vuk's Cyrillic alphabet. A slightly reduced version is used as the alphabet for Slovene, and a slightly expanded version is used as the alphabet for modern standard Montenegrin. A modified version is used for the
romanization Romanization or romanisation, in linguistics, is the conversion of text from a different writing system to the Roman (Latin) script, or a system for doing so. Methods of romanization include transliteration, for representing written text, a ...
of
Macedonian Macedonian most often refers to someone or something from or related to Macedonia. Macedonian(s) may specifically refer to: People Modern * Macedonians (ethnic group), a nation and a South Slavic ethnic group primarily associated with North Ma ...
. It further influenced alphabets of Romani languages that are spoken in Southeast Europe, namely Vlax and
Balkan Romani Balkan Romani, Balkaniko Romanes, or Balkan Gypsy is a specific non- Vlax dialect of the Romani language, spoken by groups within the Balkans, which include countries such as Albania, Bosnia-Herzegovina, Bulgaria, Greece, Kosovo, North Macedonia, ...
.


Letters

The alphabet consists of thirty upper and
lower case Letter case is the distinction between the letters that are in larger uppercase or capitals (or more formally ''majuscule'') and smaller lowercase (or more formally ''minuscule'') in the written representation of certain languages. The writing ...
letters: Gaj's original alphabet contained the digraph , which Serbian linguist Đuro Daničić later replaced with the letter . The letters do not have names, and consonants are normally pronounced as such when spelling is necessary (or followed by a short
schwa In linguistics, specifically phonetics and phonology, schwa (, rarely or ; sometimes spelled shwa) is a vowel sound denoted by the IPA symbol , placed in the central position of the vowel chart. In English and some other languages, it rep ...
, e.g. ). When clarity is needed, they are pronounced similar to the German alphabet: ''a, be, ce, če, će, de, dže, đe, e, ef, ge, ha, i, je, ka, el, elj, em, en, enj, o, pe, er, es, eš, te, u, ve, ze, že''. These rules for pronunciation of individual letters are common as far as the 22 letters that match the
ISO basic Latin alphabet The ISO basic Latin alphabet is an international standard (beginning with ISO/IEC 646) for a Latin-script alphabet that consists of two sets ( uppercase and lowercase) of 26 letters, codified in various national and international standards and ...
are concerned. The use of others is mostly limited to the context of linguistics, while in mathematics, is commonly pronounced ''jot'', as in the German of Germany. The missing four letters are pronounced as follows: as ''ku'' or ''kju'', as ''dublve'', ''duplo v'' or ''duplo ve'', as ''iks'', as ''ipsilon''. Letters , , and represent the sounds , , and , but often are transcribed as , , and .


Digraphs

Digraphs , and are considered to be single letters: * In dictionaries, ''njegov'' comes after ''novine'', in a separate section after the end of the section; ''bolje'' comes after ''bolnica''; ''nadžak'' (digraph ) comes after ''nadživjeti'' (prefix ''nad-''), and so forth. *If only the initial letter of a word is capitalized, only the first of the two component letters is capitalized: ''Njemačka'' ('
Germany Germany,, officially the Federal Republic of Germany, is a country in Central Europe. It is the second most populous country in Europe after Russia, and the most populous member state of the European Union. Germany is situated betwee ...
'), not ''NJemačka''. In
Unicode Unicode, formally The Unicode Standard,The formal version reference is is an information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard, ...
, the form is referred to as ''
titlecase Title case or headline case is a style of capitalization used for rendering the titles of published works or works of art in English. When using title case, all words are capitalized, except for minor words (typically articles, short prepositions ...
'', as opposed to the uppercase form , representing one of the few cases in which titlecase and uppercase differ. Uppercase is used only if the entire word was capitalized: ''NJEMAČKA''. *In vertical writing (such as on signs), , , are written horizontally, as a unit. For instance, if ''mjenjačnica'' ('
bureau de change A bureau de change (plural bureaux de change, both ) (British English) or currency exchange (American English) is a business where people can exchange one currency for another. Nomenclature Although originally French, the term "bureau de chang ...
') is written vertically, appears on the fourth line (but note and appear separately on the first and second lines, respectively, because contains two letters, not one). In
crossword A crossword is a word puzzle that usually takes the form of a square or a rectangular grid of white- and black-shaded squares. The goal is to fill the white squares with letters, forming words or phrases, by solving clues which lead to th ...
puzzles, , , each occupy a single square. *If words are written with a space between each letter (such as on signs), each digraph is written as a unit. For instance: ''M J E NJ A Č N I C A''.


Origins

The Serbo-Croatian Latin alphabet was mostly designed by
Ljudevit Gaj Ljudevit Gaj (; born Ludwig Gay; hu, Gáj Lajos; 8 August 1809 – 20 April 1872) was a Croatian linguist, politician, journalist and writer. He was one of the central figures of the pan-Slavist Illyrian movement. Biography Origin He was bor ...
, who modelled it after Czech (č, ž, š) and Polish (ć), and invented , and , according to similar solutions in Hungarian (ly, ny and dzs, although dž combinations exist also in Czech and Polish). In 1830 in
Buda Buda (; german: Ofen, sh-Latn-Cyrl, separator=" / ", Budim, Будим, Czech and sk, Budín, tr, Budin) was the historic capital of the Kingdom of Hungary and since 1873 has been the western part of the Hungarian capital Budapest, on the ...
, he published the book ''Kratka osnova horvatsko-slavenskog pravopisanja'' ("Brief basics of the Croatian-Slavonic orthography"), which was the first common Croatian
orthography An orthography is a set of conventions for writing a language, including norms of spelling, hyphenation, capitalization, word breaks, emphasis, and punctuation. Most transnational languages in the modern period have a writing system, and ...
book. It was not the first ever Croatian orthography work, as it was preceded by works of Rajmund Đamanjić (1639), Ignjat Đurđević and Pavao Ritter Vitezović. Croats had previously used the Latin script, but some of the specific sounds were not uniformly represented. Versions of the
Hungarian alphabet The Hungarian alphabet () is an extension of the Latin alphabet used for writing the Hungarian language. The alphabet is based on the Latin alphabet, with several added variations of letters. The alphabet consists of the 26 letters of the ISO b ...
were most commonly used, but others were too, in an often confused, inconsistent fashion. Gaj followed the example of Pavao Ritter Vitezović and the Czech orthography, making one letter of the Latin script for each sound in the language. Following Vuk Karadžić's reform of Cyrillic in the early nineteenth century, in the 1830s Ljudevit Gaj did the same for ''latinica'', using the Czech system and producing a one-to-one grapheme-phoneme correlation between the Cyrillic and Latin orthographies, resulting in a parallel system. Đuro Daničić suggested in his ''Rječnik hrvatskoga ili srpskoga jezika'' ("Dictionary of Croatian or Serbian language") published in 1880 that Gaj's digraphs , , and should be replaced by single letters : , , and respectively. The original Gaj alphabet was eventually revised, but only the digraph has been replaced with Daničić's , while , and have been kept.


Correspondence Serbian Cyrillic-Latin alphabets

The following table provides the upper and lower case forms of the Serbian Cyrillic alphabet, along with the equivalent forms in the Serbian Latin alphabet and the
International Phonetic Alphabet The International Phonetic Alphabet (IPA) is an alphabetic system of phonetic notation based primarily on the Latin script. It was devised by the International Phonetic Association in the late 19th century as a standardized representation ...
(IPA) value for each letter. The letters do not have names, and consonants are normally pronounced as such when spelling is necessary (or followed by a short schwa, e.g. /fə/).:


Computing

In the 1990s, there was a general confusion about the proper
character encoding Character encoding is the process of assigning numbers to graphical characters, especially the written characters of human language, allowing them to be stored, transmitted, and transformed using digital computers. The numerical values tha ...
to use to write text in Latin Croatian on computers. *An attempt was made to apply the 7-bit " YUSCII", later "CROSCII", which included the five letters with diacritics at the expense of five non-letter characters ( , @), but it was ultimately unsuccessful. Because the ASCII character @ sorts before A, this led to jokes calling it ''žabeceda'' (''žaba''=frog, ''abeceda''=alphabet). *Other short-lived vendor-specific efforts were also undertaken. *The
8-bit In computer architecture, 8-bit integers or other data units are those that are 8 bits wide (1 octet). Also, 8-bit central processing unit (CPU) and arithmetic logic unit (ALU) architectures are those that are based on registers or data buses ...
ISO 8859-2 (Latin-2) standard was developed by ISO. *
MS-DOS MS-DOS ( ; acronym for Microsoft Disk Operating System, also known as Microsoft DOS) is an operating system for x86-based personal computers mostly developed by Microsoft. Collectively, MS-DOS, its rebranding as IBM PC DOS, and a few o ...
introduced 8-bit encoding CP852 for Central European languages, disregarding the ISO standard. *
Microsoft Windows Windows is a group of several proprietary graphical operating system families developed and marketed by Microsoft. Each family caters to a certain sector of the computing industry. For example, Windows NT for consumers, Windows Server for ...
spread yet another 8-bit encoding called CP1250, which had a few letters mapped one-to-one with ISO 8859-2, but also had some mapped elsewhere. *
Apple An apple is an edible fruit produced by an apple tree (''Malus domestica''). Apple trees are cultivated worldwide and are the most widely grown species in the genus '' Malus''. The tree originated in Central Asia, where its wild ancest ...
's
Macintosh Central European encoding Mac OS Central European is a character encoding used on Apple Macintosh computers to represent texts in Central European and Southeastern European languages that use the Latin script. This encoding is also known as Code Page 10029. IBM assigns c ...
does not include the entire Gaj's Latin alphabet. Instead, a separate codepage, called
MacCroatian encoding Mac OS Croatian is a character encoding used on Apple Macintosh computers to represent Gaj's Latin alphabet. It is a derivative of Mac OS Roman. The three digraphs, Dž, Lj, and Nj, are not encoded. IBM uses code page 1284 (CCSID 1284) for Mac ...
, is used. *
EBCDIC Extended Binary Coded Decimal Interchange Code (EBCDIC; ) is an eight- bit character encoding used mainly on IBM mainframe and IBM midrange computer operating systems. It descended from the code used with punched cards and the corresponding ...
also has a Latin-2 encoding. The preferred
character encoding Character encoding is the process of assigning numbers to graphical characters, especially the written characters of human language, allowing them to be stored, transmitted, and transformed using digital computers. The numerical values tha ...
for Croatian today is either the ISO 8859-2, or the
Unicode Unicode, formally The Unicode Standard,The formal version reference is is an information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard, ...
encoding
UTF-8 UTF-8 is a variable-length character encoding used for electronic communication. Defined by the Unicode Standard, the name is derived from ''Unicode'' (or ''Universal Coded Character Set'') ''Transformation Format 8-bit''. UTF-8 is capable of e ...
(with two bytes or 16 bits necessary to use the letters with diacritics). However, , one can still find programs as well as databases that use CP1250, CP852 or even CROSCII. Digraphs , and in their upper case, title case and lower case forms have dedicated Unicode code points as shown in the table below, However, these are included chiefly for backwards compatibility with legacy encodings which kept a one-to-one correspondence with Cyrillic; modern texts use a sequence of characters.


Usage for Slovene

Since the early 1840s, Gaj's alphabet was increasingly used for Slovene. In the beginning, it was most commonly used by Slovene authors who treated Slovene as a variant of Serbo-Croatian (such as Stanko Vraz), but it was later accepted by a large spectrum of Slovene-writing authors. The breakthrough came in 1845, when the Slovene conservative leader Janez Bleiweis started using Gaj's script in his journal ''
Kmetijske in rokodelske novice {{Unreferenced, date=July 2015 ''Kmetijske in rokodelske novice'' ( en, Agricultural and Artisan News), frequently referred to simply as ''Novice'' (''News''), was a Slovene-language newspaper in the 19th century, which had an influential role i ...
'' ("Agricultural and Artisan News"), which was read by a wide public in the countryside. By 1850, Gaj's alphabet (known as ''gajica'' in Slovene) became the only official
Slovene alphabet The Slovene alphabet ( sl, slovenska abeceda, or ''slovenska gajica'' ) is an extension of the Latin script used to write Slovene. The standard language uses a Latin alphabet which is a slight modification of the Croatian Gaj's Latin alphabet ...
, replacing three other writing systems that had circulated in the
Slovene Lands The Slovene lands or Slovenian lands ( sl, Slovenske dežele or in short ) is the historical denomination for the territories in Central and Southern Europe where people primarily spoke Slovene. The Slovene lands were part of the Illyrian provin ...
since the 1830s: the traditional '' bohoričica'', named after
Adam Bohorič Adam Bohorič () ( – after 20 November 1598) was a Slovene Protestant preacher, teacher and author of the first grammar of Slovene. Bohorič was born in the market town of Reichenburg in the Duchy of Styria (now Brestanica in Slov ...
, who codified it; the '' dajnčica'', named after
Peter Dajnko Peter Dajnko (23 April 1787 – 22 February 1873) was a Slovene priest, author, and linguist, known primarily as the inventor of the Dajnko alphabet ( sl, dajnčica), an innovative proposal for the Slovene alphabet. Dajnko was also a proficien ...
; and the '' metelčica'', named after
Franc Serafin Metelko Franc Serafin Metelko, also known as Fran Metelko (14 July 1789 – 27 December 1860) was a Slovene Roman Catholic priest, author, and philologist, best known for his proposal of a new script for the Slovene called the Metelko alphabet, which ...
. The Slovene version of Gaj's alphabet differs from the Serbo-Croatian one in several ways: *The Slovene alphabet does not have the characters and ; the sounds they represent do not occur in Slovene. *In Slovene, the digraphs and are treated as two separate letters and represent separate sounds (the word
polje A polje, also karst polje or karst field, is a large flat plain found in karstic geological regions of the world, with areas usually . The name derives from the Slavic languages and literally means 'field', whereas in English ''polje'' specif ...
is pronounced or in Slovene, as opposed to in Serbo-Croatian). *While the phoneme exists in modern Slovene and is written , it is used in only borrowed words and so and are considered separate letters, not a digraph. As in Serbo-Croatian, Slovene orthography does not make use of diacritics to mark accent in words in regular writing, but
headword In morphology and lexicography, a lemma (plural ''lemmas'' or ''lemmata'') is the canonical form, dictionary form, or citation form of a set of word forms. In English, for example, ''break'', ''breaks'', ''broke'', ''broken'' and ''breaking'' ...
s in dictionaries are given with them to account for homographs. For instance, letter can be pronounced in four ways (, , and ), and letter in two ( and , though the difference is not phonemic). Also, it does not reflect consonant voicing assimilation: compare e.g. Slovene and Serbo-Croatian ('junkyard', 'waste').


Usage for Macedonian

Romanization Romanization or romanisation, in linguistics, is the conversion of text from a different writing system to the Roman (Latin) script, or a system for doing so. Methods of romanization include transliteration, for representing written text, a ...
of
Macedonian Macedonian most often refers to someone or something from or related to Macedonia. Macedonian(s) may specifically refer to: People Modern * Macedonians (ethnic group), a nation and a South Slavic ethnic group primarily associated with North Ma ...
is done according to Gaj's Latin alphabetMacedonian Latin alphabet, Pravopis na makedonskiot literaturen jazik, B. Vidoeski, T. Dimitrovski, K. Koneski, K. Tošev, R. Ugrinova Skalovska - Prosvetno delo Skopje, 1970, p.99 but is slightly modified. Gaj's ''ć'' and ''đ'' are not used at all, with ''ḱ'' and ''ǵ'' introduced instead. The rest of the letters of the alphabet are used to represent the equivalent Cyrillic letters. Also, Macedonian uses the letter ''dz'', which is not part of the Serbo-Croatian phonemic inventory. However, the backs of record sleeves published in the former
Yugoslavia Yugoslavia (; sh-Latn-Cyrl, separator=" / ", Jugoslavija, Југославија ; sl, Jugoslavija ; mk, Југославија ;; rup, Iugoslavia; hu, Jugoszlávia; rue, label= Pannonian Rusyn, Югославия, translit=Juhoslavij ...
, by non-Macedonian publishers, (such as Mizar's debut album) used ''ć'' and ''đ'', like other places.


Keyboard layout

The standard Gaj's Latin alphabet
keyboard layout A keyboard layout is any specific physical, visual or functional arrangement of the keys, legends, or key-meaning associations (respectively) of a computer keyboard, mobile phone, or other computer-controlled typographic keyboard. is the actua ...
for personal computers is as follows: ::


See also

* Glagolitic alphabet * Montenegrin alphabet *
Serbian Cyrillic alphabet The Serbian Cyrillic alphabet ( sr, / , ) is a variation of the Cyrillic script used to write the Serbian language, updated in 1818 by Serbian linguist Vuk Karadžić. It is one of the two alphabets used to write standard modern Serbian, th ...
*
Serbo-Croatian Serbo-Croatian () – also called Serbo-Croat (), Serbo-Croat-Bosnian (SCB), Bosnian-Croatian-Serbian (BCS), and Bosnian-Croatian-Montenegrin-Serbian (BCMS) – is a South Slavic language and the primary language of Serbia, Croatia, Bosnia an ...
*
Slovene alphabet The Slovene alphabet ( sl, slovenska abeceda, or ''slovenska gajica'' ) is an extension of the Latin script used to write Slovene. The standard language uses a Latin alphabet which is a slight modification of the Croatian Gaj's Latin alphabet ...
* Yugoslav braille *
Yugoslav manual alphabet The Yugoslav manual alphabet is two-handed manual alphabet that is used to spell in Yugoslav Sign Language. Some signs are modeled after letters of Serbian Cyrillic alphabet, others after Gaj's Latin alphabet Gaj's Latin alphabet ( sh-Latn-Cyr ...
*
Romanization of Serbian The romanization of Serbian or latinization of Serbian is the representation of the Serbian language using Latin letters. Serbian is written in two alphabets, Serbian Cyrillic, a variation of the Cyrillic alphabet, and Gaj's Latin, or ''latini ...
– describes usage not the alphabet


Sources

* *


References


External links


Omniglot
{{List of writing systems Latin alphabets
Alphabet An alphabet is a standardized set of basic written graphemes (called letters) that represent the phonemes of certain spoken languages. Not all writing systems represent language in this way; in a syllabary, each character represents a syllab ...
Alphabet An alphabet is a standardized set of basic written graphemes (called letters) that represent the phonemes of certain spoken languages. Not all writing systems represent language in this way; in a syllabary, each character represents a syllab ...
Serbo-Croatian language Slovene alphabet Writing systems introduced in the 19th century Bosnian language 1835 introductions