Lojban grammar
   HOME

TheInfoList



OR:

The grammar of
Lojban Lojban (pronounced ) is a logical, constructed, human language created by the Logical Language Group which aims to be syntactically unambigious. It succeeds the Loglan project. The Logical Language Group (LLG) began developing Lojban in 1987. ...
is based on
predicate logic First-order logic—also known as predicate logic, quantificational logic, and first-order predicate calculus—is a collection of formal systems used in mathematics, philosophy, linguistics, and computer science. First-order logic uses quantifie ...
. The majority of the grammar is borrowed from the prior "logical language"
Loglan Loglan is a logical constructed language originally designed for linguistic research, particularly for investigating the Sapir–Whorf hypothesis. The language was developed beginning in 1955 by Dr.  James Cooke Brown with the goal of making ...
, and some of its features come from Láadan. The characteristic regularity, unambiguity, and versatility of Lojban grammar owes much to modern
linguistics Linguistics is the science, scientific study of human language. It is called a scientific study because it entails a comprehensive, systematic, objective, and precise analysis of all aspects of language, particularly its nature and structure ...
and
computer programming Computer programming is the process of performing a particular computation (or more generally, accomplishing a specific computing result), usually by designing and building an executable computer program. Programming involves tasks such as anal ...
—resources that were unavailable to the designers of earlier languages. Lojbanist Bob LeChevalier summarized one advantage of Lojban grammar as follows: "Lojban moves beyond the restrictions of European grammar. It overtly incorporates
linguistic universal A linguistic universal is a pattern that occurs systematically across natural languages, potentially true for all of them. For example, ''All languages have nouns and verbs'', or ''If a language is spoken, it has consonants and vowels.'' Research ...
s, building in what is needed to support the expressivity of the whole variety of natural languages, including non-European ones."


Formal grammars

Lojban texts can be
parsed Parsing, syntax analysis, or syntactic analysis is the process of analyzing a String (computer science), string of Symbol (formal), symbols, either in natural language, computer languages or data structures, conforming to the rules of a formal gra ...
just as texts in programming languages are by using formal grammars such as PEG,
YACC Yacc (Yet Another Compiler-Compiler) is a computer program for the Unix operating system developed by Stephen C. Johnson. It is a Look Ahead Left-to-Right Rightmost Derivation (LALR) parser generator, generating a LALR parser (the part of a co ...
, Backus–Naur form. There are several parsers available.


Phonology

Lojban has 6
vowel A vowel is a syllabic speech sound pronounced without any stricture in the vocal tract. Vowels are one of the two principal classes of speech sounds, the other being the consonant. Vowels vary in quality, in loudness and also in quantity (leng ...
s and 21
consonant In articulatory phonetics, a consonant is a speech sound that is articulated with complete or partial closure of the vocal tract. Examples are and pronounced with the lips; and pronounced with the front of the tongue; and pronounced wi ...
s. The
phoneme In phonology and linguistics, a phoneme () is a unit of sound that can distinguish one word from another in a particular language. For example, in most dialects of English, with the notable exception of the West Midlands and the north-wes ...
s are commensurate with
grapheme In linguistics, a grapheme is the smallest functional unit of a writing system. The word ''grapheme'' is derived and the suffix ''-eme'' by analogy with ''phoneme'' and other names of emic units. The study of graphemes is called '' graphemi ...
s, which means Lojban has 27 letters () each corresponding to a unique phoneme. Lojbanic graphemes can vary in mode; this article employs the
Latin alphabet The Latin alphabet or Roman alphabet is the collection of letters originally used by the ancient Romans to write the Latin language. Largely unaltered with the exception of extensions (such as diacritics), it used to write English and th ...
version, which is currently in the most common usage (see Orthography for more detail). The phonemes, on the other hand, are defined solely according to the
International Phonetic Alphabet The International Phonetic Alphabet (IPA) is an alphabetic system of phonetic notation based primarily on the Latin script. It was devised by the International Phonetic Association in the late 19th century as a standardized representation ...
. The tables below show typical realizations of sounds and the Latin alphabets in Lojban. In all cases except the rhotic consonant the first phoneme represents the preferred pronunciation, while the rest are the permitted variants intended to cover dissimilitude in pronunciation by speakers of different linguistic backgrounds.


Basic sounds

Note that simple vowel graphemes are never diphthongized like in English. Diphthongs are always written with two vowel graphemes.


Diphthongs

Lojban has 16 diphthongs, vowels that change quality during their emission but always being single syllable nuclei like pure vowels. Unlike English and similarly to languages such as Spanish, diphthongs are not distinct phonemes by themselves but are analyzed as a combination of "semi-vowel + vowel" (or the inverse order). The combinations , , and , for instance, are all realized as the corresponding
falling diphthong A diphthong ( ; , ), also known as a gliding vowel, is a combination of two adjacent vowel sounds within the same syllable. Technically, a diphthong is a vowel with two different targets: that is, the tongue (and/or other parts of the speech ...
s.
Triphthong In phonetics, a triphthong (, ) (from Greek τρίφθογγος, "triphthongos", literally "with three sounds," or "with three tones") is a monosyllabic vowel combination involving a quick but smooth movement of the articulator from one vowel q ...
s exist as combinations of a rising and a falling diphthong, e.g. .


Allophones

The
vowel A vowel is a syllabic speech sound pronounced without any stricture in the vocal tract. Vowels are one of the two principal classes of speech sounds, the other being the consonant. Vowels vary in quality, in loudness and also in quantity (leng ...
s can be either rounded or unrounded. The consonants can be either aspirated or unaspirated, although the voiceless stops , and are usually aspirated to some degree. In general, consonants are not palatalized. The affricates / d͡ʒ/, /
t͡ʃ The voiceless palato-alveolar sibilant affricate or voiceless domed postalveolar sibilant affricate is a type of consonantal sound used in some spoken languages. The sound is transcribed in the International Phonetic Alphabet with , (formerly ...
/, / d͡z/, and / t͡s/ occur in Lojban (like in English ''jar'', ''chair'', ''fads'', and ''cats''.), but they are not distinct phonemes, contrary to English / d͡ʒ/ and /
t͡ʃ The voiceless palato-alveolar sibilant affricate or voiceless domed postalveolar sibilant affricate is a type of consonantal sound used in some spoken languages. The sound is transcribed in the International Phonetic Alphabet with , (formerly ...
/. Each is considered to be a combination of an appropriate stop and fricative
phoneme In phonology and linguistics, a phoneme () is a unit of sound that can distinguish one word from another in a particular language. For example, in most dialects of English, with the notable exception of the West Midlands and the north-wes ...
: the sequences , , and , respectively. The rhotic sounds are all equally acceptable as an identical phoneme. , , , and may be syllabic.


Buffering of consonant clusters

For those who, given their native language background, may have trouble pronouncing (certain)
consonant cluster In linguistics, a consonant cluster, consonant sequence or consonant compound, is a group of consonants which have no intervening vowel. In English, for example, the groups and are consonant clusters in the word ''splits''. In the education fie ...
s, there is the option of inserting buffer vowels between them, as long as they differ sufficiently from the phonological vowels and are pronounced as short as possible. This mechanism can be described as a form of anaptyxis (vocalic epenthesis). Possible choices include , , and (but not , which is the rounded counterpart of and thus a valid realization of ). The resulting added syllables are completely ignored by the grammar, including for the purposes of stress determination.


Orthography

Lojban may be written in different orthography systems as long as it meets the required regularities and unambiguities. Some of the reasons for such elasticity would be as follows: # Lojban is rather defined by the phonemes (spoken form of words), therefore, as long as they are correctly rendered so as to maintain the Lojbanic audio-visual
isomorphism In mathematics, an isomorphism is a structure-preserving mapping between two structures of the same type that can be reversed by an inverse mapping. Two mathematical structures are isomorphic if an isomorphism exists between them. The word i ...
, a representational system can be said to be an appropriate orthography of the language; # Lojban is meant to be as culturally neutral as possible, so it is never crucial or fundamental to claim that some particular orthography of some particular languages (e.g. the Latin alphabet) should be the dominant mode. Lojbanist Kena extends this principle to argue that even an original orthography of the language is to be sought.


Latin/Roman mode

Lojban's Latin alphabet consists of 23 lerfu ''a b c d e f g i j k l m n o p r s t u v x y z'' plus 3 semi-lerfu , . ''. They are intentionally ordered in accordance with that of ASCII characters. Capitalization may be applied to mark a non-standard
stressed syllable In linguistics, and particularly phonology, stress or accent is the relative emphasis or prominence given to a certain syllable in a word or to a certain word in a phrase or sentence. That emphasis is typically caused by such properties as i ...
as in cmevla, but they are not considered separate lerfu. Whether a single vowel or the entire syllable is capitalized is a matter of preference; for example, the name "Josephine" can be rendered as either or (without the capitalization, the ordinary rules of Lojban stress will cause the 'ze' syllable to be stressed instead). Punctuation marks are not mandatory; such notions as question or exclamation are expressed with words rather than unpronounced symbols.


Cyrillic mode

This mode was conceived when the introductory Lojban brochure was translated into Russian. 23 lerfu ''а б в г д е ж з и к л м н о п р с т у ф х ш ъ'' plus 3 semi-lerfu , . '' are used. The hard sign ''ъ'' is assigned to the open-mid vowel (as in
Bulgarian Bulgarian may refer to: * Something of, from, or related to the country of Bulgaria * Bulgarians, a South Slavic ethnic group * Bulgarian language, a Slavic language * Bulgarian alphabet * A citizen of Bulgaria, see Demographics of Bulgaria * Bul ...
). Diphthongs are written as vowel pairs, as in the Roman mode.


Tengwar mode

Kena argues for writing Lojban using
Tengwar The Tengwar script is an artificial script, one of several scripts created by J. R. R. Tolkien, the author of ''The Lord of the Rings''. Within the fictional context of Middle-earth, the Tengwar were invented by the Elf Fëanor, and use ...
, arguing that: # the Latin alphabet is too strongly related to Western civilizations, and thus probably introduces some kind of cultural bias in Lojban. Lojban wants to be both logical and culturally neutral, the Tengwar already are; # the Tengwar system inherently contains some main Lojban morphology rules, making Lojban easier to learn when it is written with Tengwar. Advocates of this mode include Eric S. Raymond.


Zbalermorna mode

Another option for cultural neutrality (free from fictional cultures even) is the ''zbalermorna'' alphabet.


Japanese mode

A Japanese hiragana version of Lojban orthography has been proposed, in which case more than 80 lerfu may be used. This mode is not without certain technical issues since the hiragana (and katakana too) are always
syllabic Syllabic may refer to: *Syllable, a unit of speech sound, considered the building block of words **Syllabic consonant, a consonant that forms the nucleus of a syllable *Syllabary, writing system using symbols for syllables *Abugida, writing system ...
, indicating an open syllable (with the exception of the final "n" sound), requiring some special attention when representing the Lojbanic consonant clusters. Several example
haiku is a type of short form poetry originally from Japan. Traditional Japanese haiku consist of three phrases that contain a ''kireji'', or "cutting word", 17 '' on'' (phonetic units similar to syllables) in a 5, 7, 5 pattern, and a ''kigo'', or s ...
compositions in the orthography have been created.


Morphology

Lojban has 3 word-classes: ''brivla'' (verbs or predicate words), ''cmavo'' (structure words), and ''cmevla'' (name words). Each of them has uniquely identifying morphological properties, so that one can unambiguously recognize which word is of which part of speech in a string of the language. They may be further divided in sub-classes (discussed respectively below). There also exist affixes (''rafsi'') assigned to some of the brivla and cmavo.


brivla – verbs

''brivla'' carry the content (semantic information) of an expression. The bare brivla corresponds to the
verb A verb () is a word ( part of speech) that in syntax generally conveys an action (''bring'', ''read'', ''walk'', ''run'', ''learn''), an occurrence (''happen'', ''become''), or a state of being (''be'', ''exist'', ''stand''). In the usual descr ...
in English. Additional words can be used to transform it into what corresponds to English
common noun A proper noun is a noun that identifies a single entity and is used to refer to that entity (''Africa'', ''Jupiter'', ''Sarah'', ''Microsoft)'' as distinguished from a common noun, which is a noun that refers to a class of entities (''continent, ...
s,
adjective In linguistics, an adjective (abbreviated ) is a word that generally modifies a noun or noun phrase or describes its referent. Its semantic role is to change information given by the noun. Traditionally, adjectives were considered one of the ma ...
s, or
adverb An adverb is a word or an expression that generally modifies a verb, adjective, another adverb, determiner, clause, preposition, or sentence. Adverbs typically express manner, place, time, frequency, degree, level of certainty, etc., answering ...
s (although some modal cmavo too may have adverbial purposes). Brivla may be identified by the following morphological properties: *Have more than one syllable *Are penultimately stressed *Have a consonant cluster (at least two adjacent consonants) in the first 5 lerfu including the second consonant *Start with a consonant (except some fu'ivla) *End with a vowel A word such as will still be considered as a brivla because the special gluing vowel ''y'' between ''b'' and ''p'' is to be ignored and therefore a consonant cluster (''b-p'') assumes its existence within it. Unlike verbs in English, brivla do not inflect for tense, person, or number. Brivla's sub-classes are as follows, with some examples.


gismu – core verbs

The brivla which constitute the lexical core of the language are called ''gismu''. They are invariably five-letter words, which distinguishes it from the other types of brivla, and are in a form of either CVCCV or CCVCV (C stands for a consonant and V for a vowel). Being two syllables means that the general rule of gismu to be stressed penultimately will always cause the first syllable to be stressed.
:''viska'' (CVCCV) :''prami'' (CCVCV)
They have been chosen or added as root words because they * represent concepts that are very familiar and basic, * represent concepts the usage of which is equally frequent among different languages, * would be helpful in constructing more complex words, or * represent fundamental grammatical concepts of Lojban. The main source languages from which they were drawn are
Arabic Arabic (, ' ; , ' or ) is a Semitic language spoken primarily across the Arab world.Semitic languages: an international handbook / edited by Stefan Weninger; in collaboration with Geoffrey Khan, Michael P. Streck, Janet C. E.Watson; Walter ...
,
Chinese Chinese can refer to: * Something related to China * Chinese people, people of Chinese nationality, citizenship, and/or ethnicity **''Zhonghua minzu'', the supra-ethnic concept of the Chinese nation ** List of ethnic groups in China, people of ...
,
English English usually refers to: * English language * English people English may also refer to: Peoples, culture, and language * ''English'', an adjective for something of, from, or related to England ** English national ide ...
,
Hindi Hindi ( Devanāgarī: or , ), or more precisely Modern Standard Hindi (Devanagari: ), is an Indo-Aryan language spoken chiefly in the Hindi Belt region encompassing parts of northern, central, eastern, and western India. Hindi has been ...
,
Russian Russian(s) refers to anything related to Russia, including: *Russians (, ''russkiye''), an ethnic group of the East Slavic peoples, primarily living in Russia and neighboring countries *Rossiyane (), Russian language term for all citizens and peo ...
, and
Spanish Spanish might refer to: * Items from or related to Spain: **Spaniards are a nation and ethnic group indigenous to Spain **Spanish language, spoken in Spain and many Latin American countries **Spanish cuisine Other places * Spanish, Ontario, Can ...
. Here is further explanation of the nature of gismu by Cowan: : According to Robin Turner, the creation was algorithmically done by computer. Approximately 1350 gismu exist, which is a relatively small number when compared to that of English words ranging from 450,000 up to 1,000,000. Theoretically, by learning only these root words, as well as their fragmental forms and some major structure words (cmavo), one will be able to communicate effectively in Lojban.


lujvo – verbs made of affixes

The form of brivla made of affixes (''rafsi'') is called ''lujvo''.
: – pregnant ( (an affix from , offspring) + (an affix from , containing)) : – tomorrow ( (from , future) + (from , adjacent) + (from , a day in duration)) : – object of pride ( (an affix from the particle ) + (an affix from , pride))


fu'ivla – borrowed verbs, loan words

''fu'ivla'' usually refers to things that are culture-specific (including Lojban culture) or to kinds of plants or animals, concepts which cannot be easily expressed as mere modifying-modified combinations of Lojban's internal root verbs (''gismu''). fu'ivla can be subdivided into four types according to the extent to which they are modified, namely Stage 1, 2, 3, and 4 fu'ivla.


=Stage 1 fu'ivla

= The longest form, quoting a foreign word/phrase while preserving its original spelling with particular structure words.
:
( indicates that a non-lojbanic text follows. are delimiters of that foreign text. And turns the whole sequence into a selbri so that the word/phrase can form a bridi with its given place structure. In this example, "x1 is a quantity of spaghetti" is a possible place structure.)
*A "hybrid" stage sometimes is enlisted. In this case, it would take the above sentence but Lojbanize "spaghetti" to be phonetic to a (native) Lojban without changing the ending. Therefore, stage "1.5" fu'ivla for "''spaghetti''" is ""


=Stage 2 fu'ivla

= This stage involves lojbanizing the sound and spelling of the word. :
( is still needed since cannot by itself work as a brivla.)


=Stage 3 fu'ivla

= At this stage a borrowed word is fully turned into a single brivla, having its own place structure. Since no brivla may have more than one meaning, it is often the case that they are attached by a rafsi (with a hyphen like "-r-", "-n-", or "-l-") categorizing or limiting the semantic scope of the word (such are called "rafsi classifier"). Again they always start with a consonant and end with a vowel.
: (using longer rafsi: cidj + r + "spaghetti", using , food) : (using shorter rafsi: dja + r + "spaghetti", using a rafsi of ) : ( (music) + "techno") : ( (artifact) + "origami")


=Stage 4 fu'ivla

= These are the borrowings which are so common or so important that have become as short as possible, having no rafsi classifier. Unlike other brivla, they may begin with a vowel (preceded by a pause mark separating it from the previous word). Also the word must not be of a form that one can remove all the initial vowels (and apostrophes) and have a valid word.
: ("Basque" from "") : ("buckthorn" from a species name) : ("wombat") : ("basil" from Spanish)


lujvo with components made of fu'ivla

It is possible to absorb a fu'ivla into a lujvo, with principles varying among Lojbanists. Notable proponents are Pierre Abbat and Jorge Llambías. Here are some comparisons of their methods drawn from the Lojban mailing list (as of July 2007):
: ( henna, from
Gujarati Gujarati may refer to: * something of, from, or related to Gujarat, a state of India * Gujarati people, the major ethnic group of Gujarat * Gujarati language, the Indo-Aryan language spoken by them * Gujarati languages, the Western Indo-Aryan sub ...
) + (colour)
(Abbat)
(Llambías) : (
teff ''Eragrostis tef'', also known as teff, Williams lovegrass or annual bunch grass, is an annual grass, a species of lovegrass native to the Horn of Africa, notably to both Eritrea and Ethiopia. It is cultivated for its edible seeds, also known as ...
) + ''nanba'' (bread)
(Abbat)
(Llambías) : (one millionth,
micro- ''Micro'' (Greek letter μ ( U+03BC) or the legacy symbol µ (U+00B5)) is a unit prefix in the metric system denoting a factor of 10−6 (one millionth). Confirmed in 1960, the prefix comes from the Greek ('), meaning "small". The symbol for ...
) + (
henry Henry may refer to: People *Henry (given name) * Henry (surname) * Henry Lau, Canadian singer and musician who performs under the mononym Henry Royalty * Portuguese royalty ** King-Cardinal Henry, King of Portugal ** Henry, Count of Portugal, ...
)
(Abbat)
(Llambías)


cmavo – particles, structure words

Lojban particles, ''cmavo'', are recognized by following morphological properties: *may be a single syllable *never contain a consonant cluster of any type, whether or not y is counted *end in a vowel *need not be penultimately stressed, though they often are if they have more than one syllable And they display one of the following letter patterns: V, VV, V'V, CV, CVV, CV'V. The form generally does not indicate anything about its grammatical function. cmavo can be sequenced without spaces and without any change to its meaning:
:''pa re ci'' (123) = ''pareci'' (123) :''se pi'o'' (using ...) = ''sepi'o'' (using ...)
As far as the stress rules of Lojban are concerned, such compound cmavo are still separate words, so penultimate stress (e.g. ''paREci'') is not obligatory. Some cmavo have rafsi, which may help converting tanru (sequence of brivla) into lujvo:
:''ve detri'' --> ''veldetri'' ("calendar", from fourth slot of ''detri'', date) :''se ke cpacu djica'' --> ''selkemcpadji'' (roughly "what someone wants to get", from ''cpacu'', get, and ''djica'', desire)


nouns


cmevla – name words

cmevla are mostly used for names of things (including people) in descriptions or in direct address (cf.
proper nouns A proper noun is a noun that identifies a single entity and is used to refer to that entity (''Africa'', ''Jupiter'', ''Sarah'', ''Microsoft)'' as distinguished from a common noun, which is a noun that refers to a class of entities (''continent, ...
). They can be in any form as long as they end in a consonant. The practice by which names in natural languages are modified to be used in Lojban is known as "lojbanization". :''la .bionses.nolz.'', (a possible realization of the name "Beyoncé Knowles")


rafsi – affixes (suffixes, infixes, prefixes)

A special building block, from which a new lujvo may be created, is called ''rafsi''. Rafsi are bound to certain gismu, cmavo or fu'ivla. This also means that lujvo have no rafsi form of their own. Rafsi cannot by themselves function as an individual words; they only form lujvo when at least two are joined together.
:''solri'' (original gismu for "sun"): ''sol'', ''solr'', ''solri'' (assigned rafsi): ''solxrula'' (derivative lujvo, "sunflower") :''ke'' (original cmavo): ''kem'' (assigned rafsi): ''selkemcpadji'' (derivative lujvo, "what someone wants to get") :''sam'', ''pli'' (component rafsi, from ''skami'' (computer) and ''pilno'' (use) respectively): ''sampli'' (derivative lujvo, "computer user")
The unambiguity of Lojban morphology, according to John Woldemar Cowan, gives rise to "significant clues to the meaning and the origin of the word, even if you have never heard the word before". He further says: "The same principle allows you, when speaking or writing, to invent new brivla for new concepts 'on the fly'; yet it offers people that you are trying to communicate with a good chance to figure out your meaning. In this way, Lojban has a flexible vocabulary which can be expanded indefinitely."


Syntax and semantics

According to ''What Is Lojban?'', the language's grammatical structures are "defined by a set of rules that have been tested to be unambiguous using computers", which is called the "machine grammar". Hence the characteristics of the standard syntactic (not semantic) constructs in Lojban: * each word has exactly one grammatical interpretation; * the words relate grammatically to each other in exactly one way. Such standards, however, are to be attained with certain carefulness: The computer-tested, unambiguous rules also include grammar for 'incomplete' sentences e.g. for narrative, quotational, or mathematical phrases. Lojbanic expressions are modular; smaller constructs of words are assembled into larger phrases so that all incorporating pieces manifest as a possible grammatical unity. This mechanism allows for simplistic yet infinitely powerful phrasings; "a more complex phrase can be placed inside a simple structure, which in turn can be used in another instance of the complex phrase structure".


bridi – clauses, predications

Being derived from predicate logic, the basic unit of Lojban expression is predication, a claim that some objects stand in some relationship, or that some single object has some property. In Lojban they are expressed using clauses (''bridi''). Just as a predication is formed by a
predicate Predicate or predication may refer to: * Predicate (grammar), in linguistics * Predication (philosophy) * several closely related uses in mathematics and formal logic: **Predicate (mathematical logic) **Propositional function **Finitary relation, o ...
and
arguments An argument is a statement or group of statements called premises intended to determine the degree of truth or acceptability of another statement called conclusion. Arguments can be studied from three main perspectives: the logical, the dialectic ...
in formal logic, bridi are formed by ''selbri'' and ''sumti'' in Lojban. In linguistic terms ''selbri'' is the predicate of a clause, and ''sumti'' are its arguments. A construct of selbri and sumti produces a claim that something stands in a specified relationship to something else or has a specified property.
:''do , viska , mi''
(Two sumti and one selbri, making up one bridi, claiming that a relation ''viska'' (x1 sees x2) exists between ''do'' (you) and ''mi'' (me). The selbri needs not be literally between sumti. The example can also be rendered as ''do mi viska''. A more detailed discussion on Lojban word order below.)
Multiple bridi can be either sequenced across multiple sentences or compounded in one sentence:
:''do , melbi , .i , do , xendo''
You are beautiful. You are kind.
(Two sentences, each consisting of one sumti and one selbri. ''.i'' separates sentences.) :''do , melbi , .ije , do , xendo''
You are beautiful and you are kind.
(This sentence is syntactically identical to the last one but differs in meaning. ''.ije'' may be spelled as ''.i je''.) :''do , melbi , gi'e , xendo''
You are beautiful and kind.
(One sentence, consisting of one sumti and two selbri. ''gi'e'' separates bridi as well as compounding them.) :''do , melbi , gi'e , xendo , .iki'ubo , mi , nelci , do''
You are beautiful and kind. Because of that, I am fond of you.
(Two sentences, one of which includes compound bridi. While ''.i'' simply marks a division of sentences, ''ki'u'' together with ''bo'' adds that there is a particular logical connection between the first and second sentence. ''.iki'ubo'' may be spelled as ''.i ki'u bo''.)
A compound bridi can include multiple tenses and sumti:
:''mi , puze'u gunka , gi'e , ca tatpi''
''I , for a long interval some time in the past, worked , and , now, am tired''
I worked (for a long time) and I'm now tired. :''mi , ca'o klama , ta , ti , fu le karce , gi'e , ba tavla , do , la .lojban.''
I , continuing/during, come , (to) there , (from) here , using the car , and , after, talk , (to) you
I'll keep driving the car from here to there, and I will talk to you after.
The implicit grammatical divisions can be made explicit by separator words such as ''cu'' and ''vau'', which are often elidable but sometimes need to be present to avoid ambiguity:
:''le nixli cu melbi''
(This instance shows that the left-hand gismu is sumti ("the girl") and the right-hand gismu is selbri ("is beautiful"). Without ''cu'' the two gismu would be grammatically undistinguishable.) :''mi dunda le cukta gi'e lebna lo rupnu vau do''
(''vau'' indicates that the two bridi, ''dunda le cukta'' ("give the book (to)") and ''lebna lo rupnu'' ("take the dollars (from)"), sharing the same first sumti ''mi'', together terminate at that position, enabling them to have the subsequent ''do'' as their mutual second sumti. Compare it with its longer equivalent: ''mi dunda le cukta do .ije mi lebna lo rupnu do'' ("I give the book to you and take the money from you").)
The places of ''cu'' and ''vau'' in the previous examples can be rendered as follows:
:''do (cu) viska le nixli (vau)'' :''do (cu) melbi (vau) .i do (cu) xendo (vau)'' :''do (cu) melbi (vau) gi'e xendo (vau) (vau)''
(The last ''vau'' marks the mutual termination of the two bridi.) :''do (cu) melbi (vau) gi'e xendo (vau) (vau) .i do (cu) xendo (vau)''
The ordered sets of sumti assigned to every selbri are known as "place structures". They are explicitly defined in dictionaries or word lists. :''mi , tavla , do , la .lojban. , le glibau''
(Two sumti ''mi'' and ''do'' are fitting into the place structure of the selbri ''tavla'', which is "x1 talks/speaks to x2 about subject x3 in language x4".)
Some lujvo formations usually operate on the place structure in predictable ways. The rafsi ''gau'', for instance, inserts one place for the agent and pushes all others down one. Thus brivla can have indefinitely many places. This contrasts with the
accusative alignment The accusative case (abbreviated ) of a noun is the grammatical case used to mark the direct object of a transitive verb. In the English language, the only words that occur in the accusative case are pronouns: 'me,' 'him,' 'her,' 'us,' and ‘the ...
or ergative alignment that most languages have, in which there is a small number of named places (subject, direct object, indirect object) and all others are expressed by prepositions. The
typology Typology is the study of types or the systematic classification of the types of something according to their common characteristics. Typology is the act of finding, counting and classification facts with the help of eyes, other senses and logic. Ty ...
of Lojban is basically subject–verb–object, with subject–object–verb also common. However, it can practically be anything. All the following sentences mean "I love you", differing only in word order:
:''mi , prami , do'' (SVO) :''mi , do , prami'' (SOV) :''do , se prami , mi'' (OVS) :''do , mi , se prami'' (OSV) :''prami , fa mi , do'' (VSO) :''prami , fe do , fa mi'' (VOS)
Such flexibility has to do with the language's intended capability to translate as many expressions of natural languages as possible, based on a unique positional case system. The meaning of the sentence ''mi prami do'' is determined by ''prami'' realizing, with its own predefined place structure, a specific semantic relation between ''mi'' and ''do''; when the positional relation between ''mi'' and ''do'' changes, the meaning of the sentence changes too. As shown above, Lojban has particular devices to preserve such semantic structure of words while altering their order. Compare the followings:
:''mi , tavla , do , la .lojban. , le glibau'' ( 1 , selbri , 2 , 3 , 4 )
"x1 (''mi'', I) talks/speaks to x2 (''do'', you) about subject x3 (''la lojban.'', Lojban) in language x4 (''le glibau'', English)" :''do , se tavla , mi , fo le glibau , fi la .lojban.'' ( 2 , selbri , 1 , 4 , 3 )
"x2 (''do'') is talked/spoken to by x1 (''mi'') in language x4 (''le glibau'') about subject x3 (''la lojban.'')"
''se'' converts the x1 and x2 sumti place. ''fo'' tags the x4 place, and ''fi'' the x3. Such conversion and tagging is often used to emphasize particular sumti by bringing it forward. Here are some collations of natural languages and Lojban:
:''Labhraíonn Mícheál Gaeilge le Cáit'' (VSO –
Irish Irish may refer to: Common meanings * Someone or something of, from, or related to: ** Ireland, an island situated off the north-western coast of continental Europe ***Éire, Irish language name for the isle ** Northern Ireland, a constituent unit ...
)
speaks , Mícheál , Irish , with Cáit
''tavla fa la .mixal. fo la sicko'o fe la .kat.''
speaks , Mícheál , in Irish , to/with Cáit :''Mamaky boky ny mpianatra'' (VOS – Malagasy)
reads , book , the student
''tcidu lo cukta fa le tadni''
reads , a book , the student :''Âi ba, wa mo.'' (OSV –
Xavante The Xavante (also Shavante, Chavante, Akuen, A'uwe, Akwe, Awen, or Akwen) are an indigenous people, comprising 15,315 individuals within the territory of eastern Mato Grosso state in Brazil. They speak the Xavante language, part of the Jê lang ...
)
to the-river , I , go
''fe le rirxe fa mi klama''
to the river , I , go :''Ihtébani o'ílaci yawi-pó=ra'' (OVS – Guarijio)
Esteban house-at , dance- assive uture eportative
''ti'e bu'u le zdani be la .esteban. ba nu dansu''
hear!at the house of Esteban , utureevent-of dance :僕がこれを作ったんだよ。 (SOV –
Japanese Japanese may refer to: * Something from or related to Japan, an island country in East Asia * Japanese language, spoken mainly in Japan * Japanese people, the ethnic group that identifies with Japan through ancestry or culture ** Japanese diaspor ...
)
I , this , made- ssertive-callingbr />''mi ti pu zbasu vau je'uju'i''
I , this , astmake ridi-terminator ruth-attention
It is important to note that Lojban selbri is not a real equivalent of verb in natural languages. A selbri can be either a verb, a noun, an adjective, or an adverb. Its function is determined syntactically, not morphologically. An analogy to natural language word orders by using such terms as "subject", "verb", and "object" cannot accurately describe the nature of Lojban bridi.


sumti – arguments of predicates

There are five kinds of simple sumti: #descriptions, which usually begin with a descriptor such as ''le'' or more commonly now ''lo''; #pro-sumti, the Lojban analogue of pronouns, such as ''mi''; #names, which usually begin with ''la'', such as ''la .lojban.''; #quotations, which begin with ''lu'', ''lo'u'', ''zo'', ''zoi''; #pure numbers, which usually begin with ''li''.


description

Basic descriptions in Lojban consist of two units, LE/LA descriptors and a selbri: : Although is quite close in meaning to English "the", it has particularly unique implications. In this example, creates an argument which might occur in the x1 place of the belonging selbri , namely a "market". also specifies that the speaker 1) has one or more specific markets in mind (whether or not the listener knows which ones they are) and 2) is merely describing the things he/she has in mind as markets, without being committed to the truth of that description. Whereas English-speakers must differentiate between "the market" and "the markets", Lojban-speakers are not required to make such a choice (this rule does not mean that Lojban has no way of specifying the number of markets in such a case): :
The market is big. / The markets are big.
Since the construct + ''selbri'' merely describes something or other which the speaker chooses to represent based on his/her observation, such an expression as follows is possible: :
one-or-more-specific-things-which-I-describe as "men" are women
While is specific, is not: :
one-or-more-of-all-the-things-which-really are-markets is/are-big
A market is big. / Some markets are big.
refers generally to one or more markets, without being specific about which. Unlike , must refer to something which actually is a market (that is, which can appear in the x1 place of a truthful bridi whose selbri is ). is false as there are no objects in the real world which are both dead and alive. dissociates the subsequent selbri from its normal meaning, usually making a name (this usage should not be confused with the other usage before regular Lojbanized names). Like descriptions, descriptions are implicitly restricted to those the speaker has in mind: :
the-one-named "bear" astcreates the story.
Bear wrote the story.
All descriptions implicitly terminate with , which can almost always be omitted with no danger of ambiguity. The main exceptions are a) when relative clauses are involved and b) when a description immediately precedes the selbri (in which case using an explicit before the selbri makes the unnecessary). Other usages of include making a compound negator () and terminating place-structure/tense/modal tags (, ).


selbri – main verbs of clauses, logical predicates

The selbri is the
logical predicate In logic, a predicate is a symbol which represents a property or a relation. For instance, in the first order formula P(a), the symbol P is a predicate which applies to the individual constant a. Similarly, in the formula R(a,b), R is a predicat ...
of a bridi. This is not to be confused with the meaning of predicate in terms of the English Language, but as a ''logical'' predicate. Whereas a predicate in English contains everything that the subject is doing, a logical predicate is simply the relation between all involved parties. In this context, the selbri is roughly the equivalent of a
verb A verb () is a word ( part of speech) that in syntax generally conveys an action (''bring'', ''read'', ''walk'', ''run'', ''learn''), an occurrence (''happen'', ''become''), or a state of being (''be'', ''exist'', ''stand''). In the usual descr ...
in English. For instance:
:''mi nelci le gerku'' :I like the dog. / I like the dogs.
The gismu ''nelci'' is being used as the selbri in this bridi. It is describing the relationship between the sumtis ''mi'' (I) and ''le gerku'' (the dog). The relationship is that of a liker and that which is liked. The roles in the relationship are determined by the sumti placements inherent in the word being used as the selbri. The cmavo ''se/te/ve/xe'' are used to swap the first sumti placement of the selbri with the second, third, fourth, and fifth sumti placement, respectively. This functionality allows for the flexibility in bridi. For instance, the gismu ''klama'' has the sumti of: * x1: One which goes * x2: The destination of a goer * x3: The source of a goer * x4: The route taken by a goer * x5: The vehicle used by a goer Thus:
:''ti klama ta'' ::x1 = ti ::x2 = ta :This goes to that. :''ti se klama ta'' ::x2 = ti ::x1 = ta :This is the destination of that. :''ti te klama ta'' ::x3 = ti ::x2 = ta :This is the source of something that goes to that. : ''ti ve klama ta'' ::x4 = ti ::x2 = ta :This is the route of something that goes to that. : ''ti xe klama ta'' ::x5 = ti ::x2 = ta :This is the vehicle of something that goes to that.
Selbri can also be tanru, where the sumti placements are determined by the last brivla that is part of the tanru. For instance:
:''mi gleki klama ta'' :I am a happy-goer that is going to that. :''mi klama gleki ta'' :I am a going-happy-thing that is happy about that.


tanru – compound verbs

A group of two or more selbri put one after another is called ''tanru'' and acts as a single selbri (while the selbri it is composed of are called tanru units (''selbrisle'')). They correspond to English compound verbs like ''drip fry'' or ''drip dry''. Tanru can be converted to sumti as usual selbri. Tanru may be used so as to more specifically conceptualize the intended meaning. In ''lo skami pilno'' ("computer user(s)"), there is a tanru ''skami pilno'' where the modifying brivla ''skami'' narrows the sense of the modified brivla ''pilno'' to form a more specific concept (in which case the modifier may resemble English adverbs or adjectives). Without ''skami'', ''lo pilno'' will just mean "user". Other examples:
:''ti mutce xajmi'' ("This is very funny.") :''do melbi se kanla'' ("You have beautiful eye(s).") :''.ue.oi le mabla bebna cu zvati ti'' ("Oh my gosh the damn idiot is here.")
Arguments of selbri representing tanru units may be filled as well, in which case they are attached using the particle ''be'' (starting from x2 place of the tanru unit):
:''ti melbi be mi ractu'' ("This is a pretty to me rabbit.")


cmavo

The ''cmavo'' consist of various particles and structure words. These include descriptors, connectives, attitudinals, prepositions, and tense words.


Descriptors

There are five descriptors (''gadri'' in Lojban): ''lo'', ''le'', ''la'', ''li'', and ''me'o''; of which the first three inflect to show individual, mass, or set (though as far as the formal grammar is concerned, the inflected forms are separate words, not inflected forms). The individual/mass distinction is similar to the distinction between mass nouns and
count noun In linguistics, a count noun (also countable noun) is a noun that can be modified by a quantity and that occurs in both singular and plural forms, and that can co-occur with quantificational determiners like ''every'', ''each'', ''several'', ...
s, but things that are normally counted can be considered as a mass. The set articles consider the mathematical set of the referents.
:''lo'i jurme bene'i mi cu bramau lo'i mi mivysle'' ("The set of bacteria inside me is bigger than the set of my cells." With ''loi'' this would be false, as the bacteria, though more in number, have less mass.) :''lo mi kerfa cu jdari .iku'i loi mi kerfa cu ranti'' ("My hairs are hard, but my hair is soft.")
The number and mathematical expression articles are used when talking about numbers and numerals or letters as themselves.
:''bi jgena'' ("eight knots") :''lo me li bi jgena'' ("an eight knot", whatever that is; perhaps it has eight loops) :''lo me me'o bi jgena'' ("a figure-eight knot")
Note that ''gadri'' are sometimes called ''articles'', which doesn't correspond to the usual notion of
article Article often refers to: * Article (grammar), a grammatical element used to indicate definiteness or indefiniteness * Article (publishing), a piece of nonfictional prose that is an independent part of a publication Article may also refer to: G ...
s in linguistics.


Connectives

As befits a logical language, there is a large assortment of
conjunction Conjunction may refer to: * Conjunction (grammar), a part of speech * Logical conjunction, a mathematical operator ** Conjunction introduction, a rule of inference of propositional logic * Conjunction (astronomy), in which two astronomical bodies ...
s. There exist 16 possible different binary
truth function In logic, a truth function is a function that accepts truth values as input and produces a unique truth value as output. In other words: The input and output of a truth function are all truth values; a truth function will always output exactly o ...
s, the four fundamental ones of which are assigned four vowels in Lojban. These vowels are a component sound from which actual logical-connective cmavo are built up. :
:''la .djekl. .a la .xaid. zvati ti''
Jekyll and/or Hyde is/are here. :''la .djekl. .e la .xaid. zvati ti''
Jekyll and Hyde is here. :''la .djekl. .o la .xaid. zvati ti''
Jekyll if-and-only-if Hyde is here. :''la .djekl. .u la .xaid. zvati ti''
Jekyll whether-or-not Hyde is here.
With the four vowels, the ability to negate either sentence (with ''na'' or ''nai''), and the ability to exchange the sentences, as if their order had been reversed (with ''se''), Lojban can create all of the 16 possible truth functions except TTTT and FFFF. : ''.onai'' is preferred over ''na.o''. Note that in ''se.unai'', the negation occurs before "swapping" the positions of A and B. While ''se'' can be used for the other connectives, it is unnecessary since ''or'', ''and'', and ''if and only if'' have the same relationship if their inputs are swapped (i.e. they are
commutative In mathematics, a binary operation is commutative if changing the order of the operands does not change the result. It is a fundamental property of many binary operations, and many mathematical proofs depend on it. Most familiar as the name of ...
). :
:''la .djekl. na.a la .xaid. zvati ti''
Jekyll only-if Hyde is here. / Jekyll is here only if Hyde is here :''la .djekl. .enai la .xaid. zvati ti''
Jekyll and-not Hyde is here. / Jekyll, but not Hyde, is here. :''la .djekl. .onai la .xaid. zvati ti''
Jekyll either/or Hyde is here. / Either Jekyll or Hyde is here, but not both. :''la .djekl. se.u la .xaid. zvati ti''
Regardless of Jekyll, Hyde is here. / Whether or not Jekyll is here, Hyde is.
In order to remain unambiguous, each place in the grammar of the language where logical connection is permitted has its appropriate set of connectives. If the connective suitable for sumti were used to connect selbri, ambiguity would result. Connections between components other than sumti can be expressed as follows (note that their functions are in accordance with the assigned vowels):
:''la .djekl. tavla .ija la .xaid. tavla'' (between sentences)
Jekyll speaks. And/or Hyde speaks. :''la .djekl. mikce la xaid. gi'e nanmu'' (between bridi)
Jekyll is a doctor of Hyde and is a man. :''la .djekl. sipna je cadzu'' (between gismu)
Jekyll sleeps-and-walks.
Negatives and ''se'' can be used with these connectives with similar forms.
:''la .djekl. tavla .ijanai la .xaid. tavla'' (between sentences)
Jekyll speaks. If Hyde speaks. :''la .djekl. mikce nagi'e nanmu'' (between bridi)
Jekyll is a doctor but not a man. :''la .djekl. sipna seju cadzu'' (between gismu)
Regardless as to whether Jekyll sleeps, he walks.
Connections can also be given using "forethought" forms
:''ge la .djekl. gi la .xaid. zvati ti''
Jekyll and Hyde are here. :''la .djekl. gu'e sipna gi cadzu''
Jekyll sleeps-and-walks.
Negatives are created by adding ''nai'' to the connective before the negated construct.
:''genai la .djekl. gi la .xaid. zvati ti''
Not Jekyll but Hyde is here. :''ge la .djekl. ginai la .xaid. zvati ti''
Jekyll but not Hyde is here.
Connections can be questioned:
:''la .djekl. ji la .xaid. tavla''
Jekyll ''what?Hyde speaks.
Does Jekyll or Hyde speak? :''la .djekl. sipna je'i cadzu''
Jekyll sleeps ''what?walks.
Does Jekyll sleep or walk?
Besides the logical connectives, there are several non-logical connectives. These do not change form depending on what they are connecting: :
:''lo lanme ufa'u lo guzme cu danlu fa'u spati''
Sheep and melons are animals and plants, respectively. :''la .treid. ku'a la .traian. midju la .carlyt.''
Trade intersect Tryon is the center of Charlotte. :''lo rukygu'e cu xazdo joi ropno''
Russia is Asian together with European.
The ''ku'' is required by the
LALR parser In computer science, an LALR parser or Look-Ahead LR parser is a simplified version of a canonical LR parser, to parse a text according to a set of production rules specified by a formal grammar for a computer language. ("LR" means left-to-rig ...
, but not by the PEG parser, which however is not official yet.


Attitudinals

Attitudinals are a set of cmavo which allow the speakers to express their ''emotional state'' or ''source of knowledge'', or the ''present stage of discourse''. In natural languages, attitudes are expressed using
interjection An interjection is a word or expression that occurs as an utterance on its own and expresses a spontaneous feeling or reaction. It is a diverse category, encompassing many different parts of speech, such as exclamations ''(ouch!'', ''wow!''), curse ...
s, but also by the tone of voice when speaking, and (very imperfectly) by
punctuation Punctuation (or sometimes interpunction) is the use of spacing, conventional signs (called punctuation marks), and certain typographical devices as aids to the understanding and correct reading of written text, whether read silently or aloud. An ...
when writing; in Lojban, such information are extensively expressible in words. And the meanings are to be understood separately from the main predicate.
:''.iu'' (love) :''.ui'' (happy)
They may be "scaled" by suffixes:
:''.uinai'' (happy-not = unhappy) :''.uicai'' (happy-intense = very very happy) :''.uicu'i'' (happy-neutral)
Combination is possible, and highly productive as well as creative:
:''.uinaicai'' (happy-not-intense) :''.iu.uinai'' (love-happy-not = I am unhappily in love)
''Evidentials'', derived from those of
American Indian languages Over a thousand indigenous languages are spoken by the Indigenous peoples of the Americas. These languages cannot all be demonstrated to be related to each other and are classified into a hundred or so language families (including a large numb ...
and the constructed language Láadan, show how the speaker came to say the utterance, i.e. the source of the information or the idea:
:''ti'e la .uengas cu zergau''
''I hear!Wenga is-a-crime-doer.
I hear that Wenga is a crook. :''ba'acu'i le tuple be mi cu se cortu''
''I experience!The leg of me is-the-locus-of-pain.
My leg hurts. :''pe'i la .kartagos. .ei se daspo''
''I opine!Carthage bligationis-destroyed.
In my opinion, Carthage should be destroyed.


Prepositions

There are two kinds of prepositions (''sumtcita'', which refers to adpositions in general) in Lojban: tense markers and proper prepositions. The syntactic difference is that a proper preposition can be converted with ''se'', whereas a tense marker cannot. All proper prepositions (except the vague one ''do'e'') are formed from a brivla and mark their object semantically as being in a place of that brivla. Thus the following are equivalent:
:''mi pilno lo me'andi lo nu skagau lei kerfa''
I use henna to color the hair. :''mi skagau lei kerfa sepi'o lo me'andi''
I color the hair with henna.
Prepositions (including tense markers) can also be placed in ''.i ... bo'' to make sentence conjunctions. With most prepositions this makes no sense, but ''ki'u'', ''ja'e'', ''mu'i'' and ''ni'i'' are often used this way to express various kinds of "because" and "therefore": :''la .djan. cpacu le pamoi se jinga .iki'ubo ri jinga''
John got the first prize because he won.


Tense words

Lojban has 63 unique tense words to express various aspects of both ''space'' and ''time'' as well as ''event'': such a system is unusual among other languages, in that it deals with spatial and temporal aspects in the same term. Simply put, these tense words are used to show where and when the event is taking place, relative in space-time to when the sentence is spoken. They can be roughly subdivided as follows: The "converters" require further explanation: marks that the movement is towards (or away from, etc.) a point rather than at a point, so that , meaning "(occurring) in the north", becomes , "(heading) northward"; marks the number of times an event takes place, e.g. (once), (always), (never), indicated an ordinal, e.g. , first time; marks an interval in space rather than in time, so that, for example, is the "beginning point" of something physically extending in a given direction; is used to mark tense words that have been moved in a sentence, comparable to and for sumti; marks a "sticky tense", as explained below. Marking tenses is always optional in Lojban:
: (default: no temporal tense)
I went/have-gone/go/am-going/will-go/continually-go-to the-market. :
I have-gone-to the-market. :
I am-about-going-to the-market.
Where the tense information is not specified, the context resolves the interpretation. Tense words are usually put right before the selbri: : ( is the implicit separator between the first sumti and the selbri .) They may be placed elsewhere with the additional terminator :
: : : ( is elidable at the end of the bridi)
The terminator is used so that the tense word do not directly run into the following sumti and modify it. Compare the next sentences:
:
t some point in the futurethe festival will start. :
After the festival, omething unspecifiedwill start.
Tenses can be "layered up": :
I astgo-to the house. The house astastbe-destroyed.
I went to the house. The house had been destroyed.
Tenses can be "sticky" by being set with , continuing in effect over more than a single bridi, until it is unset: :
I ast et this tenseangry-kind-of become. The girl crying-kind-of start. I et this tenseregret.
(Earlier) I got angry. The girl started crying. (Now) I regret.
The second resets the tense to the implicit default time from the speaker's point of view, which is "now" (this means that may be used as a tense word by itself). Using , equivalents of the previous layering tenses can be produced: : The second is to be counted from the tense set by the last , so in effect it is equivalent to . Similar to the tense words are a collection of words to show potentiality: , "actually is"; , "is innately capable of"; , "can but has not"; , "can and has". These words operate in the same way as tense words.
:
The child is (actually, currently) singing. :
The child is naturally capable of singing.
The child can sing. :
The child has the undemonstrated potential for singing.
The child can sing, but hasn't sung yet. :
The child has the demonstrated potential for singing.
The child can sing, and has shown that it can.


Grouping

Semantic unifications and combinations are performed by word groupings, each include a starting and a , often elidable) terminating cmavo. Here basic example sentences with them are ordered by their terminating cmavo:


References


Bibliography

*


External links


The Lojban Reference Grammar with ErrataExample Tengwar mode
{{DEFAULTSORT:Lojban Grammar Lojban Grammars of constructed languages