HOME

TheInfoList



OR:

The
English language English is a West Germanic language that developed in early medieval England and has since become a English as a lingua franca, global lingua franca. The namesake of the language is the Angles (tribe), Angles, one of the Germanic peoples th ...
is sometimes described as the ''
lingua franca A lingua franca (; ; for plurals see ), also known as a bridge language, common language, trade language, auxiliary language, link language or language of wider communication (LWC), is a Natural language, language systematically used to make co ...
'' of computing. In comparison to other sciences, where
Latin Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
and Greek predominate,
computer science Computer science is the study of computation, information, and automation. Computer science spans Theoretical computer science, theoretical disciplines (such as algorithms, theory of computation, and information theory) to Applied science, ...
utilizes English as its primary conduit. The prevalence of English in computing is a consequence of both historical and technical factors. The
United States The United States of America (USA), also known as the United States (U.S.) or America, is a country primarily located in North America. It is a federal republic of 50 U.S. state, states and a federal capital district, Washington, D.C. The 48 ...
and the
United Kingdom The United Kingdom of Great Britain and Northern Ireland, commonly known as the United Kingdom (UK) or Britain, is a country in Northwestern Europe, off the coast of European mainland, the continental mainland. It comprises England, Scotlan ...
— both countries with English-speaking majorities — had prominent roles in the development and popularization of computer systems, computer networks,
software Software consists of computer programs that instruct the Execution (computing), execution of a computer. Software also includes design documents and specifications. The history of software is closely tied to the development of digital comput ...
and
information technology Information technology (IT) is a set of related fields within information and communications technology (ICT), that encompass computer systems, software, programming languages, data processing, data and information processing, and storage. Inf ...
. Moreover, the technical limitations of early computers, combined with the lack of international standardization on the
Internet The Internet (or internet) is the Global network, global system of interconnected computer networks that uses the Internet protocol suite (TCP/IP) to communicate between networks and devices. It is a internetworking, network of networks ...
, meant that English continued to dominate computing until innovations in internet infrastructure and increases in computer speed changed these earlier conditions. The domination of English, however, should not overshadow the technological innovations and additions to computer science that utilize other languages. Also, many widely used English-based technologies were not produced in the United States or the United Kingdom. For example,
Ruby Ruby is a pinkish-red-to-blood-red-colored gemstone, a variety of the mineral corundum ( aluminium oxide). Ruby is one of the most popular traditional jewelry gems and is very durable. Other varieties of gem-quality corundum are called sapph ...
was made in Japan. While English is still influential to various other languages with regard to computers and computing technologies, now most software products are localized in numerous languages and the invention of
Unicode Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Char ...
character encoding has resolved problems with non-Latin alphabets.


History

Computer science has an ultimately mathematical foundation which was laid by non-English speaking cultures. The first mathematically literate societies in the Ancient Near East recorded methods for solving mathematical problems in steps. The word 'algorithm' comes from the name of a famous medieval Arabic mathematician who contributed to the spread of Hindu-Arabic numerals, al-Khwārizmī. The first systematic treatment of binary numbers was completed by
Leibniz Gottfried Wilhelm Leibniz (or Leibnitz; – 14 November 1716) was a German polymath active as a mathematician, philosopher, scientist and diplomat who is credited, alongside Sir Isaac Newton, with the creation of calculus in addition to many ...
, a German mathematician. Leibniz wrote his treatise on the topic in French, the lingua franca of science at the time, and innovations in what is now called
computer hardware Computer hardware includes the physical parts of a computer, such as the central processing unit (CPU), random-access memory (RAM), motherboard, computer data storage, graphics card, sound card, and computer case. It includes external devices ...
occurred outside of an English tradition, with Pascal inventing the first mechanical calculator, and Leibniz improving it. Interest in building computing machines first emerged in the 19th century, with the coming of the
Second Industrial Revolution The Second Industrial Revolution, also known as the Technological Revolution, was a phase of rapid Discovery (observation), scientific discovery, standardisation, mass production and industrialisation from the late 19th century into the early ...
. The origins of computing in an English tradition began in this era with
Charles Babbage Charles Babbage (; 26 December 1791 – 18 October 1871) was an English polymath. A mathematician, philosopher, inventor and mechanical engineer, Babbage originated the concept of a digital programmable computer. Babbage is considered ...
's conceptualization of the Difference and Analytical Engine,
George Boole George Boole ( ; 2 November 1815 – 8 December 1864) was a largely self-taught English mathematician, philosopher and logician, most of whose short career was spent as the first professor of mathematics at Queen's College, Cork in Ireland. H ...
's work on logic, and
Herman Hollerith Herman Hollerith (February 29, 1860 – November 17, 1929) was a German-American statistician, inventor, and businessman who developed an electromechanical tabulating machine for punched cards to assist in summarizing information and, later, in ...
's invention of the tabulating machine for specific use in the
1890 United States census The 1890 United States census was taken beginning June 2, 1890. The census determined the resident population of the United States to be 62,979,766, an increase of 25.5 percent over the 50,189,209 persons enumerated during the 1880 United States ...
. At the time, Britain enjoyed near complete hegemonic power in the West at the height of the ''
Pax Britannica ''Pax Britannica'' (Latin for , modelled after '' Pax Romana'') refers to the relative peace between the great powers in the time period roughly bounded by the Napoleonic Wars and World War I. During this time, the British Empire became the ...
'', and America was experiencing an economic and demographic boom. By the time of the interwar period in the early 20th century, the most important mathematics related to the development of computing was being done in English, which was also beginning to become the new lingua franca of science. In recent years, some scholars have begun to more critically study the dominance of the English language in computing. For instance, the historian Thomas Mullaney studies how Chinese computing responded to the problem of the QWERTY keyboard, which by nature of its limited key selection was not initially very accessible for Chinese computer users due to the difference between the English and Chinese languages. In his book The Chinese Computer: A Global History of the Information Age, he examines the innovative strategies that Chinese computer users have used to overcome this challenge, among others. Meanwhile, the scholar Mark Warschauer has studied internet usage across different global contexts, writing about how an over-abundance of digital information is available in English, but a significantly less amount of information is available in other languages, something that poses a problem for social inclusion. Despite English being the standard for programming languages, non-Western countries developed many widely used programming languages. Python, one of the most popular programming languages, was created in the Netherlands by Guido van Rossum in 1991.  Additionally, in Japan, Ruby, a programming language mainly used for web development, was created by Yukihiro "Matz" Matsumoto in the mid-1990s. English was considered more appropriate for the international development community. Based on French keywords, Langage Symbolique d'Enseignement – Symbolic Language for Teaching (LSE) sprouted out of France for beginner programmers, and its failure is attributed to that lack of French knowledge globally. As a result, PASCAL was developed in English while germinating in Switzerland.


Influence on other languages

The computing terminology of many languages borrows from English. Some language communities actively resist this trend, and in other cases English is used extensively and more directly. This section gives some examples of the use of English loans in other languages and mentions any notable differences.


Bulgarian

Both English and Russian have had influence over Bulgarian computing vocabulary. In many cases, however, the borrowed word is translated into Bulgarian rather than transcribed phonetically from English. Combined with the use of
Cyrillic The Cyrillic script ( ) is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic, Turkic, Mongolic, Uralic, Caucasian and Iranic-speaking countries in Southeastern Europe, Ea ...
this can make it difficult to recognize loanwords. For example, the Bulgarian term for
motherboard A motherboard, also called a mainboard, a system board, a logic board, and informally a mobo (see #Nomenclature, "Nomenclature" section), is the main printed circuit board (PCB) in general-purpose computers and other expandable systems. It ho ...
is (), literally "bottom board". * () – computer * () – hard disk (i.e.,
hard drive A hard disk drive (HDD), hard disk, hard drive, or fixed disk is an electro-mechanical data storage device that stores and retrieves digital data using magnetic storage with one or more rigid rapidly rotating hard disk drive platter, pla ...
) * () –
floppy disk A floppy disk or floppy diskette (casually referred to as a floppy, a diskette, or a disk) is a type of disk storage composed of a thin and flexible disk of a magnetic storage medium in a square or nearly square plastic enclosure lined with a ...
; like the French * () – web site; but also (), literally "internet page"


Faroese

The
Faroese language Faroese ( ; ) is a North Germanic languages, North Germanic language spoken as a first language by about 69,000 Faroe Islanders, of whom 21,000 reside mainly in Denmark and elsewhere. It is one of five languages descended from Old Norse#Old West ...
has a sparse scientific vocabulary ''based on the language itself''. Many Faroese scientific words are borrowed and/or modified versions of especially Nordic and English equivalents. The vocabulary is constantly evolving and thus new words often die out, and only a few survive and become widely used. Examples of successful words include e.g. "" (computer), "" (at sign) and "" (server).


French

In French, there are some generally accepted English loanwords, but there is also a distinct effort to avoid them. In France, the is responsible for the standardisation of the language and often coins new technological terms. Some of them are accepted in practice, but oftentimes the English loans remain predominant. In Quebec, the has a similar function. * / (in Europe); (mainly in French-speaking Canada, but increasingly used in French-speaking Europe); (only used as an abbreviation, similar to ""); more formally * – spam * , phishing – phishing * – to download * – website * , – website hyper-link * – database * , , short – webcam * , , – to boot * , – to reboot * , – to shut down * , – bootable * , , overclocking – overclocking * – watercooling * –
case modding Case modification, commonly referred to as case modding, is the modification of a computer case or a video game console chassis. Modifying a computer case in any non-standard way is considered a case mod. Modding is done, particularly by Compute ...


German

In German, English words are very often used as well: * nouns: , , , , * verbs: , ,


Japanese

Japanese uses the
katakana is a Japanese syllabary, one component of the Japanese writing system along with hiragana, kanji and in some cases the Latin script (known as rōmaji). The word ''katakana'' means "fragmentary kana", as the katakana characters are derived fr ...
alphabet for foreign loanwords, a wide variety of which are in use today. English computing terms remain prevalent in modern Japanese vocabulary. * コンピューター () -
computer A computer is a machine that can be Computer programming, programmed to automatically Execution (computing), carry out sequences of arithmetic or logical operations (''computation''). Modern digital electronic computers can perform generic set ...
* コーダー () - coder * コーデック () -
codec A codec is a computer hardware or software component that encodes or decodes a data stream or signal. ''Codec'' is a portmanteau of coder/decoder. In electronic communications, an endec is a device that acts as both an encoder and a decoder o ...
* ダウンロード () -
download In computer networks, download means to ''receive'' data from a remote system, typically a server such as a web server, an FTP server, an email server, or other similar systems. This contrasts with uploading, where data is ''sent to'' a remote ...
* リンク () - link Utilizing a keyboard layout suitable for romanization of Japanese, a user may type in the
Latin script The Latin script, also known as the Roman script, is a writing system based on the letters of the classical Latin alphabet, derived from a form of the Greek alphabet which was in use in the ancient Greek city of Cumae in Magna Graecia. The Gree ...
in order to display Japanese, inclusive of
hiragana is a Japanese language, Japanese syllabary, part of the Japanese writing system, along with ''katakana'' as well as ''kanji''. It is a phonetic lettering system. The word ''hiragana'' means "common" or "plain" kana (originally also "easy", ...
,
katakana is a Japanese syllabary, one component of the Japanese writing system along with hiragana, kanji and in some cases the Latin script (known as rōmaji). The word ''katakana'' means "fragmentary kana", as the katakana characters are derived fr ...
, and Japanese kanji.
Usually when writing in Japanese on a computer keyboard, the text is input in roman transcription, optionally according to Hepburn, Kunrei, or Nippon romanization; the common Japanese word processing programs allow for all three. Long vowels are input according to how they are written in kana; for example, a long ''o'' is input as ''ou'', instead of an ''o'' with a circumflex or macron (''ô'' or ''ō''). As letters are keyed in, they are automatically converted, as specified, into either hiragana or katakana. And these kana phrases are in turn converted, as desired, into kanji.


Icelandic

The
Icelandic language Icelandic ( ; , ) is a North Germanic languages, North Germanic language from the Indo-European languages, Indo-European language family spoken by about 314,000 people, the vast majority of whom live in Iceland, where it is the national languag ...
has its own vocabulary of scientific terms. Still, English loans exist, and are mostly used in casual conversation, whereas the Icelandic words might be longer or not as widespread.


Norwegian

It's quite common to use English words with regards to computing in all Scandinavian languages. nouns: (referring to e-mail), , (from "blog"), verbs: , ,


Polish

Polish terminology derived from English: * '' dżojstik'':
joystick A joystick, sometimes called a flight stick, is an input device consisting of a stick that pivots on a base and reports its angle or direction to the device it is controlling. Also known as the control column, it is the principal control devic ...
* '' kartrydż, kartridż'': cartridge * '' interfejs'': interface * '' mejl'':
e-mail Electronic mail (usually shortened to email; alternatively hyphenated e-mail) is a method of transmitting and receiving Digital media, digital messages using electronics, electronic devices over a computer network. It was conceived in the ...


Russian

* History of computer hardware in Eastern Bloc countries * Computer Russification


Spanish

The English influence on the software industry and the internet in Latin America has borrowed significantly from the Castilian lexicon. ;Frequently untranslated, and their Spanish equivalent *email: *mouse (only in Latin America): (mainly in Spain) *messenger: (only in Spain) *webcam: , *website: , *blog: , *ban/banned: (Latin America), , *web: , ;Not translated *flog *descargar ;Undecided Many computing terms in Spanish share a common root with their English counterpart. In these cases, both terms are understood, but the Spanish is preferred for formal use: *link vs or *net vs


Character encoding

Early computer software and hardware had very little support for character sets other than the Latin alphabet. As a result, it was difficult or impossible to represent languages based on other scripts. The
ASCII ASCII ( ), an acronym for American Standard Code for Information Interchange, is a character encoding standard for representing a particular set of 95 (English language focused) printable character, printable and 33 control character, control c ...
character encoding, created in the 1960s, usually only supported 128 different characters in a 7 bit format. With the use of additional software it was possible to provide support for some languages, for instance those based on the Cyrillic alphabet. However, complex-script and
logographic In a written language, a logogram (from Ancient Greek 'word', and 'that which is drawn or written'), also logograph or lexigraph, is a written character that represents a semantic component of a language, such as a word or morpheme. Chinese c ...
languages like Chinese or Japanese need more characters than the 256 limit imposed by 8-bit character encodings. Some computers created in the former
USSR The Union of Soviet Socialist Republics. (USSR), commonly known as the Soviet Union, was a List of former transcontinental countries#Since 1700, transcontinental country that spanned much of Eurasia from 1922 until Dissolution of the Soviet ...
had native support for the Cyrillic alphabet. The widespread adoption of
Unicode Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Char ...
, and
UTF-8 UTF-8 is a character encoding standard used for electronic communication. Defined by the Unicode Standard, the name is derived from ''Unicode Transformation Format 8-bit''. Almost every webpage is transmitted as UTF-8. UTF-8 supports all 1,112,0 ...
on the web, resolved most of these historical limitations. ASCII remains the de facto standard for command interpreters,
programming language A programming language is a system of notation for writing computer programs. Programming languages are described in terms of their Syntax (programming languages), syntax (form) and semantics (computer science), semantics (meaning), usually def ...
s and text-based communication protocols, but it is slowly dying out. *
Mojibake Mojibake (; , 'character transformation') is the garbled or gibberish text that is the result of text being decoded using an unintended character encoding. The result is a systematic replacement of symbols with completely unrelated ones, often ...
– Text presented as "unreadable" when software fails due to character encoding issues.


Programming language

The syntax of most programming languages uses English keywords, and therefore it could be argued some knowledge of English is required in order to use them. Some studies have shown that programmers nonnative to English self-report that English is their biggest obstacle to programming proficiency. However, it is important to recognize all
programming languages A programming language is a system of notation for writing computer programs. Programming languages are described in terms of their syntax (form) and semantics (meaning), usually defined by a formal language. Languages usually provide features ...
are in the class of
formal languages In logic, mathematics, computer science, and linguistics, a formal language is a set of string (computer science), strings whose symbols are taken from a set called "#Definition, alphabet". The alphabet of a formal language consists of symbol ...
. They are very different from any natural language, including English. Some examples of non-English programming languages: *
Arabic Arabic (, , or , ) is a Central Semitic languages, Central Semitic language of the Afroasiatic languages, Afroasiatic language family spoken primarily in the Arab world. The International Organization for Standardization (ISO) assigns lang ...
: ARLOGO, قلب * Bengali: BangaBhasha * Chinese: Chinese BASIC * Dutch: Superlogo * French: LSE, WinDev, Pascal (although the English version is more widespread) *
Hebrew Hebrew (; ''ʿÎbrit'') is a Northwest Semitic languages, Northwest Semitic language within the Afroasiatic languages, Afroasiatic language family. A regional dialect of the Canaanite languages, it was natively spoken by the Israelites and ...
: Hebrew Programming Language * Icelandic: Fjölnir * Indian Languages: Hindawi Programming System * Russian: Glagol * Portuguese: Portugol


Communication protocols

Many application protocols use text strings for requests and parameters, rather than the binary values commonly used in lower layer protocols. The request strings are generally based on English words, although in some cases the strings are contractions or acronyms of English expressions, which can render them somewhat cryptic to anyone not familiar with the protocol, whatever their proficiency in English. Nevertheless, the use of word-like strings is a convenient mnemonic device that allows a person skilled in the art (and with sufficient knowledge of English) to execute the protocol manually from a keyboard, usually for the purpose of finding a problem with the service. Examples: *
FTP The File Transfer Protocol (FTP) is a standard communication protocol used for the transfer of computer files from a server to a client on a computer network. FTP is built on a client–server model architecture using separate control and dat ...
: USER, PASS (password), PASV (passive), PORT, RETR (retrieve), STOR (store), QUIT *
SMTP The Simple Mail Transfer Protocol (SMTP) is an Internet standard communication protocol for electronic mail transmission. Mail servers and other message transfer agents use SMTP to send and receive mail messages. User-level email clients typi ...
: HELO (hello), MAIL, RCPT (recipient), DATA, QUIT *
HTTP HTTP (Hypertext Transfer Protocol) is an application layer protocol in the Internet protocol suite model for distributed, collaborative, hypermedia information systems. HTTP is the foundation of data communication for the World Wide Web, wher ...
: GET, PUT, POST, HEAD (headers), DELETE, TRACE, OPTIONS It is notable that response codes, that is, the strings sent back by the recipient of a request, are typically numeric: for instance, in HTTP (and some borrowed by other protocols) * 200 OK request succeeded * 301 Moved Permanently to redirect the request to a new address * 404 Not Found the requested page does not exist This is because response codes also need to convey unambiguous information, but can have various nuances that the requester may optionally use to vary its subsequent actions. To convey all such "sub-codes" with alphabetic words would be unwieldy, and negate the advantage of using pseudo-English words. Since responses are usually generated by software they do not need to be mnemonic. Numeric codes are also more easily analyzed and categorized when they are processed by software, instead of a human testing the protocol by manual input.


Localization


BIOS

Many personal computers have a
BIOS In computing, BIOS (, ; Basic Input/Output System, also known as the System BIOS, ROM BIOS, BIOS ROM or PC BIOS) is a type of firmware used to provide runtime services for operating systems and programs and to perform hardware initialization d ...
chip, displaying text in English during boot time.


Keyboard shortcut

Keyboard shortcut In computing, a keyboard shortcut (also hotkey/hot key or key binding) is a software-based assignment of an action to one or more keys on a computer keyboard. Most Operating system, operating systems and Application software, applications come ...
s are usually defined in terms of English keywords such as CTRL+F for find.


English on the World Wide Web

English is the largest language on the
World Wide Web The World Wide Web (WWW or simply the Web) is an information system that enables Content (media), content sharing over the Internet through user-friendly ways meant to appeal to users beyond Information technology, IT specialists and hobbyis ...
, with over 25% of internet users.


English speakers

Web user percentages usually focus on raw comparisons of the first language of those who access the web. Just as important is a consideration of second- and foreign-language users; i.e., the first language of a user does not necessarily reflect which language he or she regularly employs when using the web.


Native speakers

English-language users appear to be a plurality of web users, consistently cited as around one-third of the overall (near one billion). This reflects the relative affluence of English-speaking countries and high Internet penetration rates in them. This lead may be eroding due mainly to a rapid increase of Chinese users. First-language users among other relatively affluent countries appear generally stable, the two largest being German and Japanese, which each have between 5% and 10% of the overall share.


World Wide Web content

One widely quoted figure for the amount of
web content Web content is the text, visual or audio content that is made available online and user encountered as part of the online usage and experience on websites. It may include text, images, sounds and audio, online videos, among other items place ...
in English is 80%. Other sources show figures five to fifteen points lower, though still well over 50%. There are two notable facts about these percentages: The English web content is greater than the number of first-language English users by as much as 2 to 1. Given the enormous lead it already enjoys and its increasing use as a ''lingua franca'' in other spheres, English web content may continue to dominate even as English first-language Internet users decline. This is a classic positive feedback loop: new Internet users find it helpful to learn English and employ it online, thus reinforcing the language's prestige and forcing subsequent new users to learn English as well. Certain other factors (some predating the medium's appearance) have propelled English into a majority web-content position. Most notable in this regard is the tendency for researchers and professionals to publish in English to ensure maximum exposure. The largest database of medical bibliographical information, for example, shows English was the majority language choice for the past forty years and its share has continually increased over the same period. The fact that non-Anglophones regularly publish in English only reinforces the language's dominance. English has a rich technical vocabulary (largely because native and non-native speakers alike use it to communicate technical ideas) and many IT and technical professionals use English regardless of country of origin (
Linus Torvalds Linus Benedict Torvalds ( , ; born 28 December 1969) is a Finnish software engineer who is the creator and lead developer of the Linux kernel. He also created the distributed version control system Git. He was honored, along with Shinya Yam ...
, for instance, comments his code in English, despite being from Finland and having Swedish as his first language).


Notes

{{reflist computing in Computing and society Internet culture Natural language and computing
computing Computing is any goal-oriented activity requiring, benefiting from, or creating computer, computing machinery. It includes the study and experimentation of algorithmic processes, and the development of both computer hardware, hardware and softw ...