Google Translate
   HOME

TheInfoList



OR:

Google Translate is a
multilingual Multilingualism is the use of more than one language, either by an individual speaker or by a group of speakers. It is believed that multilingual speakers outnumber monolingual speakers in the world's population. More than half of all ...
neural In biology, the nervous system is the highly complex part of an animal that coordinates its actions and sensory information by transmitting signals to and from different parts of its body. The nervous system detects environmental changes ...
machine translation Machine translation, sometimes referred to by the abbreviation MT (not to be confused with computer-aided translation, machine-aided human translation or interactive translation), is a sub-field of computational linguistics that investigates ...
service developed by
Google Google LLC () is an American Multinational corporation, multinational technology company focusing on Search Engine, search engine technology, online advertising, cloud computing, software, computer software, quantum computing, e-commerce, ar ...
to
translate Translation is the communication of the meaning of a source-language text by means of an equivalent target-language text. The English language draws a terminological distinction (which does not exist in every language) between ''transl ...
text, documents and websites from one language into another. It offers a website interface, a
mobile app A mobile application or app is a computer program or software application designed to run on a mobile device such as a phone, tablet, or watch. Mobile applications often stand in contrast to desktop applications which are designed to run on d ...
for Android and iOS, and an API that helps developers build
browser extension A browser extension is a small software module for customizing a web browser. Browsers typically allow a variety of extensions, including user interface modifications, cookie management, ad blocking, and the custom scripting and styling of web ...
s and
software application Software is a set of computer programs and associated documentation and data. This is in contrast to hardware, from which the system is built and which actually performs the work. At the lowest programming level, executable code consists ...
s. As of , Google Translate supports languages at various levels, and , claimed over 500 million total users, with more than 100 billion words translated daily, after the company stated in May 2013 that it served over 200 million people daily. Launched in April 2006 as a statistical machine translation service, it used
United Nations The United Nations (UN) is an intergovernmental organization whose stated purposes are to maintain international peace and security, develop friendly relations among nations, achieve international cooperation, and be a centre for harmoni ...
and
European Parliament The European Parliament (EP) is one of the Legislature, legislative bodies of the European Union and one of its seven Institutions of the European Union, institutions. Together with the Council of the European Union (known as the Council and in ...
documents and transcripts to gather linguistic data. Rather than translating languages directly, it first translates text to English and then pivots to the target language in most of the language combinations it posits in its grid, with a few exceptions including Catalan-Spanish. During a translation, it looks for patterns in millions of documents to help decide which words to choose and how to arrange them in the target language. Its accuracy, which has been criticized on several occasions, has been measured to vary greatly across languages. In November 2016, Google announced that Google Translate would switch to a neural machine translation engine – Google Neural Machine Translation (GNMT) – which translates "whole sentences at a time, rather than just piece by piece. It uses this broader context to help it figure out the most relevant translation, which it then rearranges and adjusts to be more like a human speaking with proper grammar".


History

Google Translate is a web-based free-to-user translation service developed by Google in April 2006. It translates multiple forms of texts and media such as words, phrases and webpages. Originally, Google Translate was released as a statistical machine translation service. The input text had to be translated into English first before being translated into the selected language. Since SMT uses predictive
algorithm In mathematics and computer science, an algorithm () is a finite sequence of rigorous instructions, typically used to solve a class of specific problems or to perform a computation. Algorithms are used as specifications for performing ...
s to translate text, it had poor grammatical accuracy. Despite this, Google initially did not hire experts to resolve this limitation due to the ever-evolving nature of language. In January 2010, Google introduced an Android app and iOS version in February 2011 to serve as a portable personal interpreter. As of February 2010, it was integrated into browsers such as Chrome and was able to pronounce the translated text, automatically recognize words in a picture and spot unfamiliar text and languages. In May 2014, Google acquired Word Lens to improve the quality of visual and voice translation. It is able to scan text or a picture using the device and have it translated instantly. Moreover, the system automatically identifies foreign languages and translates speech without requiring individuals to tap the microphone button whenever speech translation is needed. In November 2016, Google transitioned its translating method to a system called neural machine translation. It uses
deep learning Deep learning (also known as deep structured learning) is part of a broader family of machine learning methods based on artificial neural networks with representation learning. Learning can be supervised, semi-supervised or unsupervised. ...
techniques to translate whole sentences at a time, which has been measured to be more accurate between English and French, German, Spanish, and Chinese. Retrieved May 14, 2017 No measurement results have been provided by Google researchers for GNMT from English to other languages, other languages to English, or between language pairs that do not include English. As of 2018, it translates more than 100 billion words a day. In 2017, Google Translate was used during a court hearing when court officials at Teesside Magistrates' Court failed to book an interpreter for the Chinese defendant. At the end of September 2022, Google Translate was discontinued in
mainland China "Mainland China" is a geopolitical term defined as the territory governed by the China, People's Republic of China (including islands like Hainan or Chongming Island, Chongming), excluding dependent territories of the PRC, and other territorie ...
, which Google said was due to "low usage" (see
Internet censorship in China Internet censorship in the People's Republic of China (PRC) affects both publishing and viewing online material. Many controversial events are censored from news coverage, preventing many Chinese citizens from knowing about the actions of th ...
).


Functions

Google Translate can translate multiple forms of text and media, which includes text, speech, and text within still or moving images. Specifically, its functions include: *Written Words Translation: a function that translates written words or text to a foreign language. *Website Translation: a function that translates a whole webpage to selected languages. *Document Translation: a function that translates a document uploaded by the users to selected languages. The documents should be in the form of: .doc, .docx, .odf, .pdf, .ppt, .pptx, .ps, .rtf, .txt, .xls, .xlsx. *Speech Translation: a function that instantly translates spoken language into the selected foreign language. *Mobile App Translation: in 2018, Google introduced its new Google Translate feature called "Tap to Translate", which made instant translation accessible inside any app without exiting or switching it. *Image Translation: a function that identifies text in a picture taken by the users and translates text on the screen instantly by images. *Handwritten Translation: a function that translates language that are handwritten on the phone screen or drawn on a virtual keyboard without the support of a keyboard. *Bilingual Conversation Translation: a function that translates conversations in multiple languages. *Transcription: a function that transcribes speech in different languages. For most of its features, Google Translate provides the pronunciation, dictionary, and listening to translation. Additionally, Google Translate has introduced its own Translate app, so translation is available with a mobile phone in offline mode.


Features


Web interface

Google Translate produces approximations across languages of multiple forms of text and media, including text, speech, websites, or text on display in still or live video images. For some languages, Google Translate can synthesize speech from text, and in certain pairs it is possible to highlight specific corresponding words and phrases between the source and target text. Results are sometimes shown with dictional information below the translation box, but it is not a dictionary and has been shown to invent translations in all languages for words it does not recognize. If "Detect language" is selected, text in an unknown language can be automatically identified. In the web interface, users can suggest alternate translations, such as for technical terms, or correct mistakes. These suggestions may be included in future updates to the translation process. If a user enters a URL in the source text, Google Translate will produce a
hyperlink In computing, a hyperlink, or simply a link, is a digital reference to data that the user can follow or be guided by clicking or tapping. A hyperlink points to a whole document or to a specific element within a document. Hypertext is text w ...
to a machine translation of the website. Users can save translation proposals in a "phrasebook" for later use, and a shareable URL is generated for each translation. For some languages, text can be entered via an
on-screen keyboard A virtual keyboard is a software component that allows the input of characters without the need for physical keys. The interaction with the virtual keyboard happens mostly via a touchscreen interface, but can also take place in a different form ...
, through handwriting recognition, or
speech recognition Speech recognition is an interdisciplinary subfield of computer science and computational linguistics that develops methodologies and technologies that enable the recognition and translation of spoken language into text by computers with the ...
. It is possible to enter searches in a source language that are first translated to a destination language allowing one to browse and interpret results from the selected destination language in the source language. Texts written in the
Arabic Arabic (, ' ; , ' or ) is a Semitic language spoken primarily across the Arab world.Semitic languages: an international handbook / edited by Stefan Weninger; in collaboration with Geoffrey Khan, Michael P. Streck, Janet C. E.Watson; Walter ...
,
Cyrillic The Cyrillic script ( ), Slavonic script or the Slavic script, is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic, Turkic, Mongolic, Uralic, Caucasian and Iranic-speaking co ...
,
Devanagari Devanagari ( ; , , Sanskrit pronunciation: ), also called Nagari (),Kathleen Kuiper (2010), The Culture of India, New York: The Rosen Publishing Group, , page 83 is a left-to-right abugida (a type of segmental writing system), based on the ...
and Greek scripts can be transliterated automatically from phonetic equivalents written in the
Latin alphabet The Latin alphabet or Roman alphabet is the collection of letters originally used by the ancient Romans to write the Latin language. Largely unaltered with the exception of extensions (such as diacritics), it used to write English and the ...
. The browser version of Google Translate provides the option to show phonetic equivalents of text translated from Japanese to English. The same option is not available on the paid API version. Many of the more popular languages have a "text-to-speech" audio function that is able to read back a text in that language, up to a few dozen words or so. In the case of
pluricentric language A pluricentric language or polycentric language is a language with several interacting codified standard forms, often corresponding to different countries. Many examples of such languages can be found worldwide among the most-spoken languages, inc ...
s, the accent depends on the region: for English, in the
Americas The Americas, which are sometimes collectively called America, are a landmass comprising the totality of North America, North and South America. The Americas make up most of the land in Earth's Western Hemisphere and comprise the New World. ...
, most of the
Asia-Pacific Asia-Pacific (APAC) is the part of the world near the western Pacific Ocean. The Asia-Pacific region varies in area depending on context, but it generally includes East Asia, Russian Far East, South Asia, Southeast Asia, Australia and Paci ...
and
Western Asia Western Asia, West Asia, or Southwest Asia, is the westernmost subregion of the larger geographical region of Asia, as defined by some academics, UN bodies and other institutions. It is almost entirely a part of the Middle East, and includes A ...
, the audio uses a female
General American General American English or General American (abbreviated GA or GenAm) is the umbrella accent of American English spoken by a majority of Americans. In the United States it is often perceived as lacking any distinctly regional, ethnic, or so ...
accent, whereas in Europe,
Hong Kong Hong Kong ( (US) or (UK); , ), officially the Hong Kong Special Administrative Region of the People's Republic of China (abbr. Hong Kong SAR or HKSAR), is a List of cities in China, city and Special administrative regions of China, special ...
,
Malaysia Malaysia ( ; ) is a country in Southeast Asia. The federal constitutional monarchy consists of thirteen states and three federal territories, separated by the South China Sea into two regions: Peninsular Malaysia and Borneo's East Mal ...
,
Singapore Singapore (), officially the Republic of Singapore, is a sovereign island country and city-state in maritime Southeast Asia. It lies about one degree of latitude () north of the equator, off the southern tip of the Malay Peninsula, bor ...
,
Guyana Guyana ( or ), officially the Cooperative Republic of Guyana, is a country on the northern mainland of South America. Guyana is an indigenous word which means "Land of Many Waters". The capital city is Georgetown. Guyana is bordered by the ...
and all other parts of the world, a female British (
Received Pronunciation Received Pronunciation (RP) is the accent traditionally regarded as the standard and most prestigious form of spoken British English. For over a century, there has been argument over such questions as the definition of RP, whether it is geo ...
) accent is used, except for a special General Australian accent used in Australia, New Zealand and
Norfolk Island Norfolk Island (, ; Norfuk: ''Norf'k Ailen'') is an external territory of Australia located in the Pacific Ocean between New Zealand and New Caledonia, directly east of Australia's Evans Head and about from Lord Howe Island. Together wit ...
, and an
Indian English Indian English (IE) is a group of English dialects spoken in the republic of India and among the Indian diaspora. English is used by the Indian government for communication, along with Hindi, as enshrined in the Constitution of India. ...
accent used in India; for Spanish, in the
Americas The Americas, which are sometimes collectively called America, are a landmass comprising the totality of North America, North and South America. The Americas make up most of the land in Earth's Western Hemisphere and comprise the New World. ...
, a
Latin American Latin Americans ( es, Latinoamericanos; pt, Latino-americanos; ) are the citizens of Latin American countries (or people with cultural, ancestral or national origins in Latin America). Latin American countries and their diasporas are multi-e ...
accent is used, while in the other parts of the world, a Castilian accent is used; for Portuguese, a
São Paulo São Paulo (, ; Portuguese for ' Saint Paul') is the most populous city in Brazil, and is the capital of the state of São Paulo, the most populous and wealthiest Brazilian state, located in the country's Southeast Region. Listed by the GaW ...
accent is used around the world, except in Portugal, where their native accent is used instead; for
French French (french: français(e), link=no) may refer to: * Something of, from, or related to France ** French language, which originated in France, and its various dialects and accents ** French people, a nation and ethnic group identified with Franc ...
, a
Quebec Quebec ( ; )According to the Canadian government, ''Québec'' (with the acute accent) is the official name in Canadian French and ''Quebec'' (without the accent) is the province's official name in Canadian English is one of the thirte ...
accent is used in Canada, while in the other parts of the world, a standard European accent is used; for
Bengali Bengali or Bengalee, or Bengalese may refer to: *something of, from, or related to Bengal, a large region in South Asia * Bengalis, an ethnic and linguistic group of the region * Bengali language, the language they speak ** Bengali alphabet, the w ...
, a male Bangladeshi accent is used, except in India, where a special female Indian Bengali accent is used instead. Some less widely spoken languages use the open-source eSpeak synthesizer for their speech; producing a robotic, awkward voice that may be difficult to understand.


Browser integration

Google Translate is available in some
web browser A web browser is application software for accessing websites. When a user requests a web page from a particular website, the browser retrieves its files from a web server and then displays the page on the user's screen. Browsers are used o ...
s as an optional downloadable
extension Extension, extend or extended may refer to: Mathematics Logic or set theory * Axiom of extensionality * Extensible cardinal * Extension (model theory) * Extension (predicate logic), the set of tuples of values that satisfy the predicate * Ext ...
that can run the translation engine, which allow right-click command access to the translation service. In February 2010, Google Translate was integrated into the
Google Chrome Google Chrome is a cross-platform web browser developed by Google. It was first released in 2008 for Microsoft Windows, built with free software components from Apple WebKit and Mozilla Firefox. Versions were later released for Linux, macOS, ...
browser by default, for optional automatic webpage translation.


Mobile app

The Google Translate app for Android and iOS supports languages and can propose translations for 37 languages via photo, 32 via voice in "conversation mode", and 27 via live video imagery in "augmented reality mode". The Android app was released in January 2010, and for iOS on February 8, 2011, after an HTML5
web application A web application (or web app) is application software that is accessed using a web browser. Web applications are delivered on the World Wide Web to users with an active network connection. History In earlier computing models like client-serv ...
was released for iOS users in August 2008. The Android app is compatible with devices running at least Android 2.1, while the iOS app is compatible with
iPod Touch The iPod Touch (stylized as iPod touch) is a discontinued line of iOS-based mobile devices designed and marketed by Apple Inc. with a touchscreen-controlled user interface. As with other iPod models, the iPod Touch can be used as a musi ...
es,
iPad The iPad is a brand of iOS and iPadOS-based tablet computers that are developed by Apple Inc., Apple Inc. The iPad was conceived before the related iPhone but the iPhone was developed and released first. Speculation about the development, ...
s, and iPhones updated to iOS 7.0+. A January 2011 Android version experimented with a "Conversation Mode" that aims to allow users to communicate fluidly with a nearby person in another language. Originally limited to English and Spanish, the feature received support for 12 new languages, still in testing, the following October. The 'Camera input' functionality allows users to take a photograph of a document, signboard, etc. Google Translate recognises the text from the image using
optical character recognition Optical character recognition or optical character reader (OCR) is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a sc ...
(OCR) technology and gives the translation. Camera input is not available for all languages. In January 2015, the apps gained the ability to propose translations of physical signs in real time using the device's camera, as a result of Google's acquisition of the Word Lens app. The original January launch only supported seven languages, but a July update added support for 20 new languages, with the release of a new implementation that utilizes
convolutional neural network In deep learning, a convolutional neural network (CNN, or ConvNet) is a class of artificial neural network (ANN), most commonly applied to analyze visual imagery. CNNs are also known as Shift Invariant or Space Invariant Artificial Neural Netwo ...
s, and also enhanced the speed and quality of Conversation Mode translations (
augmented reality Augmented reality (AR) is an interactive experience that combines the real world and computer-generated content. The content can span multiple sensory modalities, including visual, auditory, haptic, somatosensory and olfactory. AR can be de ...
). The feature was subsequently renamed Instant Camera. The technology underlying Instant Camera combines image processing and optical character recognition, then attempts to produce cross-language equivalents using standard Google Translate estimations for the text as it is perceived. On May 11, 2016, Google introduced ''Tap to Translate'' for Google Translate for Android. Upon highlighting text in an app that is in a foreign language, Translate will pop up inside of the app and offer translations.


API

On May 26, 2011, Google announced that the Google Translate API for
software developer Software development is the process of conceiving, specifying, designing, programming, documenting, testing, and bug fixing involved in creating and maintaining applications, frameworks, or other software components. Software development invo ...
s had been deprecated and would cease functioning. The Translate API page stated the reason as "substantial economic burden caused by extensive abuse" with an end date set for December 1, 2011. In response to public pressure, Google announced in June 2011 that the API would continue to be available as a paid service. Because the API was used in numerous third-party websites and apps, the original decision to deprecate it led some developers to criticize Google and question the viability of using Google APIs in their products.


Google Assistant

Google Translate also provides translations for Google Assistant and the devices that Google Assistant runs on such as
Google Nest Google Nest is a line of smart home products including smart speakers, smart displays, streaming devices, thermostats, smoke detectors, routers and security systems including smart doorbells, cameras and smart locks. The Nest brand name was ...
and Pixel Buds.


Supported languages

As of , the following 133 languages are supported by Google Translate. #
Afrikaans Afrikaans (, ) is a West Germanic language that evolved in the Dutch Cape Colony from the Dutch vernacular of Holland proper (i.e., the Hollandic dialect) used by Dutch, French, and German settlers and their enslaved people. Afrikaans gr ...
# Albanian #
Amharic Amharic ( or ; (Amharic: ), ', ) is an Ethiopian Semitic language, which is a subgrouping within the Semitic branch of the Afroasiatic languages. It is spoken as a first language by the Amharas, and also serves as a lingua franca for all oth ...
#
Arabic Arabic (, ' ; , ' or ) is a Semitic language spoken primarily across the Arab world.Semitic languages: an international handbook / edited by Stefan Weninger; in collaboration with Geoffrey Khan, Michael P. Streck, Janet C. E.Watson; Walter ...
# Armenian # Assamese # Aymara #
Azerbaijani Azerbaijani may refer to: * Something of, or related to Azerbaijan * Azerbaijanis * Azerbaijani language See also * Azerbaijan (disambiguation) * Azeri (disambiguation) * Azerbaijani cuisine * Culture of Azerbaijan The culture of Azerbaijan ...
# Bambara # Basque # Belarusian #
Bengali Bengali or Bengalee, or Bengalese may refer to: *something of, from, or related to Bengal, a large region in South Asia * Bengalis, an ethnic and linguistic group of the region * Bengali language, the language they speak ** Bengali alphabet, the w ...
#
Bhojpuri Bhojpuri (;Bhojpuri entry, Oxford Dictionaries
, Oxford U ...
# Bosnian # Bulgarian # Burmese (Myanmar) #
Catalan Catalan may refer to: Catalonia From, or related to Catalonia: * Catalan language, a Romance language * Catalans, an ethnic group formed by the people from, or with origins in, Northern or southern Catalonia Places * 13178 Catalan, asteroid #1 ...
# Cebuano # Chewa (Chichewa) # Chinese ( Simplified) # Chinese (
Traditional A tradition is a belief or behavior (folk custom) passed down within a group or society with symbolic meaning or special significance with origins in the past. A component of cultural expressions and folklore, common examples include holidays or ...
) # Corsican # Croatian # Czech # Danish # Dogri # Dutch #
English English usually refers to: * English language * English people English may also refer to: Peoples, culture, and language * ''English'', an adjective for something of, from, or related to England ** English national ...
#
Esperanto Esperanto ( or ) is the world's most widely spoken constructed international auxiliary language. Created by the Warsaw-based ophthalmologist L. L. Zamenhof in 1887, it was intended to be a universal second language for international communic ...
#
Estonian Estonian may refer to: * Something of, from, or related to Estonia, a country in the Baltic region in northern Europe * Estonians, people from Estonia, or of Estonian descent * Estonian language * Estonian cuisine * Estonian culture See also * ...
# Ewe # Finnish #
French French (french: français(e), link=no) may refer to: * Something of, from, or related to France ** French language, which originated in France, and its various dialects and accents ** French people, a nation and ethnic group identified with Franc ...
# Galician # Georgian #
German German(s) may refer to: * Germany (of or related to) **Germania (historical use) * Germans, citizens of Germany, people of German ancestry, or native speakers of the German language ** For citizens of Germany, see also German nationality law **Ge ...
# Greek # Guarani #
Gujarati Gujarati may refer to: * something of, from, or related to Gujarat, a state of India * Gujarati people, the major ethnic group of Gujarat * Gujarati language, the Indo-Aryan language spoken by them * Gujarati languages, the Western Indo-Aryan sub- ...
#
Haitian Creole Haitian Creole (; ht, kreyòl ayisyen, links=no, ; french: créole haïtien, links=no, ), commonly referred to as simply ''Creole'', or ''Kreyòl'' in the Creole language, is a French-based creole language spoken by 10–12million people wor ...
# Hausa # Hawaiian #
Hebrew Hebrew (; ; ) is a Northwest Semitic language of the Afroasiatic language family. Historically, it is one of the spoken languages of the Israelites and their longest-surviving descendants, the Jews and Samaritans. It was largely preserved ...
#
Hindi Hindi (Devanāgarī: or , ), or more precisely Modern Standard Hindi (Devanagari: ), is an Indo-Aryan language spoken chiefly in the Hindi Belt region encompassing parts of northern, central, eastern, and western India. Hindi has been ...
# Hmong # Hungarian # Icelandic # Igbo # Ilocano # Indonesian # Irish # Italian # Japanese # Javanese #
Kannada Kannada (; ಕನ್ನಡ, ), originally romanised Canarese, is a Dravidian language spoken predominantly by the people of Karnataka in southwestern India, with minorities in all neighbouring states. It has around 47 million native s ...
# Kazakh # Khmer #
Kinyarwanda Kinyarwanda, Rwandan or Rwanda, officially known as Ikinyarwanda, is a Bantu language and a dialect of the Rwanda-Rundi language that is spoken in Rwanda and adjacent parts of Burundi, the Democratic Republic of the Congo, Uganda (where ther ...
# Konkani # Korean # Krio # Kurdish ( Kurmanji) # Kurdish ( Sorani) #
Kyrgyz Kyrgyz, Kirghiz or Kyrgyzstani may refer to: * Someone or something related to Kyrgyzstan *Kyrgyz people *Kyrgyz national games *Kyrgyz language *Kyrgyz culture *Kyrgyz cuisine *Yenisei Kirghiz *The Fuyü Gïrgïs language in Northeastern China ...
# Lao #
Latin Latin (, or , ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally a dialect spoken in the lower Tiber area (then known as Latium) around present-day Rome, but through ...
# Latvian #
Lingala Lingala (Ngala) (Lingala: ''Lingála'') is a Bantu language spoken in the northwest of the Democratic Republic of the Congo, the northern half of the Republic of the Congo, in their capitals, Kinshasa and Brazzaville, and to a lesser degree i ...
# Lithuanian #
Luganda The Ganda language or Luganda (, , ) is a Bantu language spoken in the African Great Lakes region. It is one of the major languages in Uganda and is spoken by more than 10 million Baganda and other people principally in central Uganda includin ...
#
Luxembourgish Luxembourgish ( ; also ''Luxemburgish'', ''Luxembourgian'', ''Letzebu(e)rgesch''; Luxembourgish: ) is a West Germanic language that is spoken mainly in Luxembourg. About 400,000 people speak Luxembourgish worldwide. As a standard form of th ...
#
Macedonian Macedonian most often refers to someone or something from or related to Macedonia. Macedonian(s) may specifically refer to: People Modern * Macedonians (ethnic group), a nation and a South Slavic ethnic group primarily associated with North Ma ...
# Maithili # Malagasy # Malay #
Malayalam Malayalam (; , ) is a Dravidian language spoken in the Indian state of Kerala and the union territories of Lakshadweep and Puducherry ( Mahé district) by the Malayali people. It is one of 22 scheduled languages of India. Malayalam wa ...
# Maldivian (Dhivehi) # Maltese #
Māori Māori or Maori can refer to: Relating to the Māori people * Māori people of New Zealand, or members of that group * Māori language, the language of the Māori people of New Zealand * Māori culture * Cook Islanders, the Māori people of the Co ...
(Maori) # Marathi # Meitei (Manipuri, Meiteilon) #
Mizo Mizo may refer to: *Mizo people, an ethnic group native to north-eastern India, western Myanmar (Burma) and eastern Bangladesh * Mizo language, a language spoken by the Mizo people *Mizoram, a state in Northeast India *Lusei people, an ethnic group ...
# Mongolian #
Nepali Nepali or Nepalese may refer to : Concerning Nepal * Anything of, from, or related to Nepal * Nepali people, citizens of Nepal * Nepali language, an Indo-Aryan language found in Nepal, the current official national language and a language spoken ...
# Northern Sotho (Sepedi) # Norwegian #
Odia Odia, also spelled Oriya or Odiya, may refer to: * Odia people in Odisha, India * Odia language, an Indian language, belonging to the Indo-Aryan branch of the Indo-European language family * Odia alphabet, a writing system used for the Odia languag ...
(Oriya) # Oromo #
Pashto Pashto (,; , ) is an Eastern Iranian language in the Indo-European language family. It is known in historical Persian literature as Afghani (). Spoken as a native language mostly by ethnic Pashtuns, it is one of the two official langua ...
# Persian # Polish # Portuguese #
Punjabi Punjabi, or Panjabi, most often refers to: * Something of, from, or related to Punjab, a region in India and Pakistan * Punjabi language * Punjabi people * Punjabi dialects and languages Punjabi may also refer to: * Punjabi (horse), a British Th ...
(
Gurmukhi Gurmukhī ( pa, ਗੁਰਮੁਖੀ, , Shahmukhi: ) is an abugida developed from the Laṇḍā scripts, standardized and used by the second Sikh guru, Guru Angad (1504–1552). It is used by Punjabi Sikhs to write the language, commonly ...
) # Quechua #
Romanian Romanian may refer to: *anything of, from, or related to the country and nation of Romania ** Romanians, an ethnic group **Romanian language, a Romance language ***Romanian dialects, variants of the Romanian language **Romanian cuisine, traditiona ...
# Russian # Samoan #
Sanskrit Sanskrit (; attributively , ; nominally , , ) is a classical language belonging to the Indo-Aryan languages, Indo-Aryan branch of the Indo-European languages. It arose in South Asia after its predecessor languages had Trans-cultural diffusion ...
#
Scottish Gaelic Scottish Gaelic ( gd, Gàidhlig ), also known as Scots Gaelic and Gaelic, is a Goidelic language (in the Celtic branch of the Indo-European language family) native to the Gaels of Scotland. As a Goidelic language, Scottish Gaelic, as well as ...
(Scots Gaelic) #
Serbian Serbian may refer to: * someone or something related to Serbia, a country in Southeastern Europe * someone or something related to the Serbs, a South Slavic people * Serbian language * Serbian names See also * * * Old Serbian (disambiguation ...
# Sesotho # Shona #
Sindhi Sindhi may refer to: *something from, or related to Sindh, a province of Pakistan * Sindhi people, an ethnic group from the Sindh region * Sindhi language, the Indo-Aryan language spoken by them People with the name * Sarkash Sindhi (1940–2012 ...
# Sinhala # Slovak #
Slovenian Slovene or Slovenian may refer to: * Something of, from, or related to Slovenia, a country in Central Europe * Slovene language, a South Slavic language mainly spoken in Slovenia * Slovenes, an ethno-linguistic group mainly living in Slovenia * Sl ...
#
Somali Somali may refer to: Horn of Africa * Somalis, an inhabitant or ethnicity associated with Greater Somali Region ** Proto-Somali, the ancestors of modern Somalis ** Somali culture ** Somali cuisine ** Somali language, a Cushitic language ** Somali ...
# Spanish # Sundanese # Swahili # Swedish #
Tagalog Tagalog may refer to: Language * Tagalog language, a language spoken in the Philippines ** Old Tagalog, an archaic form of the language ** Batangas Tagalog, a dialect of the language * Tagalog script, the writing system historically used for Taga ...
( Filipino) # Tajik #
Tamil Tamil may refer to: * Tamils, an ethnic group native to India and some other parts of Asia ** Sri Lankan Tamils, Tamil people native to Sri Lanka also called ilankai tamils **Tamil Malaysians, Tamil people native to Malaysia * Tamil language, na ...
#
Tatar The Tatars ()Tatar
in the Collins English Dictionary
is an umbrella term for different
#
Telugu Telugu may refer to: * Telugu language, a major Dravidian language of India *Telugu people, an ethno-linguistic group of India * Telugu script, used to write the Telugu language ** Telugu (Unicode block), a block of Telugu characters in Unicode ...
# Thai # Tigrinya # Tsonga # Turkish # Turkmen # Twi #
Ukrainian Ukrainian may refer to: * Something of, from, or related to Ukraine * Something relating to Ukrainians, an East Slavic people from Eastern Europe * Something relating to demographics of Ukraine in terms of demography and population of Ukraine * So ...
#
Urdu Urdu (;"Urdu"
'' Uyghur # Uzbek # Vietnamese #
Welsh Welsh may refer to: Related to Wales * Welsh, referring or related to Wales * Welsh language, a Brittonic Celtic language spoken in Wales * Welsh people People * Welsh (surname) * Sometimes used as a synonym for the ancient Britons (Celtic peopl ...
# West Frisian (Frisian) # Xhosa #
Yiddish Yiddish (, or , ''yidish'' or ''idish'', , ; , ''Yidish-Taytsh'', ) is a West Germanic language historically spoken by Ashkenazi Jews. It originated during the 9th century in Central Europe, providing the nascent Ashkenazi community with a ve ...
#
Yoruba The Yoruba people (, , ) are a West African ethnic group that mainly inhabit parts of Nigeria, Benin, and Togo. The areas of these countries primarily inhabited by Yoruba are often collectively referred to as Yorubaland. The Yoruba constitute ...
# Zulu


Stages

''(by chronological order of introduction)'' #1st stage ##English to and from
French French (french: français(e), link=no) may refer to: * Something of, from, or related to France ** French language, which originated in France, and its various dialects and accents ** French people, a nation and ethnic group identified with Franc ...
##English to and from
German German(s) may refer to: * Germany (of or related to) **Germania (historical use) * Germans, citizens of Germany, people of German ancestry, or native speakers of the German language ** For citizens of Germany, see also German nationality law **Ge ...
##English to and from Spanish #2nd stage ##English to and from Portuguese #3rd stage ##English to and from Italian #4th stage ##English to and from Chinese ( Simplified) ##English to and from Japanese ##English to and from Korean #5th stage (launched April 28, 2006) ##English to and from
Arabic Arabic (, ' ; , ' or ) is a Semitic language spoken primarily across the Arab world.Semitic languages: an international handbook / edited by Stefan Weninger; in collaboration with Geoffrey Khan, Michael P. Streck, Janet C. E.Watson; Walter ...
#6th stage (launched December 16, 2006) ##English to and from Russian #7th stage (launched February 9, 2007) ##English to and from Chinese (
Traditional A tradition is a belief or behavior (folk custom) passed down within a group or society with symbolic meaning or special significance with origins in the past. A component of cultural expressions and folklore, common examples include holidays or ...
) ## Chinese (( Simplified) to and from
Traditional A tradition is a belief or behavior (folk custom) passed down within a group or society with symbolic meaning or special significance with origins in the past. A component of cultural expressions and folklore, common examples include holidays or ...
) #8th stage (all 25 language pairs use Google's machine translation system) (launched October 22, 2007) ##English to and from Dutch ##English to and from Greek #9th stage ##English to and from
Hindi Hindi (Devanāgarī: or , ), or more precisely Modern Standard Hindi (Devanagari: ), is an Indo-Aryan language spoken chiefly in the Hindi Belt region encompassing parts of northern, central, eastern, and western India. Hindi has been ...
#10th stage (as of this stage, translation can be done between any two languages, using English as an intermediate step, if needed) (launched May 8, 2008) ## Bulgarian ## Croatian ## Czech ## Danish ## Finnish ## Norwegian (
Bokmål Bokmål () (, ; ) is an official written standard for the Norwegian language, alongside Nynorsk. Bokmål is the preferred written standard of Norwegian for 85% to 90% of the population in Norway. Unlike, for instance, the Italian language, there ...
) ## Polish ##
Romanian Romanian may refer to: *anything of, from, or related to the country and nation of Romania ** Romanians, an ethnic group **Romanian language, a Romance language ***Romanian dialects, variants of the Romanian language **Romanian cuisine, traditiona ...
## Swedish #11th stage (launched September 25, 2008) ##
Catalan Catalan may refer to: Catalonia From, or related to Catalonia: * Catalan language, a Romance language * Catalans, an ethnic group formed by the people from, or with origins in, Northern or southern Catalonia Places * 13178 Catalan, asteroid #1 ...
## Filipino (
Tagalog Tagalog may refer to: Language * Tagalog language, a language spoken in the Philippines ** Old Tagalog, an archaic form of the language ** Batangas Tagalog, a dialect of the language * Tagalog script, the writing system historically used for Taga ...
) ##
Hebrew Hebrew (; ; ) is a Northwest Semitic language of the Afroasiatic language family. Historically, it is one of the spoken languages of the Israelites and their longest-surviving descendants, the Jews and Samaritans. It was largely preserved ...
## Indonesian ## Latvian ## Lithuanian ##
Serbian Serbian may refer to: * someone or something related to Serbia, a country in Southeastern Europe * someone or something related to the Serbs, a South Slavic people * Serbian language * Serbian names See also * * * Old Serbian (disambiguation ...
## Slovak ## Slovene ##
Ukrainian Ukrainian may refer to: * Something of, from, or related to Ukraine * Something relating to Ukrainians, an East Slavic people from Eastern Europe * Something relating to demographics of Ukraine in terms of demography and population of Ukraine * So ...
## Vietnamese #12th stage (launched January 30, 2009) ## Albanian ##
Estonian Estonian may refer to: * Something of, from, or related to Estonia, a country in the Baltic region in northern Europe * Estonians, people from Estonia, or of Estonian descent * Estonian language * Estonian cuisine * Estonian culture See also * ...
## Galician ## Hungarian ## Maltese ## Thai ## Turkish #13th stage (launched June 19, 2009) ## Persian #14th stage (launched August 24, 2009) ##
Afrikaans Afrikaans (, ) is a West Germanic language that evolved in the Dutch Cape Colony from the Dutch vernacular of Holland proper (i.e., the Hollandic dialect) used by Dutch, French, and German settlers and their enslaved people. Afrikaans gr ...
## Belarusian ## Icelandic ## Irish ##
Macedonian Macedonian most often refers to someone or something from or related to Macedonia. Macedonian(s) may specifically refer to: People Modern * Macedonians (ethnic group), a nation and a South Slavic ethnic group primarily associated with North Ma ...
## Malay ## Swahili ##
Welsh Welsh may refer to: Related to Wales * Welsh, referring or related to Wales * Welsh language, a Brittonic Celtic language spoken in Wales * Welsh people People * Welsh (surname) * Sometimes used as a synonym for the ancient Britons (Celtic peopl ...
##
Yiddish Yiddish (, or , ''yidish'' or ''idish'', , ; , ''Yidish-Taytsh'', ) is a West Germanic language historically spoken by Ashkenazi Jews. It originated during the 9th century in Central Europe, providing the nascent Ashkenazi community with a ve ...
#15th stage (launched November 19, 2009) ##The Beta stage is finished. Users can now choose to have the
romanization Romanization or romanisation, in linguistics, is the conversion of text from a different writing system to the Roman (Latin) script, or a system for doing so. Methods of romanization include transliteration, for representing written text, a ...
written for Belarusian, Bulgarian, Chinese, Greek, Hindi, Japanese, Korean, Russian, Thai and Ukrainian. For translations from Arabic, Hindi and Persian, the user can enter a Latin transliteration of the text and the text will be transliterated to the native script for these languages as the user is typing. The text can now be read by a
text-to-speech Speech synthesis is the artificial production of human speech. A computer system used for this purpose is called a speech synthesizer, and can be implemented in software or hardware products. A text-to-speech (TTS) system converts normal langu ...
program in English, French, German and Italian. #16th stage (launched January 30, 2010) ##
Haitian Creole Haitian Creole (; ht, kreyòl ayisyen, links=no, ; french: créole haïtien, links=no, ), commonly referred to as simply ''Creole'', or ''Kreyòl'' in the Creole language, is a French-based creole language spoken by 10–12million people wor ...
#17th stage (launched April 2010) ##Speech program launched in Hindi and Spanish. #18th stage (launched May 5, 2010) ##Speech program launched in Afrikaans, Albanian, Catalan, Chinese (Mandarin), Croatian, Czech, Danish, Dutch, Finnish, Greek, Hungarian, Icelandic, Indonesian, Latvian, Macedonian, Norwegian, Polish, Portuguese, Romanian, Russian, Serbian, Slovak, Swahili, Swedish, Turkish, Vietnamese and Welsh (based on eSpeak) #19th stage (launched May 13, 2010) ## Armenian ##
Azerbaijani Azerbaijani may refer to: * Something of, or related to Azerbaijan * Azerbaijanis * Azerbaijani language See also * Azerbaijan (disambiguation) * Azeri (disambiguation) * Azerbaijani cuisine * Culture of Azerbaijan The culture of Azerbaijan ...
## Basque ## Georgian ##
Urdu Urdu (;"Urdu"
'' Latin Latin (, or , ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally a dialect spoken in the lower Tiber area (then known as Latium) around present-day Rome, but through ...
#22nd stage (launched December 2010) ##Romanization of Arabic removed. ##Spell check added. ##For some languages, Google replaced text-to-speech synthesizers from eSpeak's robot voice to native speaker's nature voice technologies made by SVOX (Chinese, Czech, Danish, Dutch, Finnish, Greek, Hungarian, Norwegian, Polish, Portuguese, Russian, Swedish and Turkish), and also the old versions of French, German, Italian and Spanish; Latin uses the same synthesizer as Italian. ##Speech program launched in Arabic, Japanese and Korean. #23rd stage (launched January 2011) ##Choice of different translations for a word. #24th stage (launched June 2011) ##5 new Indic languages (in alpha) and a transliterated input method: ##
Bengali Bengali or Bengalee, or Bengalese may refer to: *something of, from, or related to Bengal, a large region in South Asia * Bengalis, an ethnic and linguistic group of the region * Bengali language, the language they speak ** Bengali alphabet, the w ...
##
Gujarati Gujarati may refer to: * something of, from, or related to Gujarat, a state of India * Gujarati people, the major ethnic group of Gujarat * Gujarati language, the Indo-Aryan language spoken by them * Gujarati languages, the Western Indo-Aryan sub- ...
##
Kannada Kannada (; ಕನ್ನಡ, ), originally romanised Canarese, is a Dravidian language spoken predominantly by the people of Karnataka in southwestern India, with minorities in all neighbouring states. It has around 47 million native s ...
##
Tamil Tamil may refer to: * Tamils, an ethnic group native to India and some other parts of Asia ** Sri Lankan Tamils, Tamil people native to Sri Lanka also called ilankai tamils **Tamil Malaysians, Tamil people native to Malaysia * Tamil language, na ...
##
Telugu Telugu may refer to: * Telugu language, a major Dravidian language of India *Telugu people, an ethno-linguistic group of India * Telugu script, used to write the Telugu language ** Telugu (Unicode block), a block of Telugu characters in Unicode ...
#25th stage (launched July 2011) ##Translation rating introduced. #26th stage (launched January 2012) ##Dutch male voice synthesizer replaced with female. ##Elena by SVOX replaced the Slovak eSpeak voice. ##Transliteration of Yiddish added. #27th stage (launched February 2012) ##Speech program launched in Thai. ##
Esperanto Esperanto ( or ) is the world's most widely spoken constructed international auxiliary language. Created by the Warsaw-based ophthalmologist L. L. Zamenhof in 1887, it was intended to be a universal second language for international communic ...
#28th stage (launched September 2012) ## Lao #29th stage (launched October 2012) ##Transliteration of Lao added. (alpha status) #30th stage (launched October 2012) ##New speech program launched in English. #31st stage (launched November 2012) ##New speech program in French, German, Italian, Latin and Spanish. #32nd stage (launched March 2013) ##Phrasebook added. #33rd stage (launched April 2013) ## Khmer #34th stage (launched May 2013) ## Bosnian ## Cebuano ## Hmong ## Javanese ## Marathi #35th stage (launched May 2013) ##16 additional languages can be used with camera-input: Bulgarian, Catalan, Croatian, Danish, Estonian, Finnish, Hungarian, Indonesian, Icelandic, Latvian, Lithuanian, Norwegian, Romanian, Slovak, Slovenian and Swedish. #36th stage (launched December 2013) ## Hausa ## Igbo ## Maori ## Mongolian ##
Nepali Nepali or Nepalese may refer to : Concerning Nepal * Anything of, from, or related to Nepal * Nepali people, citizens of Nepal * Nepali language, an Indo-Aryan language found in Nepal, the current official national language and a language spoken ...
##
Punjabi Punjabi, or Panjabi, most often refers to: * Something of, from, or related to Punjab, a region in India and Pakistan * Punjabi language * Punjabi people * Punjabi dialects and languages Punjabi may also refer to: * Punjabi (horse), a British Th ...
(
Gurmukhi Gurmukhī ( pa, ਗੁਰਮੁਖੀ, , Shahmukhi: ) is an abugida developed from the Laṇḍā scripts, standardized and used by the second Sikh guru, Guru Angad (1504–1552). It is used by Punjabi Sikhs to write the language, commonly ...
) ##
Somali Somali may refer to: Horn of Africa * Somalis, an inhabitant or ethnicity associated with Greater Somali Region ** Proto-Somali, the ancestors of modern Somalis ** Somali culture ** Somali cuisine ** Somali language, a Cushitic language ** Somali ...
##
Yoruba The Yoruba people (, , ) are a West African ethnic group that mainly inhabit parts of Nigeria, Benin, and Togo. The areas of these countries primarily inhabited by Yoruba are often collectively referred to as Yorubaland. The Yoruba constitute ...
## Zulu #37th stage (launched June 2014) ##Definition of words added. #38th stage (launched December 2014) ## Burmese ## Chewa ## Kazakh ## Malagasy ##
Malayalam Malayalam (; , ) is a Dravidian language spoken in the Indian state of Kerala and the union territories of Lakshadweep and Puducherry ( Mahé district) by the Malayali people. It is one of 22 scheduled languages of India. Malayalam wa ...
## Sinhala ## Sotho ## Sundanese ## Tajik ## Uzbek #39th stage (launched October 2015) ##Transliteration of Arabic restored. #40th stage (launched November 2015) ## Aurebesh #41st stage (launched February 2016) ##Aurebesh removed. ##Speech program launched in Bengali. ##
Amharic Amharic ( or ; (Amharic: ), ', ) is an Ethiopian Semitic language, which is a subgrouping within the Semitic branch of the Afroasiatic languages. It is spoken as a first language by the Amharas, and also serves as a lingua franca for all oth ...
## Corsican ## Hawaiian ## Kurdish ( Kurmanji) ##
Kyrgyz Kyrgyz, Kirghiz or Kyrgyzstani may refer to: * Someone or something related to Kyrgyzstan *Kyrgyz people *Kyrgyz national games *Kyrgyz language *Kyrgyz culture *Kyrgyz cuisine *Yenisei Kirghiz *The Fuyü Gïrgïs language in Northeastern China ...
##
Luxembourgish Luxembourgish ( ; also ''Luxemburgish'', ''Luxembourgian'', ''Letzebu(e)rgesch''; Luxembourgish: ) is a West Germanic language that is spoken mainly in Luxembourg. About 400,000 people speak Luxembourgish worldwide. As a standard form of th ...
##
Pashto Pashto (,; , ) is an Eastern Iranian language in the Indo-European language family. It is known in historical Persian literature as Afghani (). Spoken as a native language mostly by ethnic Pashtuns, it is one of the two official langua ...
## Samoan ##
Scottish Gaelic Scottish Gaelic ( gd, Gàidhlig ), also known as Scots Gaelic and Gaelic, is a Goidelic language (in the Celtic branch of the Indo-European language family) native to the Gaels of Scotland. As a Goidelic language, Scottish Gaelic, as well as ...
## Shona ##
Sindhi Sindhi may refer to: *something from, or related to Sindh, a province of Pakistan * Sindhi people, an ethnic group from the Sindh region * Sindhi language, the Indo-Aryan language spoken by them People with the name * Sarkash Sindhi (1940–2012 ...
## West Frisian ## Xhosa #42nd stage (launched September 2016) ##Speech program launched in Ukrainian. #43rd stage (launched December 2016) ##Speech program launched in Khmer and Sinhala. #44th stage (launched June 2018) ##Speech program launched in Burmese, Malayalam, Marathi, Nepali and Telugu. #45th stage (launched September 2019) ##Speech program launched in Gujarati, Kannada and Urdu. #46th stage (launched February 2020) ##
Kinyarwanda Kinyarwanda, Rwandan or Rwanda, officially known as Ikinyarwanda, is a Bantu language and a dialect of the Rwanda-Rundi language that is spoken in Rwanda and adjacent parts of Burundi, the Democratic Republic of the Congo, Uganda (where ther ...
##
Odia Odia, also spelled Oriya or Odiya, may refer to: * Odia people in Odisha, India * Odia language, an Indian language, belonging to the Indo-Aryan branch of the Indo-European language family * Odia alphabet, a writing system used for the Odia languag ...
##
Tatar The Tatars ()Tatar
in the Collins English Dictionary
is an umbrella term for different
## Turkmen ## Uyghur #47th stage (launched February 2021) ##Speech program launched in Afrikaans, Bulgarian, Catalan, Icelandic, Latvian, and Serbian (changed from eSpeak to a natural voice). ##New speech system (WaveNet) for several languages. #48th stage (launched January 2022) ##Speech program launched in Hebrew. #49th stage (launched May 2022) ## Assamese ## Aymara ## Bambara ##
Bhojpuri Bhojpuri (;Bhojpuri entry, Oxford Dictionaries
, Oxford U ...
## Dogri ## Ewe ## Guarani ## Ilocano ## Konkani ## Krio ## Kurdish ( Sorani) ##
Lingala Lingala (Ngala) (Lingala: ''Lingála'') is a Bantu language spoken in the northwest of the Democratic Republic of the Congo, the northern half of the Republic of the Congo, in their capitals, Kinshasa and Brazzaville, and to a lesser degree i ...
##
Luganda The Ganda language or Luganda (, , ) is a Bantu language spoken in the African Great Lakes region. It is one of the major languages in Uganda and is spoken by more than 10 million Baganda and other people principally in central Uganda includin ...
## Maithili ## Maldivian ## Meitei ##
Mizo Mizo may refer to: *Mizo people, an ethnic group native to north-eastern India, western Myanmar (Burma) and eastern Bangladesh * Mizo language, a language spoken by the Mizo people *Mizoram, a state in Northeast India *Lusei people, an ethnic group ...
## Sepedi ## Oromo ## Quechua ##
Sanskrit Sanskrit (; attributively , ; nominally , , ) is a classical language belonging to the Indo-Aryan languages, Indo-Aryan branch of the Indo-European languages. It arose in South Asia after its predecessor languages had Trans-cultural diffusion ...
## Tigrinya ## Tsonga ## Twi ##eSpeak voice synthesizer removed from Armenian, Esperanto, Macedonian and Welsh. #50th stage (launched November 2022) ##New speech program launched for Albanian, Bosnian and Swahili.


Languages in development and beta version

The following languages are not yet supported by Google Translate, but are available in the Translate Community. As of , there are 103 languages in development, of which 9 are in
beta Beta (, ; uppercase , lowercase , or cursive ; grc, βῆτα, bē̂ta or ell, βήτα, víta) is the second letter of the Greek alphabet. In the system of Greek numerals, it has a value of 2. In Modern Greek, it represents the voiced labiod ...
version. The languages in
beta Beta (, ; uppercase , lowercase , or cursive ; grc, βῆτα, bē̂ta or ell, βήτα, víta) is the second letter of the Greek alphabet. In the system of Greek numerals, it has a value of 2. In Modern Greek, it represents the voiced labiod ...
version are closer to their public release and have an exclusive extra option to contribute that allows evaluating up to 4 translations of the
beta Beta (, ; uppercase , lowercase , or cursive ; grc, βῆτα, bē̂ta or ell, βήτα, víta) is the second letter of the Greek alphabet. In the system of Greek numerals, it has a value of 2. In Modern Greek, it represents the voiced labiod ...
version by translating an English text of up to 50 characters. There is currently a petition for Google to add
Cree The Cree ( cr, néhinaw, script=Latn, , etc.; french: link=no, Cri) are a North American Indigenous people. They live primarily in Canada, where they form one of the country's largest First Nations. In Canada, over 350,000 people are Cree o ...
to Google Translate, but as of , it is not one of the languages in development. # Acehnese # Adyghe # Afar # Aragonese # Avar (Avaric) # Bagheli # Balochi (Baluchi) # Bangala # Baoulé # Bashkir #
Berber Berber or Berbers may refer to: Ethnic group * Berbers, an ethnic group native to Northern Africa * Berber languages, a family of Afro-Asiatic languages Places * Berber, Sudan, a town on the Nile People with the surname * Ady Berber (1913–19 ...
(Tamazight) # Betawi #
Bodo Bodo may refer to: Ethnicity * Boro people, an ethno-linguistic group mainly from Northwest Assam, India * Bodo-Kachari people, an umbrella group from Nepal, India and Bangladesh that includes the Bodo people Culture and language * Boro cu ...
(Boro) # Breton #
Cantonese Cantonese ( zh, t=廣東話, s=广东话, first=t, cy=Gwóngdūng wá) is a language within the Chinese (Sinitic) branch of the Sino-Tibetan languages originating from the city of Guangzhou (historically known as Canton) and its surrounding a ...
# Chechen #
Cherokee The Cherokee (; chr, ᎠᏂᏴᏫᏯᎢ, translit=Aniyvwiyaʔi or Anigiduwagi, or chr, ᏣᎳᎩ, links=no, translit=Tsalagi) are one of the indigenous peoples of the Southeastern Woodlands of the United States. Prior to the 18th century, th ...
#
Chhattisgarhi Chhattisgarhi ( / ) is an Indo-Aryan language, spoken by approximately 16 million people from Chhattisgarh & other states. It is mostly spoken in the Indian states of Chhattisgarh, Odisha, Madhya Pradesh & Maharashtra. It is closely related ...
# Chittagonian # Chuvash # Deccani #
Dholuo The Dholuo dialect (pronounced ) or ''Nilotic Kavirondo'', is a dialect of the Luo group of Nilotic languages, spoken by about 4.2 million Luo people of Kenya and Tanzania, who occupy parts of the eastern shore of Lake Victoria and areas to th ...
# Dyula #
Dzongkha Dzongkha (; ) is a Sino-Tibetan language that is the official and national language of Bhutan. It is written using the Tibetan script. The word means "the language of the fortress", from ' "fortress" and ' "language". , Dzongkha had 171,080 ...
#
Edo Edo ( ja, , , "bay-entrance" or "estuary"), also romanized as Jedo, Yedo or Yeddo, is the former name of Tokyo. Edo, formerly a ''jōkamachi'' (castle town) centered on Edo Castle located in Musashi Province, became the ''de facto'' capital of ...
# Efik # Esan # Fon # Fula (Fulah) # Gagauz # Garhwali # Greenlandic (Kalaallisut) # Haryanvi # Hiligaynon #
Inuktitut Inuktitut (; , syllabics ; from , "person" + , "like", "in the manner of"), also Eastern Canadian Inuktitut, is one of the principal Inuit languages of Canada. It is spoken in all areas north of the tree line, including parts of the provinces o ...
# Isoko # Kamba # Kanuri # Kapampangan (Pampanga) #
Karachay-Balkar Karachay-Balkar (, ), or Mountain Turkic (, ), is a Turkic language spoken by the Karachays and Balkars in Kabardino-Balkaria and Karachay–Cherkessia, European Russia, as well as by an immigrant population in Afyonkarahisar Province, Tur ...
# Karakalpak (Kara-Kalpak) #
Kashmiri Kashmiri may refer to: * People or things related to the Kashmir Valley or the broader region of Kashmir * Kashmiris, an ethnic group native to the Kashmir Valley * Kashmiri language, their language People with the name * Kashmiri Saikia Baruah ...
#
Kedah Malay Kedah Malay or Kedahan (); also known as ''Pelat Utara'' or ''Loghat Utara'' ('Northern Dialect') or as it is known in Thailand, Syburi Malay () is a variety of the Malayic languages mainly spoken in the northwestern Malaysian states of Perl ...
#
Khakas The Khakas (also spelled Khakass; Khakas: , ''khakas'', , ''tadar'', , ''khakastar'', , ''tadarlar'') are a Turkic indigenous people of Siberia, who live in the republic of Khakassia, Russia. They speak the Khakas language. The Khakhassi ...
# Khandeshi (Ahirani) # Khorasani Turkic #
Kikuyu Kikuyu or Gikuyu (Gĩkũyũ) mostly refers to an ethnic group in Kenya or its associated language. It may also refer to: *Kikuyu people, a majority ethnic group in Kenya *Kikuyu language, the language of Kikuyu people *Kikuyu, Kenya, a town in Centr ...
# Kokborok (Tripuri) # Kumyk # Kʼicheʼ # Lakota # Lhasa Tibetan (Tibetan) #
Luba-Kasai Luba-Kasai, also known as Western Luba, ''Bena-Lulua, Cilubà/Tshilubà'', ''Luba-Lulua'' or ''Luva'', is a Bantu language ( Zone L) of Central Africa and a national language of the Democratic Republic of the Congo, alongside Lingala, Swahil ...
(Tshiluba) # Luba-Katanga # Madurese #
Magahi The Magahi language (), also known as Magadhi (), is a language spoken in Bihar, Jharkhand and West Bengal states of eastern India, and in the Terai of Nepal. Magadhi Prakrit was the ancestor of Magahi, from which the latter's name deriv ...
# Marwari # Mazanderani # Minangkabau # Montenegrin #
Mooré The Mossi language (Mooré) is a Gur language of the Oti–Volta branch and one of two official regional languages of Burkina Faso. It is the language of the Mossi people, spoken by approximately 8 million people in Burkina Faso, Ghana, Cote ...
(Mossi) #
Navajo The Navajo (; British English: Navaho; nv, Diné or ') are a Native Americans in the United States, Native American people of the Southwestern United States. With more than 399,494 enrolled tribal members , the Navajo Nation is the largest fe ...
#
Newar Newar (; new, नेवार, endonym: Newa; new, नेवा, Pracalit script:) or Nepami, are the historical inhabitants of the Kathmandu Valley and its surrounding areas in Nepal and the creators of its historic heritage and civilisat ...
(Nepalbhasa) # Nigerian Pidgin # Northern Sami # Occitan # Pattani Malay # Qashqai #Rajasthani languages, Rajasthani #Rangpuri language, Rangpuri (Kamtapuri) #Rohingya language, Rohingya #Romansh language, Romansh #Sadri language, Sadri #Salar language, Salar #Samogitian dialect, Samogitian #Sango language, Sango #Santali language, Santali #Saraiki language, Saraiki #Serrano language, Serrano #Shor language, Shor #Siberian Tatar language, Siberian Tatar #Sicilian language, Sicilian #Southern Altai language, Southern Altai #Southern Ndebele language, Southern Ndebele #Surjapuri language, Surjapuri #Swahili language, Swahili Congo #Sylheti language, Sylheti #Tiv language, Tiv #Toba Batak language, Toba Batak (Batak Toba) #Tok Pisin #Tonga language (Zambia and Zimbabwe), Tonga (Zambia and Zimbabwe) (Chitonga) #Tswana language, Tswana (Setswana) #Tswa language, Tswa #Tuvan language, Tuvan (Tuvinian) #Urhobo language, Urhobo #Urum language, Urum #Varhadi dialect, Varhadi (Varhadi-Nagpuri) #Venda language, Venda (Tshivenda) #Wolof language, Wolof #Yakut language, Yakut #Yucatec Maya language, Yucatec Maya (Yucateco) #Zaza language, Zazaki #Standard Zhuang, Zhuang


Translation methodology

In April 2006, Google Translate launched with a statistical machine translation engine. Google Translate does not apply grammar, grammatical rules, since its algorithms are based on statistical or pattern analysis rather than traditional rule-based analysis. The system's original creator, Franz Josef Och, has criticized the effectiveness of rule-based machine translation, rule-based
algorithm In mathematics and computer science, an algorithm () is a finite sequence of rigorous instructions, typically used to solve a class of specific problems or to perform a computation. Algorithms are used as specifications for performing ...
s in favor of statistical approaches. Original versions of Google Translate were based on a method called statistical machine translation, and more specifically, on research by Och who won the DARPA contest for speed machine translation in 2003. Och was the head of Google's machine translation group until leaving to join Human Longevity, Inc. in July 2014. Google Translate does not translate from one language to another (L1 → L2). Instead, it often translates first to English and then to the target language (L1 → EN → L2). However, because English, like all human languages, is ambiguous and depends on context, this can cause translation errors. For example, translating from French to Russian gives '' → you → '' OR '. If Google were using an unambiguous, artificial language as the intermediary, it would be '' → you → '' OR '' → thou → ''. Such a suffixing of words disambiguates their different meanings. Hence, publishing in English, using unambiguous words, providing context, using expressions such as "you all" may or may not make a better one-step translation depending on the target language. The following languages do not have a direct Google translation to or from English. These languages are translated through the indicated intermediate language (which in most cases is closely related to the desired language but more widely spoken) in addition to through English: * Belarusian (East Slavic languages, be ↔ ru ↔ en ↔ other); *
Catalan Catalan may refer to: Catalonia From, or related to Catalonia: * Catalan language, a Romance language * Catalans, an ethnic group formed by the people from, or with origins in, Northern or southern Catalonia Places * 13178 Catalan, asteroid #1 ...
(Iberian Romance languages, ca ↔ es ↔ en ↔ other); * Galician (Galician-Portuguese, gl ↔ pt ↔ en ↔ other); *
Haitian Creole Haitian Creole (; ht, kreyòl ayisyen, links=no, ; french: créole haïtien, links=no, ), commonly referred to as simply ''Creole'', or ''Kreyòl'' in the Creole language, is a French-based creole language spoken by 10–12million people wor ...
(French-based creole languages, ht ↔ fr ↔ en ↔ other); * Korean (Altaic languages, ko ↔ ja ↔ en ↔ other); * Slovak (Czech–Slovak languages, sk ↔ cs ↔ en ↔ other); *
Ukrainian Ukrainian may refer to: * Something of, from, or related to Ukraine * Something relating to Ukrainians, an East Slavic people from Eastern Europe * Something relating to demographics of Ukraine in terms of demography and population of Ukraine * So ...
(East Slavic languages, uk ↔ ru ↔ en ↔ other); *
Urdu Urdu (;"Urdu"
'' Statistical mathematical model, models from these data are then used to translate between those languages. To acquire this huge amount of linguistic data, Google used
United Nations The United Nations (UN) is an intergovernmental organization whose stated purposes are to maintain international peace and security, develop friendly relations among nations, achieve international cooperation, and be a centre for harmoni ...
and
European Parliament The European Parliament (EP) is one of the Legislature, legislative bodies of the European Union and one of its seven Institutions of the European Union, institutions. Together with the Council of the European Union (known as the Council and in ...
documents and transcripts. The UN typically publishes documents in all six official languages of the United Nations, official UN languages, which has produced a very large 6-language corpus. Google representatives have been involved with domestic conferences in Japan where it has solicited bilingual data from researchers.Google was an official sponsor of the annual Computational Linguistics in Japan Conference ("Gengoshorigakkai") in 2007. Google also sent a delegate from its headquarters to the meeting of the members of the Computational Linguistic Society of Japan in March 2005, promising funding to researchers who would be willing to share text data. When Google Translate generates a translation proposal, it looks for patterns in hundreds of millions of documents to help decide on the best translation. By detecting patterns in documents that have already been translated by human translators, Google Translate makes artificial intelligence, informed guesses (AI) as to what an appropriate translation should be. Before October 2007, for languages other than
Arabic Arabic (, ' ; , ' or ) is a Semitic language spoken primarily across the Arab world.Semitic languages: an international handbook / edited by Stefan Weninger; in collaboration with Geoffrey Khan, Michael P. Streck, Janet C. E.Watson; Walter ...
, Chinese and Russian, Google Translate was based on SYSTRAN, a software engine which is still used by several other online translation services such as Babel Fish (website), Babel Fish (now defunct). From October 2007, Google Translate used proprietary, in-house technology based on statistical machine translation instead, before transitioning to neural machine translation.


Google Translate Community

Google has crowdsourcing features for volunteers to be a part of its "Translate Community", intended to help improve Google Translate's accuracy. Volunteers can select up to five languages to help improve translation; users can verify translated phrases and translate phrases in their languages to and from English, helping to improve the accuracy of translating more rare and complex phrases. In August 2016, a Crowdsource (app), Google Crowdsource app was released for Android users, in which translation tasks are offered. There are three ways to contribute. First, Google will show a phrase that one should type in the translated version. Second, Google will show a proposed translation for a user to agree, disagree, or skip. Third, users can suggest translations for phrases where they think they can improve on Google's results. Tests in 44 languages show that the "suggest an edit" feature led to an improvement in a maximum of 40% of cases over four years.


Statistical machine translation

Although Google deployed a new system called neural machine translation for better quality translation, there are languages that still use the traditional translation method called statistical machine translation. It is a rule-based translation method that utilizes predictive algorithms to guess ways to translate texts in foreign languages. It aims to translate whole phrases rather than single words then gather overlapping phrases for translation. Moreover, it also analyzes bilingual text corpora to generate statistical model that translates texts from one language to another.


Google Neural Machine Translation

In September 2016, a research team at Google announced the development of the Google Neural Machine Translation system (GNMT) to increase fluency and accuracy in Google Translate and in November announced that Google Translate would switch to GNMT. Google Translate's neural machine translation system uses a large end-to-end principle, end-to-end artificial neural network that attempts to perform
deep learning Deep learning (also known as deep structured learning) is part of a broader family of machine learning methods based on artificial neural networks with representation learning. Learning can be supervised, semi-supervised or unsupervised. ...
, in particular, long short-term memory networks. GNMT improves the quality of translation over SMT in some instances because it uses an example-based machine translation (EBMT) method in which the system "learns from millions of examples." According to Google researchers, it translates "whole sentences at a time, rather than just piece by piece. It uses this broader context to help it figure out the most relevant translation, which it then rearranges and adjusts to be more like a human speaking with proper grammar". GNMT's "proposed architecture" of "system learning" has been implemented on over a hundred languages supported by Google Translate. With the end-to-end framework, Google states but does not demonstrate for most languages that "the system learns over time to create better, more natural translations." The GNMT network attempts interlingual machine translation, which encodes the "semantics of the sentence rather than simply memorizing phrase-to-phrase translations", and the system did not invent its own universal language, but uses "the commonality found in between many languages". GNMT was first enabled for eight languages: to and from English and Chinese, French, German, Japanese, Korean, Portuguese, Spanish and Turkish. In March 2017, it was enabled for Hindi, Russian and Vietnamese, followed by Bengali, Gujarati, Indonesian, Kannada, Malayalam, Marathi, Punjabi, Tamil and Telugu in April.


Accuracy

Google Translate is not as reliable as human translation. When text is well-structured, written using formal language, with simple sentences, relating to formal topics for which training data is ample, it often produces conversions similar to human translations between English and a number of high-resource languages. Accuracy decreases for those languages when fewer of those conditions apply, for example when sentence length increases or the text uses familiar or literary language. For many other languages vis-à-vis English, it can produce the gist of text in those formal circumstances. Human evaluation from English to all 102 languages shows that the main idea of a text is conveyed more than 50% of the time for 35 languages. For 67 languages, a minimally comprehensible result is not achieved 50% of the time or greater. A few studies have evaluated Chinese, French, German, and Spanish to English, but no systematic human evaluation has been conducted from most Google Translate languages to English. Speculative language-to-language scores extrapolated from English-to-other measurements indicate that Google Translate will produce translation results that convey the gist of a text from one language to another more than half the time in about 1% of language pairs, where neither language is English. Research conducted in 2011 showed that Google Translate got a slightly higher score than the UCLA minimum score for the English Proficiency Exam. Due to its identical choice of words without considering the flexibility of choosing alternative words or expressions, it produces a relatively similar translation to human translation from the perspective of formality, referential cohesion, and conceptual cohesion. Moreover, a number of languages are translated into a sentence structure and sentence length similar to a human translation. Furthermore, Google carried out a test that required native speakers of each language to rate the translation on a scale between 0 and 6, and Google Translate scored 5.43 on average. When used as a dictionary to translate single words, Google Translate is highly inaccurate because it must guess between polysemy, polysemic words. Among the top 100 words in the English language, which make up more than 50% of all written English, the average word has most common words in English, more than 15 senses, which makes the odds against a correct translation about 15 to 1 if each sense maps to a different word in the target language. Most common English words have at least two senses, which produces 50/50 odds in the likely case that the target language uses different words for those different senses. The odds are similar from other languages to English. Google Translate makes statistical guesses that raise the likelihood of producing the most frequent sense of a word, with the consequence that an accurate translation will be unobtainable in cases that do not match the majority or plurality text corpus, corpus occurrence. The accuracy of single-word predictions has not been measured for any language. Because almost all non-English language pairs pivot through English, the odds against obtaining accurate single-word translations from one non-English language to another can be estimated by multiplying the number of senses in the source language with the number of senses each of those terms have in English. When Google Translate does not have a word in its vocabulary, it makes up a result as part of its algorithm.


Limitations

Google Translate, like other automatic translation tools, has its limitations. The service limits the number of paragraphs and the range of technical terms that can be translated, and while it can help the reader understand the general content of a foreign language text, it does not always deliver accurate translations, and most times it tends to repeat verbatim the same word it is expected to translate. Grammatically, for example, Google Translate struggles to differentiate between ''imperfect'' and ''perfect'' grammatical aspect, aspects in Romance languages so habitual and continuous acts in the past often become single ''historical'' events. Although seemingly pedantic, this can often lead to incorrect results (to a native speaker of for example French and Spanish) which would have been avoided by a human translator. Knowledge of the ''subjunctive mood'' is virtually non-existent. Moreover, the formal second person () is often chosen, whatever the context or accepted usage. Since its English reference material contains only "you" forms, it has difficulty translating a language with "you all" or formal "you" variations. Due to differences between languages in investment, research, and the extent of digital resources, the accuracy of Google Translate varies greatly among languages. Some languages produce better results than others. Most languages from Africa, Asia, and the Pacific, tend to score poorly in relation to the scores of many well-financed European languages, Afrikaans and Chinese being the high-scoring exceptions from their continents. No languages indigenous to Australia are included within Google Translate. Higher scores for European can be partially attributed to the Europarl Corpus, a trove of documents from the
European Parliament The European Parliament (EP) is one of the Legislature, legislative bodies of the European Union and one of its seven Institutions of the European Union, institutions. Together with the Council of the European Union (known as the Council and in ...
that have been professionally translated by the mandate of the European Union into as many as 21 languages. A 2010 analysis indicated that French to English translation is relatively accurate, and 2011 and 2012 analyses showed that Italian to English translation is relatively accurate as well. However, if the source text is shorter, rule-based machine translations often perform better; this effect is particularly evident in Chinese to English translations. While edits of translations may be submitted, in Chinese specifically one cannot edit sentences as a whole. Instead, one must edit sometimes arbitrary sets of characters, leading to incorrect edits. A good example is Russian-to-English. Formerly one would use Google Translate to make a draft and then use a dictionary and common sense to correct the numerous mistakes. As of early 2018 Translate is sufficiently accurate to make the Russian Wikipedia accessible to those who can read English. The quality of Translate can be checked by adding it as an extension to Chrome or Firefox and applying it to the left language links of any Wikipedia article. It can be used as a dictionary by typing in words. One can translate from a book by using a scanner and an OCR like Google Drive, but this takes about five minutes per page. In its Written Words Translation function, there is a word limit on the amount of text that can be translated at once. Therefore, long text should be transferred to a document form and translated through its Document Translate function. Moreover, like all machine translation programs, Google Translate struggles with polysemy (the multiple meanings a word may have) and idiom#Multiword expression, multiword expressions (terms that have meanings that cannot be understood or translated by analyzing the individual word units that compose them). A word in a foreign language might have two different meanings in the translated language. This might lead to mistranslations. Additionally, grammatical errors remain a major limitation to the accuracy of Google Translate. International Journal of Linguistics, Literature and Translation (IJLLT) 2(3):196-200, 2019. Retrieved August 26, 2020


Open-source licenses and components

Irish language data from Foras na Gaeilge's New English-Irish Dictionary (English database designed and developed for Foras na Gaeilge by Lexicography MasterClass Ltd.) Welsh language data from Gweiadur by Gwerin. Certain content is copyright Oxford University Press USA. Some phrase translations come from Wikitravel.


Reviews

Shortly after launching the translation service for the first time, Google won an international competition for English–Arabic and English–Chinese machine translation.


Translation mistakes and oddities

Since Google Translate used statistical matching to translate, translated text can often include apparently nonsensical and obvious errors, often swapping common terms for similar but nonequivalent common terms in the other language, as well as inverting sentence meaning. Novelty websites like Translation Services USA#Bad Translator, Bad Translator and Translation Party have utilized the service to produce humorous text by translating back and forth between multiple languages, similar to the children's game Chinese whispers, telephone.


See also

* Apertium * Babel Fish (website), Babel Fish (discontinued; redirects to the main Yahoo! site) * Comparison of machine translation applications * DeepL Translator * Google Dictionary * Google Translator Toolkit * Jollo (discontinued) * List of Google products * Microsoft Translator * Reverso (language tools), Reverso * Smartcat * Speech Services * SYSTRAN * Word Lens (discontinued; merged into Google Translate app) * Yandex Translate


References


External links

*
Contribute
{{Authority control Google services, Translate Internet properties established in 2006 Machine translation software Natural language processing software Products introduced in 2006 Translation websites