Google Language Tools
   HOME

TheInfoList



OR:

Google Translate is a
multilingual Multilingualism is the use of more than one language, either by an individual speaker or by a group of speakers. When the languages are just two, it is usually called bilingualism. It is believed that multilingual speakers outnumber monolin ...
neural
machine translation Machine translation is use of computational techniques to translate text or speech from one language to another, including the contextual, idiomatic and pragmatic nuances of both languages. Early approaches were mostly rule-based or statisti ...
service developed by
Google Google LLC (, ) is an American multinational corporation and technology company focusing on online advertising, search engine technology, cloud computing, computer software, quantum computing, e-commerce, consumer electronics, and artificial ...
to translate text, documents and websites from one language into another. It offers a website interface, a
mobile app A mobile application or app is a computer program or software application designed to run on a mobile device such as a smartphone, phone, tablet computer, tablet, or smartwatch, watch. Mobile applications often stand in contrast to desktop appli ...
for Android and
iOS Ios, Io or Nio (, ; ; locally Nios, Νιός) is a Greek island in the Cyclades group in the Aegean Sea. Ios is a hilly island with cliffs down to the sea on most sides. It is situated halfway between Naxos and Santorini. It is about long an ...
, as well as an
API An application programming interface (API) is a connection between computers or between computer programs. It is a type of software interface, offering a service to other pieces of software. A document or standard that describes how to build ...
that helps developers build
browser extension A browser extension is a software module for customizing a web browser. Browsers typically allow users to install a variety of extensions, including user interface modifications, cookie management, ad blocking, and the custom scripting and st ...
s and
software application Application software is any computer program that is intended for end-user use not computer operator, operating, system administration, administering or computer programming, programming the computer. An application (app, application program, sof ...
s. As of , Google Translate supports languages and language varieties at various levels. It served over 200 million people daily in May 2013, and over 500 million total users , with more than 100 billion words translated daily. Launched in April 2006 as a
statistical machine translation Statistical machine translation (SMT) is a machine translation approach where translations are generated on the basis of statistical models whose parameters are derived from the analysis of bilingual text corpora. The statistical approach contra ...
service, it originally used
United Nations The United Nations (UN) is the Earth, global intergovernmental organization established by the signing of the Charter of the United Nations, UN Charter on 26 June 1945 with the stated purpose of maintaining international peace and internationa ...
and
European Parliament The European Parliament (EP) is one of the two legislative bodies of the European Union and one of its seven institutions. Together with the Council of the European Union (known as the Council and informally as the Council of Ministers), it ...
documents and transcripts to gather linguistic data. Rather than translating languages directly, it first translated text to English and then pivoted to the target language in most of the language combinations it posited in its grid, with a few exceptions including Catalan–Spanish. During a translation, it looked for patterns in millions of documents to help decide which words to choose and how to arrange them in the target language. In recent years, it has used a
deep learning Deep learning is a subset of machine learning that focuses on utilizing multilayered neural networks to perform tasks such as classification, regression, and representation learning. The field takes inspiration from biological neuroscience a ...
model to power its translations. Its accuracy, which has been criticized on several occasions, has been measured to vary greatly across languages. In November 2016, Google announced that Google Translate would switch to a
neural machine translation Neural machine translation (NMT) is an approach to machine translation that uses an artificial neural network to predict the likelihood of a sequence of words, typically modeling entire sentences in a single integrated model. It is the dominant a ...
engine – Google Neural Machine Translation (GNMT) – which translated "whole sentences at a time, rather than just piece by piece. It uses this broader context to help it figure out the most relevant translation, which it then rearranges and adjusts to be more like a human speaking with proper grammar".


History

Google Translate is a web-based free-to-use translation service developed by Google in April 2006. It translates multiple forms of texts and media such as words, phrases and webpages. Originally, Google Translate was released as a
statistical machine translation Statistical machine translation (SMT) is a machine translation approach where translations are generated on the basis of statistical models whose parameters are derived from the analysis of bilingual text corpora. The statistical approach contra ...
(SMT) service. The input text had to be translated into English first before being translated into the selected language. Since SMT uses predictive
algorithm In mathematics and computer science, an algorithm () is a finite sequence of Rigour#Mathematics, mathematically rigorous instructions, typically used to solve a class of specific Computational problem, problems or to perform a computation. Algo ...
s to translate text, it had poor grammatical accuracy. Despite this, Google initially did not hire experts to resolve this limitation due to the ever-evolving nature of language. In January 2010, Google introduced an Android app and
iOS Ios, Io or Nio (, ; ; locally Nios, Νιός) is a Greek island in the Cyclades group in the Aegean Sea. Ios is a hilly island with cliffs down to the sea on most sides. It is situated halfway between Naxos and Santorini. It is about long an ...
version in February 2011 to serve as a portable personal interpreter. As of February 2010, it was integrated into browsers such as Chrome and was able to pronounce the translated text, automatically recognize words in a picture and spot unfamiliar text and languages. In May 2014, Google acquired
Word Lens Word Lens was an augmented reality translation application from Quest Visual. Word Lens used the built-in cameras on smartphones and similar devices to quickly scan and identify foreign text (such as that found in a sign or a menu), and then tra ...
to improve the quality of visual and voice translation. It is able to scan text or a picture using the device and have it translated instantly. Moreover, the system automatically identifies foreign languages and translates speech without requiring individuals to tap the microphone button whenever
speech translation Speech translation is the process by which conversational spoken phrases are instantly translated and spoken aloud in a second language. This differs from phrase translation, which is where the system only translates a fixed and finite set of phr ...
is needed. In November 2016, Google transitioned its translating method to a system called
neural machine translation Neural machine translation (NMT) is an approach to machine translation that uses an artificial neural network to predict the likelihood of a sequence of words, typically modeling entire sentences in a single integrated model. It is the dominant a ...
. It uses
deep learning Deep learning is a subset of machine learning that focuses on utilizing multilayered neural networks to perform tasks such as classification, regression, and representation learning. The field takes inspiration from biological neuroscience a ...
techniques to translate whole sentences at a time, which has been measured to be more accurate between English and French, German, Spanish, and Chinese. Retrieved May 14, 2017 No measurement results have been provided by Google researchers for GNMT from English to other languages, other languages to English, or between language pairs that do not include English. As of 2018, it translates more than 100 billion words a day. In 2017, Google Translate was used during a court hearing when court officials at
Teesside Teesside () is an urban area around the River Tees in North East England. Straddling the border between County Durham and North Yorkshire, it spans the boroughs of Borough of Middlesbrough, Middlesbrough, Borough of Stockton-on-Tees, Stockton ...
Magistrates' Court failed to book an interpreter for the Chinese defendant. A petition for Google to add
Cree The Cree, or nehinaw (, ), are a Indigenous peoples of the Americas, North American Indigenous people, numbering more than 350,000 in Canada, where they form one of the country's largest First Nations in Canada, First Nations. They live prim ...
to Google Translate was created in 2021, but it was not one of the languages in development at the time of the Translate Community's closure. At the end of September 2022, Google Translate was discontinued in
mainland China "Mainland China", also referred to as "the Chinese mainland", is a Geopolitics, geopolitical term defined as the territory under direct administration of the People's Republic of China (PRC) in the aftermath of the Chinese Civil War. In addit ...
, which Google said was due to "low usage". In 2024, a record of 110 languages including Cantonese, Tok Pisin and some regional languages in Russia including Bashkir, Chechen, Ossetian and Crimean Tatar were added. The languages were added through the help of the PaLM 2
Generative AI Generative artificial intelligence (Generative AI, GenAI, or GAI) is a subfield of artificial intelligence that uses generative models to produce text, images, videos, or other forms of data. These models learn the underlying patterns and str ...
model. In May 2025, users across the web found that if a
Traditional Chinese A tradition is a system of beliefs or behaviors (folk custom) passed down within a group of people or society with symbolic meaning or special significance with origins in the past. A component of cultural expressions and folklore, common examp ...
phrase entered in the French section is translated to
Traditional Chinese A tradition is a system of beliefs or behaviors (folk custom) passed down within a group of people or society with symbolic meaning or special significance with origins in the past. A component of cultural expressions and folklore, common examp ...
, Google Translate will provide strange translations. For example, if "" (;
Xi Jinping Xi Jinping, pronounced (born 15 June 1953) is a Chinese politician who has been the general secretary of the Chinese Communist Party (CCP) and Chairman of the Central Military Commission (China), chairman of the Central Military Commission ...
in
naked Nudity is the state of being in which a human is without clothing. While estimates vary, for the first 90,000 years of pre-history, anatomically modern humans were naked, having lost their body hair, living in hospitable climates, and not ...
style) is entered, the translation "" (), which means "what Chinese love", will result.


Functions

Google Translate can translate multiple forms of text and media, which includes text, speech, and text within still or moving images. Specifically, its functions include: *Written Words Translation: a function that translates written words or text to a foreign language. *Website Translation: a function that translates a whole webpage to selected languages. *Document Translation: a function that translates a document uploaded by the users to selected languages. The documents should be in the form of: .doc, .docx, .odf, .pdf, .ppt, .pptx, .ps, .rtf, .txt, .xls, .xlsx. *Speech Translation: a function that instantly translates spoken language into the selected foreign language. *Mobile App Translation: in 2018, Google introduced its new Google Translate feature called "Tap to Translate", which made instant translation accessible inside any app without exiting or switching it. *Image Translation: a function that identifies text in a picture taken by the users and translates text on the screen instantly by images. *Handwritten Translation: a function that translates language that are handwritten on the phone screen or drawn on a virtual keyboard without the support of a keyboard. *Bilingual Conversation Translation: a function that translates conversations in multiple languages. *Transcription: a function that transcribes speech in different languages. For most of its features, Google Translate provides the pronunciation, dictionary, and listening to translation. Additionally, Google Translate has introduced its own Translate app, so translation is available with a mobile phone in offline mode.


Features


Web interface

Google Translate produces approximations across languages of multiple forms of text and media, including text, speech, websites, or text on display in still or live video images. For some languages, Google Translate can synthesize speech from text, and in certain pairs it is possible to highlight specific corresponding words and phrases between the source and target text. Results are sometimes shown with dictional information below the translation box, but it is not a dictionary and has been shown to invent translations in all languages for words it does not recognize. If "Detect language" is selected, text in an unknown language can be automatically identified. In the web interface, users can suggest alternate translations, such as for technical terms, or correct mistakes. These suggestions may be included in future updates to the translation process. If a user enters a
URL A uniform resource locator (URL), colloquially known as an address on the Web, is a reference to a resource that specifies its location on a computer network and a mechanism for retrieving it. A URL is a specific type of Uniform Resource Identi ...
in the source text, Google Translate will produce a
hyperlink In computing, a hyperlink, or simply a link, is a digital reference providing direct access to Data (computing), data by a user (computing), user's point and click, clicking or touchscreen, tapping. A hyperlink points to a whole document or to ...
to a machine translation of the website. Users can save translation proposals in a "phrasebook" for later use, and a shareable URL is generated for each translation. For some languages, text can be entered via an on-screen keyboard, whether through
handwriting recognition Handwriting recognition (HWR), also known as handwritten text recognition (HTR), is the ability of a computer to receive and interpret intelligible handwriting, handwritten input from sources such as paper documents, photographs, touch-screens ...
or
speech recognition Speech recognition is an interdisciplinary subfield of computer science and computational linguistics that develops methodologies and technologies that enable the recognition and translation of spoken language into text by computers. It is also ...
. It is possible to enter searches in a source language that are first translated to a destination language allowing one to browse and interpret results from the selected destination language in the source language. Texts written in the
Arabic Arabic (, , or , ) is a Central Semitic languages, Central Semitic language of the Afroasiatic languages, Afroasiatic language family spoken primarily in the Arab world. The International Organization for Standardization (ISO) assigns lang ...
,
Cyrillic The Cyrillic script ( ) is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic, Turkic, Mongolic, Uralic, Caucasian and Iranic-speaking countries in Southeastern Europe, Ea ...
,
Devanagari Devanagari ( ; in script: , , ) is an Indic script used in the Indian subcontinent. It is a left-to-right abugida (a type of segmental Writing systems#Segmental systems: alphabets, writing system), based on the ancient ''Brāhmī script, Brā ...
and
Greek Greek may refer to: Anything of, from, or related to Greece, a country in Southern Europe: *Greeks, an ethnic group *Greek language, a branch of the Indo-European language family **Proto-Greek language, the assumed last common ancestor of all kno ...
scripts can be automatically transliterated from their phonetic equivalents written in the
Latin alphabet The Latin alphabet, also known as the Roman alphabet, is the collection of letters originally used by the Ancient Rome, ancient Romans to write the Latin language. Largely unaltered except several letters splitting—i.e. from , and from ...
. The browser version of Google Translate provides the option to show phonetic equivalents of text translated from Japanese to English. The same option is not available on the paid API version. Many of the more popular languages have a "text-to-speech" audio function that is able to read back a text in that language, up to several hundred words or so. In the case of
pluricentric language A pluricentric language or polycentric language is a language with several codified standard forms, often corresponding to different countries. Many examples of such languages can be found worldwide among the most-spoken languages, including but n ...
s, the accent depends on the region: for English, in the
Americas The Americas, sometimes collectively called America, are a landmass comprising the totality of North America and South America.''Webster's New World College Dictionary'', 2010 by Wiley Publishing, Inc., Cleveland, Ohio. When viewed as a sing ...
, most of the
Asia–Pacific The Asia–Pacific (APAC) also Known as Indo-Pacific is the region of the world adjoining the western Pacific Ocean. The region's precise boundaries vary depending on context, but countries and territories in Australasia, East Asia, and Southea ...
and
West Asia West Asia (also called Western Asia or Southwest Asia) is the westernmost region of Asia. As defined by most academics, UN bodies and other institutions, the subregion consists of Anatolia, the Arabian Peninsula, Iran, Mesopotamia, the Armenian ...
, the audio uses a female
General American General American English, known in linguistics simply as General American (abbreviated GA or GenAm), is the umbrella accent of American English used by a majority of Americans, encompassing a continuum rather than a single unified accent. ...
accent, whereas in Europe,
Hong Kong Hong Kong)., Legally Hong Kong, China in international treaties and organizations. is a special administrative region of China. With 7.5 million residents in a territory, Hong Kong is the fourth most densely populated region in the wor ...
,
Malaysia Malaysia is a country in Southeast Asia. Featuring the Tanjung Piai, southernmost point of continental Eurasia, it is a federation, federal constitutional monarchy consisting of States and federal territories of Malaysia, 13 states and thre ...
,
Singapore Singapore, officially the Republic of Singapore, is an island country and city-state in Southeast Asia. The country's territory comprises one main island, 63 satellite islands and islets, and one outlying islet. It is about one degree ...
,
Guyana Guyana, officially the Co-operative Republic of Guyana, is a country on the northern coast of South America, part of the historic British West Indies. entry "Guyana" Georgetown, Guyana, Georgetown is the capital of Guyana and is also the co ...
and all other parts of the world, a female
British British may refer to: Peoples, culture, and language * British people, nationals or natives of the United Kingdom, British Overseas Territories and Crown Dependencies. * British national identity, the characteristics of British people and culture ...
(
Received Pronunciation Received Pronunciation (RP) is the Accent (sociolinguistics), accent of British English regarded as the Standard language, standard one, carrying the highest Prestige (sociolinguistics), social prestige, since as late as the beginning of the 2 ...
) accent is used, except for a special General Australian accent used in Australia, New Zealand and
Norfolk Island Norfolk Island ( , ; ) is an States and territories of Australia, external territory of Australia located in the Pacific Ocean between New Zealand and New Caledonia, directly east of Australia's Evans Head, New South Wales, Evans Head and a ...
, and an
Indian English Indian English (IndE, IE) or English (India) is a group of English dialects spoken in the Republic of India and among the Indian diaspora and native to India. English is used by the Government of India for communication, and is enshrined ...
accent used in India; for Spanish, in the
Americas The Americas, sometimes collectively called America, are a landmass comprising the totality of North America and South America.''Webster's New World College Dictionary'', 2010 by Wiley Publishing, Inc., Cleveland, Ohio. When viewed as a sing ...
, a
Latin American Latin Americans (; ) are the citizenship, citizens of Latin American countries (or people with cultural, ancestral or national origins in Latin America). Latin American countries and their Latin American diaspora, diasporas are Metroethnicity, ...
accent is used, while in other parts of the world, a Castilian accent is used; for French, a
Quebec Quebec is Canada's List of Canadian provinces and territories by area, largest province by area. Located in Central Canada, the province shares borders with the provinces of Ontario to the west, Newfoundland and Labrador to the northeast, ...
accent is used in Canada, while in other parts of the world, a standard European accent is used; for Bengali, a male Bangladeshi accent is used, except in India, where a special female Indian Bengali accent is used instead. Until March 2023, some less widely spoken languages used the open-source eSpeak synthesizer for their speech; producing a robotic, awkward voice that may be difficult to understand.


Browser integration

Google Translate is available in some
web browser A web browser, often shortened to browser, is an application for accessing websites. When a user requests a web page from a particular website, the browser retrieves its files from a web server and then displays the page on the user's scr ...
s as an optional downloadable
extension Extension, extend or extended may refer to: Mathematics Logic or set theory * Axiom of extensionality * Extensible cardinal * Extension (model theory) * Extension (proof theory) * Extension (predicate logic), the set of tuples of values that ...
that can run the translation engine, which allow right-click command access to the translation service. In February 2010, Google Translate was integrated into the
Google Chrome Google Chrome is a web browser developed by Google. It was first released in 2008 for Microsoft Windows, built with free software components from Apple WebKit and Mozilla Firefox. Versions were later released for Linux, macOS, iOS, iPadOS, an ...
browser by default, for optional automatic webpage translation.


Mobile app

The Google Translate app for Android and
iOS Ios, Io or Nio (, ; ; locally Nios, Νιός) is a Greek island in the Cyclades group in the Aegean Sea. Ios is a hilly island with cliffs down to the sea on most sides. It is situated halfway between Naxos and Santorini. It is about long an ...
supports languages and can propose translations for 37 languages via photo, 32 via voice in "conversation mode", and 27 via live video imagery in "augmented reality mode". The Android app was released in January 2010, and for iOS on February 8, 2011, after an
HTML5 HTML5 (Hypertext Markup Language 5) is a markup language used for structuring and presenting hypertext documents on the World Wide Web. It was the fifth and final major HTML version that is now a retired World Wide Web Consortium (W3C) recommend ...
web application A web application (or web app) is application software that is created with web technologies and runs via a web browser. Web applications emerged during the late 1990s and allowed for the server to dynamically build a response to the request, ...
was released for iOS users in August 2008. The Android app is compatible with devices running at least Android 2.1, while the iOS app is compatible with
iPod Touch The iPod Touch (stylized as iPod touch) is a discontinued line of iOS-based mobile devices designed and formerly marketed by Apple Inc. with a touchscreen-controlled user interface. As with other iPod models, the iPod Touch can be used as a po ...
es,
iPad The iPad is a brand of tablet computers developed and marketed by Apple Inc., Apple that run the company's mobile operating systems iOS and later iPadOS. The IPad (1st generation), first-generation iPad was introduced on January 27, 2010. ...
s and
iPhone The iPhone is a line of smartphones developed and marketed by Apple that run iOS, the company's own mobile operating system. The first-generation iPhone was announced by then–Apple CEO and co-founder Steve Jobs on January 9, 2007, at ...
s updated to iOS 7.0+. A January 2011 Android version experimented with a "Conversation Mode" that aims to allow users to communicate fluidly with a nearby person in another language. Originally limited to English and Spanish, the feature received support for 12 new languages, still in testing, the following October. The 'Camera input' functionality allows users to take a photograph of a document, signboard, etc. Google Translate recognises the text from the image using
optical character recognition Optical character recognition or optical character reader (OCR) is the electronics, electronic or machine, mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo ...
(OCR) technology and gives the translation. Camera input is not available for all languages. In January 2015, the apps gained the ability to propose translations of physical signs in real time using the device's camera, as a result of Google's acquisition of the
Word Lens Word Lens was an augmented reality translation application from Quest Visual. Word Lens used the built-in cameras on smartphones and similar devices to quickly scan and identify foreign text (such as that found in a sign or a menu), and then tra ...
app. The original January launch only supported seven languages, but a July update added support for 20 new languages, with the release of a new implementation that utilizes
convolutional neural network A convolutional neural network (CNN) is a type of feedforward neural network that learns features via filter (or kernel) optimization. This type of deep learning network has been applied to process and make predictions from many different ty ...
s, and also enhanced the speed and quality of Conversation Mode translations (
augmented reality Augmented reality (AR), also known as mixed reality (MR), is a technology that overlays real-time 3D computer graphics, 3D-rendered computer graphics onto a portion of the real world through a display, such as a handheld device or head-mounted ...
). The feature was subsequently renamed Instant Camera. The technology underlying Instant Camera combines image processing and optical character recognition, then attempts to produce cross-language equivalents using standard Google Translate estimations for the text as it is perceived. On May 11, 2016, Google introduced ''Tap to Translate'' for Google Translate for Android. Upon highlighting text in an app that is in a foreign language, Translate will pop up inside of the app and offer translations.


API

On May 26, 2011, Google announced that the Google Translate
API An application programming interface (API) is a connection between computers or between computer programs. It is a type of software interface, offering a service to other pieces of software. A document or standard that describes how to build ...
for
software developer Software development is the process of designing and Implementation, implementing a software solution to Computer user satisfaction, satisfy a User (computing), user. The process is more encompassing than Computer programming, programming, wri ...
s had been deprecated and would cease functioning. The Translate API page stated the reason as "substantial economic burden caused by extensive abuse" with an end date set for December 1, 2011. In response to public pressure, Google announced in June 2011 that the API would continue to be available as a paid service. Because the API was used in numerous third-party websites and apps, the original decision to deprecate it led some developers to criticize Google and question the viability of using Google APIs in their products.


Google Assistant

Google Translate also provides translations for
Google Assistant Google Assistant is a virtual assistant software application developed by Google that is primarily available on home automation and mobile devices. Based on artificial intelligence, Google Assistant can engage in two-way conversations, unlike ...
and the devices that Google Assistant runs on such as
Google Nest Google Nest is a line of smart home products including smart speakers, smart displays, streaming devices, thermostats, smoke detectors, routers and security systems including smart doorbells, cameras and smart locks. The Nest brand n ...
and Pixel Buds.


Supported languages

the following 249 languages, dialects and language varieties written in different scripts ( unique languages and dialects) are supported by Google Translate. # Abkhaz # Acehnese #
Acholi Acholi may refer to: * Acholi people, a Luo nation of Uganda, in the Northern part of the country. * Acholi language, a Nilotic language * Acholi Inn, a building in Gulu, Uganda * Acholi nationalism, a political ideology of Acholi people {{dab ...
# Afar #
Afrikaans Afrikaans is a West Germanic languages, West Germanic language spoken in South Africa, Namibia and to a lesser extent Botswana, Zambia, Zimbabwe and also Argentina where there is a group in Sarmiento, Chubut, Sarmiento that speaks the Pat ...
#
Albanian Albanian may refer to: *Pertaining to Albania in Southeast Europe; in particular: **Albanians, an ethnic group native to the Balkans **Albanian language **Albanian culture **Demographics of Albania, includes other ethnic groups within the country ...
# Alur #
Amharic Amharic is an Ethio-Semitic language, which is a subgrouping within the Semitic branch of the Afroasiatic languages. It is spoken as a first language by the Amhara people, and also serves as a lingua franca for all other metropolitan populati ...
#
Arabic Arabic (, , or , ) is a Central Semitic languages, Central Semitic language of the Afroasiatic languages, Afroasiatic language family spoken primarily in the Arab world. The International Organization for Standardization (ISO) assigns lang ...
#
Armenian Armenian may refer to: * Something of, from, or related to Armenia, a country in the South Caucasus region of Eurasia * Armenians, the national people of Armenia, or people of Armenian descent ** Armenian diaspora, Armenian communities around the ...
# Assamese # Avar # Awadhi # Aymara # Azerbaijani # Balinese # Baluchi # Bambara # Baoulé # Bashkir #
Basque Basque may refer to: * Basques, an ethnic group of Spain and France * Basque language, their language Places * Basque Country (greater region), the homeland of the Basque people with parts in both Spain and France * Basque Country (autonomous co ...
#
Batak Karo The Karo (also known as Karo Batak) people are a people of the ''Tanah Karo'' (Karo lands) in North Sumatra, Indonesia. The Karo lands consist of Karo Regency, plus neighboring areas in East Aceh Regency, Langkat Regency, Dairi Regency, Simal ...
# Batak Simalungun # Batak Toba # Belarusian # Bemba # Bengali # Betawi #
Bhojpuri Bhojpuri may refer to: * Bhojpuri language, an Indo-Aryan language of India and Nepal * Bhojpuri grammar, grammatical rules of the language * Bhojpuri nouns, nouns of the language * Bhojpuri people, people who speak the language * Bhojpuri region ...
# Bikol # Bosnian #
Breton Breton most often refers to: *anything associated with Brittany, and generally **Breton people **Breton language, a Southwestern Brittonic Celtic language of the Indo-European language family, spoken in Brittany ** Breton (horse), a breed **Gale ...
# Bulgarian # Buryat #
Cantonese Cantonese is the traditional prestige variety of Yue Chinese, a Sinitic language belonging to the Sino-Tibetan language family. It originated in the city of Guangzhou (formerly known as Canton) and its surrounding Pearl River Delta. While th ...
# Catalan # Cebuano # Chamorro # Chechen # Chichewa # Chinese ( Simplified) # Chinese (
Traditional A tradition is a system of beliefs or behaviors (folk custom) passed down within a group of people or society with symbolic meaning or special significance with origins in the past. A component of cultural expressions and folklore, common examp ...
) # Chuukese # Chuvash # Corsican # Crimean Tatar (
Cyrillic The Cyrillic script ( ) is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic, Turkic, Mongolic, Uralic, Caucasian and Iranic-speaking countries in Southeastern Europe, Ea ...
) # Crimean Tatar (
Latin Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
) # Croatian #
Czech Czech may refer to: * Anything from or related to the Czech Republic, a country in Europe ** Czech language ** Czechs, the people of the area ** Czech culture ** Czech cuisine * One of three mythical brothers, Lech, Czech, and Rus *Czech (surnam ...
# Danish #
Dari Dari (; endonym: ), Dari Persian (, , or , ), or Eastern Persian is the variety of the Persian language spoken in Afghanistan. Dari is the Afghan government's official term for the Persian language;Lazard, G.Darī – The New Persian ...
# Dhivehi #
Dinka The Dinka people () are a Nilotic ethnic group native to South Sudan. The Dinka mostly live along the Nile, from Mangalla-Bor to Renk, in the region of Bahr el Ghazal, Upper Nile (two out of three provinces that were formerly part of southern ...
# Dogri # Dombe # Dutch # Dyula #
Dzongkha Dzongkha (; ) is a Tibeto-Burman languages, Tibeto-Burman language that is the official and national language of Bhutan. It is written using the Tibetan script. The word means "the language of the fortress", from ' "fortress" and ' "language ...
# English #
Esperanto Esperanto (, ) is the world's most widely spoken Constructed language, constructed international auxiliary language. Created by L. L. Zamenhof in 1887 to be 'the International Language' (), it is intended to be a universal second language for ...
#
Estonian Estonian may refer to: * Something of, from, or related to Estonia, a country in the Baltic region in northern Europe * Estonians, people from Estonia, or of Estonian descent * Estonian language * Estonian cuisine * Estonian culture See also

...
# Ewe # Faroese # Fijian # Filipino # Finnish # Fon # French # French (
Canada Canada is a country in North America. Its Provinces and territories of Canada, ten provinces and three territories extend from the Atlantic Ocean to the Pacific Ocean and northward into the Arctic Ocean, making it the world's List of coun ...
) # Frisian # Friulian #
Fulani The Fula, Fulani, or Fulɓe people are an ethnic group in Sahara, Sahel and West Africa, widely dispersed across the region. Inhabiting many countries, they live mainly in West Africa and northern parts of Central Africa, South Sudan, Darfur, ...
# Ga # Galician # Georgian #
German German(s) may refer to: * Germany, the country of the Germans and German things **Germania (Roman era) * Germans, citizens of Germany, people of German ancestry, or native speakers of the German language ** For citizenship in Germany, see also Ge ...
#
Greek Greek may refer to: Anything of, from, or related to Greece, a country in Southern Europe: *Greeks, an ethnic group *Greek language, a branch of the Indo-European language family **Proto-Greek language, the assumed last common ancestor of all kno ...
# Guarani # Gujarati #
Haitian Creole Haitian Creole (; , ; , ), or simply Creole (), is a French-based creole languages, French-based creole language spoken by 10 to 12million people worldwide, and is one of the two official languages of Haiti (the other being French), where it ...
# Hakha Chin # Hausa # Hawaiian #
Hebrew Hebrew (; ''ʿÎbrit'') is a Northwest Semitic languages, Northwest Semitic language within the Afroasiatic languages, Afroasiatic language family. A regional dialect of the Canaanite languages, it was natively spoken by the Israelites and ...
# Hiligaynon #
Hindi Modern Standard Hindi (, ), commonly referred to as Hindi, is the Standard language, standardised variety of the Hindustani language written in the Devanagari script. It is an official language of India, official language of the Government ...
#
Hmong Hmong may refer to: * Hmong people, an ethnic group living mainly in Southwest China, Vietnam, Laos, and Thailand * Hmong cuisine * Hmong customs and culture ** Hmong music ** Hmong textile art * Hmong language, a continuum of closely related ...
# Hungarian #
Hunsrik Hunsrik (natively ''Hunsrik'' , ''Hunsrückisch'' or ''Hunsrickisch'' and Portuguese ''hunsriqueano'' or ''hunsriqueano riograndense''), also called Riograndese Hunsrik, ' or ', is a Moselle Franconian language derived primarily from the Hunsr ...
#
Iban IBAN or Iban or Ibán may refer to: Banking * International Bank Account Number Ethnology * Iban culture * Iban language The Iban language () is spoken by the Iban, one of the Dayak ethnic groups who live in Brunei, the Indonesian provinc ...
# Icelandic # Igbo # Ilocano # Indonesian #
Inuktut Inuktut is the collective name for the Inuit languages. It is used by Inuit Tapiriit Kanatami, the Inuit Circumpolar Council, and the Government of Nunavut throughout Inuit Nunaat and Inuit Nunangat. Usage Inuit Tapiriit Kanatami (ITK) says ...
(Latin) #
Inuktut Inuktut is the collective name for the Inuit languages. It is used by Inuit Tapiriit Kanatami, the Inuit Circumpolar Council, and the Government of Nunavut throughout Inuit Nunaat and Inuit Nunangat. Usage Inuit Tapiriit Kanatami (ITK) says ...
( Syllabics) # Irish #
Italian Italian(s) may refer to: * Anything of, from, or related to the people of Italy over the centuries ** Italians, a Romance ethnic group related to or simply a citizen of the Italian Republic or Italian Kingdom ** Italian language, a Romance languag ...
#
Jamaican Patois Jamaican Patois (; locally rendered Patwah and called Jamaican Creole by linguists) is an English-based creole language with influences from West African, Arawak, Spanish and other languages, spoken primarily in Jamaica and among the Jamaican ...
#
Japanese Japanese may refer to: * Something from or related to Japan, an island country in East Asia * Japanese language, spoken mainly in Japan * Japanese people, the ethnic group that identifies with Japan through ancestry or culture ** Japanese diaspor ...
# Javanese # Jingpo # Kalaallisut #
Kannada Kannada () is a Dravidian language spoken predominantly in the state of Karnataka in southwestern India, and spoken by a minority of the population in all neighbouring states. It has 44 million native speakers, and is additionally a ...
# Kanuri #
Kapampangan Kapampangan, Capampañgan or Pampangan may refer to: *Kapampangan people, of the Philippines *Kapampangan language Kapampangan, Capampáñgan, or Pampangan, is an Austronesian language, and one of the eight major languages of the Philippines. ...
# Kazakh # Khasi # Khmer # Kiga # Kikongo #
Kinyarwanda Kinyarwanda, Rwandan or Rwanda, officially known as Ikinyarwanda, is a Bantu language and the national language of Rwanda. It is a dialect of the Rwanda-Rundi language that is also spoken in adjacent parts of the Democratic Republic of the ...
# Kituba #
Kokborok Kokborok (or Tripuri) is a Tibeto-Burman language of the Indian state of Tripura and neighbouring areas of Bangladesh. Its name comes from ''kók'' meaning "verbal" or "language" and ''borok'' meaning "people" or "human", It is one of the anci ...
# Komi #
Konkani __NOTOC__ Konkani may refer to: Language * Konkani language is an Indo-Aryan language spoken in the Konkan region of India. * Konkani alphabets, different scripts used to write the language **Konkani in the Roman script, one of the scripts used to ...
#
Korean Korean may refer to: People and culture * Koreans, people from the Korean peninsula or of Korean descent * Korean culture * Korean language **Korean alphabet, known as Hangul or Korean **Korean dialects **See also: North–South differences in t ...
# Krio #
Kurdish Kurdish may refer to: *Kurds or Kurdish people *Kurdish language ** Northern Kurdish (Kurmanji) **Central Kurdish (Sorani) **Southern Kurdish ** Laki Kurdish *Kurdish alphabets *Kurdistan, the land of the Kurdish people which includes: **Southern ...
(
Kurmanji Kurmanji (, ), also termed Northern Kurdish, is the northernmost of the Kurdish languages, spoken predominantly in southeast Turkey, northwest and northeast Iran, northern Iraq, northern Syria and the Caucasus and Khorasan regions. It is the ...
) #
Kurdish Kurdish may refer to: *Kurds or Kurdish people *Kurdish language ** Northern Kurdish (Kurmanji) **Central Kurdish (Sorani) **Southern Kurdish ** Laki Kurdish *Kurdish alphabets *Kurdistan, the land of the Kurdish people which includes: **Southern ...
(
Sorani Central Kurdish, also known as Sorani Kurdish, is a Kurdish dialect or a language spoken in Iraq, mainly in Iraqi Kurdistan, as well as the provinces of Kurdistan, Kermanshah, and West Azerbaijan in western Iran. Central Kurdish is one of the ...
) # Kyrgyz # Lao # Latgalian #
Latin Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
# Latvian # Ligurian #
Limburgish Limburgish ( or ; ; also Limburgian, Limburgic or Limburgan) refers to a group of South Low Franconian Variety (linguistics), varieties spoken in Belgium and the Netherlands, characterized by their distance to, and limited participation ...
#
Lingala Lingala (or Ngala, Lingala: ) is a Bantu languages, Bantu language spoken in the northwest of the Democratic Republic of the Congo, the northern half of the Republic of the Congo, in their capitals, Kinshasa and Brazzaville, and to a lesser de ...
# Lithuanian # Lombard #
Luganda Ganda or Luganda ( ; ) is a Bantu language spoken in the African Great Lakes region. It is one of the major languages in Uganda and is spoken by more than 5.56 million Ganda people, Baganda and other people principally in central Uganda, includ ...
# Luo #
Luxembourgish Luxembourgish ( ; also ''Luxemburgish'', ''Luxembourgian'', ''Letzebu(e)rgesch''; ) is a West Germanic language that is spoken mainly in Luxembourg. About 400,000 people speak Luxembourgish worldwide. The language is standardized and officiall ...
# Macedonian # Madurese # Maithili #
Makassar Makassar ( ), formerly Ujung Pandang ( ), is the capital of the Indonesian Provinces of Indonesia, province of South Sulawesi. It is the largest city in the region of Eastern Indonesia and the country's fifth-largest urban center after Jakarta, ...
# Malagasy # Malay # Malay ( Jawi) #
Malayalam Malayalam (; , ) is a Dravidian languages, Dravidian language spoken in the Indian state of Kerala and the union territories of Lakshadweep and Puducherry (union territory), Puducherry (Mahé district) by the Malayali people. It is one of ...
#
Maltese Maltese may refer to: * Someone or something of, from, or related to Malta * Maltese alphabet * Maltese cuisine * Maltese culture * Maltese language, the Semitic language spoken by Maltese people * Maltese people, people from Malta or of Maltese ...
# Mam # Manx # Maori #
Marathi Marathi may refer to: *Marathi people, an Indo-Aryan ethnolinguistic group of Maharashtra, India **Marathi people (Uttar Pradesh), the Marathi people in the Indian state of Uttar Pradesh *Marathi language, the Indo-Aryan language spoken by the Mar ...
# Marshallese # Marwadi # Mauritian Creole # Meadow Mari # Meiteilon (Manipuri) # Minang # Mizo # Mongolian # Myanmar (Burmese) # Nahuatl (Eastern Huasteca) # Ndau # Ndebele (South) # Nepalbhasa (Newari) # Nepali # NKo # Norwegian (
Bokmål Bokmål () (, ; ) is one of the official written standards for the Norwegian language, alongside Nynorsk. Bokmål is by far the most used written form of Norwegian today, as it is adopted by 85% to 90% of the population in Norway. There is no cou ...
) # Nuer #
Occitan Occitan may refer to: * Something of, from, or related to the Occitania territory in parts of France, Italy, Monaco and Spain. * Something of, from, or related to the Occitania administrative region of France. * Occitan language, spoken in parts o ...
# Odia (Oriya) # Oromo # Ossetian #
Pangasinan Pangasinan, officially the Province of Pangasinan (, ; ; ), is a coastal Provinces of the Philippines, province in the Philippines located in the Ilocos Region of Luzon. Its capital is Lingayen, Pangasinan, Lingayen while San Carlos, Pangasi ...
#
Papiamento Papiamento () or Papiamentu (; ) is a Portuguese-based creole language spoken in the Dutch Caribbean. It is the most widely spoken language on Aruba, Bonaire, and Curaçao ( ABC Islands). The language, spelled in Aruba and in Bonaire and ...
#
Pashto Pashto ( , ; , ) is an eastern Iranian language in the Indo-European language family, natively spoken in northwestern Pakistan and southern and eastern Afghanistan. It has official status in Afghanistan and the Pakistani province of Khyb ...
#
Persian Persian may refer to: * People and things from Iran, historically called ''Persia'' in the English language ** Persians, the majority ethnic group in Iran, not to be conflated with the Iranic peoples ** Persian language, an Iranian language of the ...
# Polish # Portuguese (
Brazil Brazil, officially the Federative Republic of Brazil, is the largest country in South America. It is the world's List of countries and dependencies by area, fifth-largest country by area and the List of countries and dependencies by population ...
) # Portuguese (
Portugal Portugal, officially the Portuguese Republic, is a country on the Iberian Peninsula in Southwestern Europe. Featuring Cabo da Roca, the westernmost point in continental Europe, Portugal borders Spain to its north and east, with which it share ...
) # Punjabi (
Gurmukhi Gurmukhī ( , Shahmukhi: ) is an abugida developed from the Laṇḍā scripts, standardized and used by the second Sikh guru, Guru Angad (1504–1552). Commonly regarded as a Sikh script, Gurmukhi is used in Punjab, India as the official scrip ...
) # Punjabi (
Shahmukhi Shahmukhi (, , , ) is the right-to-left abjad-based script developed from the Perso-Arabic alphabet used for the Punjabi language varieties, predominantly in Punjab, Pakistan. It is generally written in the Nastaʿlīq calligraphic hand, whic ...
) #
Quechua Quechua may refer to: *Quechua people, several Indigenous ethnic groups in South America, especially in Peru *Quechuan languages, an Indigenous South American language family spoken primarily in the Andes, derived from a common ancestral language ...
#
Qʼeqchiʼ Qʼeqchiʼ () (Kʼekchiʼ in the former orthography, or simply Kekchi in many English-language contexts, such as in Belize) are a Maya people Maya () are an ethnolinguistic group of Indigenous peoples of the Americas, Indigenous peoples o ...
# Romani #
Romanian Romanian may refer to: *anything of, from, or related to the country and nation of Romania **Romanians, an ethnic group **Romanian language, a Romance language ***Romanian dialects, variants of the Romanian language **Romanian cuisine, traditional ...
# Rundi #
Russian Russian(s) may refer to: *Russians (), an ethnic group of the East Slavic peoples, primarily living in Russia and neighboring countries *A citizen of Russia *Russian language, the most widely spoken of the Slavic languages *''The Russians'', a b ...
# Sami (North) # Samoan # Sango #
Sanskrit Sanskrit (; stem form ; nominal singular , ,) is a classical language belonging to the Indo-Aryan languages, Indo-Aryan branch of the Indo-European languages. It arose in northwest South Asia after its predecessor languages had Trans-cultural ...
# Santali (Latin) # Santali ( Ol Chiki) #
Scots Gaelic Scottish Gaelic (, ; endonym: ), also known as Scots Gaelic or simply Gaelic, is a Celtic language native to the Gaels of Scotland. As a member of the Goidelic branch of Celtic, Scottish Gaelic, alongside both Irish and Manx, developed ou ...
# Sepedi # Serbian #
Sesotho Sotho (), also known as ''Sesotho'' (), Southern Sotho, or ''Sesotho sa Borwa'' is a Southern Bantu languages, Southern Bantu language spoken in Lesotho as its national language and South Africa where it is an official language. Like all Ba ...
#
Seychellois Creole Seychellois Creole (), also known as Kreol, Seselwa Creole French, and Seselwa Creole is the French-based creole language spoken by the Seychellois Creole people, Seychelles Creole people of the Seychelles. It is one of the national language, na ...
# Shan # Shona # Sicilian # Silesian # Sindhi # Sinhala # Slovak # Slovenian # Somali #
Spanish Spanish might refer to: * Items from or related to Spain: **Spaniards are a nation and ethnic group indigenous to Spain **Spanish language, spoken in Spain and many countries in the Americas **Spanish cuisine **Spanish history **Spanish culture ...
# Sundanese # Susu # Swahili # Swati # Swedish # Tahitian # Tajik #
Tamazight The Berber languages, also known as the Amazigh languages or Tamazight, are a branch of the Afroasiatic language family. They comprise a group of closely related but mostly mutually unintelligible languages spoken by Berber communities, who ar ...
#
Tamazight The Berber languages, also known as the Amazigh languages or Tamazight, are a branch of the Afroasiatic language family. They comprise a group of closely related but mostly mutually unintelligible languages spoken by Berber communities, who ar ...
(
Tifinagh Tifinagh ( Tuareg Berber language: ; Neo-Tifinagh: ; Berber Latin alphabet: ; ) is a script used to write the Berber languages. Tifinagh is descended from the ancient Libyco-Berber alphabet. The traditional Tifinagh, sometimes called Tuareg Tifi ...
) #
Tamil Tamil may refer to: People, culture and language * Tamils, an ethno-linguistic group native to India, Sri Lanka, and some other parts of Asia **Sri Lankan Tamils, Tamil people native to Sri Lanka ** Myanmar or Burmese Tamils, Tamil people of Ind ...
#
Tatar Tatar may refer to: Peoples * Tatars, an umbrella term for different Turkic ethnic groups bearing the name "Tatar" * Volga Tatars, a people from the Volga-Ural region of western Russia * Crimean Tatars, a people from the Crimea peninsula by the B ...
# Telugu #
Tetum Tetum may refer to: * Tetum language, an Austronesian language ** Tetum alphabet, used to write the Tetum language * Tetum people, an ethnic group of East Timor and Indonesia {{disambiguation Language and nationality disambiguation pages ...
# Thai # Tibetan # Tigrinya # Tiv #
Tok Pisin Tok Pisin ( ,Laurie Bauer, 2007, ''The Linguistics Student's Handbook'', Edinburgh ; ), often referred to by English speakers as New Guinea Pidgin or simply Pidgin, is an English-based creole languages, English creole language spoken throughou ...
# Tongan # Tshiluba #
Tsonga Tsonga may refer to: * Tsonga language, a Bantu language spoken in southern Africa * Tsonga people, a large group of people living mainly in southern Mozambique and South Africa. * Jo-Wilfried Tsonga Jo-Wilfried Tsonga (; born 17 April 1985) ...
#
Tswana Tswana may refer to: * Tswana people, the Bantu languages, Bantu speaking people in Botswana, South Africa, Namibia, Zimbabwe, Zambia, and other Southern Africa regions * Tswana language, the language spoken by the (Ba)Tswana people * Tswanaland, ...
# Tulu # Tumbuka # Turkish # Turkmen # Tuvan # Twi # Udmurt # Ukrainian #
Urdu Urdu (; , , ) is an Indo-Aryan languages, Indo-Aryan language spoken chiefly in South Asia. It is the Languages of Pakistan, national language and ''lingua franca'' of Pakistan. In India, it is an Eighth Schedule to the Constitution of Indi ...
#
Uyghur Uyghur may refer to: * Uyghurs, a Turkic ethnic group living in Eastern and Central Asia (West China) ** Uyghur language, a Turkic language spoken primarily by the Uyghurs *** Old Uyghur language, a different Turkic language spoken in the Uyghur K ...
# Uzbek #
Venda Venda ( ), officially the Republic of Venda (; ), was a Bantustan in northern South Africa. It was fairly close to the South African border with Zimbabwe to the north, while, to the south and east, it shared a long border with another black hom ...
# Venetian #
Vietnamese Vietnamese may refer to: * Something of, from, or related to Vietnam, a country in Southeast Asia * Vietnamese people, or Kinh people, a Southeast Asian ethnic group native to Vietnam ** Overseas Vietnamese, Vietnamese people living outside Vietna ...
# Waray # Welsh #
Wolof Wolof or Wollof may refer to: * Wolof people, an ethnic group found in Senegal, Gambia, and Mauritania * Wolof language, a language spoken in Senegal, Gambia, and Mauritania * The Wolof or Jolof Empire, a medieval West African successor of the Mal ...
#
Xhosa Xhosa may refer to: * Xhosa people, a nation, and ethnic group, who live in south-central and southeasterly region of South Africa * Xhosa language, one of the 11 official languages of South Africa, principally spoken by the Xhosa people See als ...
# Yakut #
Yiddish Yiddish, historically Judeo-German, is a West Germanic language historically spoken by Ashkenazi Jews. It originated in 9th-century Central Europe, and provided the nascent Ashkenazi community with a vernacular based on High German fused with ...
# Yoruba #
Yucatec Maya Yucatec Maya ( ; referred to by its speakers as or ) is a Mayan languages, Mayan language spoken in the Yucatán Peninsula, including part of northern Belize. There is also a significant diasporic community of Yucatec Maya speakers in San Fra ...
# Zapotec # Zulu


Languages with text-to-speech support

the following 68 languages, dialects and language varieties currently have
text-to-speech Speech synthesis is the artificial production of human speech. A computer system used for this purpose is called a speech synthesizer, and can be implemented in software or Computer hardware, hardware products. A text-to-speech (TTS) system conv ...
support. #
Afrikaans Afrikaans is a West Germanic languages, West Germanic language spoken in South Africa, Namibia and to a lesser extent Botswana, Zambia, Zimbabwe and also Argentina where there is a group in Sarmiento, Chubut, Sarmiento that speaks the Pat ...
#
Albanian Albanian may refer to: *Pertaining to Albania in Southeast Europe; in particular: **Albanians, an ethnic group native to the Balkans **Albanian language **Albanian culture **Demographics of Albania, includes other ethnic groups within the country ...
#
Amharic Amharic is an Ethio-Semitic language, which is a subgrouping within the Semitic branch of the Afroasiatic languages. It is spoken as a first language by the Amhara people, and also serves as a lingua franca for all other metropolitan populati ...
#
Arabic Arabic (, , or , ) is a Central Semitic languages, Central Semitic language of the Afroasiatic languages, Afroasiatic language family spoken primarily in the Arab world. The International Organization for Standardization (ISO) assigns lang ...
#
Basque Basque may refer to: * Basques, an ethnic group of Spain and France * Basque language, their language Places * Basque Country (greater region), the homeland of the Basque people with parts in both Spain and France * Basque Country (autonomous co ...
# Bengali # Bosnian # Bulgarian #
Cantonese Cantonese is the traditional prestige variety of Yue Chinese, a Sinitic language belonging to the Sino-Tibetan language family. It originated in the city of Guangzhou (formerly known as Canton) and its surrounding Pearl River Delta. While th ...
# Catalan # Chinese ( Simplified) # Chinese (
Traditional A tradition is a system of beliefs or behaviors (folk custom) passed down within a group of people or society with symbolic meaning or special significance with origins in the past. A component of cultural expressions and folklore, common examp ...
) # Croatian #
Czech Czech may refer to: * Anything from or related to the Czech Republic, a country in Europe ** Czech language ** Czechs, the people of the area ** Czech culture ** Czech cuisine * One of three mythical brothers, Lech, Czech, and Rus *Czech (surnam ...
# Danish # Dutch # English #
Estonian Estonian may refer to: * Something of, from, or related to Estonia, a country in the Baltic region in northern Europe * Estonians, people from Estonia, or of Estonian descent * Estonian language * Estonian cuisine * Estonian culture See also

...
# Filipino # Finnish # French # French (
Canada Canada is a country in North America. Its Provinces and territories of Canada, ten provinces and three territories extend from the Atlantic Ocean to the Pacific Ocean and northward into the Arctic Ocean, making it the world's List of coun ...
) # Galician #
German German(s) may refer to: * Germany, the country of the Germans and German things **Germania (Roman era) * Germans, citizens of Germany, people of German ancestry, or native speakers of the German language ** For citizenship in Germany, see also Ge ...
#
Greek Greek may refer to: Anything of, from, or related to Greece, a country in Southern Europe: *Greeks, an ethnic group *Greek language, a branch of the Indo-European language family **Proto-Greek language, the assumed last common ancestor of all kno ...
# Gujarati # Hausa #
Hebrew Hebrew (; ''ʿÎbrit'') is a Northwest Semitic languages, Northwest Semitic language within the Afroasiatic languages, Afroasiatic language family. A regional dialect of the Canaanite languages, it was natively spoken by the Israelites and ...
#
Hindi Modern Standard Hindi (, ), commonly referred to as Hindi, is the Standard language, standardised variety of the Hindustani language written in the Devanagari script. It is an official language of India, official language of the Government ...
# Hungarian # Icelandic # Indonesian #
Italian Italian(s) may refer to: * Anything of, from, or related to the people of Italy over the centuries ** Italians, a Romance ethnic group related to or simply a citizen of the Italian Republic or Italian Kingdom ** Italian language, a Romance languag ...
#
Japanese Japanese may refer to: * Something from or related to Japan, an island country in East Asia * Japanese language, spoken mainly in Japan * Japanese people, the ethnic group that identifies with Japan through ancestry or culture ** Japanese diaspor ...
# Javanese #
Kannada Kannada () is a Dravidian language spoken predominantly in the state of Karnataka in southwestern India, and spoken by a minority of the population in all neighbouring states. It has 44 million native speakers, and is additionally a ...
# Khmer #
Korean Korean may refer to: People and culture * Koreans, people from the Korean peninsula or of Korean descent * Korean culture * Korean language **Korean alphabet, known as Hangul or Korean **Korean dialects **See also: North–South differences in t ...
#
Latin Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
# Latvian # Lithuanian # Malay #
Malayalam Malayalam (; , ) is a Dravidian languages, Dravidian language spoken in the Indian state of Kerala and the union territories of Lakshadweep and Puducherry (union territory), Puducherry (Mahé district) by the Malayali people. It is one of ...
#
Marathi Marathi may refer to: *Marathi people, an Indo-Aryan ethnolinguistic group of Maharashtra, India **Marathi people (Uttar Pradesh), the Marathi people in the Indian state of Uttar Pradesh *Marathi language, the Indo-Aryan language spoken by the Mar ...
# Myanmar (Burmese) # Nepali # Norwegian (
Bokmål Bokmål () (, ; ) is one of the official written standards for the Norwegian language, alongside Nynorsk. Bokmål is by far the most used written form of Norwegian today, as it is adopted by 85% to 90% of the population in Norway. There is no cou ...
) # Polish # Portuguese (
Brazil Brazil, officially the Federative Republic of Brazil, is the largest country in South America. It is the world's List of countries and dependencies by area, fifth-largest country by area and the List of countries and dependencies by population ...
) # Portuguese (
Portugal Portugal, officially the Portuguese Republic, is a country on the Iberian Peninsula in Southwestern Europe. Featuring Cabo da Roca, the westernmost point in continental Europe, Portugal borders Spain to its north and east, with which it share ...
) # Punjabi (
Gurmukhi Gurmukhī ( , Shahmukhi: ) is an abugida developed from the Laṇḍā scripts, standardized and used by the second Sikh guru, Guru Angad (1504–1552). Commonly regarded as a Sikh script, Gurmukhi is used in Punjab, India as the official scrip ...
) #
Romanian Romanian may refer to: *anything of, from, or related to the country and nation of Romania **Romanians, an ethnic group **Romanian language, a Romance language ***Romanian dialects, variants of the Romanian language **Romanian cuisine, traditional ...
#
Russian Russian(s) may refer to: *Russians (), an ethnic group of the East Slavic peoples, primarily living in Russia and neighboring countries *A citizen of Russia *Russian language, the most widely spoken of the Slavic languages *''The Russians'', a b ...
# Serbian # Sinhala # Slovak #
Spanish Spanish might refer to: * Items from or related to Spain: **Spaniards are a nation and ethnic group indigenous to Spain **Spanish language, spoken in Spain and many countries in the Americas **Spanish cuisine **Spanish history **Spanish culture ...
# Sundanese # Swahili # Swedish #
Tamil Tamil may refer to: People, culture and language * Tamils, an ethno-linguistic group native to India, Sri Lanka, and some other parts of Asia **Sri Lankan Tamils, Tamil people native to Sri Lanka ** Myanmar or Burmese Tamils, Tamil people of Ind ...
# Telugu # Thai # Turkish # Ukrainian #
Urdu Urdu (; , , ) is an Indo-Aryan languages, Indo-Aryan language spoken chiefly in South Asia. It is the Languages of Pakistan, national language and ''lingua franca'' of Pakistan. In India, it is an Eighth Schedule to the Constitution of Indi ...
#
Vietnamese Vietnamese may refer to: * Something of, from, or related to Vietnam, a country in Southeast Asia * Vietnamese people, or Kinh people, a Southeast Asian ethnic group native to Vietnam ** Overseas Vietnamese, Vietnamese people living outside Vietna ...
# Welsh


Languages with Dictation support

the following 72 languages, dialects and language varieties currently have dictation support. #
Afrikaans Afrikaans is a West Germanic languages, West Germanic language spoken in South Africa, Namibia and to a lesser extent Botswana, Zambia, Zimbabwe and also Argentina where there is a group in Sarmiento, Chubut, Sarmiento that speaks the Pat ...
#
Albanian Albanian may refer to: *Pertaining to Albania in Southeast Europe; in particular: **Albanians, an ethnic group native to the Balkans **Albanian language **Albanian culture **Demographics of Albania, includes other ethnic groups within the country ...
#
Amharic Amharic is an Ethio-Semitic language, which is a subgrouping within the Semitic branch of the Afroasiatic languages. It is spoken as a first language by the Amhara people, and also serves as a lingua franca for all other metropolitan populati ...
#
Arabic Arabic (, , or , ) is a Central Semitic languages, Central Semitic language of the Afroasiatic languages, Afroasiatic language family spoken primarily in the Arab world. The International Organization for Standardization (ISO) assigns lang ...
#
Armenian Armenian may refer to: * Something of, from, or related to Armenia, a country in the South Caucasus region of Eurasia * Armenians, the national people of Armenia, or people of Armenian descent ** Armenian diaspora, Armenian communities around the ...
#
Basque Basque may refer to: * Basques, an ethnic group of Spain and France * Basque language, their language Places * Basque Country (greater region), the homeland of the Basque people with parts in both Spain and France * Basque Country (autonomous co ...
# Bengali # Bulgarian # Catalan # Chichewa # Chinese ( Simplified) # Chinese (
Traditional A tradition is a system of beliefs or behaviors (folk custom) passed down within a group of people or society with symbolic meaning or special significance with origins in the past. A component of cultural expressions and folklore, common examp ...
) # Croatian #
Czech Czech may refer to: * Anything from or related to the Czech Republic, a country in Europe ** Czech language ** Czechs, the people of the area ** Czech culture ** Czech cuisine * One of three mythical brothers, Lech, Czech, and Rus *Czech (surnam ...
# Danish # Dutch # English #
Estonian Estonian may refer to: * Something of, from, or related to Estonia, a country in the Baltic region in northern Europe * Estonians, people from Estonia, or of Estonian descent * Estonian language * Estonian cuisine * Estonian culture See also

...
# Filipino # Finnish # French #
German German(s) may refer to: * Germany, the country of the Germans and German things **Germania (Roman era) * Germans, citizens of Germany, people of German ancestry, or native speakers of the German language ** For citizenship in Germany, see also Ge ...
#
Greek Greek may refer to: Anything of, from, or related to Greece, a country in Southern Europe: *Greeks, an ethnic group *Greek language, a branch of the Indo-European language family **Proto-Greek language, the assumed last common ancestor of all kno ...
# Gujarati # Hausa #
Hebrew Hebrew (; ''ʿÎbrit'') is a Northwest Semitic languages, Northwest Semitic language within the Afroasiatic languages, Afroasiatic language family. A regional dialect of the Canaanite languages, it was natively spoken by the Israelites and ...
#
Hindi Modern Standard Hindi (, ), commonly referred to as Hindi, is the Standard language, standardised variety of the Hindustani language written in the Devanagari script. It is an official language of India, official language of the Government ...
# Hungarian # Igbo # Indonesian #
Italian Italian(s) may refer to: * Anything of, from, or related to the people of Italy over the centuries ** Italians, a Romance ethnic group related to or simply a citizen of the Italian Republic or Italian Kingdom ** Italian language, a Romance languag ...
#
Japanese Japanese may refer to: * Something from or related to Japan, an island country in East Asia * Japanese language, spoken mainly in Japan * Japanese people, the ethnic group that identifies with Japan through ancestry or culture ** Japanese diaspor ...
#
Kannada Kannada () is a Dravidian language spoken predominantly in the state of Karnataka in southwestern India, and spoken by a minority of the population in all neighbouring states. It has 44 million native speakers, and is additionally a ...
# Khmer #
Korean Korean may refer to: People and culture * Koreans, people from the Korean peninsula or of Korean descent * Korean culture * Korean language **Korean alphabet, known as Hangul or Korean **Korean dialects **See also: North–South differences in t ...
#
Latin Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
# Latvian # Malay #
Malayalam Malayalam (; , ) is a Dravidian languages, Dravidian language spoken in the Indian state of Kerala and the union territories of Lakshadweep and Puducherry (union territory), Puducherry (Mahé district) by the Malayali people. It is one of ...
#
Marathi Marathi may refer to: *Marathi people, an Indo-Aryan ethnolinguistic group of Maharashtra, India **Marathi people (Uttar Pradesh), the Marathi people in the Indian state of Uttar Pradesh *Marathi language, the Indo-Aryan language spoken by the Mar ...
# Myanmar (Burmese) # Ndebele (South) # Nepali # Norwegian (
Bokmål Bokmål () (, ; ) is one of the official written standards for the Norwegian language, alongside Nynorsk. Bokmål is by far the most used written form of Norwegian today, as it is adopted by 85% to 90% of the population in Norway. There is no cou ...
) # Oromo # Polish # Portuguese (
Brazil Brazil, officially the Federative Republic of Brazil, is the largest country in South America. It is the world's List of countries and dependencies by area, fifth-largest country by area and the List of countries and dependencies by population ...
) #
Romanian Romanian may refer to: *anything of, from, or related to the country and nation of Romania **Romanians, an ethnic group **Romanian language, a Romance language ***Romanian dialects, variants of the Romanian language **Romanian cuisine, traditional ...
# Rundi #
Russian Russian(s) may refer to: *Russians (), an ethnic group of the East Slavic peoples, primarily living in Russia and neighboring countries *A citizen of Russia *Russian language, the most widely spoken of the Slavic languages *''The Russians'', a b ...
# Serbian # Shona # Sinhala # Slovak # Somali #
Spanish Spanish might refer to: * Items from or related to Spain: **Spaniards are a nation and ethnic group indigenous to Spain **Spanish language, spoken in Spain and many countries in the Americas **Spanish cuisine **Spanish history **Spanish culture ...
# Swahili # Swati # Swedish #
Tamil Tamil may refer to: People, culture and language * Tamils, an ethno-linguistic group native to India, Sri Lanka, and some other parts of Asia **Sri Lankan Tamils, Tamil people native to Sri Lanka ** Myanmar or Burmese Tamils, Tamil people of Ind ...
# Telugu # Thai # Tigrinya #
Tswana Tswana may refer to: * Tswana people, the Bantu languages, Bantu speaking people in Botswana, South Africa, Namibia, Zimbabwe, Zambia, and other Southern Africa regions * Tswana language, the language spoken by the (Ba)Tswana people * Tswanaland, ...
# Turkish # Twi # Ukrainian #
Urdu Urdu (; , , ) is an Indo-Aryan languages, Indo-Aryan language spoken chiefly in South Asia. It is the Languages of Pakistan, national language and ''lingua franca'' of Pakistan. In India, it is an Eighth Schedule to the Constitution of Indi ...
#
Vietnamese Vietnamese may refer to: * Something of, from, or related to Vietnam, a country in Southeast Asia * Vietnamese people, or Kinh people, a Southeast Asian ethnic group native to Vietnam ** Overseas Vietnamese, Vietnamese people living outside Vietna ...
# Welsh # Yoruba # Zulu


Stages

''(by chronological order of introduction)'' # 1st stage ## English to and from French ## English to and from
German German(s) may refer to: * Germany, the country of the Germans and German things **Germania (Roman era) * Germans, citizens of Germany, people of German ancestry, or native speakers of the German language ** For citizenship in Germany, see also Ge ...
## English to and from
Spanish Spanish might refer to: * Items from or related to Spain: **Spaniards are a nation and ethnic group indigenous to Spain **Spanish language, spoken in Spain and many countries in the Americas **Spanish cuisine **Spanish history **Spanish culture ...
# 2nd stage ## English to and from Portuguese (
Brazil Brazil, officially the Federative Republic of Brazil, is the largest country in South America. It is the world's List of countries and dependencies by area, fifth-largest country by area and the List of countries and dependencies by population ...
) # 3rd stage ## English to and from
Italian Italian(s) may refer to: * Anything of, from, or related to the people of Italy over the centuries ** Italians, a Romance ethnic group related to or simply a citizen of the Italian Republic or Italian Kingdom ** Italian language, a Romance languag ...
# 4th stage ## English to and from Chinese ( Simplified) ## English to and from
Japanese Japanese may refer to: * Something from or related to Japan, an island country in East Asia * Japanese language, spoken mainly in Japan * Japanese people, the ethnic group that identifies with Japan through ancestry or culture ** Japanese diaspor ...
## English to and from
Korean Korean may refer to: People and culture * Koreans, people from the Korean peninsula or of Korean descent * Korean culture * Korean language **Korean alphabet, known as Hangul or Korean **Korean dialects **See also: North–South differences in t ...
# 5th stage (launched April 28, 2006) ## English to and from
Arabic Arabic (, , or , ) is a Central Semitic languages, Central Semitic language of the Afroasiatic languages, Afroasiatic language family spoken primarily in the Arab world. The International Organization for Standardization (ISO) assigns lang ...
# 6th stage (launched December 16, 2006) ## English to and from
Russian Russian(s) may refer to: *Russians (), an ethnic group of the East Slavic peoples, primarily living in Russia and neighboring countries *A citizen of Russia *Russian language, the most widely spoken of the Slavic languages *''The Russians'', a b ...
# 7th stage (launched February 9, 2007) ## English to and from Chinese (
Traditional A tradition is a system of beliefs or behaviors (folk custom) passed down within a group of people or society with symbolic meaning or special significance with origins in the past. A component of cultural expressions and folklore, common examp ...
) ## Chinese (( Simplified) to and from
Traditional A tradition is a system of beliefs or behaviors (folk custom) passed down within a group of people or society with symbolic meaning or special significance with origins in the past. A component of cultural expressions and folklore, common examp ...
) # 8th stage (all 25 language pairs use Google's machine translation system) (launched October 22, 2007) ## English to and from Dutch ## English to and from
Greek Greek may refer to: Anything of, from, or related to Greece, a country in Southern Europe: *Greeks, an ethnic group *Greek language, a branch of the Indo-European language family **Proto-Greek language, the assumed last common ancestor of all kno ...
# 9th stage ## English to and from
Hindi Modern Standard Hindi (, ), commonly referred to as Hindi, is the Standard language, standardised variety of the Hindustani language written in the Devanagari script. It is an official language of India, official language of the Government ...
# 10th stage (as of this stage, translation can be done between any two languages, using English as an intermediate step, if needed) (launched May 8, 2008) ## Bulgarian ## Croatian ##
Czech Czech may refer to: * Anything from or related to the Czech Republic, a country in Europe ** Czech language ** Czechs, the people of the area ** Czech culture ** Czech cuisine * One of three mythical brothers, Lech, Czech, and Rus *Czech (surnam ...
## Danish ## Finnish ## Norwegian (
Bokmål Bokmål () (, ; ) is one of the official written standards for the Norwegian language, alongside Nynorsk. Bokmål is by far the most used written form of Norwegian today, as it is adopted by 85% to 90% of the population in Norway. There is no cou ...
) ## Polish ##
Romanian Romanian may refer to: *anything of, from, or related to the country and nation of Romania **Romanians, an ethnic group **Romanian language, a Romance language ***Romanian dialects, variants of the Romanian language **Romanian cuisine, traditional ...
## Swedish # 11th stage (launched September 25, 2008) ## Catalan ## Filipino ( Tagalog) ##
Hebrew Hebrew (; ''ʿÎbrit'') is a Northwest Semitic languages, Northwest Semitic language within the Afroasiatic languages, Afroasiatic language family. A regional dialect of the Canaanite languages, it was natively spoken by the Israelites and ...
## Indonesian ## Latvian ## Lithuanian ## Serbian ## Slovak ## Slovene ## Ukrainian ##
Vietnamese Vietnamese may refer to: * Something of, from, or related to Vietnam, a country in Southeast Asia * Vietnamese people, or Kinh people, a Southeast Asian ethnic group native to Vietnam ** Overseas Vietnamese, Vietnamese people living outside Vietna ...
# 12th stage (launched January 30, 2009) ##
Albanian Albanian may refer to: *Pertaining to Albania in Southeast Europe; in particular: **Albanians, an ethnic group native to the Balkans **Albanian language **Albanian culture **Demographics of Albania, includes other ethnic groups within the country ...
##
Estonian Estonian may refer to: * Something of, from, or related to Estonia, a country in the Baltic region in northern Europe * Estonians, people from Estonia, or of Estonian descent * Estonian language * Estonian cuisine * Estonian culture See also

...
## Galician ## Hungarian ##
Maltese Maltese may refer to: * Someone or something of, from, or related to Malta * Maltese alphabet * Maltese cuisine * Maltese culture * Maltese language, the Semitic language spoken by Maltese people * Maltese people, people from Malta or of Maltese ...
## Thai ## Turkish # 13th stage (launched June 19, 2009) ##
Persian Persian may refer to: * People and things from Iran, historically called ''Persia'' in the English language ** Persians, the majority ethnic group in Iran, not to be conflated with the Iranic peoples ** Persian language, an Iranian language of the ...
# 14th stage (launched August 24, 2009) ##
Afrikaans Afrikaans is a West Germanic languages, West Germanic language spoken in South Africa, Namibia and to a lesser extent Botswana, Zambia, Zimbabwe and also Argentina where there is a group in Sarmiento, Chubut, Sarmiento that speaks the Pat ...
## Belarusian ## Icelandic ## Irish ## Macedonian ## Malay ## Swahili ## Welsh ##
Yiddish Yiddish, historically Judeo-German, is a West Germanic language historically spoken by Ashkenazi Jews. It originated in 9th-century Central Europe, and provided the nascent Ashkenazi community with a vernacular based on High German fused with ...
# 15th stage (launched November 19, 2009) ## The Beta stage is finished. Users can now choose to have the
romanization In linguistics, romanization is the conversion of text from a different writing system to the Latin script, Roman (Latin) script, or a system for doing so. Methods of romanization include transliteration, for representing written text, and tra ...
written for Belarusian, Bulgarian, Chinese, Greek, Hindi, Japanese, Korean, Russian, Thai and Ukrainian. For translations from Arabic, Hindi and Persian, the user can enter a Latin transliteration of the text and the text will be transliterated to the native script for these languages as the user is typing. The text can now be read by a
text-to-speech Speech synthesis is the artificial production of human speech. A computer system used for this purpose is called a speech synthesizer, and can be implemented in software or Computer hardware, hardware products. A text-to-speech (TTS) system conv ...
program in English, French, German and Italian. # 16th stage (launched January 30, 2010) ##
Haitian Creole Haitian Creole (; , ; , ), or simply Creole (), is a French-based creole languages, French-based creole language spoken by 10 to 12million people worldwide, and is one of the two official languages of Haiti (the other being French), where it ...
# 17th stage (launched April 2010) ## Speech program launched in Hindi and Spanish. # 18th stage (launched May 5, 2010) ## Speech program launched in Afrikaans, Albanian, Catalan, Chinese (Mandarin), Croatian, Czech, Danish, Dutch, Finnish, Greek, Hungarian, Icelandic, Indonesian, Latvian, Macedonian, Norwegian, Polish, Portuguese, Romanian, Russian, Serbian, Slovak, Swahili, Swedish, Turkish, Vietnamese and Welsh (based on eSpeak). # 19th stage (launched May 13, 2010) ##
Armenian Armenian may refer to: * Something of, from, or related to Armenia, a country in the South Caucasus region of Eurasia * Armenians, the national people of Armenia, or people of Armenian descent ** Armenian diaspora, Armenian communities around the ...
## Azerbaijani ##
Basque Basque may refer to: * Basques, an ethnic group of Spain and France * Basque language, their language Places * Basque Country (greater region), the homeland of the Basque people with parts in both Spain and France * Basque Country (autonomous co ...
## Georgian ##
Urdu Urdu (; , , ) is an Indo-Aryan languages, Indo-Aryan language spoken chiefly in South Asia. It is the Languages of Pakistan, national language and ''lingua franca'' of Pakistan. In India, it is an Eighth Schedule to the Constitution of Indi ...
# 20th stage (launched June 2010) ## Provides romanization for Arabic. # 21st stage (launched September 2010) ## Allows phonetic typing for Arabic, Greek, Hindi, Persian, Russian, Serbian and Urdu. ##
Latin Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
# 22nd stage (launched December 2010) ## Romanization of Arabic removed. ## Spell check added. ## For some languages, Google replaced text-to-speech synthesizers from eSpeak's robot voice to native speaker's nature voice technologies made by SVOX (Chinese, Czech, Danish, Dutch, Finnish, Greek, Hungarian, Norwegian, Polish, Portuguese, Russian, Swedish and Turkish), and also the old versions of French, German, Italian and Spanish; Latin uses the same synthesizer as Italian. ## Speech program launched in Arabic, Japanese and Korean. # 23rd stage (launched January 2011) ## Choice of different translations for a word. # 24th stage (launched June 2011) #* 5 new Indic languages (in alpha) and a transliterated input method: ## Bengali ## Gujarati ##
Kannada Kannada () is a Dravidian language spoken predominantly in the state of Karnataka in southwestern India, and spoken by a minority of the population in all neighbouring states. It has 44 million native speakers, and is additionally a ...
##
Tamil Tamil may refer to: People, culture and language * Tamils, an ethno-linguistic group native to India, Sri Lanka, and some other parts of Asia **Sri Lankan Tamils, Tamil people native to Sri Lanka ** Myanmar or Burmese Tamils, Tamil people of Ind ...
## Telugu # 25th stage (launched July 2011) ## Translation rating introduced. # 26th stage (launched January 2012) ## Dutch male voice synthesizer replaced with female. ## Elena by SVOX replaced the Slovak eSpeak voice. ## Transliteration of Yiddish added. # 27th stage (launched February 2012) ## Speech program launched in Thai. ##
Esperanto Esperanto (, ) is the world's most widely spoken Constructed language, constructed international auxiliary language. Created by L. L. Zamenhof in 1887 to be 'the International Language' (), it is intended to be a universal second language for ...
# 28th stage (launched September 2012) ## Lao # 29th stage (launched October 2012) ## Transliteration of Lao added. (alpha status) # 30th stage (launched October 2012) ## New speech program launched in English. # 31st stage (launched November 2012) ## New speech program in French, German, Italian, Latin and Spanish. # 32nd stage (launched March 2013) ## Phrasebook added. # 33rd stage (launched April 2013) ## Khmer # 34th stage (launched May 2013) ## Bosnian ## Cebuano ##
Hmong Hmong may refer to: * Hmong people, an ethnic group living mainly in Southwest China, Vietnam, Laos, and Thailand * Hmong cuisine * Hmong customs and culture ** Hmong music ** Hmong textile art * Hmong language, a continuum of closely related ...
## Javanese ##
Marathi Marathi may refer to: *Marathi people, an Indo-Aryan ethnolinguistic group of Maharashtra, India **Marathi people (Uttar Pradesh), the Marathi people in the Indian state of Uttar Pradesh *Marathi language, the Indo-Aryan language spoken by the Mar ...
# 35th stage (launched May 2013) ## 16 additional languages can be used with camera-input: Bulgarian, Catalan, Croatian, Danish, Estonian, Finnish, Hungarian, Indonesian, Icelandic, Latvian, Lithuanian, Norwegian, Romanian, Slovak, Slovenian and Swedish. # 36th stage (launched December 2013) ## Hausa ## Igbo ## Maori ## Mongolian ## Nepali ## Punjabi (
Gurmukhi Gurmukhī ( , Shahmukhi: ) is an abugida developed from the Laṇḍā scripts, standardized and used by the second Sikh guru, Guru Angad (1504–1552). Commonly regarded as a Sikh script, Gurmukhi is used in Punjab, India as the official scrip ...
) ## Somali ## Yoruba ## Zulu # 37th stage (launched June 2014) ## Definition of words added. # 38th stage (launched December 2014) ## Burmese ##
Chewa Chewa may refer to: *the Chewa people *the Chewa language Chewa ( ; also known as Nyanja ) is a Bantu languages, Bantu language spoken in Malawi and a recognised minority in Zambia and Mozambique. The noun class prefix ''chi-'' is used for lang ...
## Kazakh ## Malagasy ##
Malayalam Malayalam (; , ) is a Dravidian languages, Dravidian language spoken in the Indian state of Kerala and the union territories of Lakshadweep and Puducherry (union territory), Puducherry (Mahé district) by the Malayali people. It is one of ...
## Sinhala ## Sotho ## Sundanese ## Tajik ## Uzbek # 39th stage (launched October 2015) ## Transliteration of Arabic restored. # 40th stage (launched November 2015) ## Aurebesh # 41st stage (launched February 2016) ## Aurebesh removed. ## Speech program launched in Bengali. ##
Amharic Amharic is an Ethio-Semitic language, which is a subgrouping within the Semitic branch of the Afroasiatic languages. It is spoken as a first language by the Amhara people, and also serves as a lingua franca for all other metropolitan populati ...
## Corsican ## Hawaiian ##
Kurdish Kurdish may refer to: *Kurds or Kurdish people *Kurdish language ** Northern Kurdish (Kurmanji) **Central Kurdish (Sorani) **Southern Kurdish ** Laki Kurdish *Kurdish alphabets *Kurdistan, the land of the Kurdish people which includes: **Southern ...
(
Kurmanji Kurmanji (, ), also termed Northern Kurdish, is the northernmost of the Kurdish languages, spoken predominantly in southeast Turkey, northwest and northeast Iran, northern Iraq, northern Syria and the Caucasus and Khorasan regions. It is the ...
) ## Kyrgyz ##
Luxembourgish Luxembourgish ( ; also ''Luxemburgish'', ''Luxembourgian'', ''Letzebu(e)rgesch''; ) is a West Germanic language that is spoken mainly in Luxembourg. About 400,000 people speak Luxembourgish worldwide. The language is standardized and officiall ...
##
Pashto Pashto ( , ; , ) is an eastern Iranian language in the Indo-European language family, natively spoken in northwestern Pakistan and southern and eastern Afghanistan. It has official status in Afghanistan and the Pakistani province of Khyb ...
## Samoan ##
Scottish Gaelic Scottish Gaelic (, ; Endonym and exonym, endonym: ), also known as Scots Gaelic or simply Gaelic, is a Celtic language native to the Gaels of Scotland. As a member of the Goidelic language, Goidelic branch of Celtic, Scottish Gaelic, alongs ...
## Shona ## Sindhi ## West Frisian ##
Xhosa Xhosa may refer to: * Xhosa people, a nation, and ethnic group, who live in south-central and southeasterly region of South Africa * Xhosa language, one of the 11 official languages of South Africa, principally spoken by the Xhosa people See als ...
# 42nd stage (launched September 2016) ## Speech program launched in Ukrainian. # 43rd stage (launched December 2016) ## Speech program launched in Khmer and Sinhala. # 44th stage (launched June 2018) ## Speech program launched in Burmese, Malayalam, Marathi, Nepali and Telugu. # 45th stage (launched September 2019) ## Speech program launched in Gujarati, Kannada and Urdu. # 46th stage (launched February 2020) ##
Kinyarwanda Kinyarwanda, Rwandan or Rwanda, officially known as Ikinyarwanda, is a Bantu language and the national language of Rwanda. It is a dialect of the Rwanda-Rundi language that is also spoken in adjacent parts of the Democratic Republic of the ...
## Odia ##
Tatar Tatar may refer to: Peoples * Tatars, an umbrella term for different Turkic ethnic groups bearing the name "Tatar" * Volga Tatars, a people from the Volga-Ural region of western Russia * Crimean Tatars, a people from the Crimea peninsula by the B ...
## Turkmen ##
Uyghur Uyghur may refer to: * Uyghurs, a Turkic ethnic group living in Eastern and Central Asia (West China) ** Uyghur language, a Turkic language spoken primarily by the Uyghurs *** Old Uyghur language, a different Turkic language spoken in the Uyghur K ...
# 47th stage (launched February 2021) ## Speech program launched in Afrikaans, Bulgarian, Catalan, Icelandic, Latvian, and Serbian (changed from eSpeak to a natural voice). ## New speech system (WaveNet) for several languages. # 48th stage (launched January 2022) ## Speech program launched in Hebrew. # 49th stage (launched May 2022) ## Assamese ## Aymara ## Bambara ##
Bhojpuri Bhojpuri may refer to: * Bhojpuri language, an Indo-Aryan language of India and Nepal * Bhojpuri grammar, grammatical rules of the language * Bhojpuri nouns, nouns of the language * Bhojpuri people, people who speak the language * Bhojpuri region ...
## Dogri ## Ewe ## Guarani ## Ilocano ##
Konkani __NOTOC__ Konkani may refer to: Language * Konkani language is an Indo-Aryan language spoken in the Konkan region of India. * Konkani alphabets, different scripts used to write the language **Konkani in the Roman script, one of the scripts used to ...
## Krio ##
Kurdish Kurdish may refer to: *Kurds or Kurdish people *Kurdish language ** Northern Kurdish (Kurmanji) **Central Kurdish (Sorani) **Southern Kurdish ** Laki Kurdish *Kurdish alphabets *Kurdistan, the land of the Kurdish people which includes: **Southern ...
(
Sorani Central Kurdish, also known as Sorani Kurdish, is a Kurdish dialect or a language spoken in Iraq, mainly in Iraqi Kurdistan, as well as the provinces of Kurdistan, Kermanshah, and West Azerbaijan in western Iran. Central Kurdish is one of the ...
) ##
Lingala Lingala (or Ngala, Lingala: ) is a Bantu languages, Bantu language spoken in the northwest of the Democratic Republic of the Congo, the northern half of the Republic of the Congo, in their capitals, Kinshasa and Brazzaville, and to a lesser de ...
##
Luganda Ganda or Luganda ( ; ) is a Bantu language spoken in the African Great Lakes region. It is one of the major languages in Uganda and is spoken by more than 5.56 million Ganda people, Baganda and other people principally in central Uganda, includ ...
## Maithili ## Maldivian ## Meitei ## Mizo ##
Northern Sotho Sepedi, also known as Northern Sotho, is one of South Africa’s twelve official languages and belongs to the Bantu language family, specifically the Sotho-Tswana group. The language is spoken mainly in Limpopo Province, and to a lesser exten ...
## Oromo ##
Quechua Quechua may refer to: *Quechua people, several Indigenous ethnic groups in South America, especially in Peru *Quechuan languages, an Indigenous South American language family spoken primarily in the Andes, derived from a common ancestral language ...
##
Sanskrit Sanskrit (; stem form ; nominal singular , ,) is a classical language belonging to the Indo-Aryan languages, Indo-Aryan branch of the Indo-European languages. It arose in northwest South Asia after its predecessor languages had Trans-cultural ...
## Tigrinya ##
Tsonga Tsonga may refer to: * Tsonga language, a Bantu language spoken in southern Africa * Tsonga people, a large group of people living mainly in southern Mozambique and South Africa. * Jo-Wilfried Tsonga Jo-Wilfried Tsonga (; born 17 April 1985) ...
## Twi ## eSpeak voice synthesizer removed from Armenian, Esperanto, Macedonian and Welsh. # 50th stage (launched November 2022) ## Speech program launched in Albanian, Bosnian and Swahili (changed from eSpeak to natural). ## New speech program launched in Malayalam, Marathi and Tamil. # 51st stage (launched March 2023) ## Speech program launched in Croatian (changed from eSpeak to natural). # 52nd stage (launched April 2023) ## Speech program launched in Welsh (only on Google search results). ## New speech programs launched in Chinese (simplified and traditional), German, Indonesian, Malay, Malayalam, Tamil, and Telugu (Chinese, German, Indonesian, Malayalam and Tamil reverted from WaveNet). # 53rd stage (launched June 2023) ## Speech program launched in Lithuanian and Punjabi. # 54th stage (launched June 2024) ## Abkhaz ## Acehnese ##
Acholi Acholi may refer to: * Acholi people, a Luo nation of Uganda, in the Northern part of the country. * Acholi language, a Nilotic language * Acholi Inn, a building in Gulu, Uganda * Acholi nationalism, a political ideology of Acholi people {{dab ...
## Afar ## Alur ## Avar ## Awadhi ## Balinese ## Baluchi ## Baoulé ## Bashkir ##
Batak Karo The Karo (also known as Karo Batak) people are a people of the ''Tanah Karo'' (Karo lands) in North Sumatra, Indonesia. The Karo lands consist of Karo Regency, plus neighboring areas in East Aceh Regency, Langkat Regency, Dairi Regency, Simal ...
## Batak Simalungun ## Batak Toba ## Bemba ## Betawi ## Bikol ##
Breton Breton most often refers to: *anything associated with Brittany, and generally **Breton people **Breton language, a Southwestern Brittonic Celtic language of the Indo-European language family, spoken in Brittany ** Breton (horse), a breed **Gale ...
## Buryat ##
Cantonese Cantonese is the traditional prestige variety of Yue Chinese, a Sinitic language belonging to the Sino-Tibetan language family. It originated in the city of Guangzhou (formerly known as Canton) and its surrounding Pearl River Delta. While th ...
## Chamorro ## Chechen ## Chuukese ## Chuvash ## Crimean Tatar (
Cyrillic The Cyrillic script ( ) is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic, Turkic, Mongolic, Uralic, Caucasian and Iranic-speaking countries in Southeastern Europe, Ea ...
) ##
Dari Dari (; endonym: ), Dari Persian (, , or , ), or Eastern Persian is the variety of the Persian language spoken in Afghanistan. Dari is the Afghan government's official term for the Persian language;Lazard, G.Darī – The New Persian ...
##
Dinka The Dinka people () are a Nilotic ethnic group native to South Sudan. The Dinka mostly live along the Nile, from Mangalla-Bor to Renk, in the region of Bahr el Ghazal, Upper Nile (two out of three provinces that were formerly part of southern ...
## Dombe ## Dyula ##
Dzongkha Dzongkha (; ) is a Tibeto-Burman languages, Tibeto-Burman language that is the official and national language of Bhutan. It is written using the Tibetan script. The word means "the language of the fortress", from ' "fortress" and ' "language ...
## Faroese ## Fijian ## Fon ## Friulian ##
Fulani The Fula, Fulani, or Fulɓe people are an ethnic group in Sahara, Sahel and West Africa, widely dispersed across the region. Inhabiting many countries, they live mainly in West Africa and northern parts of Central Africa, South Sudan, Darfur, ...
## Ga ## Hakha Chin ## Hiligaynon ##
Hunsrik Hunsrik (natively ''Hunsrik'' , ''Hunsrückisch'' or ''Hunsrickisch'' and Portuguese ''hunsriqueano'' or ''hunsriqueano riograndense''), also called Riograndese Hunsrik, ' or ', is a Moselle Franconian language derived primarily from the Hunsr ...
##
Iban IBAN or Iban or Ibán may refer to: Banking * International Bank Account Number Ethnology * Iban culture * Iban language The Iban language () is spoken by the Iban, one of the Dayak ethnic groups who live in Brunei, the Indonesian provinc ...
##
Jamaican Patois Jamaican Patois (; locally rendered Patwah and called Jamaican Creole by linguists) is an English-based creole language with influences from West African, Arawak, Spanish and other languages, spoken primarily in Jamaica and among the Jamaican ...
## Jingpo ## Kalaallisut ## Kanuri ##
Kapampangan Kapampangan, Capampañgan or Pampangan may refer to: *Kapampangan people, of the Philippines *Kapampangan language Kapampangan, Capampáñgan, or Pampangan, is an Austronesian language, and one of the eight major languages of the Philippines. ...
## Khasi ## Kiga ## Kikongo ## Kituba ##
Kokborok Kokborok (or Tripuri) is a Tibeto-Burman language of the Indian state of Tripura and neighbouring areas of Bangladesh. Its name comes from ''kók'' meaning "verbal" or "language" and ''borok'' meaning "people" or "human", It is one of the anci ...
## Komi ## Latgalian ## Ligurian ##
Limburgish Limburgish ( or ; ; also Limburgian, Limburgic or Limburgan) refers to a group of South Low Franconian Variety (linguistics), varieties spoken in Belgium and the Netherlands, characterized by their distance to, and limited participation ...
## Lombard ## Luo ## Madurese ##
Makassar Makassar ( ), formerly Ujung Pandang ( ), is the capital of the Indonesian Provinces of Indonesia, province of South Sulawesi. It is the largest city in the region of Eastern Indonesia and the country's fifth-largest urban center after Jakarta, ...
## Malay ( Jawi) ## Mam ## Manx ## Marshallese ## Marwadi ## Mauritian Creole ## Meadow Mari ## Minang ## Nahuatl (Eastern Huasteca) ## Ndau ## Ndebele (South) ## Nepalbhasa (Newari) ## NKo ## Nuer ##
Occitan Occitan may refer to: * Something of, from, or related to the Occitania territory in parts of France, Italy, Monaco and Spain. * Something of, from, or related to the Occitania administrative region of France. * Occitan language, spoken in parts o ...
## Ossetian ##
Pangasinan Pangasinan, officially the Province of Pangasinan (, ; ; ), is a coastal Provinces of the Philippines, province in the Philippines located in the Ilocos Region of Luzon. Its capital is Lingayen, Pangasinan, Lingayen while San Carlos, Pangasi ...
##
Papiamento Papiamento () or Papiamentu (; ) is a Portuguese-based creole language spoken in the Dutch Caribbean. It is the most widely spoken language on Aruba, Bonaire, and Curaçao ( ABC Islands). The language, spelled in Aruba and in Bonaire and ...
## Portuguese (
Portugal Portugal, officially the Portuguese Republic, is a country on the Iberian Peninsula in Southwestern Europe. Featuring Cabo da Roca, the westernmost point in continental Europe, Portugal borders Spain to its north and east, with which it share ...
) ## Punjabi (
Shahmukhi Shahmukhi (, , , ) is the right-to-left abjad-based script developed from the Perso-Arabic alphabet used for the Punjabi language varieties, predominantly in Punjab, Pakistan. It is generally written in the Nastaʿlīq calligraphic hand, whic ...
) ##
Qʼeqchiʼ Qʼeqchiʼ () (Kʼekchiʼ in the former orthography, or simply Kekchi in many English-language contexts, such as in Belize) are a Maya people Maya () are an ethnolinguistic group of Indigenous peoples of the Americas, Indigenous peoples o ...
## Romani ## Rundi ## Sami (North) ## Sango ## Santali ##
Seychellois Creole Seychellois Creole (), also known as Kreol, Seselwa Creole French, and Seselwa Creole is the French-based creole language spoken by the Seychellois Creole people, Seychelles Creole people of the Seychelles. It is one of the national language, na ...
## Shan ## Sicilian ## Silesian ## Susu ## Swati ## Tahitian ##
Tamazight The Berber languages, also known as the Amazigh languages or Tamazight, are a branch of the Afroasiatic language family. They comprise a group of closely related but mostly mutually unintelligible languages spoken by Berber communities, who ar ...
##
Tamazight The Berber languages, also known as the Amazigh languages or Tamazight, are a branch of the Afroasiatic language family. They comprise a group of closely related but mostly mutually unintelligible languages spoken by Berber communities, who ar ...
(
Tifinagh Tifinagh ( Tuareg Berber language: ; Neo-Tifinagh: ; Berber Latin alphabet: ; ) is a script used to write the Berber languages. Tifinagh is descended from the ancient Libyco-Berber alphabet. The traditional Tifinagh, sometimes called Tuareg Tifi ...
) ##
Tetum Tetum may refer to: * Tetum language, an Austronesian language ** Tetum alphabet, used to write the Tetum language * Tetum people, an ethnic group of East Timor and Indonesia {{disambiguation Language and nationality disambiguation pages ...
## Tibetan ## Tiv ##
Tok Pisin Tok Pisin ( ,Laurie Bauer, 2007, ''The Linguistics Student's Handbook'', Edinburgh ; ), often referred to by English speakers as New Guinea Pidgin or simply Pidgin, is an English-based creole languages, English creole language spoken throughou ...
## Tongan ##
Tswana Tswana may refer to: * Tswana people, the Bantu languages, Bantu speaking people in Botswana, South Africa, Namibia, Zimbabwe, Zambia, and other Southern Africa regions * Tswana language, the language spoken by the (Ba)Tswana people * Tswanaland, ...
## Tulu ## Tumbuka ## Tuvan ## Udmurt ##
Venda Venda ( ), officially the Republic of Venda (; ), was a Bantustan in northern South Africa. It was fairly close to the South African border with Zimbabwe to the north, while, to the south and east, it shared a long border with another black hom ...
## Venetian ## Waray ##
Wolof Wolof or Wollof may refer to: * Wolof people, an ethnic group found in Senegal, Gambia, and Mauritania * Wolof language, a language spoken in Senegal, Gambia, and Mauritania * The Wolof or Jolof Empire, a medieval West African successor of the Mal ...
## Yakut ##
Yucatec Maya Yucatec Maya ( ; referred to by its speakers as or ) is a Mayan languages, Mayan language spoken in the Yucatán Peninsula, including part of northern Belize. There is also a significant diasporic community of Yucatec Maya speakers in San Fra ...
## Zapotec ## Speech program launched in Amharic, Bulgarian, Cantonese, Galician, Hausa, and Welsh # 55th stage (launched October 2024) ## Crimean Tatar (
Latin Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
) ## French (
Canada Canada is a country in North America. Its Provinces and territories of Canada, ten provinces and three territories extend from the Atlantic Ocean to the Pacific Ocean and northward into the Arctic Ocean, making it the world's List of coun ...
) ##
Inuktut Inuktut is the collective name for the Inuit languages. It is used by Inuit Tapiriit Kanatami, the Inuit Circumpolar Council, and the Government of Nunavut throughout Inuit Nunaat and Inuit Nunangat. Usage Inuit Tapiriit Kanatami (ITK) says ...
(Latin) ##
Inuktut Inuktut is the collective name for the Inuit languages. It is used by Inuit Tapiriit Kanatami, the Inuit Circumpolar Council, and the Government of Nunavut throughout Inuit Nunaat and Inuit Nunangat. Usage Inuit Tapiriit Kanatami (ITK) says ...
( Syllabics) ## Santali ( Ol Chiki) ## Tshiluba


Translation methodology

In April 2006, Google Translate launched with a statistical machine translation engine. Google Translate does not apply
grammatical In linguistics, grammaticality is determined by the conformity to language usage as derived by the grammar of a particular speech variety. The notion of grammaticality rose alongside the theory of generative grammar, the goal of which is to formu ...
rules, since its algorithms are based on statistical or pattern analysis rather than traditional rule-based analysis. The system's original creator, Franz Josef Och, has criticized the effectiveness of rule-based
algorithm In mathematics and computer science, an algorithm () is a finite sequence of Rigour#Mathematics, mathematically rigorous instructions, typically used to solve a class of specific Computational problem, problems or to perform a computation. Algo ...
s in favor of statistical approaches. Original versions of Google Translate were based on a method called
statistical machine translation Statistical machine translation (SMT) is a machine translation approach where translations are generated on the basis of statistical models whose parameters are derived from the analysis of bilingual text corpora. The statistical approach contra ...
, and more specifically, on research by Och who won the
DARPA The Defense Advanced Research Projects Agency (DARPA) is a research and development agency of the United States Department of Defense responsible for the development of emerging technologies for use by the military. Originally known as the Adva ...
contest for speed machine translation in 2003. Och was the head of Google's machine translation group until leaving to join Human Longevity, Inc. in July 2014. Google Translate does not directly translate from one language to another (L1 → L2). Instead, it often translates first to English and then to the target language (L1 → EN → L2). However, because English, like all human languages, is ambiguous and depends on context, this can cause translation errors. For example, translating from French to Russian gives → ''you'' → ' OR '. If Google were using an unambiguous,
artificial language Artificial languages are languages of a typically very limited size which emerge either in computer simulations between artificial agents, robot interactions or controlled psychological experiments with humans. They are different from both constr ...
as the intermediary, it would be → ''you'' → ' OR → ''thou'' → '. Such a suffixing of words disambiguates their different meanings. Hence, publishing in English, using unambiguous words, providing context, or using expressions such as "you all" may or may not make a better one-step translation depending on the target language. The following languages do not have a direct Google translation to or from English. These languages are translated through the indicated intermediate language (which in most cases is closely related to the desired language but more widely spoken) in addition to through English: * Belarusian ( be ↔ ru ↔ en ↔ other); * Catalan ( ca ↔ es ↔ en ↔ other); * Galician ( gl ↔ pt ↔ en ↔ other); *
Haitian Creole Haitian Creole (; , ; , ), or simply Creole (), is a French-based creole languages, French-based creole language spoken by 10 to 12million people worldwide, and is one of the two official languages of Haiti (the other being French), where it ...
( ht ↔ fr ↔ en ↔ other); *
Korean Korean may refer to: People and culture * Koreans, people from the Korean peninsula or of Korean descent * Korean culture * Korean language **Korean alphabet, known as Hangul or Korean **Korean dialects **See also: North–South differences in t ...
( ko ↔ ja ↔ en ↔ other); * Slovak ( sk ↔ cs ↔ en ↔ other); * Ukrainian ( uk ↔ ru ↔ en ↔ other); *
Urdu Urdu (; , , ) is an Indo-Aryan languages, Indo-Aryan language spoken chiefly in South Asia. It is the Languages of Pakistan, national language and ''lingua franca'' of Pakistan. In India, it is an Eighth Schedule to the Constitution of Indi ...
( ur ↔ hi ↔ en ↔ other). According to Och, a solid base for developing a usable statistical machine translation system for a new pair of languages from scratch would consist of a bilingual
text corpus In linguistics and natural language processing, a corpus (: corpora) or text corpus is a dataset, consisting of natively digital and older, digitalized, language resources, either annotated or unannotated. Annotated, they have been used in corp ...
(or parallel collection) of more than 150–200 million words, and two monolingual corpora each of more than a billion words. Statistical
model A model is an informative representation of an object, person, or system. The term originally denoted the plans of a building in late 16th-century English, and derived via French and Italian ultimately from Latin , . Models can be divided in ...
s from these data are then used to translate between those languages. To acquire this huge amount of linguistic data, Google used
United Nations The United Nations (UN) is the Earth, global intergovernmental organization established by the signing of the Charter of the United Nations, UN Charter on 26 June 1945 with the stated purpose of maintaining international peace and internationa ...
and
European Parliament The European Parliament (EP) is one of the two legislative bodies of the European Union and one of its seven institutions. Together with the Council of the European Union (known as the Council and informally as the Council of Ministers), it ...
documents and transcripts. The UN typically publishes documents in all six official UN languages, which has produced a very large 6-language corpus. Google representatives have been involved with domestic conferences in Japan where it has solicited bilingual data from researchers. When Google Translate generates a translation proposal, it looks for
pattern A pattern is a regularity in the world, in human-made design, or in abstract ideas. As such, the elements of a pattern repeat in a predictable manner. A geometric pattern is a kind of pattern formed of geometric shapes and typically repeated l ...
s in hundreds of millions of documents to help decide on the best translation. By detecting patterns in documents that have already been translated by human translators, Google Translate makes informed guesses (AI) as to what an appropriate translation should be. Before October 2007, for languages other than
Arabic Arabic (, , or , ) is a Central Semitic languages, Central Semitic language of the Afroasiatic languages, Afroasiatic language family spoken primarily in the Arab world. The International Organization for Standardization (ISO) assigns lang ...
, Chinese and
Russian Russian(s) may refer to: *Russians (), an ethnic group of the East Slavic peoples, primarily living in Russia and neighboring countries *A citizen of Russia *Russian language, the most widely spoken of the Slavic languages *''The Russians'', a b ...
, Google Translate was based on SYSTRAN, a software engine which is still used by several other online translation services such as Babel Fish (now defunct). From October 2007, Google Translate used proprietary, in-house technology based on
statistical machine translation Statistical machine translation (SMT) is a machine translation approach where translations are generated on the basis of statistical models whose parameters are derived from the analysis of bilingual text corpora. The statistical approach contra ...
instead, before transitioning to neural machine translation.


Google Translate Community

Google used to have
crowdsourcing Crowdsourcing involves a large group of dispersed participants contributing or producing goods or services—including ideas, votes, micro-tasks, and finances—for payment or as volunteers. Contemporary crowdsourcing often involves digit ...
features for volunteers to be a part of its "Translate Community", intended to help improve Google Translate's accuracy. Volunteers could select up to five languages to help improve translation; users could verify translated phrases and translate phrases in their languages to and from English, helping to improve the accuracy of translating more rare and complex phrases. In August 2016, a Google Crowdsource app was released for Android users, in which translation tasks are offered. There were three ways to contribute. First, Google showed a phrase that one should type in the translated version. Second, Google showed a proposed translation for a user to agree, disagree, or skip. Third, users could suggest translations for phrases where they think they can improve on Google's results. Tests in 44 languages showed that the "suggest an edit" feature led to an improvement in a maximum of 40% of cases over four years. Despite its role in improving translation quality and expanding language coverage, Google closed the Translate Community on March 28, 2024.


Statistical machine translation

Although Google has deployed a new system called neural machine translation for better quality translation, there are languages that still use the traditional translation method called statistical machine translation. It is a rule-based translation method that uses predictive algorithms to guess ways to translate texts in foreign languages. It aims to translate whole phrases rather than single words then gather overlapping phrases for translation. Moreover, it also analyzes bilingual text corpora to generate a statistical model that translates texts from one language to another.


Neural machine translation

In September 2016, a research team at Google announced the development of the Google Neural Machine Translation system (GNMT) to increase fluency and accuracy in Google Translate and in November announced that Google Translate would switch to GNMT. Google Translate's
neural machine translation Neural machine translation (NMT) is an approach to machine translation that uses an artificial neural network to predict the likelihood of a sequence of words, typically modeling entire sentences in a single integrated model. It is the dominant a ...
system used a large end-to-end
artificial neural network In machine learning, a neural network (also artificial neural network or neural net, abbreviated ANN or NN) is a computational model inspired by the structure and functions of biological neural networks. A neural network consists of connected ...
that attempts to perform
deep learning Deep learning is a subset of machine learning that focuses on utilizing multilayered neural networks to perform tasks such as classification, regression, and representation learning. The field takes inspiration from biological neuroscience a ...
, in particular,
long short-term memory Long short-term memory (LSTM) is a type of recurrent neural network (RNN) aimed at mitigating the vanishing gradient problem commonly encountered by traditional RNNs. Its relative insensitivity to gap length is its advantage over other RNNs, ...
networks. GNMT improved the quality of translation over SMT in some instances because it uses an
example-based machine translation Example-based machine translation (EBMT) is a method of machine translation often characterized by its use of a bilingual corpus with parallel texts as its main knowledge base at run-time. It is essentially a translation by analogy and can be vie ...
(EBMT) method in which the system "learns from millions of examples." According to Google researchers, it translated "whole sentences at a time, rather than just piece by piece. It uses this broader context to help it figure out the most relevant translation, which it then rearranges and adjusts to be more like a human speaking with proper grammar". GNMT's "proposed architecture" of "system learning" has been implemented on over a hundred languages supported by Google Translate. With the end-to-end framework, Google states but does not demonstrate for most languages that "the system learns over time to create better, more natural translations." The GNMT network attempts interlingual machine translation, which encodes the "semantics of the sentence rather than simply memorizing phrase-to-phrase translations", and the system did not invent its own universal language, but uses "the commonality found in between many languages". GNMT was first enabled for eight languages: to and from English and Chinese, French, German, Japanese, Korean, Portuguese, Spanish and Turkish. In March 2017, it was enabled for Hindi, Russian and Vietnamese, followed by Bengali, Gujarati, Indonesian, Kannada, Malayalam, Marathi, Punjabi, Tamil and Telugu in April. Since 2020, Google has phased out GNMT and has implemented deep learning networks based on
transformer In electrical engineering, a transformer is a passive component that transfers electrical energy from one electrical circuit to another circuit, or multiple Electrical network, circuits. A varying current in any coil of the transformer produces ...
s.


Accuracy

Google Translate is not as reliable as human translation. When text is well-structured, written using formal language, with simple sentences, relating to formal topics for which training data is ample, it often produces conversions similar to human translations between English and a number of high-resource languages. Accuracy decreases for those languages when fewer of those conditions apply, for example when sentence length increases or the text uses familiar or literary language. For many other languages vis-à-vis English, it can produce the gist of text in those formal circumstances. Human evaluation from English to all 102 languages shows that the main idea of a text is conveyed more than 50% of the time for 35 languages. For 67 languages, a minimally comprehensible result is not achieved 50% of the time or greater. A few studies have evaluated Chinese, French, German, and Spanish to English, but no systematic human evaluation has been conducted from most Google Translate languages to English. Speculative language-to-language scores extrapolated from English-to-other measurements indicate that Google Translate will produce translation results that convey the gist of a text from one language to another more than half the time in about 1% of language pairs, where neither language is English. Research conducted in 2011 showed that Google Translate got a slightly higher score than the
UCLA The University of California, Los Angeles (UCLA) is a public land-grant research university in Los Angeles, California, United States. Its academic roots were established in 1881 as a normal school then known as the southern branch of the C ...
minimum score for the English Proficiency Exam. Due to its identical choice of words without considering the flexibility of choosing alternative words or expressions, it produces a relatively similar translation to human translation from the perspective of formality, referential cohesion, and conceptual cohesion. Moreover, a number of languages are translated into a sentence structure and sentence length similar to a human translation. Furthermore, Google carried out a test that required native speakers of each language to rate the translation on a scale between 0 and 6, and Google Translate scored 5.43 on average. When used as a dictionary to translate single words, Google Translate is highly inaccurate because it must guess between polysemic words. Among the top 100 words in the English language, which make up more than 50% of all written English, the average word has more than 15 senses, which makes the odds against a correct translation about 15 to 1 if each sense maps to a different word in the target language. Most common English words have at least two senses, which produces 50/50 odds in the likely case that the target language uses different words for those different senses. The odds are similar from other languages to English. Google Translate makes statistical guesses that raise the likelihood of producing the most frequent sense of a word, with the consequence that an accurate translation will be unobtainable in cases that do not match the majority or plurality
corpus Corpus (plural ''corpora'') is Latin for "body". It may refer to: Linguistics * Text corpus, in linguistics, a large and structured set of texts * Speech corpus, in linguistics, a large set of speech audio files * Corpus linguistics, a branch of ...
occurrence. The accuracy of single-word predictions has not been measured for any language. Because almost all non-English language pairs pivot through English, the odds against obtaining accurate single-word translations from one non-English language to another can be estimated by multiplying the number of senses in the source language with the number of senses each of those terms have in English. When Google Translate does not have a word in its vocabulary, it makes up a result as part of its algorithm.


Limitations

Google Translate, like other automatic translation tools, has its limitations, struggles with
polysemy Polysemy ( or ; ) is the capacity for a Sign (semiotics), sign (e.g. a symbol, morpheme, word, or phrase) to have multiple related meanings. For example, a word can have several word senses. Polysemy is distinct from ''monosemy'', where a word h ...
(the multiple meanings a word may have) and
multiword expression A multiword expression (MWE), also called phraseme, is a lexeme-like unit made up of a sequence of two or more lexemes that has properties that are not predictable from the properties of the individual lexemes or their normal mode of combination. MW ...
s (terms that have meanings that cannot be understood or translated by analyzing the individual word units that compose them). A word in a foreign language might have two different meanings in the translated language. This might lead to mistranslation. Additionally, grammatical errors remain a major limitation to the accuracy of Google Translate. International Journal of Linguistics, Literature and Translation (IJLLT) 2(3):196-200, 2019. Retrieved August 26, 2020 Google Translate struggles to differentiate between ''imperfect'' and ''perfect'' aspects in Romance languages. The
subjunctive mood The subjunctive (also known as the conjunctive in some languages) is a grammatical mood, a feature of an utterance that indicates the speaker's attitude toward it. Subjunctive forms of verbs are typically used to express various states of unreali ...
is often erroneous. Moreover, the formal second person () is often chosen, whatever the context. Since its English reference material contains only "you" forms, it has difficulty translating a language with "you all" or formal "you" variations. Due to differences between languages in investment, research, and the extent of digital resources, the accuracy of Google Translate varies greatly among languages. Some languages produce better results than others. Most languages from Africa, Asia, and the Pacific, tend to score poorly in relation to the scores of many well-financed European languages, Afrikaans and Chinese being the high-scoring exceptions from their continents. No languages indigenous to Australia are included within Google Translate. Higher scores for European can be partially attributed to the Europarl Corpus, a trove of documents from the
European Parliament The European Parliament (EP) is one of the two legislative bodies of the European Union and one of its seven institutions. Together with the Council of the European Union (known as the Council and informally as the Council of Ministers), it ...
that have been professionally translated by the mandate of the
European Union The European Union (EU) is a supranational union, supranational political union, political and economic union of Member state of the European Union, member states that are Geography of the European Union, located primarily in Europe. The u ...
into as many as 21 languages. A 2010 analysis indicated that French to English translation is relatively accurate, and 2011 and 2012 analyses showed that Italian to English translation is relatively accurate as well. However, if the source text is shorter, rule-based machine translations often perform better; this effect is particularly evident in Chinese to English translations. While edits of translations may be submitted, in Chinese specifically one cannot edit sentences as a whole. Instead, one must edit sometimes arbitrary sets of characters, leading to incorrect edits. The service can be used as a dictionary by typing in words. One can translate from a book by using a scanner and an OCR like Google Drive. In its Written Words Translation function, there is a word limit on the amount of text that can be translated at once. Therefore, long text should be transferred to a document form and translated through its Document Translate function.


Open-source licenses and components

Irish language data from
Foras na Gaeilge (, " Irish Institute"; ) is a public body responsible for the promotion of the Irish language throughout the island of Ireland, including both the Republic of Ireland and Northern Ireland. It was set up on 2 December 1999, assuming the rol ...
's New English-Irish Dictionary. (English database designed and developed for Foras na Gaeilge by Lexicography MasterClass Ltd.) Welsh language data from Gweiadur by Gwerin. Certain content is copyrighted by
Oxford University Press Oxford University Press (OUP) is the publishing house of the University of Oxford. It is the largest university press in the world. Its first book was printed in Oxford in 1478, with the Press officially granted the legal right to print books ...
, United States. Some phrase translations come from Wikitravel.


Reviews

Shortly after launching the translation service for the first time, Google won an international competition for English–Arabic and English–Chinese machine translation.


Translation mistakes and oddities

Since Google Translate uses statistical matching to translate, translated text can often include apparently nonsensical and obvious errors, often swapping common terms for similar but nonequivalent common terms in the other language, as well as inverting sentence meaning. Novelty websites like Bad Translator and Translation Party have used the service to produce humorous text by translating back and forth between multiple languages, similar to the children's game
telephone A telephone, colloquially referred to as a phone, is a telecommunications device that enables two or more users to conduct a conversation when they are too far apart to be easily heard directly. A telephone converts sound, typically and most ...
.


Replying to @sarah_mcdonald(s)

Certain texts in Japanese have shown to be translated to "Replying to @sarah_mcdonald(s)" in English, often with no relation to the source text. Examples include "もーるるるるるるるる", "バチバチで草" and "絵にfう". This has been asked on multiple platforms, including
YouTube YouTube is an American social media and online video sharing platform owned by Google. YouTube was founded on February 14, 2005, by Steve Chen, Chad Hurley, and Jawed Karim who were three former employees of PayPal. Headquartered in ...
.


See also

*
Apertium Apertium is a free/open-source rule-based machine translation platform. It is free software and released under the terms of the GNU General Public License. Overview Apertium is a transfer-based machine translation system, which uses finite st ...
* Babel Fish (discontinued; redirects to the main Yahoo! site) * Baidu Fanyi *
Comparison of machine translation applications Machine translation is an algorithm which attempts to translate text or speech from one natural language to another. General information Basic general information for popular machine translation applications. Languages features comparison ...
*
DeepL Translator DeepL Translator is a neural machine translation service that was launched in August 2017 and is owned by Cologne-based DeepL SE. The translating system was first developed within Linguee and launched as entity ''DeepL''. It initially offered t ...
*
Google Dictionary Google Dictionary is an online dictionary service of Google that can be accessed with the "''define''" operator and other similar phrases in Google Search. It is also available in Google Translate and as a Google Chrome extension. The dictionary ...
*
Google Translator Toolkit Google Translator Toolkit was an online computer-assisted translation tool, computer-assisted translation Comparison of computer-assisted translation tools, tool (CAT)—a web application designed to permit translators to edit the translation ...
* Jollo (discontinued) *
List of Google products The following is a list of products, services, and apps provided by Google. Active, soon-to-be discontinued, and discontinued products, services, tools, hardware, and other applications are broken out into designated sections. Web-based products ...
*
Microsoft Translator Microsoft Translator or Bing Translator is a multilingual machine translation cloud service provided by Microsoft. Microsoft Translator is a part of Microsoft Cognitive Services and integrated across multiple consumer, developer, and enterprise pro ...
* PROMT * Reverso * Smartcat * Speech Recognition & Synthesis * SYSTRAN *
Translate (Apple) Translate is a translation app developed by Apple for their iOS, iPadOS and watchOS devices. Introduced on June 22, 2020, it functions as a service for translating text sentences or speech between several languages and was officially released on ...
*
Word Lens Word Lens was an augmented reality translation application from Quest Visual. Word Lens used the built-in cameras on smartphones and similar devices to quickly scan and identify foreign text (such as that found in a sign or a menu), and then tra ...
(discontinued; merged into Google Translate app) * Yandex Translate


References


External links

*
Contribute
{{Authority control Google Translate Internet properties established in 2006 Machine translation software Natural language processing software Products introduced in 2006 Translation websites