Microsoft Translator is a multilingual
machine translation cloud service provided by
Microsoft. Microsoft Translator is a part of Microsoft Cognitive Services and integrated across multiple consumer, developer, and enterprise products; including
Bing,
Microsoft Office,
SharePoint,
Microsoft Edge,
Microsoft Lync
Skype for Business (formerly Microsoft Lync and Office Communicator) is an enterprise software application for instant messaging and videotelephony developed by Microsoft as part of the Microsoft Office suite. It is designed for use with the on- ...
,
Yammer,
Skype Translator,
Visual Studio, and Microsoft Translator apps for
Windows,
Windows Phone,
iPhone and
Apple Watch, and
Android
Android may refer to:
Science and technology
* Android (robot), a humanoid robot or synthetic organism designed to imitate a human
* Android (operating system), Google's mobile operating system
** Bugdroid, a Google mascot sometimes referred to ...
phone and
Android Wear.
Microsoft Translator also offers text and speech translation through cloud services for businesses. Service for text translation via the Translator Text API ranges from a free tier supporting two million characters per month to paid tiers supporting billions of characters per month.
Speech translation via Microsoft Speech services is offered based on time of the audio stream.
The service supports text translation between languages and language varieties as of . It also supports several speech translation systems that currently power the Microsoft Translator live conversation feature, Skype Translator, and Skype for Windows Desktop, and the Microsoft Translator Apps for
iOS and
Android
Android may refer to:
Science and technology
* Android (robot), a humanoid robot or synthetic organism designed to imitate a human
* Android (operating system), Google's mobile operating system
** Bugdroid, a Google mascot sometimes referred to ...
.
Development
History
The first version of Microsoft's machine translation system was developed between 1999 and 2000 within
Microsoft Research
Microsoft Research (MSR) is the research subsidiary of Microsoft. It was created in 1991 by Richard Rashid, Bill Gates and Nathan Myhrvold with the intent to advance state-of-the-art computing and solve difficult world problems through technologi ...
. This system was based on semantic predicate-argument structures known as
logical forms (LF) and was spun from the grammar correction feature developed for
Microsoft Word. This system was eventually used to translate the entire Microsoft Knowledge Base into Spanish, French, German, and Japanese.
Microsoft's approach to machine translation, like most modern machine translation systems, is "
data driven":
rather than relying on writing explicit rules to translate
natural language
In neuropsychology, linguistics, and philosophy of language, a natural language or ordinary language is any language that has evolved naturally in humans through use and repetition without conscious planning or premeditation. Natural languages ...
, algorithms are trained to understand and interpret translated
parallel texts, allowing them to automatically learn how to translate new natural language text. Microsoft's experience with the LF system led directly to a treelet translation system that simplified the LF to
dependency trees and eventually to an order template model, significantly improving in speed and enabling the incorporation of new target languages.
The consumer-facing translation site known as Bing Translator (previously known as Windows Live translator) was launched in 2007 and provides free text and website translations on the web. Text is translated directly within the Bing Translator webpage while websites are translated through the Bilingual Viewer tools.
In 2011, the service was extended to include numerous Microsoft Translator products through a
cloud-based application programming interface
An application programming interface (API) is a way for two or more computer programs to communicate with each other. It is a type of software interface, offering a service to other pieces of software. A document or standard that describes how t ...
, which supports products available to both consumer and enterprise users. An additional speech translation capability was introduced in March 2016.
In May 2018, an update to the API was introduced. This new version offered neural machine translation as the default method of translating. In addition to translation, the new version features
transliteration and a bilingual dictionary to look up words to find alternative translations and to view examples in sentences.
Speech translation was integrated into Microsoft Speech services in September 2018, providing end-to-end speech, speech-to-text, and text-to-speech translation.
Translation methodology and research
Microsoft Translator uses machine translation to create instantaneous translations from one natural language to another. This system is based on four distinct areas of computer learning research seen below.
Accuracy
The quality of Microsoft Translator’s machine translation outputs are evaluated using a method called the
BLEU score.
BLEU (Bilingual Evaluation Understudy) is an algorithm for evaluating the quality of text which has been machine-translated from one
natural language
In neuropsychology, linguistics, and philosophy of language, a natural language or ordinary language is any language that has evolved naturally in humans through use and repetition without conscious planning or premeditation. Natural languages ...
to another. Quality is considered to be the correspondence between a machine's output and that of a human. BLEU was one of the first
metrics to achieve a high
correlation
In statistics, correlation or dependence is any statistical relationship, whether causal or not, between two random variables or bivariate data. Although in the broadest sense, "correlation" may indicate any type of association, in statistics ...
with human judgments of quality, and remains one of the most popular automated and inexpensive metrics.
Because machine translation is based on statistical algorithms rather than human translators, the automatic translations it produces are not always entirely accurate. Microsoft Translator has introduced various feedback features, such as the Collaborative Translation Framework, into its products to allow users to suggest alternative translations. These alternative translations are then integrated into the Microsoft Translator algorithms to improve future translations.
In November 2016, Microsoft Translator introduced translation using deep neural networks in nine of its highest traffic languages, including all of its speech languages and Japanese. Neural networks provide better translation than industry standard statistical machine learning.
Core products
Microsoft Translator is a cloud-based API that is integrated into numerous Microsoft products and services. The Translator API can be used on its own and can be customized for use in a pre-publishing or post-publishing environment. The API, which is available through subscription, is free for lower translation volumes, and is charged according to a tiered payment system for volumes exceeding two million characters per month.
The remaining core products are available for free.
Microsoft Translator cloud translation
The Microsoft Translator is a cloud-based automatic translation service that can be used to build applications, websites, and tools requiring multi-language support.
*Text translation: The Microsoft Translator Text API can be used to translate text into any of the languages supported by the service.
*Speech translation: Microsoft Translator is integrated into Microsoft Speech services which is an end-to-end REST based API that can be used to build applications, tools, or any solution requiring multi-languages speech translation. Speech to speech translation is available to or from any of the conversation languages, and speech to text translation is available from the conversation languages into any of the Microsoft Translator-supported language systems.
Custom Translator
Custom Translator is a feature of the Microsoft Translator services that allows enterprises, app developers, and language service providers to build neural translation systems that understand the terminology used in their own business and industry. The customized translations can then be delivered into existing applications, workflows and websites using a normal call to the Microsoft Translator API. Custom Translator can be used when translating text with the Microsoft Translator Text API and when translating speech with Microsoft Speech services.
Live feature
A personal universal translator that enables up to 500 people to have live, multi-device, multi-language, in person translated conversations. This feature is currently free and available in the Microsoft Translator apps (Android, iOS or Windows) and from the browser at:
Microsoft Translator Hub
The Microsoft Translator Hub allows enterprises and language service providers to build their own translation systems that understand business- and industry-specific terminology. The Hub can also be used in conjunction with the CTF, allowing administrators to approve CTF results and add them directly to the Hub. The Microsoft Translator Hub is only available for statistical machine translation and cannot be used with the newest version of the Microsoft Translator API.
The Hub has also been used for language preservation, allowing communities to create their own language translation systems for language and cultural preservation. The Hub has been used to create translation systems for languages such as Hmong, Mayan, Nepali, and Welsh.
Multilingual App Toolkit (MAT)
The Multilingual App Toolkit (MAT) is an integrated Visual Studio tool, which allows developers to streamline localization workflows of their Windows, Windows Phone and desktop apps. MAT improves localization of file management, translation support, and editing tools.
Microsoft Translator web app
Microsoft Translator (previously Live Search Translator, Windows Live Translator, and Bing Translator) is a user facing translation portal provided by
Microsoft as part of its
Bing services to translate texts or entire web pages into different languages. All translation pairs are powered by the Microsoft Translator, a
Neural machine translation platform and web service, developed by
Microsoft Research
Microsoft Research (MSR) is the research subsidiary of Microsoft. It was created in 1991 by Richard Rashid, Bill Gates and Nathan Myhrvold with the intent to advance state-of-the-art computing and solve difficult world problems through technologi ...
, as its backend translation software. Two transliteration pairs (between
Chinese (Simplified) and
Chinese (Traditional)
Traditional Chinese characters are one type of standard Chinese character sets of the contemporary written Chinese. The traditional characters had taken shapes since the clerical change and mostly remained in the same structure they took at ...
) are provided by Microsoft's Windows International team.
Bing Translator can translate phrases entered by the user or acquire a link to a web page and translate it entirely. When translating an entire web page, or when the user selects "Translate this page" in
Bing search results, the Bilingual Viewer is shown, which allows users to browse the original web page text and translation in parallel, supported by synchronized highlights, scrolling, and navigation.
Four Bilingual Viewer layouts are available:
*Side by side
*Top and bottom
*Original with hover translation
*Translation with hover original
Bing Translator integrates with several other Microsoft products. The following is a table of products into which Bing Translator is integrated or may be integrated:
Supported products
Through its core product offerings, Microsoft Translator supports the translation features of many Microsoft products at the consumer and enterprise levels. These products fall broadly into three categories — communication products, Microsoft Office, and apps.
Communication
*
Lync
*
SharePoint
*
Yammer
*
Skype Translator
Microsoft Office
*
Excel
ExCeL London (an abbreviation for Exhibition Centre London) is an exhibition centre, international convention centre and former hospital in the Custom House area of Newham, East London. It is situated on a site on the northern quay of the ...
*
OneNote
*
Outlook
*
PowerPoint
*
Publisher
*
Visio
*
Word
*Word Online
Apps
*Web app (translator.microsoft.com)
*
Windows and
Windows 10
*
Windows Phone
*
iPhone and
Apple Watch
*
Android
Android may refer to:
Science and technology
* Android (robot), a humanoid robot or synthetic organism designed to imitate a human
* Android (operating system), Google's mobile operating system
** Bugdroid, a Google mascot sometimes referred to ...
phone and
Android Wear
*
Kindle Fire
*
Skype Translator
*
Microsoft Edge
*
Microsoft SwiftKey
Deprecated products
Collaborative Translation Framework (CTF)
The Collaborative Translation Framework (CTF) is an extension of the Microsoft Translator API that allows post-publishing improvement of translated text. By using the CTF, readers have the ability to suggest alternative translations to those provided by the API, or vote on previously offered alternatives. This information is then delivered to the API to improve future translations.
Translator Web Widget
The Translator Web Widget is a translation tool that can be added to web pages by pasting a predefined snippet of JavaScript code into the page. The web widget is offered for free by Microsoft, and supports both pre-publishing customized translations using the Translator Hub, and post-publishing improvements using the Collaborative Translation Framework.
Supported languages
As of , Microsoft Translator supports
110 languages and language varieties.
The list of supported languages is available at the Microsoft Translator website and can also be retrieved programmatically through the cloud services.
#
Afrikaans
#
Albanian
Albanian may refer to:
*Pertaining to Albania in Southeast Europe; in particular:
**Albanians, an ethnic group native to the Balkans
**Albanian language
**Albanian culture
**Demographics of Albania, includes other ethnic groups within the country ...
#
Amharic
Amharic ( or ; (Amharic: ), ', ) is an Ethiopian Semitic language, which is a subgrouping within the Semitic branch of the Afroasiatic languages. It is spoken as a first language by the Amharas, and also serves as a lingua franca for all oth ...
#
Arabic
#
Armenian
#
Assamese
Assamese may refer to:
* Assamese people, a socio-ethnolinguistic identity of north-eastern India
* People of Assam, multi-ethnic, multi-linguistic and multi-religious people of Assam
* Assamese language, one of the easternmost Indo-Aryan language ...
#
Azerbaijani
Azerbaijani may refer to:
* Something of, or related to Azerbaijan
* Azerbaijanis
* Azerbaijani language
See also
* Azerbaijan (disambiguation)
* Azeri (disambiguation)
* Azerbaijani cuisine
* Culture of Azerbaijan
The culture of Azerbaijan ...
#
Bangla
Bangla (Bengali: বাংলা) may refer to:
*Bengali language, an eastern Indo-Aryan language
*The endonym of Bengal, a geographical and ethno-linguistic region in South Asia
*''Bangla-'', a prefix indicating Bangladesh
Businesses and organ ...
#
Bashkir
#
Basque
#
Bosnian
#
Bulgarian
#
Cantonese (Traditional)
#
Catalan
#
Chinese (Literary)
#
Chinese (Simplified)
#
Chinese (Traditional)
Traditional Chinese characters are one type of standard Chinese character sets of the contemporary written Chinese. The traditional characters had taken shapes since the clerical change and mostly remained in the same structure they took at ...
#
Croatian
#
Czech
#
Danish
#
Dari
#
Divehi
#
Dutch
#
English
#
Estonian
Estonian may refer to:
* Something of, from, or related to Estonia, a country in the Baltic region in northern Europe
* Estonians, people from Estonia, or of Estonian descent
* Estonian language
* Estonian cuisine
* Estonian culture
See also ...
#
Faroese
#
Fijian
#
Filipino
#
Finnish
#
French
French (french: français(e), link=no) may refer to:
* Something of, from, or related to France
** French language, which originated in France, and its various dialects and accents
** French people, a nation and ethnic group identified with Franc ...
#
French (Canada)
#
Galician
#
Georgian
#
German
#
Greek
#
Gujarati
#
Haitian Creole
Haitian Creole (; ht, kreyòl ayisyen, links=no, ; french: créole haïtien, links=no, ), commonly referred to as simply ''Creole'', or ''Kreyòl'' in the Creole language, is a French-based creole language spoken by 10–12million people wor ...
#
Hebrew
#
Hindi
#
Hmong Daw
Hmong / Mong (; RPA: ''Hmoob,'' ; Nyiakeng Puachue: ; Pahawh: , ) is a dialect continuum of the West Hmongic branch of the Hmongic languages spoken by the Hmong people of Sichuan, Yunnan, Guizhou, Guangxi, Hainan, northern Vietnam, Thailand, ...
#
Hungarian
#
Icelandic
#
Indonesian
#
Inuinnaqtun
#
Inuktitut
Inuktitut (; , syllabics ; from , "person" + , "like", "in the manner of"), also Eastern Canadian Inuktitut, is one of the principal Inuit languages of Canada. It is spoken in all areas north of the tree line, including parts of the provinces o ...
#
Inuktitut (Latin)
#
Irish
#
Italian
#
Japanese
#
Kannada
#
Kazakh
Kazakh, Qazaq or Kazakhstani may refer to:
* Someone or something related to Kazakhstan
*Kazakhs, an ethnic group
*Kazakh language
*The Kazakh Khanate
* Kazakh cuisine
* Qazakh Rayon, Azerbaijan
*Qazax, Azerbaijan
*Kazakh Uyezd, administrative dis ...
#
Khmer
#
Klingon (Latin)
#
Korean
#
Kurdish (Central)
#
Kurdish (Northern)
#
Kyrgyz Kyrgyz, Kirghiz or Kyrgyzstani may refer to:
* Someone or something related to Kyrgyzstan
*Kyrgyz people
*Kyrgyz national games
*Kyrgyz language
*Kyrgyz culture
*Kyrgyz cuisine
*Yenisei Kirghiz
*The Fuyü Gïrgïs language in Northeastern China
...
#
Lao
#
Latvian
#
Lithuanian
Lithuanian may refer to:
* Lithuanians
* Lithuanian language
* The country of Lithuania
* Grand Duchy of Lithuania
* Culture of Lithuania
* Lithuanian cuisine
* Lithuanian Jews as often called "Lithuanians" (''Lita'im'' or ''Litvaks'') by other Jew ...
#
Macedonian
Macedonian most often refers to someone or something from or related to Macedonia.
Macedonian(s) may specifically refer to:
People Modern
* Macedonians (ethnic group), a nation and a South Slavic ethnic group primarily associated with North M ...
#
Malagasy
#
Malay
#
Malayalam
#
Maltese
Maltese may refer to:
* Someone or something of, from, or related to Malta
* Maltese alphabet
* Maltese cuisine
* Maltese culture
* Maltese language, the Semitic language spoken by Maltese people
* Maltese people, people from Malta or of Malte ...
#
Marathi
Marathi may refer to:
*Marathi people, an Indo-Aryan ethnolinguistic group of Maharashtra, India
*Marathi language, the Indo-Aryan language spoken by the Marathi people
*Palaiosouda, also known as Marathi, a small island in Greece
See also
*
* ...
#
Mongolian (Cyrillic)
#
Mongolian (Traditional)
#
Myanmar (Burmese)
#
Māori
#
Nepali
Nepali or Nepalese may refer to :
Concerning Nepal
* Anything of, from, or related to Nepal
* Nepali people, citizens of Nepal
* Nepali language, an Indo-Aryan language found in Nepal, the current official national language and a language spoken ...
#
Norwegian
#
Odia
Odia, also spelled Oriya or Odiya, may refer to:
* Odia people in Odisha, India
* Odia language, an Indian language, belonging to the Indo-Aryan branch of the Indo-European language family
* Odia alphabet, a writing system used for the Odia languag ...
#
Pashto
#
Persian
#
Polish
#
Portuguese (Brazil)
#
Portuguese (Portugal)
#
Punjabi (Gurmukhi)
#
Querétaro Otomi
Northwestern Otomi is a Native American language of central Mexico.
Varieties
There are two varieties with limited (c. 78%) intelligibility, sometimes considered separate languages:
*Mezquital Otomi (''Otomí del Valle del Mezquital''). The aut ...
#
Romanian
#
Russian
#
Samoan
#
Serbian (Cyrillic)
#
Serbian (Latin)
#
Slovak
#
Slovenian
#
Somali
#
Spanish
#
Swahili
Swahili may refer to:
* Swahili language, a Bantu language official in Kenya, Tanzania and Uganda and widely spoken in the African Great Lakes
* Swahili people, an ethnic group in East Africa
* Swahili culture
Swahili culture is the culture of ...
#
Swedish
Swedish or ' may refer to:
Anything from or related to Sweden, a country in Northern Europe. Or, specifically:
* Swedish language, a North Germanic language spoken primarily in Sweden and Finland
** Swedish alphabet, the official alphabet used by ...
#
Tahitian
#
Tamil
#
Tatar
#
Telugu
Telugu may refer to:
* Telugu language, a major Dravidian language of India
*Telugu people, an ethno-linguistic group of India
* Telugu script, used to write the Telugu language
** Telugu (Unicode block), a block of Telugu characters in Unicode
S ...
#
Thai
#
Tibetan
#
Tigrinya
#
Tongan
#
Turkish
Turkish may refer to:
*a Turkic language spoken by the Turks
* of or about Turkey
** Turkish language
*** Turkish alphabet
** Turkish people, a Turkic ethnic group and nation
*** Turkish citizen, a citizen of Turkey
*** Turkish communities and mi ...
#
Turkmen
Turkmen, Türkmen, Turkoman, or Turkman may refer to:
Peoples Historical ethnonym
* Turkoman (ethnonym), ethnonym used for the Oghuz Turks during the Middle Ages
Ethnic groups
* Turkmen in Anatolia and the Levant (Seljuk and Ottoman-Turkish desc ...
#
Ukrainian
#
Upper Sorbian
Upper Sorbian (), occasionally referred to as "Wendish", is a minority language spoken by Sorbs in Germany in the historical province of Upper Lusatia, which is today part of Saxony. It is grouped in the West Slavic language branch, together ...
#
Urdu
#
Uyghur
#
Uzbek (Latin)
#
Vietnamese
#
Welsh
Welsh may refer to:
Related to Wales
* Welsh, referring or related to Wales
* Welsh language, a Brittonic Celtic language spoken in Wales
* Welsh people
People
* Welsh (surname)
* Sometimes used as a synonym for the ancient Britons (Celtic peopl ...
#
Yucatec Maya
#
Zulu
Community partners
Microsoft Translator has engaged with community partners to increase the number of languages and to improve overall language translation quality. Below is a list of community partners that Microsoft Translator has teamed with.
*CNGL Centre for Global Intelligent Content
*Hmong Language Partners -
Hmong Daw
Hmong / Mong (; RPA: ''Hmoob,'' ; Nyiakeng Puachue: ; Pahawh: , ) is a dialect continuum of the West Hmongic branch of the Hmongic languages spoken by the Hmong people of Sichuan, Yunnan, Guizhou, Guangxi, Hainan, northern Vietnam, Thailand, ...
*
Jawaharlal Nehru University -
Urdu
*
Senedd Cymru (the Welsh Parliament, partnered as the
National Assembly for Wales) -
Welsh
Welsh may refer to:
Related to Wales
* Welsh, referring or related to Wales
* Welsh language, a Brittonic Celtic language spoken in Wales
* Welsh people
People
* Welsh (surname)
* Sometimes used as a synonym for the ancient Britons (Celtic peopl ...
*Tilde -
Estonian
Estonian may refer to:
* Something of, from, or related to Estonia, a country in the Baltic region in northern Europe
* Estonians, people from Estonia, or of Estonian descent
* Estonian language
* Estonian cuisine
* Estonian culture
See also ...
,
Latvian,
Lithuanian
Lithuanian may refer to:
* Lithuanians
* Lithuanian language
* The country of Lithuania
* Grand Duchy of Lithuania
* Culture of Lithuania
* Lithuanian cuisine
* Lithuanian Jews as often called "Lithuanians" (''Lita'im'' or ''Litvaks'') by other Jew ...
*
Translators Without Borders
Translators without Borders (TWB) is a non-profit organization set up to provide translation services for humanitarian non-profits. It was established in 2010 as a sister organization of , founded in 1993 by Lori Thicke and Ros Smith-Thomas of ...
-
Swahili
Swahili may refer to:
* Swahili language, a Bantu language official in Kenya, Tanzania and Uganda and widely spoken in the African Great Lakes
* Swahili people, an ethnic group in East Africa
* Swahili culture
Swahili culture is the culture of ...
*
Appen -
Filipino (
Tagalog
Tagalog may refer to:
Language
* Tagalog language, a language spoken in the Philippines
** Old Tagalog, an archaic form of the language
** Batangas Tagalog, a dialect of the language
* Tagalog script, the writing system historically used for Tagal ...
),
Malagasy,
Samoan,
Tahitian,
Tongan
*
Government of Nunavut -
Inuktitut
Inuktitut (; , syllabics ; from , "person" + , "like", "in the manner of"), also Eastern Canadian Inuktitut, is one of the principal Inuit languages of Canada. It is spoken in all areas north of the tree line, including parts of the provinces o ...
*
Auckland University of Technology -
Māori
*
Waikato University -
Māori
*Government of the State of Querétaro -
Querétaro Otomi
Northwestern Otomi is a Native American language of central Mexico.
Varieties
There are two varieties with limited (c. 78%) intelligibility, sometimes considered separate languages:
*Mezquital Otomi (''Otomí del Valle del Mezquital''). The aut ...
*
Klingon Language Institute -
Klingon
The Klingons ( ; Klingon: ''tlhIngan'' ) are a fictional species in the science fiction franchise ''Star Trek''.
Developed by screenwriter Gene L. Coon in 1967 for the original ''Star Trek'' (''TOS'') series, Klingons were swarthy humanoids c ...
Additionally, Microsoft has teamed with the
Klingon Language Institute, which promotes the constructed language,
Klingon
The Klingons ( ; Klingon: ''tlhIngan'' ) are a fictional species in the science fiction franchise ''Star Trek''.
Developed by screenwriter Gene L. Coon in 1967 for the original ''Star Trek'' (''TOS'') series, Klingons were swarthy humanoids c ...
, which is used within the fictional
Star Trek
''Star Trek'' is an American science fiction media franchise created by Gene Roddenberry, which began with the eponymous 1960s television series and quickly became a worldwide pop-culture phenomenon. The franchise has expanded into vari ...
universe produced by
Paramount and
CBS Studios. Klingon has been supported by Microsoft Translator since May 2013.
See also
*
Machine translation
*
Speech synthesis
Speech synthesis is the artificial production of human speech. A computer system used for this purpose is called a speech synthesizer, and can be implemented in software or hardware products. A text-to-speech (TTS) system converts normal languag ...
*
Lernout & Hauspie
*
Babel Fish (discontinued; redirects to main Yahoo! site)
*
PlainTalk
*
NeoSpeech
*
Loquendo
*
Nuance Communications
*
Microsoft text-to-speech voices
*
Google Translate
References
External links
* (
copyright of Microsoft)
*
*
SW development
*
*
{{Authority control
Machine translation software
Natural language processing software
Translation websites
Universal Windows Platform apps
IOS software
WatchOS software
Android (operating system) software
Wear OS software
Microsoft websites
Microsoft software
Microsoft Edge extensions
Internet Explorer add-ons
Microsoft Bing