Post-editing
   HOME

TheInfoList



OR:

Post-editing (or postediting) is the process whereby humans amend machine-generated translation to achieve an acceptable final product. A person who post-edits is called a post-editor. The concept of post-editing is linked to that of
pre-editing Pre-editing is the process whereby a human prepares a document before applying machine translation. The main goal of pre-editing is to reduce the post-editing workload by adapting the source document to improve the raw output of the machine transla ...
. In the process of translating a text via
machine translation Machine translation, sometimes referred to by the abbreviation MT (not to be confused with computer-aided translation, machine-aided human translation or interactive translation), is a sub-field of computational linguistics that investigates ...
, best results may be gained by pre-editing the source textfor example by applying the principles of
controlled language Controlled natural languages (CNLs) are subsets of natural languages that are obtained by restricting the grammar and vocabulary in order to reduce or eliminate ambiguity and complexity. Traditionally, controlled languages fall into two major types ...
and then post-editing the machine output. It is distinct from editing, which refers to the process of improving human generated text (a process which is often known as revision in the field of translation). Post-edited text may afterwards be revised to ensure the quality of the language choices are proofread to correct simple mistakes. Post-editing involves the correction of
machine translation Machine translation, sometimes referred to by the abbreviation MT (not to be confused with computer-aided translation, machine-aided human translation or interactive translation), is a sub-field of computational linguistics that investigates ...
output to ensure that it meets a level of quality negotiated in advance between the client and the post-editor. Light post-editing aims at making the output simply understandable; full post-editing at making it also stylistically appropriate. With advances in machine translation full post-editing is becoming an alternative to manual translation. Practically all computer assisted translation (CAT) tools now support post-editing of machine translated output.


Post-editing and machine translation

Machine translation Machine translation, sometimes referred to by the abbreviation MT (not to be confused with computer-aided translation, machine-aided human translation or interactive translation), is a sub-field of computational linguistics that investigates ...
left the labs to start being used for its actual purpose in the late seventies at some big institutions such as the
European Commission The European Commission (EC) is the executive of the European Union (EU). It operates as a cabinet government, with 27 members of the Commission (informally known as "Commissioners") headed by a President. It includes an administrative body ...
and the
Pan-American Health Organization The Pan American Health Organization (PAHO) is an international public health agency working to improve the health and living standards of the people of the Americas. It is part of the United Nations system, serving as the Regional Office fo ...
, and then, later, at some corporations such as Caterpillar and
General Motors The General Motors Company (GM) is an American Multinational corporation, multinational Automotive industry, automotive manufacturing company headquartered in Detroit, Michigan, United States. It is the largest automaker in the United States and ...
. First studies on post-editing appeared in the eighties, linked to those implementations. To develop appropriate guidelines and training, members of the Association for Machine Translation in the Americas (AMTA) and the European Association for Machine Translation (EAMT) set a Post-editing Special Interest Group in 1999. After the nineties, advances in computer power and connectivity sped machine translation development and allowed for its deployment through the web browser, including as a free, useful adjunct to the main search engines (
Google Translate Google Translate is a multilingual neural machine translation service developed by Google to translate text, documents and websites from one language into another. It offers a website interface, a mobile app for Android and iOS, and an API ...
,
Bing Translator Microsoft Translator is a multilingual machine translation cloud service provided by Microsoft. Microsoft Translator is a part of Microsoft Cognitive Services and integrated across multiple consumer, developer, and enterprise products; including B ...
,
Yahoo! Babel Fish Yahoo! Babel Fish was a free Web-based multilingual translation application. In May 2012 it was replaced by Bing Translator (now Microsoft Translator), to which queries were redirected. Although Yahoo! has transitioned its Babel Fish translation ...
). A wider acceptance of less than perfect machine translation was accompanied also by a wider acceptance of post-editing. With the demand for localisation of goods and services growing at a pace that could not be met by human translation, not even assisted by translation memory and other translation management technologies, industry bodies such as the Translation Automation Users Society (TAUS) expect machine translation and post-editing to play a much bigger role within the next few years. The use of Machine Translation suggests sometimes
pre-editing Pre-editing is the process whereby a human prepares a document before applying machine translation. The main goal of pre-editing is to reduce the post-editing workload by adapting the source document to improve the raw output of the machine transla ...
.


Light and full post-editing

For many years, no widely accepted, standardized post-editing guidelines existed; however, in 2017, ISO standard
18587:2017: Translation services — Post-editing of machine translation output — Requirements
' was published. Studies in the eighties distinguished between degrees of post-editing which, in the context of the European Commission Translation Service, were first defined as conventional and rapid or full and rapid. Light and full post-editing seems the wording most used today. Light post-editing implies minimal intervention by the post-editor, with the aim of ensuring quality is "good enough" or "understandable"; the expectation is that the client will use it for inbound purposes only, often when the text is needed urgently, or has a short time span. Full post-editing involves a greater level of intervention to achieve a degree of quality to be negotiated between client and post-editor; the expectation is that the outcome will be a text that is not only understandable but presented in some stylistically appropriate way, so it can be used for assimilation and even for dissemination, for inbound and for outbound purposes. The quality is expected to be publishable and equivalent to that of a human translation. The assumption, however, has been that it takes less effort for translators to work directly from the source text than to post-edit the machine generated version. With advances in
machine translation Machine translation, sometimes referred to by the abbreviation MT (not to be confused with computer-aided translation, machine-aided human translation or interactive translation), is a sub-field of computational linguistics that investigates ...
, this may be changing. For some language pairs and for some tasks, and with engines that have been customised with domain specific good quality data, some clients are already requesting translators to post-edit instead of translating from scratch, in the belief that they will attain similar quality at a lower cost. The light/full classification, developed in the nineties when
machine translation Machine translation, sometimes referred to by the abbreviation MT (not to be confused with computer-aided translation, machine-aided human translation or interactive translation), is a sub-field of computational linguistics that investigates ...
still came on a CD-ROM, may not suit advances in machine translation at the light post-editing end either. For some language pairs and some tasks, particularly if the source has been pre-edited, raw machine output may be good enough for gisting purposes without requiring subsequent human intervention.


Post-editing efficiency

Post-editing is used when raw machine translation is not good enough and human translation not required. Industry advises post-editing to be used when it can at least double the productivity of manual translation, even fourfold it in the case of light post-editing (1000 words per hour vs. 250 wph). However, post-editing efficiency is difficult to predict. Various studies from both academia and industry have claimed that post-editing is generally faster than translating from scratch, regardless of language pairs or translators' experience. There is, however, no agreement about ''how much'' time can be saved through post-editing in practice (if any at all): While the industry reports on time savings around 40%, some academic studies suggest that time savings under actual working conditions are more likely to be between 0–20%, or that it may depend on the terminological proximity between the source and target languages. Professionals have also reported negative productivity gains where corrections require more time than to translate from scratch.


Post-editing and the language industry

After some thirty years, post-editing is still "a nascent profession". What the right profile of the post-editor is has not yet been fully studied. Post-editing overlaps with translating and editing, but only partially. Most think the ideal post-editor will be a translator keen to be trained on the specific skills required, but there are some who think a bilingual without a background in translation may be easier to train. Not much is known either on who the actual post-editors are, whether they tend to be professional translators, whether they work mostly as in-house employees or self-employed, and on which conditions. Many professional translators dislike post-editing, among other reasons because it tends to be paid at lower rates than conventional translations, with the International Association of Professional Translators and Interpreters (IAPTI) having been particularly vocal about it. The quality of machine translation output for post-editing is higher, and therefore requires less post-editing effort, when the
machine translation Machine translation, sometimes referred to by the abbreviation MT (not to be confused with computer-aided translation, machine-aided human translation or interactive translation), is a sub-field of computational linguistics that investigates ...
is provided by a neural, vertical or customised
machine translation Machine translation, sometimes referred to by the abbreviation MT (not to be confused with computer-aided translation, machine-aided human translation or interactive translation), is a sub-field of computational linguistics that investigates ...
engine. Translation efficiency gains can be measured by tracking time linguists need to correct the
machine translation Machine translation, sometimes referred to by the abbreviation MT (not to be confused with computer-aided translation, machine-aided human translation or interactive translation), is a sub-field of computational linguistics that investigates ...
in the same translation environment, such as XTM Cloud, a
Translation management system A translation management system (TMS), formerly globalization management system (GMS), is a type of software for automating many parts of the human language translation process and maximizing translator efficiency. The idea of a translation man ...
and
Computer-assisted translation Computer-aided translation (CAT), also referred to as computer-assisted translation or computer-aided human translation (CAHT), is the use of software to assist a human translator in the translation Translation is the communication of ...
tool, where post-editing times and linguistic quality assessment results of the post-edited texts can be compared. There are not clear figures on how big the post-editing pie is within the translation industry. A recent survey showed 50% of language service providers offered it, but for 85% of them it accounted less than 10% of their throughput. Memsource, a web-based translation tool, claims over 50 percent of translations between English and Spanish, French and other languages have been done in its platform combining translation memory with machine translation. Post-editing is also being done through translation crowdsourcing portals such as
Unbabel Unbabel is an artificial intelligence-powered human translation platform. The company has headquarters in Lisbon, Portugal, and San Francisco, California. Unbabel combines Neural Machine Translation with machine learning and a crowdsourced mo ...
which, by November 2014 claimed to have post-edited over 11 million words. Productivity and volume estimates are, in any case, moving targets since advances in machine translation, in a significant part driven by the post-edited text being fed back into its engines, will mean the more post-editing is done, the higher the quality of machine translation and the more widespread post-editing will become.


See also

*
Machine translation Machine translation, sometimes referred to by the abbreviation MT (not to be confused with computer-aided translation, machine-aided human translation or interactive translation), is a sub-field of computational linguistics that investigates ...
*
Controlled language Controlled natural languages (CNLs) are subsets of natural languages that are obtained by restricting the grammar and vocabulary in order to reduce or eliminate ambiguity and complexity. Traditionally, controlled languages fall into two major types ...
*
Translation memory A translation memory (TM) is a database that stores "segments", which can be sentences, paragraphs or sentence-like units (headings, titles or elements in a list) that have previously been translated, in order to aid human translators. The translat ...
*
Editing Editing is the process of selecting and preparing written, photographic, visual, audible, or cinematic material used by a person or an entity to convey a message or information. The editing process can involve correction, condensation, or ...
*
Proofreading Proofreading is the reading of a galley proof or an electronic copy of a publication to find and correct reproduction errors of text or art. Proofreading is the final step in the editorial cycle before publication. Professional Traditional ...
*
Computer-assisted translation Computer-aided translation (CAT), also referred to as computer-assisted translation or computer-aided human translation (CAHT), is the use of software to assist a human translator in the translation Translation is the communication of ...
*
Pre-editing Pre-editing is the process whereby a human prepares a document before applying machine translation. The main goal of pre-editing is to reduce the post-editing workload by adapting the source document to improve the raw output of the machine transla ...


References


Further reading

*
ISO 18587:2017: Translation services – Post-editing of machine translation output – Requirements
'
Preview + table of contents
* Ilona Wallberg: ''ISO 18587—A standard for the post-editing process.'' In: Jörg Porsiel (Ed.): ''Machine Translation—What Language Professionals Need to Know.'' BDÜ Weiterbildungs- und Fachverlagsgesellschaft mbH, Berlin 2017, . * Nitzke, Jean & Hansen-Schirra, Silvia. 2021. A short guide to post-editing. (Translation and Multilingual Natural Language Processing 16). Berlin: Language Science Press. DOI: 10.5281/zenodo.5646896. Open Access. https://langsci-press.org/catalog/book/319


External links


Translation Automation Users Society (TAUS)
{{Authority control Translation Machine translation Evaluation of machine translation