The Georgetown–IBM experiment was an influential demonstration of
machine translation
Machine translation is use of computational techniques to translate text or speech from one language to another, including the contextual, idiomatic and pragmatic nuances of both languages.
Early approaches were mostly rule-based or statisti ...
, which was performed on January 7, 1954. Developed jointly by
Georgetown University
Georgetown University is a private university, private Jesuit research university in Washington, D.C., United States. Founded by Bishop John Carroll (archbishop of Baltimore), John Carroll in 1789, it is the oldest Catholic higher education, Ca ...
and
IBM
International Business Machines Corporation (using the trademark IBM), nicknamed Big Blue, is an American Multinational corporation, multinational technology company headquartered in Armonk, New York, and present in over 175 countries. It is ...
, the experiment involved completely automatic translation of more than sixty
Russian
Russian(s) may refer to:
*Russians (), an ethnic group of the East Slavic peoples, primarily living in Russia and neighboring countries
*A citizen of Russia
*Russian language, the most widely spoken of the Slavic languages
*''The Russians'', a b ...
sentences into
English.
Background
Conceived and performed primarily in order to attract governmental and public interest and funding by showing the possibilities of machine translation, it was by no means a fully featured system: It had only six grammar rules and 250 lexical items in its vocabulary (of stems and endings).
Words in the vocabulary were in the fields of politics, law, mathematics, chemistry, metallurgy, communications and military affairs. Vocabulary was punched onto
punch cards.
This complete dictionary was never fully shown (only the extended one from Garvin's article). Apart from general topics, the system was specialized in the domain of
organic chemistry
Organic chemistry is a subdiscipline within chemistry involving the science, scientific study of the structure, properties, and reactions of organic compounds and organic matter, organic materials, i.e., matter in its various forms that contain ...
.
The translation was carried out using an
IBM 701
The IBM 701 Electronic Data Processing Machine, known as the Defense Calculator while in development, was IBM’s first commercial scientific computer and its first series production mainframe computer, which was announced to the public on May 2 ...
mainframe computer (launched in April 1953).
The Georgetown-IBM experiment is the best-known result of the
MIT
The Massachusetts Institute of Technology (MIT) is a private research university in Cambridge, Massachusetts, United States. Established in 1861, MIT has played a significant role in the development of many areas of modern technology and sc ...
conference in June 1952 to which all active researchers in the machine translation field were invited. At the conference,
Duncan Harkin from
US Department of Defense
The United States Department of Defense (DoD, USDOD, or DOD) is an executive department of the U.S. federal government charged with coordinating and supervising the six U.S. armed services: the Army, Navy, Marines, Air Force, Space Force, ...
suggested that his department would finance a new machine translation project. Jerome Weisner supported the idea and offered finance from the Research Laboratory of Electronics at MIT.
Leon Dostert had been invited to the project for his previous experience with the automatic correction of translations (back then 'mechanical translation'); his interpretation system had a strong impact on the
. The linguistics part of the demonstration was carried out for the most part by linguist
Paul Garvin who had also good knowledge of
Russian
Russian(s) may refer to:
*Russians (), an ethnic group of the East Slavic peoples, primarily living in Russia and neighboring countries
*A citizen of Russia
*Russian language, the most widely spoken of the Slavic languages
*''The Russians'', a b ...
.
Over 60
Romanized Russian statements from a wide range of political, legal, mathematical, and scientific topics were entered into the machine by a computer operator who knew no Russian, and the resulting English translations appeared on a printer.
The sentences to be translated were carefully selected. Many operations for the demonstration were fitted to specific words and sentences. In addition, there was no relational or sentence analysis which could recognize the sentence structure. The approach was mostly 'lexicographical' based on a dictionary where a specific word had a connection with specific rules and steps.
Algorithm
The algorithm first translates Russian words into numerical codes, then performs the following case-analysis on each numerical code to choose between possible English word translations, reorder the English words, or omit some English words.
The flowchart of the algorithm is reproduced in
(see Table 1 for the 6 rules).
Translation examples
How it analyzes ''Vyelyichyina ugla opryedyelyayetsya otnoshyenyiyem dlyini dugi k radyiusu'' (figure 2 of
).
Reception
Well publicized by journalists and perceived as a success, the experiment did encourage governments to invest in
computational linguistics
Computational linguistics is an interdisciplinary field concerned with the computational modelling of natural language, as well as the study of appropriate computational approaches to linguistic questions. In general, computational linguistics ...
. The authors claimed that within three or five years, machine translation could well be a solved problem. However, the real progress was much slower, and after the
ALPAC report in 1966, which found that the ten years of long research had failed to fulfill the expectations, funding was reduced dramatically.
The demonstration was given widespread coverage in the foreign press, but only a small fraction of journalists drew attention to previous machine translation attempts.
References
External links
A summary of the Georgetown–IBM experiment
{{DEFAULTSORT:Georgetown-IBM experiment
Computational linguistics
Machine translation