Telegraph code
   HOME

TheInfoList



OR:

A telegraph code is one of the
character encoding Character encoding is the process of assigning numbers to graphical characters, especially the written characters of human language, allowing them to be stored, transmitted, and transformed using digital computers. The numerical values tha ...
s used to transmit
information Information is an abstract concept that refers to that which has the power to inform. At the most fundamental level information pertains to the interpretation of that which may be sensed. Any natural process that is not completely random, ...
by
telegraphy Telegraphy is the long-distance transmission of messages where the sender uses symbolic codes, known to the recipient, rather than a physical exchange of an object bearing the message. Thus flag semaphore is a method of telegraphy, whereas ...
.
Morse code Morse code is a method used in telecommunication to encode text characters as standardized sequences of two different signal durations, called ''dots'' and ''dashes'', or ''dits'' and ''dahs''. Morse code is named after Samuel Morse, one ...
is the best-known such code. ''Telegraphy'' usually refers to the
electrical telegraph Electrical telegraphs were point-to-point text messaging systems, primarily used from the 1840s until the late 20th century. It was the first electrical telecommunications system and the most widely used of a number of early messaging systems ...
, but telegraph systems using the
optical telegraph An optical telegraph is a line of stations, typically towers, for the purpose of conveying textual information by means of visual signals. There are two main types of such systems; the semaphore telegraph which uses pivoted indicator arms and ...
were in use before that. A code consists of a number of
code point In character encoding terminology, a code point, codepoint or code position is a numerical value that maps to a specific character. Code points usually represent a single grapheme—usually a letter, digit, punctuation mark, or whitespace—but ...
s, each corresponding to a letter of the alphabet, a numeral, or some other character. In codes intended for machines rather than humans, code points for
control character In computing and telecommunication, a control character or non-printing character (NPC) is a code point (a number) in a character set, that does not represent a written symbol. They are used as in-band signaling to cause effects other than the ...
s, such as carriage return, are required to control the operation of the mechanism. Each code point is made up of a number of elements arranged in a unique way for that character. There are usually two types of element (a binary code), but more element types were employed in some codes not intended for machines. For instance,
American Morse code American Morse Code — also known as Railroad Morse—is the latter-day name for the original version of the Morse Code developed in the mid-1840s, by Samuel Morse and Alfred Vail for their electric telegraph. The "American" qualifier was added ...
had about five elements, rather than the two (dot and dash) of
International Morse Code Morse code is a method used in telecommunication to encode text characters as standardized sequences of two different signal durations, called ''dots'' and ''dashes'', or ''dits'' and ''dahs''. Morse code is named after Samuel Morse, one ...
. Codes meant for human interpretation were designed so that the characters that occurred most often had the fewest elements in the corresponding code point. For instance, Morse code for ''E'', the most common letter in English, is a single dot (), whereas ''Q'' is . These arrangements meant the message could be sent more quickly and it would take longer for the operator to become fatigued. Telegraphs were always operated by humans until late in the 19th century. When automated telegraph messages came in, codes with variable-length code points were inconvenient for machine design of the period. Instead, codes with a fixed length were used. The first of these was the
Baudot code The Baudot code is an early character encoding for telegraphy invented by Émile Baudot in the 1870s. It was the predecessor to the International Telegraph Alphabet No. 2 (ITA2), the most common teleprinter code in use until the advent of ASCII ...
, a five- bit code. Baudot has only enough code points to print in
upper case Letter case is the distinction between the letters that are in larger uppercase or capitals (or more formally ''majuscule'') and smaller lowercase (or more formally ''minuscule'') in the written representation of certain languages. The writing ...
. Later codes had more bits (
ASCII ASCII ( ), abbreviated from American Standard Code for Information Interchange, is a character encoding standard for electronic communication. ASCII codes represent text in computers, telecommunications equipment, and other devices. Because ...
has seven) so that both upper and lower case could be printed. Beyond the telegraph age, modern computers require a very large number of code points (
Unicode Unicode, formally The Unicode Standard,The formal version reference is is an information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard, ...
has 21 bits) so that multiple languages and alphabets (
character set Character encoding is the process of assigning numbers to graphical characters, especially the written characters of human language, allowing them to be stored, transmitted, and transformed using digital computers. The numerical values tha ...
s) can be handled without having to change the character encoding. Modern computers can easily handle variable-length codes such as
UTF-8 UTF-8 is a variable-length character encoding used for electronic communication. Defined by the Unicode Standard, the name is derived from ''Unicode'' (or ''Universal Coded Character Set'') ''Transformation Format 8-bit''. UTF-8 is capable of e ...
and UTF-16 which have now become ubiquitous.


Manual telegraph codes


Optical telegraph codes

Prior to the electrical telegraph, a widely used method of building national telegraph networks was the
optical telegraph An optical telegraph is a line of stations, typically towers, for the purpose of conveying textual information by means of visual signals. There are two main types of such systems; the semaphore telegraph which uses pivoted indicator arms and ...
consisting of a chain of towers from which signals could be sent by semaphore or shutters from tower to tower. This was particularly highly developed in France and had its beginnings during the
French Revolution The French Revolution ( ) was a period of radical political and societal change in France that began with the Estates General of 1789 and ended with the formation of the French Consulate in November 1799. Many of its ideas are conside ...
. The code used in France was the Chappe code, named after
Claude Chappe Claude Chappe (; 25 December 1763 – 23 January 1805) was a French inventor who in 1792 demonstrated a practical semaphore system that eventually spanned all of France. His system consisted of a series of towers, each within line of sight of ...
the inventor. The
British Admiralty The Admiralty was a department of the Government of the United Kingdom responsible for the command of the Royal Navy until 1964, historically under its titular head, the Lord High Admiral – one of the Great Officers of State. For much of i ...
also used the semaphore telegraph, but with their own code. The British code was necessarily different from that used in France because the British optical telegraph worked in a different way. The Chappe system had moveable arms, as if it were waving flags as in
flag semaphore Flag semaphore (from the Ancient Greek () 'sign' and - (-) '-bearer') is a semaphore system conveying information at a distance by means of visual signals with hand-held flags, rods, disks, paddles, or occasionally bare or gloved hands. Informa ...
. The British system used an array of shutters that could be opened or closed.


Chappe code

The Chappe system consisted of a large pivoted beam (the regulator) with an arm at each end (the indicators) which pivoted around the regulator on one extremity. The angles these components were allowed to take was limited to multiples of 45° to aid readability. This gave a code space of 8×4×8
code point In character encoding terminology, a code point, codepoint or code position is a numerical value that maps to a specific character. Code points usually represent a single grapheme—usually a letter, digit, punctuation mark, or whitespace—but ...
s, but the indicator position inline with the regulator was never used because it was hard to distinguish from the indicator being folded back on top of the regulator, leaving a code space of . Symbols were always formed with the regulator on either the left- or right-leaning diagonal (oblique) and only accepted as valid when the regulator moved to either the vertical or horizontal position. The left oblique was always used for messages, with the right oblique being used for control of the system. This further reduced the code space to 98, of which either four or six code points (depending on version) were
control character In computing and telecommunication, a control character or non-printing character (NPC) is a code point (a number) in a character set, that does not represent a written symbol. They are used as in-band signaling to cause effects other than the ...
s, leaving a code space for text of 94 or 92 respectively. The Chappe system mostly transmitted messages using a code book with a large number of set words and phrases. It was first used on an experimental chain of towers in 1793 and put into service from Paris to
Lille Lille ( , ; nl, Rijsel ; pcd, Lile; vls, Rysel) is a city in the northern part of France, in French Flanders. On the river Deûle, near France's border with Belgium, it is the capital of the Hauts-de-France region, the prefecture of the No ...
in 1794. The code book used this early is not known for certain, but an unidentified code book in the Paris Postal Museum may have been for the Chappe system. The arrangement of this code in columns of 88 entries led Holzmann & Pehrson to suggest that 88 code points might have been used. However, the proposal in 1793 was for ten code points representing the numerals 0–9, and Bouchet says this system was still in use as late as 1800 (Holzmann & Pehrson put the change at 1795). The code book was revised and simplified in 1795 to speed up transmission. The code was in two divisions, the first division was 94 alphabetic and numeric characters plus some commonly used letter combinations. The second division was a code book of 94 pages with 94 entries on each page. A code point was assigned for each number up to 94. Thus, only two symbols needed to be sent to transmit an entire sentence – the page and line numbers of the code book, compared to four symbols using the ten-symbol code. In 1799, three additional divisions were added. These had additional words and phrases, geographical places, and names of people. These three divisions required extra symbols to be added in front of the code symbol to identify the correct book. The code was revised again in 1809 and remained stable thereafter. In 1837 a horizontal only coding system was introduced by Gabriel Flocon which did not require the heavy regulator to be moved. Instead, an additional indicator was provided in the centre of the regulator to transmit that element of the code.


Edelcrantz code

The Edelcrantz system was used in Sweden and was the second largest network built after that of France. The telegraph consisted of a set of ten shutters. Nine of these were arranged in a 3×3 matrix. Each column of shutters represented a binary-coded octal digit with a closed shutter representing "1" and the most significant digit at the bottom. Each symbol of telegraph transmission was thus a three-digit octal number. The tenth shutter was an extra-large one at the top. Its meaning was that the codepoint should be preceded by "A". One use of the "A" shutter was that a numeral codepoint preceded by "A" meant add a zero (multiply by ten) to the digit. Larger numbers could be indicated by following the numeral with the code for hundreds (236), thousands (631) or a combination of these. This required fewer symbols to be transmitted than sending all the zero digits individually. However, the main purpose of the "A" codepoints was for a codebook of predetermined messages, much like the Chappe codebook. The symbols without "A" were a large set of numerals, letters, common syllables and words to aid code compaction. Around 1809, Edelcrantz introduced a new codebook with 5,120 codepoints, each requiring a two-symbol transmission to identify. There were many codepoints for error correction (272, error), flow control, and supervisory messages. Usually, messages were expected to be passed all the way down the line, but there were circumstances when individual stations needed to communicate directly, usually for managerial purposes. The most common, and simplest situation was communication between adjacent stations. Codepoints 722 and 227 was used for this purpose, to get the attention of the next station towards, or away from the sun respectively. For more remote stations codepoints 557 and 755 respectively were used, followed by the identification of the requesting and target stations.


Wig-wag

Flag signalling was widely used for point-to-point signalling prior to the optical telegraph, but it was difficult to construct a nationwide network with hand-held flags. The much larger mechanical apparatus of the semaphore telegraph towers was needed so that a greater distance between links could be achieved. However, an extensive network with hand-held flags was constructed during the
American Civil War The American Civil War (April 12, 1861 – May 26, 1865; also known by Names of the American Civil War, other names) was a civil war in the United States. It was fought between the Union (American Civil War), Union ("the North") and t ...
. This was the wig-wag system which used the code invented by
Albert J. Myer Albert James Myer (September 20, 1828 – August 24, 1880) was a surgeon and United States Army general. He is known as the father of the U.S. Army Signal Corps, as its first chief signal officer just prior to the American Civil War, the inventor ...
. Some of the towers used were enormous, up to 130 feet, to get a good range. Myer's code required only one flag using a ternary code. That is, each code element consisted of one of three distinct flag positions. However, the alphabetical codepoints required only two positions, the third position only being used in
control character In computing and telecommunication, a control character or non-printing character (NPC) is a code point (a number) in a character set, that does not represent a written symbol. They are used as in-band signaling to cause effects other than the ...
s. Using a ternary code in the alphabet would have resulted in shorter messages because fewer elements are required in each codepoint, but a binary system is easier to read at long distance since fewer flag positions need to be distinguished. Myer's manual also describes a ternary-coded alphabet with a fixed length of three elements for each codepoint.


Electrical telegraph codes


Cooke and Wheatstone and other early codes

Many different codes were invented during the early development of the
electrical telegraph Electrical telegraphs were point-to-point text messaging systems, primarily used from the 1840s until the late 20th century. It was the first electrical telecommunications system and the most widely used of a number of early messaging systems ...
. Virtually every inventor produced a different code to suit their particular apparatus. The earliest code used commercially on an electrical telegraph was the Cooke and Wheatstone telegraph five needle code (C&W5). This was first used on the
Great Western Railway The Great Western Railway (GWR) was a British railway company that linked London with the southwest, west and West Midlands of England and most of Wales. It was founded in 1833, received its enabling Act of Parliament on 31 August 1835 and ran ...
in 1838. C&W5 had the major advantage that the code did not need to be learned by the operator; the letters could be read directly off the display board. However, it had the disadvantage that it required too many wires. A one needle code, C&W1, was developed that required only one wire. C&W1 was widely used in the UK and the British Empire. Some other countries used C&W1, but it never became an international standard and generally each country developed their own code. In the US,
American Morse code American Morse Code — also known as Railroad Morse—is the latter-day name for the original version of the Morse Code developed in the mid-1840s, by Samuel Morse and Alfred Vail for their electric telegraph. The "American" qualifier was added ...
was used, whose elements consisted of dots and dashes distinguished from each other by the length of the pulse of current on the telegraph line. This code was used on the telegraph invented by
Samuel Morse Samuel Finley Breese Morse (April 27, 1791 – April 2, 1872) was an American inventor and painter. After having established his reputation as a portrait painter, in his middle age Morse contributed to the invention of a single-wire telegraph ...
and
Alfred Vail Alfred Lewis Vail (September 25, 1807 – January 18, 1859) was an American machinist and inventor. Along with Samuel Morse, Vail was central in developing and commercializing American telegraphy between 1837 and 1844. Vail and Morse were the f ...
and was first used commercially in 1844. Morse initially had code points only for numerals. He planned that numbers sent over the telegraph would be used as an index to a dictionary with a limited set of words. Vail invented an extended code that included code points for all the letters so that any desired word could be sent. It was Vail's code that became American Morse. In France, the telegraph used the Foy-Breguet telegraph, a two-needle telegraph that displayed the needles in Chappe code, the same code as the French optical telegraph, which was still more widely used than the electrical telegraph in France. To the French, this had the great advantage that they did not need to retrain their operators in a new code.


Standardisation—Morse code

In Germany in 1848, Friedrich Clemens Gerke developed a heavily modified version of American Morse for use on German railways. American Morse had three different lengths of dashes and two different lengths of space between the dots and dashes in a code point. The Gerke code had only one length of dash and all inter-element spaces within a code point were equal. Gerke also created code points for the German umlaut letters, which do not exist in English. Many central European countries belonged to the German-Austrian Telegraph Union. In 1851, the Union decided to adopt a common code across all its countries so that messages could be sent between them without the need for operators to recode them at borders. The Gerke code was adopted for this purpose. In 1865, a conference in Paris adopted the Gerke code as the international standard, calling it
International Morse Code Morse code is a method used in telecommunication to encode text characters as standardized sequences of two different signal durations, called ''dots'' and ''dashes'', or ''dits'' and ''dahs''. Morse code is named after Samuel Morse, one ...
. With some very minor changes, this is the
Morse code Morse code is a method used in telecommunication to encode text characters as standardized sequences of two different signal durations, called ''dots'' and ''dashes'', or ''dits'' and ''dahs''. Morse code is named after Samuel Morse, one ...
used today. The Cooke and Wheatstone telegraph needle instruments were capable of using Morse code since dots and dashes could be sent as left and right movements of the needle. By this time, the needle instruments were being made with end stops that made two distinctly different notes as the needle hit them. This enabled the operator to write the message without looking up at the needle which was much more efficient. This was a similar advantage to the Morse telegraph in which the operators could hear the message from the clicking of the relay armature. Nevertheless, after the British telegraph companies were nationalised in 1870 the
General Post Office The General Post Office (GPO) was the state postal system and telecommunications carrier of the United Kingdom until 1969. Before the Acts of Union 1707, it was the postal system of the Kingdom of England, established by Charles II in 1660. ...
decided to standardise on the Morse telegraph and get rid of the many different systems they had inherited from private companies. In the US, telegraph companies refused to use International Morse because of the cost of retraining operators. They opposed attempts by the government to make it law. In most other countries, the telegraph was state controlled so the change could simply be mandated. In the US, there was no single entity running the telegraph. Rather, it was a multiplicity of private companies. This resulted in international operators needing to be fluent in both versions of Morse and to recode both incoming and outgoing messages. The US continued to use American Morse on landlines (
radiotelegraphy Wireless telegraphy or radiotelegraphy is transmission of text messages by radio waves, analogous to electrical telegraphy using cables. Before about 1910, the term ''wireless telegraphy'' was also used for other experimental technologies for t ...
generally used International Morse) and this remained the case until the advent of teleprinters which required entirely different codes and rendered the issue moot.


Transmission speed

The speed of sending in a manual telegraph is limited by the speed the operator can send each code element. Speeds are typically stated in
words per minute Words per minute, commonly abbreviated wpm (sometimes uppercased WPM), is a measure of words processed in a minute, often used as a measurement of the speed of typing, reading or Morse code sending and receiving. Alphanumeric entry Since words ...
. Words are not all the same length, so literally counting the words will get a different result depending on message content. Instead, a word is defined as five characters for the purpose of measuring speed, regardless of how many words are actually in the message. Morse code, and many other codes, also do not have the same length of code for each character of the word, again introducing a content-related variable. To overcome this, the speed of the operator repeatedly transmitting a standard word is used. PARIS is classically chosen as this standard because that is the length of an average word in Morse. In American Morse, the characters are generally shorter than International Morse. This is partly because American Morse uses more dot elements, and partly because the most common dash, the short dash, is shorter than the International Morse dash—two dot elements against three dot elements long. In principle, American Morse will be transmitted faster than International Morse if all other variables are equal. In practice, there are two things that detract from this. Firstly, American Morse, with around five coding elements was harder to get the timings right when sent quickly. Inexperienced operators were apt to send garbled messages, an effect known as hog Morse. The second reason is that American Morse is more prone to
intersymbol interference In telecommunication, intersymbol interference (ISI) is a form of distortion of a signal in which one symbol interferes with subsequent symbols. This is an unwanted phenomenon as the previous symbols have a similar effect as noise, thus making ...
(ISI) because of the larger density of closely spaced dots. This problem was particularly severe on
submarine telegraph cable A submarine communications cable is a cable laid on the sea bed between land-based stations to carry telecommunication signals across stretches of ocean and sea. The first submarine communications cables laid beginning in the 1850s carried tel ...
s, making American Morse less suitable for international communications. The only solution an operator had immediately to hand to deal with ISI was to slow down the transmission speed.


Language character encodings

Morse code for non-Latin alphabets This is a summary of the use of Morse code to represent alphabets other than Latin. Greek The Greek Morse code alphabet is very similar to the Latin alphabet. It uses one extra letter for Greek letter and no longer uses the codes for Latin lette ...
, such as
Cyrillic The Cyrillic script ( ), Slavonic script or the Slavic script, is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic, Turkic, Mongolic, Uralic, Caucasian and Iranic-speaking co ...
or
Arabic script The Arabic script is the writing system used for Arabic and several other languages of Asia and Africa. It is the second-most widely used writing system in the world by number of countries using it or a script directly derived from it, and th ...
, is achieved by constructing a
character encoding Character encoding is the process of assigning numbers to graphical characters, especially the written characters of human language, allowing them to be stored, transmitted, and transformed using digital computers. The numerical values tha ...
for the alphabet in question using the same, or nearly the same code points as used in the
Latin alphabet The Latin alphabet or Roman alphabet is the collection of letters originally used by the ancient Romans to write the Latin language. Largely unaltered with the exception of extensions (such as diacritics), it used to write English and the ...
.
Syllabaries In the linguistic study of written languages, a syllabary is a set of written symbols that represent the syllables or (more frequently) moras which make up words. A symbol in a syllabary, called a syllabogram, typically represents an (optiona ...
, such as Japanese
katakana is a Japanese syllabary, one component of the Japanese writing system along with hiragana, kanji and in some cases the Latin script (known as rōmaji). The word ''katakana'' means "fragmentary kana", as the katakana characters are derived f ...
, are also handled this way (
Wabun code is a form of Morse code used to send Japanese language in kana characters. Unlike International Morse Code, which represents letters of the Latin script, in Wabun each symbol represents a Japanese kana. For this reason, Wabun code is also someti ...
). The alternative of adding more code points to Morse code for each new character would result in code transmissions being very long in some languages. Languages that use
logogram In a written language, a logogram, logograph, or lexigraph is a written character that represents a word or morpheme. Chinese characters (pronounced '' hanzi'' in Mandarin, ''kanji'' in Japanese, ''hanja'' in Korean) are generally logograms, ...
s are more difficult to handle due to the much larger number of characters required. The
Chinese telegraph code The Chinese telegraph code, Chinese telegraphic code, or Chinese commercial code ( or ) is a four-digit decimal code (character encoding) for electrically telegraphing messages written with Chinese characters. Encoding and decoding A codebook ...
uses a codebook of around 9,800 characters (7,000 when originally launched in 1871) which are each assigned a four-digit number. It is these numbers that are transmitted, so Chinese Morse code consists entirely of numerals. The numbers must be looked up at the receiving end making this a slow process, but in the era when telegraph was widely used, skilled Chinese
telegrapher A telegraphist (British English), telegrapher (American English), or telegraph operator is an operator who uses a telegraph key to send and receive the Morse code in order to communicate by land lines or radio. During the Great War the Royal ...
s could recall many thousands of the common codes from memory. The Chinese telegraph code is still used by law enforcement because it is an unambiguous method of recording Chinese names in non-Chinese scripts.


Automatic telegraph codes


Baudot code

Early
printing telegraph The printing telegraph was invented by Royal Earl House in 1846. House's equipment could transmit around 40 instantly readable words per minute, but was difficult to manufacture in bulk. The printer could copy and print out up to 2,000 words per ...
s continued to use Morse code, but the operator no longer sent the dots and dashes directly with a single key. Instead they operated a piano keyboard with the characters to be sent marked on each key. The machine generated the appropriate Morse code point from the key press. An entirely new type of code was developed by Émile Baudot, patented in 1874. The
Baudot code The Baudot code is an early character encoding for telegraphy invented by Émile Baudot in the 1870s. It was the predecessor to the International Telegraph Alphabet No. 2 (ITA2), the most common teleprinter code in use until the advent of ASCII ...
was a 5-bit binary code, with the bits sent serially. Having a fixed length code greatly simplified the machine design. The operator entered the code from a small 5-key piano keyboard, each key corresponding to one bit of the code. Like Morse, Baudot code was organised to minimise operator fatigue with the code points requiring the fewest key presses assigned to the most common letters. Early printing telegraphs required mechanical synchronisation between the sending and receiving machine. The Hughes printing telegraph of 1855 achieved this by sending a Morse dash every revolution of the machine. A different solution was adopted in conjunction with the Baudot code. Start and stop bits were added to each character on transmission, which allowed asynchronous serial communication. This scheme of start and stop bits was followed on all the later major telegraph codes.


Murray code

On busy telegraph lines, a variant of the Baudot code was used with
punched paper tape Five- and eight-hole punched paper tape Paper tape reader on the Harwell computer with a small piece of five-hole tape connected in a circle – creating a physical program loop Punched tape or perforated paper tape is a form of data storage ...
. This was the Murray code, invented by Donald Murray in 1901. Instead of directly transmitting to the line, the keypresses of the operator punched holes in the tape. Each row of holes across the tape had five possible positions to punch, corresponding to the five bits of the Murray code. The tape was then run through a tape reader which generated the code and sent it down the telegraph line. The advantage of this system was that multiple messages could be sent to line very fast from one tape, making better use of the line than direct manual operation could. Murray completely rearranged the character encoding to minimise wear on the machine since operator fatigue was no longer an issue. Thus, the character sets of the original Baudot and the Murray codes are not compatible. The five bits of the Baudot code are insufficient to represent all the letters, numerals, and punctuation required in a text message. Further, additional characters are required by printing telegraphs to better control the machine. Examples of these
control character In computing and telecommunication, a control character or non-printing character (NPC) is a code point (a number) in a character set, that does not represent a written symbol. They are used as in-band signaling to cause effects other than the ...
s are
line feed Newline (frequently called line ending, end of line (EOL), next line (NEL) or line break) is a control character or sequence of control characters in character encoding specifications such as ASCII, EBCDIC, Unicode, etc. This character, or ...
and carriage return. Murray solved this problem by introducing shift codes. These codes instruct the receiving machine to change the character encoding to a different character set. Two shift codes were used in the Murray code; figure shift and letter shift. Another control character introduced by Murray was the delete character (DEL, code 11111) which punched out all five holes on the tape. Its intended purpose was to remove erroneous characters from the tape, but Murray also used multiple DELs to mark the boundary between messages. Having all the holes punched out made a perforation which was easy to tear into separate messages at the receiving end. A variant of the Baudot–Murray code became an international standard as International Telegraph Alphabet no. 2 (ITA 2) in 1924. The "2" in ITA 2 is because the original Baudot code became the basis for ITA 1. ITA 2 remained the standard telegraph code in use until the 1960s and was still in use in places well beyond then.


Computer age

The
teleprinter A teleprinter (teletypewriter, teletype or TTY) is an electromechanical device that can be used to send and receive typed messages through various communications channels, in both point-to-point and point-to-multipoint configurations. Init ...
was invented in 1915. This is a printing telegraph with a typewriter-like keyboard on which the operator types the message. Nevertheless,
telegram Telegraphy is the long-distance transmission of messages where the sender uses symbolic codes, known to the recipient, rather than a physical exchange of an object bearing the message. Thus flag semaphore is a method of telegraphy, whereas ...
s continued to be sent in
upper case Letter case is the distinction between the letters that are in larger uppercase or capitals (or more formally ''majuscule'') and smaller lowercase (or more formally ''minuscule'') in the written representation of certain languages. The writing ...
only because there was not room for a lower case character set in Baudot–Murray or ITA 2 codes. This changed with the arrival of computers and the desire to interface computer-generated messages or word processor composed documents with the telegraph system. An immediate problem was the use of shift codes which caused a difficulty with computer storage of text. If a part of a message, or only one character, was retrieved, it was not possible to tell which encoding shift should be applied without searching back through the rest of the message for the last shift control. This led to the introduction of the 6-bit TeleTypeSetter (TTS) code. In TTS, the additional bit was used to store the shift state, thus obviating the need for shift characters. TTS was also of some benefit to teleprinters as well as computers. Corruption of a TTS transmitted letter code just resulted in one wrong letter being printed, which could probably be corrected by the receiving user. On the other hand, corruption of an shift character resulted in all the message from that point onwards being garbled until the next shift character was sent.


ASCII

By the 1960s, improving teleprinter technology meant that longer codes were nowhere near as significant a factor in teleprinter costs as they once were. The computer users' wanted lowercase characters and additional punctuation and both teleprinter and computer manufacturers wished to get rid of ITA 2 and its shift codes. This led the
American Standards Association The American National Standards Institute (ANSI ) is a private non-profit organization that oversees the development of voluntary consensus standards for products, services, processes, systems, and personnel in the United States. The organ ...
to develop a 7-bit code, the American Standard Code for Information Interchange (
ASCII ASCII ( ), abbreviated from American Standard Code for Information Interchange, is a character encoding standard for electronic communication. ASCII codes represent text in computers, telecommunications equipment, and other devices. Because ...
). The final form of ASCII was published in 1964 and it rapidly became the standard teleprinter code. ASCII was the last major code developed explicitly with telegraphy equipment in mind. Telegraphy rapidly declined after this and was largely replaced by
computer networks A computer network is a set of computers sharing resources located on or provided by network nodes. The computers use common communication protocols over digital interconnections to communicate with each other. These interconnections are m ...
, especially the
Internet The Internet (or internet) is the global system of interconnected computer networks that uses the Internet protocol suite (TCP/IP) to communicate between networks and devices. It is a '' network of networks'' that consists of private, p ...
in the 1990s. ASCII had several features geared to aid computer programming. The letter characters were in numerical order of code point, so an alphabetical sort could be achieved simply by sorting the data numerically. The code point for corresponding upper and lower case letters differed only by the value of bit 6, allowing a mix of cases to be sorted alphabetically if this bit was ignored. Other codes were introduced, notably IBM's
EBCDIC Extended Binary Coded Decimal Interchange Code (EBCDIC; ) is an eight- bit character encoding used mainly on IBM mainframe and IBM midrange computer operating systems. It descended from the code used with punched cards and the corresponding ...
derived from the
punched card A punched card (also punch card or punched-card) is a piece of stiff paper that holds digital data represented by the presence or absence of holes in predefined positions. Punched cards were once common in data processing applications or to di ...
method of input, but it was ASCII and its derivatives that won out as the ''lingua franca'' of computer information exchange.


ASCII extension and Unicode

The arrival of the
microprocessor A microprocessor is a computer processor where the data processing logic and control is included on a single integrated circuit, or a small number of integrated circuits. The microprocessor contains the arithmetic, logic, and control circ ...
in the 1970s and the
personal computer A personal computer (PC) is a multi-purpose microcomputer whose size, capabilities, and price make it feasible for individual use. Personal computers are intended to be operated directly by an end user, rather than by a computer expert or te ...
in the 1980s with their 8-bit architecture led to the 8-bit
byte The byte is a unit of digital information that most commonly consists of eight bits. Historically, the byte was the number of bits used to encode a single character of text in a computer and for this reason it is the smallest addressable uni ...
becoming the standard unit of computer storage. Packing 7-bit data into 8-bit storage is inconvenient for data retrieval. Instead, most computers stored one ASCII character per byte. This left one bit over that was not doing anything useful. Computer manufacturers used this bit in
extended ASCII Extended ASCII is a repertoire of character encodings that include (most of) the original 96 ASCII character set, plus up to 128 additional characters. There is no formal definition of "extended ASCII", and even use of the term is sometimes critic ...
to overcome some of the limitations of standard ASCII. The main issue was that ASCII was geared to English, particularly American English, and lacked the
accented A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacriti ...
vowels used in other European languages such as French. Currency symbols for other countries were also added to the character set. Unfortunately, different manufacturers implemented different extended ASCIIs making them incompatible across platforms. In 1987, the International Standards Organisation issued the standard
ISO 8859-1 ISO/IEC 8859-1:1998, ''Information technology — 8-bit single-byte coded graphic character sets — Part 1: Latin alphabet No. 1'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in ...
, for an 8-bit character encoding based on 7-bit ASCII which was widely taken up.
ISO 8859 ISO/IEC 8859 is a joint ISO and IEC series of standards for 8-bit character encodings. The series of standards consists of numbered parts, such as ISO/IEC 8859-1, ISO/IEC 8859-2, etc. There are 15 parts, excluding the abandoned ISO/IEC 8859-12. ...
character encodings were developed for non-
Latin script The Latin script, also known as Roman script, is an alphabetic writing system based on the letters of the classical Latin alphabet, derived from a form of the Greek alphabet which was in use in the ancient Greek city of Cumae, in southern ...
s such as
Cyrillic The Cyrillic script ( ), Slavonic script or the Slavic script, is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic, Turkic, Mongolic, Uralic, Caucasian and Iranic-speaking co ...
,
Hebrew Hebrew (; ; ) is a Northwest Semitic language of the Afroasiatic language family. Historically, it is one of the spoken languages of the Israelites and their longest-surviving descendants, the Jews and Samaritans. It was largely preserved ...
,
Arabic Arabic (, ' ; , ' or ) is a Semitic language spoken primarily across the Arab world.Semitic languages: an international handbook / edited by Stefan Weninger; in collaboration with Geoffrey Khan, Michael P. Streck, Janet C. E.Watson; Walter ...
, and
Greek Greek may refer to: Greece Anything of, from, or related to Greece, a country in Southern Europe: *Greeks, an ethnic group. *Greek language, a branch of the Indo-European language family. **Proto-Greek language, the assumed last common ancestor ...
. This was still problematic if a document or data used more than one script. Multiple switches between character encodings was required. This was solved by the publication in 1991 of the standard for 16-bit
Unicode Unicode, formally The Unicode Standard,The formal version reference is is an information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard, ...
, in development since 1987. Unicode maintained ASCII characters at the same code points for compatibility. As well as support for non-Latin scripts, Unicode provided code points for logograms such as
Chinese characters Chinese characters () are logograms developed for the writing of Chinese. In addition, they have been adapted to write other East Asian languages, and remain a key component of the Japanese writing system where they are known as ''kanji ...
and many specialist characters such as astrological and mathematical symbols. In 1996, Unicode 2.0 allowed code points greater than 16-bit; up to 20-bit, and 21-bit with an additional private use area. 20-bit Unicode provided support for extinct languages such as
Old Italic script The Old Italic scripts are a family of similar ancient writing systems used in the Italian Peninsula between about 700 and 100 BC, for various languages spoken in that time and place. The most notable member is the Etruscan alphabet, whic ...
and many rarely used Chinese characters.


International Code of Signals (radiotelegraph)

In 1931, the International Code of Signals, originally created for ship communication by signalling using flags, was expanded by adding a collection of five-letter codes to be used by radiotelegraph operators.


Comparison of codes


Comparison of flag codes


Table 1 notes


Comparison of needle codes


Table 2 notes

An alternative representation of needle codes is to use the numeral "1" for needle left, and "3" for needle right. The numeral "2", which does not appear in most codes represents the needle in the neutral upright position. The codepoints using this scheme are marked on the face of some needle instruments, especially those used for training.


Comparison of dot-dash codes


Table 3 notes

When used with a
printing telegraph The printing telegraph was invented by Royal Earl House in 1846. House's equipment could transmit around 40 instantly readable words per minute, but was difficult to manufacture in bulk. The printer could copy and print out up to 2,000 words per ...
or
siphon recorder The syphon or siphon recorder is an obsolete electromechanical device used as a receiver for submarine telegraph cables invented by William Thomson, 1st Baron Kelvin in 1867. It automatically records an incoming telegraph message as a wiggling in ...
, the "dashes" of dot-dash codes are often made the same length as the "dot". Typically, the mark on the tape for a dot is made above the mark for a dash. An example of this can be seen in the 1837 Steinheil code, which is nearly identical to the 1849 Steinheil code, except that they are represented differently in the table. International Morse code was commonly used in this form on
submarine telegraph cable A submarine communications cable is a cable laid on the sea bed between land-based stations to carry telecommunication signals across stretches of ocean and sea. The first submarine communications cables laid beginning in the 1850s carried tel ...
s.Bright, pp. 601–606


Comparison of binary codes


Table 4 notes


See also

*
Commercial code (communications) In telecommunication, a commercial code is a code once used to save on cablegram costs. Telegraph (and telex) charged per word sent, so companies which sent large volumes of telegrams developed codes to save money on tolls. Elaborate commercial cod ...
*
Great Western Railway telegraphic codes Great Western Railway telegraphic codes were a commercial telegraph code used to shorten the telegraphic messages sent between the stations and offices of the railway. The codes listed below are taken from the 1939 edition of the ''Telegraph M ...


References


Bibliography

* Beauchamp, Ken, '' History of Telegraphy'', IET, 2001 . * Bouchet, Olivier, ''Wireless Optical Communications'', Wiley, 2012 . * Bright, Charles Tilston
''Submarine Telegraphs''
London: Crosby Lockwood, 1898 . * Burns, Russel W., ''Communications: An International History of the Formative Years'', IEE, 2004 . * Calvert, James B.

accessed an

13 October 2019. * Chesnoy, Jose, ''Undersea Fiber Communication Systems'', Academic Press, 2002 . * Coe, Lewis, ''The Telegraph: A History of Morse's Invention and Its Predecessors in the United States'', McFarland, 2003 . * Edelcrantz, Abraham Niclas, ''Afhandling om Telegrapher'' ("A Treatise on Telegraphs"), 1796, as translated in ch. 4 of Holzmann & Pehrson. * Gerke, Friedrich Clemens
''Der praktische Telegraphist, oder, Die electro-magnetische Telegraphie''
Hoffmann und Campe, 1851 . * Gillam, Richard, ''Unicode Demystified'', Addison-Wesley Professional, 2003 . * Gollings, Gus, "Multilingual Script Encoding", ch. 6 in, Cope, Bill; Gollings, Gus, ''Multilingual Book Production'', Common Ground, 2001 . * Guillemin, Amédée
''The Applications of Physical Forces''
Macmillan and Company, 1877 . * Hallas, Stuart, M.

accessed an

5 October 2019. * Highton, Edward, ''The Electric Telegraph: Its History and Progress'', J. Weale, 1852 . * Holzmann, Gerard J.; Pehrson, Björn, ''The Early History of Data Networks'', Wiley, 1995 . * Huurdeman, Anton A., ''The Worldwide History of Telecommunications'', John Wiley & Sons, 2003 . * Johnson, Rossiter (ed)
''Universal Cyclopædia and Atlas''
vol. 10, D. Appleton and Company, 1901 . * Kieve, Jeffrey L., ''The Electric Telegraph: A Social and Economic History'', David and Charles, 1973 . * King, Thomas W., ''Modern Morse Code in Rehabilitation and Education'', Allyn and Bacon, 2000 . * Lyall, Francis, ''International Communications: The International Telecommunication Union and the Universal Postal Union'', Routledge, 2016 . * Maver, William, Jr.
''American Telegraphy and Encyclopedia of the Telegraph''
Maver Publishing Company, 1909 . * Mullaney, Thomas S., "Semiotic Sovereignty: The 1871 Chinese Telegraph Code in Historical Perspective", pp. 153–184 in, Jing Tsu; Elman, Benjamin A. (eds), ''Science and Technology in Modern China, 1880s–1940s'', BRILL, 2014 . * Myer, Albert J., ''A New Sign Language for Deaf Mutes'', Jewett, Thomas & Co., 1851 . * Myer, Albert J.
''A Manual of Signals''
D. van Nostrand, 1866 . * Myer, Albert J.
''A Manual of Signals''
D. van Nostrand, 1872 . * Noll, A. Michael, ''The Evolution of Media'', Rowman & Littlefield, 2007 . * Raykoff, Ivan, "Piano, telegraph, typewriter: Listening to the language of touch", ch. 8 in, Colligan, Colette (ed); Linley, Margaret (ed), ''Media, Technology, and Literature in the Nineteenth Century'', Routledge, 2016 . * Salomon, David, ''Data Compression: The Complete Reference'', Springer Science & Business Media, 2007 . * Shaffner, Taliaferro Preston
''The Telegraph Manual''
Pudney & Russell, 1859 . * Shiers, George, ''The Electric Telegraph: An Historical Anthology'', Arno Press, 1977 , including reprints of parts of, ** Smithsonian Institution
''Annual Report of the Board of Regents of the Smithsonian Institution, 1878''
Smithsonian Institution, 1879 . * Toncich, Dario J., ''Data Communications and Networking for Manufacturing Industries'', Chrystobel Engineering, 1993 . * Wrixon, Fred B., ''Codes, Ciphers, Secrets and Cryptic Communication'', Black Dog & Leventhal Publishers, 2005 . * Wyatt, Allen L., ''Using Assembly Language'', Que Corporation, 1887 .


External links


Single-needle telegraph instrument
with Cooke and Wheatstone code marked on the dial and two-note endstops
Cooke and Wheatstone style single-needle instrument
with Morse code marked on the dial * James B. Calvert

shows several encodings including Schilling (1820), Gauss and Weber (1833), Steinheil (1837), C&W1 (1846), C&W2 (1843), Bregeut (1844), Russian Morse, and a comparison chart of Morse type codes including the Bain code. {{DEFAULTSORT:Telegraph Code Telegraphy