A six-bit character code is a
character encoding
Character encoding is the process of assigning numbers to graphical character (computing), characters, especially the written characters of human language, allowing them to be stored, transmitted, and transformed using computers. The numerical v ...
designed for use on computers with
word lengths a multiple of 6. Six bits can only encode 64 distinct characters, so these codes generally include only the upper-case letters, the numerals, some punctuation characters, and sometimes control characters. The
7-track magnetic tape
Magnetic tape is a medium for magnetic storage made of a thin, magnetizable coating on a long, narrow strip of plastic film. It was developed in Germany in 1928, based on the earlier magnetic wire recording from Denmark. Devices that use magnetic ...
format was developed to store data in such codes, along with an additional
parity bit.
Types of six-bit codes
An early six-bit binary code was used for
Braille
Braille ( , ) is a Tactile alphabet, tactile writing system used by blindness, blind or visually impaired people. It can be read either on embossed paper or by using refreshable braille displays that connect to computers and smartphone device ...
, the reading system for the blind that was developed in the 1820s.
The earliest computers dealt with numeric data only, and made no provision for character data.
Six-bit BCD, with several variants, was used by
IBM
International Business Machines Corporation (using the trademark IBM), nicknamed Big Blue, is an American Multinational corporation, multinational technology company headquartered in Armonk, New York, and present in over 175 countries. It is ...
on early computers such as the
IBM 702 in 1953 and the
IBM 704
The IBM 704 is the model name of a large digital computer, digital mainframe computer introduced by IBM in 1954. Designed by John Backus and Gene Amdahl, it was the first mass-produced computer with hardware for floating-point arithmetic. The I ...
in 1954.
Six-bit encodings were replaced by the 8-bit
EBCDIC
Extended Binary Coded Decimal Interchange Code (EBCDIC; ) is an eight- bit character encoding used mainly on IBM mainframe and IBM midrange computer operating systems. It descended from the code used with punched cards and the corresponding si ...
code starting in 1964, when
System/360
The IBM System/360 (S/360) is a family of mainframe computer systems announced by IBM on April 7, 1964, and delivered between 1965 and 1978. System/360 was the first family of computers designed to cover both commercial and scientific applicati ...
standardized on 8-bit
byte
The byte is a unit of digital information that most commonly consists of eight bits. Historically, the byte was the number of bits used to encode a single character of text in a computer and for this reason it is the smallest addressable un ...
s. There are some variants of this type of code (see
below).
Six-bit character codes generally succeeded the five-bit
Baudot code
The Baudot code () is an early character encoding for telegraphy invented by Émile Baudot in the 1870s. It was the predecessor to the International Telegraph Alphabet No. 2 (ITA2), the most common teleprinter code in use before ASCII. Each ch ...
and preceded seven-bit
ASCII
ASCII ( ), an acronym for American Standard Code for Information Interchange, is a character encoding standard for representing a particular set of 95 (English language focused) printable character, printable and 33 control character, control c ...
.
Six-bit codes could encode more than 64 characters by the use of
Shift Out and Shift In characters, essentially incorporating two distinct 62-character sets and switching between them. For example, the popular
IBM 2741 communications terminal supported a variety of character sets of up to 88 printing characters plus control characters.
Teletypesetter code
A special 6-level extension of the 5-level
International Telegraph Alphabet was used to remotely control
Linotype machine
The Linotype machine ( ) is a "line casting" machine used in printing which is manufactured and sold by the former Mergenthaler Linotype Company and related It was a hot metal typesetting system that cast lines of metal type for one-time use. Li ...
s beginning around 1930. By 1950 it was widely used by
wire service
A news agency is an organization that gathers news reports and sells them to subscribing news organizations, such as newspapers, magazines and All-news radio, radio and News broadcasting, television Broadcasting, broadcasters. A news agency ma ...
s to send preformatted news stories to participating newspapers. It supported the 90
printable character
In ISO/IEC 646 (commonly known as ASCII) and related standards including ISO 8859 and Unicode, a graphic character, also known as printing character (or printable character), is any character intended to be written, printed, or otherwise display ...
s characters of a Linotype machine, plus
whitespace characters.
The TTS code had two pairs of shift codes allowing a total of four shift states. The first operated much like a keyboard's shift key and selected between a lower-case and digits repertoire, and an upper-case and symbols one. A second pair of Linotype-specific "lower rail" and "upper rail" shift codes would select an alternate (usually italic) font.
BCD six-bit code
Six-bit
BCD code was the adaptation of the
punched card code to
binary code
A binary code represents plain text, text, instruction set, computer processor instructions, or any other data using a two-symbol system. The two-symbol system used is often "0" and "1" from the binary number, binary number system. The binary cod ...
.
IBM
International Business Machines Corporation (using the trademark IBM), nicknamed Big Blue, is an American Multinational corporation, multinational technology company headquartered in Armonk, New York, and present in over 175 countries. It is ...
applied the terms ''binary-coded decimal'' and ''BCD'' to the variations of BCD ''alphamerics'' used in most early IBM computers, including the
IBM 1620
The IBM 1620 was a model of scientific minicomputer produced by IBM. It was announced on October 21, 1959, and was then marketed as an inexpensive scientific computer. After a total production of about two thousand machines, it was withdrawn on N ...
,
IBM 1400 series
The IBM 1400 series are second-generation (transistor) mid-range business decimal computers that IBM marketed in the early 1960s. The computers were offered to replace tabulating machines like the IBM 407. The 1400-series machines stored infor ...
, and non-
decimal architecture members of the
IBM 700/7000 series
The IBM 700/7000 series is a series of large-scale (Mainframe computer, mainframe) computer systems that were made by IBM through the 1950s and early 1960s. The series includes several different, incompatible processor architectures. The 700s ...
.
COBOL databases six-bit code
A six-bit code was also used in COBOL databases, where end-of-record information was stored separately.
Magnetic stripe card six-bit code
A six-bit code, with added odd
parity bit, is used on Track 1 of
magnetic stripe cards, as specified in
ISO/IEC 7811-2.
DEC SIXBIT code
A popular six-bit code was
DEC SIXBIT. This is simply the ASCII character codes from 32 to 95 coded as 0 to 63 by subtracting 32 (i.e., columns 2, 3, 4, and 5 of the ASCII table (16 characters to a column), shifted to columns 0 through 3, by subtracting 2 from the high bits); it includes the space, punctuation characters, numbers, and capital letters, but no control characters. Since it included no control characters, not even end-of-line, it was not used for general text processing. However, six-character names such as
filename
A filename or file name is a name used to uniquely identify a computer file in a file system. Different file systems impose different restrictions on filename lengths.
A filename may (depending on the file system) include:
* name – base ...
s and
assembler symbol
A symbol is a mark, Sign (semiotics), sign, or word that indicates, signifies, or is understood as representing an idea, physical object, object, or wikt:relationship, relationship. Symbols allow people to go beyond what is known or seen by cr ...
s could be stored in a single
36-bit word of the
PDP-10
Digital Equipment Corporation (DEC)'s PDP-10, later marketed as the DECsystem-10, is a mainframe computer family manufactured beginning in 1966 and discontinued in 1983. 1970s models and beyond were marketed under the DECsystem-10 name, especi ...
, and three characters fit in each word of the
PDP-1 and two characters fit in each word of the
PDP-8.
Another, less common, variant is obtained by just stripping the high bit of an ASCII code in 32 - 95 range (codes 32 - 63 remain at their positions, higher values have 64 subtracted from them). Such variant was sometimes used on DEC's
PDP-8 (1965).
ECMA and ISO six-bit code
A six-bit code similar to DEC's, but replacing a few punctuation characters with the most useful control characters—including
SO/SI, allowing code extension—was specified as
ECMAbr>
1in 1963. Four years later, ISO Recommendation R 646-1967 (which later evolved into
ISO Standard 646) included an almost identical six-bit code, differing only in some of the alternative options permitted for a few characters. ECMA-1 was eventually withdrawn, and ISO 646-1973 explicitly removed the six-bit code, standardizing only its 7-bit code.
ICT/ICL 6-bit character set
The
ICT (later ICL) 1900-series of mainframes used a six-bit code derived from an early 1963 version of
ASCII
ASCII ( ), an acronym for American Standard Code for Information Interchange, is a character encoding standard for representing a particular set of 95 (English language focused) printable character, printable and 33 control character, control c ...
for internal storage and processing, referred to as the "ECMA character set" in its documentation.
AIS SixBit ASCII
FIELDATA six-bit code
FIELDATA was a seven-bit code (with optional parity) of which only 64 code positions (occupying six bits) were formally defined.
A variant was used by
UNIVAC
UNIVAC (Universal Automatic Computer) was a line of electronic digital stored-program computers starting with the products of the Eckert–Mauchly Computer Corporation. Later the name was applied to a division of the Remington Rand company and ...
's 1100-series computers.
Treating the code as a six-bit code these systems used a 36-bit word (capable of storing six such reduced FIELDATA characters).
Braille six-bit code
Braille
Braille ( , ) is a Tactile alphabet, tactile writing system used by blindness, blind or visually impaired people. It can be read either on embossed paper or by using refreshable braille displays that connect to computers and smartphone device ...
characters are represented using six dot positions, arranged in a rectangle. Each position may contain a raised dot or not, so Braille can be considered to be a six-bit binary code. Some more modern Braille systems add an extra two dots, making these systems an eight-bit code instead.
Six-bit codes for binary-to-text encoding
Transmission of binary data over systems which are designed for text only can sometimes introduce problems. For example,
email
Electronic mail (usually shortened to email; alternatively hyphenated e-mail) is a method of transmitting and receiving Digital media, digital messages using electronics, electronic devices over a computer network. It was conceived in the ...
historically supported only 7-bit ASCII codes and would strip the 8th bit, thus corrupting binary data sent directly through any troublesome mail server. Other systems can cause issues by improperly interpreting control characters during storage or transmission.
A number of schemes exist to pack 8-bit data into text-only representations which can pass through text mail systems, to be decoded at the destination. Examples of 6-bit character subsets used for packing binary data include
Uuencode and
Base64
In computer programming, Base64 is a group of binary-to-text encoding schemes that transforms binary data into a sequence of printable characters, limited to a set of 64 unique characters. More specifically, the source binary data is taken 6 bits ...
. These sets contain no control characters (only printable numbers, letters, some punctuation, and maybe space) and allow data to be transmitted over any medium which is also able to transmit human-readable text.
Examples of BCD six-bit codes
IBM, which dominated commercial data processing use a variety of six-bit codes, which were tied to the character set used on
punched card
A punched card (also punch card or punched-card) is a stiff paper-based medium used to store digital information via the presence or absence of holes in predefined positions. Developed over the 18th to 20th centuries, punched cards were widel ...
s, ''see''
BCD (character encoding).
Other vendor character codes are shown below, with their
Unicode
Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Char ...
equivalents.
GOST 6-bit code
Example of six-bit Braille codes
The following table shows the arrangement of characters, with the hex value, corresponding ASCII character, Braille 6-bit codes (dot combinations), Braille
Unicode
Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Char ...
glyph, and general meaning (the actual meaning may change depending on context).
See also
*
Binary-coded decimal
*
BCD (character encoding)
*
CDC display code
*
DEC RADIX 50 /
MOD40
*
IBM SQUOZE
*
IBM Transcode
*
ASCII
ASCII ( ), an acronym for American Standard Code for Information Interchange, is a character encoding standard for representing a particular set of 95 (English language focused) printable character, printable and 33 control character, control c ...
*
Baudot code
The Baudot code () is an early character encoding for telegraphy invented by Émile Baudot in the 1870s. It was the predecessor to the International Telegraph Alphabet No. 2 (ITA2), the most common teleprinter code in use before ASCII. Each ch ...
*
EBCDIC
Extended Binary Coded Decimal Interchange Code (EBCDIC; ) is an eight- bit character encoding used mainly on IBM mainframe and IBM midrange computer operating systems. It descended from the code used with punched cards and the corresponding si ...
*
Unicode
Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Char ...
*
ANSI X3.64
*
UTF-8
UTF-8 is a character encoding standard used for electronic communication. Defined by the Unicode Standard, the name is derived from ''Unicode Transformation Format 8-bit''. Almost every webpage is transmitted as UTF-8.
UTF-8 supports all 1,112,0 ...
*
UTF-16
*
Teletypesetter code (TTS)
References
External links
*
*
* - Digital Equipment Corporation SIXBIT character coding system
{{Character encoding
Character encoding