telecommunication Telecommunication is the transmission of information by various types of technologies over wire, radio, optical, or other electromagnetic systems. It has its origin in the desire of humans for communication over a distance greater than that ...

, a line code is a pattern of voltage, current, or photons used to represent digital data transmitted down a

communication channel A communication channel refers either to a physical transmission medium such as a wire, or to a logical connection over a multiplexed medium such as a radio channel in telecommunications and computer networking. A channel is used for informa ...

or written to a

storage medium Data storage is the recording (storing) of information (data) in a storage medium. Handwriting, phonographic recording, magnetic tape, and optical discs are all examples of storage media. Biological molecules such as RNA and DNA are conside ...

. This repertoire of signals is usually called a constrained code in data storage systems. Some signals are more prone to error than others as the physics of the communication channel or storage medium constrains the repertoire of signals that can be used reliably. Common line encodings are unipolar,

polar Polar may refer to: Geography Polar may refer to: * Geographical pole, either of two fixed points on the surface of a rotating body or planet, at 90 degrees from the equator, based on the axis around which a body rotates *Polar climate, the cli ...

, bipolar, and

Manchester code In telecommunication and data storage, Manchester code (also known as phase encoding, or PE) is a line code in which the encoding of each data bit is either low then high, or high then low, for equal time. It is a self-clocking signal with no D ...

Transmission and storage

After line coding, the signal is put through a physical communication channel, either a

transmission medium A transmission medium is a system or substance that can mediate the propagation of signals for the purposes of telecommunication. Signals are typically imposed on a wave of some kind suitable for the chosen medium. For example, data can modulat ...

data storage medium Data storage is the recording (storing) of information (data) in a storage medium. Handwriting, phonographic recording, magnetic tape, and optical discs are all examples of storage media. Biological molecules such as RNA and DNA are conside ...

.Karl Paulsen
"Coding for Magnetic Storage Mediums"
.2007. The most common physical channels are: * the line-coded signal can directly be put on a

transmission line In electrical engineering, a transmission line is a specialized cable or other structure designed to conduct electromagnetic waves in a contained manner. The term applies when the conductors are long enough that the wave nature of the transmi ...

, in the form of variations of the voltage or current (often using

differential signaling Differential signalling is a method for electrically transmitting information using two complementary signals. The technique sends the same electrical signal as a differential pair of signals, each in its own conductor. The pair of conduc ...

). * the line-coded signal (the ''

baseband In telecommunications and signal processing, baseband is the range of frequencies occupied by a signal that has not been modulated to higher frequencies. Baseband signals typically originate from transducers, converting some other variable i ...

signal'') undergoes further

pulse shaping In electronics and telecommunications, pulse shaping is the process of changing the waveform of transmitted pulses to optimize the signal for its intended purpose or the communication channel. This is often done by limiting the bandwidth of the tra ...

(to reduce its frequency bandwidth) and then is modulated (to shift its frequency) to create an ''

RF signal Radio frequency (RF) is the oscillation rate of an alternating electric current or voltage or of a magnetic, electric or electromagnetic field or mechanical system in the frequency range from around to around . This is roughly between the upper ...

'' that can be sent through free space. * the line-coded signal can be used to turn on and off a light source in

free-space optical communication Free-space optical communication (FSO) is an optical communication technology that uses light propagating in free space to wirelessly transmit data for telecommunications or computer networking. "Free space" means air, outer space, vacuum, ...

, most commonly used in an infrared

remote control In electronics, a remote control (also known as a remote or clicker) is an electronic device used to operate another device from a distance, usually wirelessly. In consumer electronics, a remote control can be used to operate devices such ...

. * the line-coded signal can be printed on paper to create a

bar code A barcode or bar code is a method of representing data in a visual, machine-readable form. Initially, barcodes represented data by varying the widths, spacings and sizes of parallel lines. These barcodes, now commonly referred to as linear or on ...

. * the line-coded signal can be converted to magnetized spots on a

hard drive A hard disk drive (HDD), hard disk, hard drive, or fixed disk is an electro-mechanical data storage device that stores and retrieves digital data using magnetic storage with one or more rigid rapidly rotating platters coated with mag ...

tape drive A tape drive is a data storage device that reads and writes data on a magnetic tape. Magnetic tape data storage is typically used for offline, archival data storage. Tape media generally has a favorable unit cost and a long archival stability. ...

. * the line-coded signal can be converted to pits on an

optical disc In computing and optical disc recording technologies, an optical disc (OD) is a flat, usually circular disc that encodes binary data ( bits) in the form of pits and lands on a special material, often aluminum, on one of its flat surface ...

. Some of the more common binary line codes include: Digital signal encoding formats-en

Each line code has advantages and disadvantages. Line codes are chosen to meet one or more of the following criteria: * Minimize transmission hardware * Facilitate synchronization * Ease error detection and correction * Achieve a target

spectral density The power spectrum S_(f) of a time series x(t) describes the distribution of power into frequency components composing that signal. According to Fourier analysis, any physical signal can be decomposed into a number of discrete frequencies ...

* Eliminate a

DC component DC, D.C., D/C, Dc, or dc may refer to: Places * Washington, D.C. (District of Columbia), the capital and the federal territory of the United States * Bogotá, Distrito Capital, the capital city of Colombia * Dubai City, as distinct from t ...

Disparity

Most long-distance communication channels cannot reliably transport a

. The DC component is also called the ''disparity'', the ''bias'', or the DC coefficient. The disparity of a bit pattern is the difference in the number of one bits vs the number of zero bits. The ''running disparity'' is the running total of the disparity of all previously transmitted bits. The simplest possible line code, unipolar, gives too many errors on such systems, because it has an unbounded DC component. Most line codes eliminate the DC component such codes are called DC-balanced, zero-DC, or DC-free. There are three ways of eliminating the DC component: * Use a constant-weight code. Each transmitted

code word In communication, a code word is an element of a standardized code or protocol. Each code word is assembled in accordance with the specific rules of the code and assigned a unique meaning. Code words are typically used for reasons of reliability, ...

in a constant-weight code is designed such that every code word that contains some positive or negative levels also contains enough of the opposite levels, such that the average level over each code word is zero. Examples of constant-weight codes include

and

Interleaved 2 of 5 Interleaved 2 of 5 (ITF) is a continuous two-width barcode symbology encoding digits. It is used commercially on 135 film, for ITF-14 barcodes, and on cartons of some products, while the products inside are labeled with UPC or EAN. ITF enco ...

. * Use a

paired disparity code In telecommunication, a paired disparity code is a line code in which at least one of the data characters is represented by two codewords of opposite disparity that are used in sequence so as to minimize the total disparity of a longer sequence o ...

. Each code word in a paired disparity code that averages to a negative level is paired with another code word that averages to a positive level. The transmitter keeps track of the running DC buildup, and picks the code word that pushes the DC level back towards zero. The receiver is designed so that either code word of the pair decodes to the same data bits. Examples of paired disparity codes include

alternate mark inversion In telecommunication, bipolar encoding is a type of return-to-zero (RZ) line code, where two nonzero values are used, so that the three values are +, −, and zero. Such a signal is called a duobinary signal. Standard bipolar encodings are designed ...

8b/10b In telecommunications, 8b/10b is a line code that maps 8-bit words to 10-bit symbols to achieve DC balance and bounded disparity, and at the same time provide enough state changes to allow reasonable clock recovery. This means that the diffe ...

and 4B3T. * Use a scrambler. For example, the scrambler specified in for

64b/66b encoding In data networking and transmission, 64b/66b is a line code that transforms 64- bit data to 66-bit line code to provide enough state changes to allow reasonable clock recovery and alignment of the data stream at the receiver. It was defined by t ...

Polarity

Bipolar line codes have two polarities, are generally implemented as RZ, and have a radix of three since there are three distinct output levels (negative, positive and zero). One of the principle advantages of this type of code is that it can eliminate any DC component. This is important if the signal must pass through a transformer or a long transmission line. Unfortunately, several long-distance communication channels have polarity ambiguity. Polarity-insensitive line codes compensate in these channels. There are three ways of providing unambiguous reception of 0 and 1 bits over such channels: * Pair each code word with the polarity-inverse of that code word. The receiver is designed so that either code word of the pair decodes to the same data bits. Examples include

Differential Manchester encoding Differential Manchester encoding (DM) is a line code in digital frequency modulation in which data and clock signals are combined to form a single two-level self-synchronizing data stream. In various specific applications, this method is also call ...

, coded mark inversion and Miller encoding. *

differential coding In digital communications, differential coding is a technique used to provide ''unambiguous'' signal reception when using some types of modulation. It makes data to be transmitted to depend not only on the current signal state (or symbol), but also ...

each symbol relative to the previous symbol. Examples include MLT-3 encoding and NRZI. * Invert the whole stream when inverted syncwords are detected, perhaps using polarity switching

Run-length limited codes

For reliable

clock recovery In serial communication of digital data, clock recovery is the process of extracting timing information from a serial data stream itself, allowing the timing of the data in the stream to be accurately determined without separate clock information. ...

at the receiver, a run-length limitation may be imposed on the generated channel sequence, i.e., the maximum number of consecutive ones or zeros is bounded to a reasonable number. A clock period is recovered by observing transitions in the received sequence, so that a maximum run length guarantees sufficient transitions to assure clock recovery quality. RLL codes are defined by four main parameters: ''m'', ''n'', ''d'', ''k''. The first two, ''m''/''n'', refer to the rate of the code, while the remaining two specify the minimal ''d'' and maximal ''k'' number of zeroes between consecutive ones. This is used in both

and storage systems that move a medium past a fixed

recording head A recording head is the physical interface between a recording apparatus and a moving recording medium. Recording heads are generally classified according to the physical principle that allows them to impress their data upon their medium. A reco ...

. Specifically, RLL bounds the length of stretches (runs) of repeated bits during which the signal does not change. If the runs are too long, clock recovery is difficult; if they are too short, the high frequencies might be attenuated by the communications channel. By

modulating In music, modulation is the change from one tonality ( tonic, or tonal center) to another. This may or may not be accompanied by a change in key signature (a key change). Modulations articulate or create the structure or form of many pieces, a ...

the

data In the pursuit of knowledge, data (; ) is a collection of discrete values that convey information, describing quantity, quality, fact, statistics, other basic units of meaning, or simply sequences of symbols that may be further interpret ...

, RLL reduces the timing uncertainty in decoding the stored data, which would lead to the possible erroneous insertion or removal of bits when reading the data back. This mechanism ensures that the boundaries between bits can always be accurately found (preventing

bit slip In digital transmission, bit slip is the loss or gain of a bit or bits, caused by clock drift – variations in the respective clock rates of the transmitting and receiving devices. One cause of bit slippage is overflow of a receive buffer that ...

), while efficiently using the media to reliably store the maximal amount of data in a given space. Early disk drives used very simple encoding schemes, such as RLL (0,1) FM code, followed by RLL (1,3) MFM code which were widely used in

hard disk drive A hard disk drive (HDD), hard disk, hard drive, or fixed disk is an electro-mechanical data storage device that stores and retrieves digital data using magnetic storage with one or more rigid rapidly rotating platters coated with mag ...

s until the mid-1980s and are still used in digital optical discs such as CD, DVD, MD, Hi-MD and

Blu-ray The Blu-ray Disc (BD), often known simply as Blu-ray, is a digital optical disc data storage format. It was invented and developed in 2005 and released on June 20, 2006 worldwide. It is designed to supersede the DVD format, and capable of st ...

using EFM and EFMPLus codes. Higher density RLL (2,7) and RLL (1,7) codes became the

de facto standard A ''de facto'' standard is a custom or convention that has achieved a dominant position by public acceptance or market forces (for example, by early entrance to the market). is a Latin phrase (literally " in fact"), here meaning "in practice b ...

s for hard disks by the early 1990s.

Synchronization

Line coding should make it possible for the receiver to synchronize itself to the phase of the received signal. If the clock recovery is not ideal, then the signal to be decoded will not be sampled at the optimal times. This will increase the probability of error in the received data. Biphase line codes require at least one transition per bit time. This makes it easier to synchronize the transceivers and detect errors, however, the baud rate is greater than that of NRZ codes.

Other considerations

A line code will typically reflect technical requirements of the transmission medium, such as

optical fiber An optical fiber, or optical fibre in Commonwealth English, is a flexible, transparent fiber made by drawing glass ( silica) or plastic to a diameter slightly thicker than that of a human hair Hair is a protein filament that grows ...

shielded twisted pair Twisted pair cabling is a type of wiring used for communications in which two conductors of a single circuit are twisted together for the purposes of improving electromagnetic compatibility. Compared to a single conductor or an untwisted ba ...

. These requirements are unique for each medium, because each one has different behavior related to interference, distortion, capacitance and attenuation.

Common line codes

* 2B1Q * 4B3T *

4B5B In telecommunication, 4B5B is a form of data communications line code. 4B5B maps groups of 4 bits of data onto groups of 5 bits for transmission. These 5-bit words are pre-determined in a dictionary and they are chosen to ensure that there will b ...

6b/8b encoding In telecommunications, 6b/8b is a line code that expands 6- bit codes to 8-bit symbols for the purposes of maintaining DC-balance in a communications system. The 6b/8b encoding is a balanced code -- each 8-bit output symbol contains 4 zero bits a ...

8b/10b encoding In telecommunications, 8b/10b is a line code that maps 8-bit words to 10-bit symbols to achieve DC balance and bounded disparity, and at the same time provide enough state changes to allow reasonable clock recovery. This means that the diff ...

* 128b/130b encoding *

Alternate mark inversion In telecommunication, bipolar encoding is a type of return-to-zero (RZ) line code, where two nonzero values are used, so that the three values are +, −, and zero. Such a signal is called a duobinary signal. Standard bipolar encodings are designed ...

(AMI) * Coded mark inversion (CMI) * EFMPlus, used in DVDs *

Eight-to-fourteen modulation Eight-to-fourteen modulation (EFM) is a data encoding technique – formally, a ''line code'' – used by compact discs (CD), laserdiscs (LD) and pre-Hi-MD MiniDiscs. EFMPlus is a related code, used in DVDs and Super Audio CDs (SACDs). EFM and E ...

(EFM), used in

compact disc The compact disc (CD) is a digital optical disc data storage format that was co-developed by Philips and Sony to store and play digital audio recordings. In August 1982, the first compact disc was manufactured. It was then released in O ...

s *

Hamming code In computer science and telecommunication, Hamming codes are a family of linear error-correcting codes. Hamming codes can detect one-bit and two-bit errors, or correct one-bit errors without detection of uncorrected errors. By contrast, the sim ...

* Hybrid ternary code *

and differential Manchester * Mark and space * MLT-3 encoding * Modified AMI codes: B8ZS, B6ZS, B3ZS, HDB3 * Modified frequency modulation, Miller encoding and delay encoding *

Non-return-to-zero In telecommunication, a non-return-to-zero (NRZ) line code is a binary code in which ones are represented by one significant condition, usually a positive voltage, while zeros are represented by some other significant condition, usually a negat ...

(NRZ) * Non-return-to-zero, inverted (NRZI) * Pulse-position modulation * Return-to-zero (RZ) *

TC-PAM Trellis-coded pulse-amplitude modulation (TC-PAM) is the modulation format that is used in HDSL2 and G.SHDSL. It is a variant of trellis coded modulation (TCM) which uses a one-dimensional pulse-amplitude modulation (PAM) symbol space, as opposed ...

Optical line codes

Alternate-Phase Return-to-Zero Alternate-Phase Return-to-Zero (APRZ) is an optical line code. In APRZ the field intensity drops to zero between consecutive bits, and the field phase alternates between neighbouring bits, so that if the phase of the signal is, for example, 0 in ...

(APRZ) *

Carrier-Suppressed Return-to-Zero Carrier-Suppressed Return-to-Zero (CSRZ) is an optical line code. In CSRZ the field intensity drops to zero between consecutive bits ( RZ), and the field phase alternates by π radians between neighbouring bits, so that if the phase of the signal i ...

(CSRZ) * Three of Six, Fiber Optical (TS-FO)

References

External links

Line Coding Lecture No. 9

Line Coding in Digital Communication

CodSim 2.0: Open source simulator for Digital Data Communications Model at the University of Malaga written in HTML
{{Bit-encoding * Physical layer protocols Coding theory