Bus width
In contrast with standard SDRAM, used in stationary devices and laptops and usually connected over a 64-bit wide memory bus, LPDDR also permits 16- or 32-bit wide channels. The "E" and "X" versions mark enhanced versions of the specifications. They formalize overclocking the memory array by usually 33%. As with standard SDRAM, most generations double the internal fetch size and external transfer speed. (DDR4 and LPDDR5 being the exceptions.)Generations
LPDDR(1)
The original low-power DDR (sometimes retroactively called LPDDR1), released in 2006 is a slightly modified form of DDR SDRAM, with several changes to reduce overall power consumption. Most significantly, the supply voltage is reduced from 2.5 to 1.8 V. Additional savings come from temperature-compensated refresh (DRAM requires refresh less often at low temperatures), partial array self refresh, and a "deep power down" mode which sacrifices all memory contents. Additionally, chips are smaller, using less board space than their non-mobile equivalents. Samsung and Micron are two of the main providers of this technology, which is used in tablet and phone devices such as the iPhone 3GS, original iPad, Samsung Galaxy Tab 7.0 and Motorola Droid X.LPDDR2
LPDDR3
In May 2012, JEDEC published the JESD209-3 Low Power Memory Device Standard.JESD209-3 LPDDR3 Low Power Memory Device StandardLPDDR3E
An "enhanced" version of the specification called LPDDR3E increases the data rate to 2133 MT/s.LPDDR4
On 14 March 2012, JEDEC hosted a conference to explore how future mobile device requirements will drive upcoming standards like LPDDR4. On 30 December 2013, Samsung announced that it had developed the first 20 nm-class 8 gigabit (1 GB) LPDDR4 capable of transmitting data at 3,200 MT/s, thus providing 50 percent higher performance than the fastest LPDDR3 and consuming around 40 percent less energy at 1.1 volts. On 25 August 2014, JEDEC published the JESD209-4 LPDDR4 Low Power Memory Device Standard. Significant changes include: * Doubling of the interface speed, and numerous consequent electrical changes, including changing the I/O standard to low-voltage swing-terminated logic (LVSTL) * Doubling of the internal prefetch size, and minimum transfer size * Change from a 10-bit DDR command/address bus to a 6-bit SDR bus * Change from one 32-bit wide bus to two independent 16-bit wide buses * Self-refresh is enabled by dedicated commands, rather than being controlled by the CKE line The standard defines SDRAM packages containing two independent 16-bit access channels, each connected to up to two dies per package. Each channel is 16 data bits wide, has its own control/address pins, and allows access to 8 banks of DRAM. Thus, the package may be connected in three ways: * Data lines and control connected in parallel to a 16-bit data bus, and only chip selects connected independently per channel. * To two halves of a 32-bit wide data bus, and the control lines in parallel, including chip select. * To two independent 16-bit wide data buses Each die provides 4, 6, 8, 12, or 16 gigabits of memory, half to each channel. Thus, each bank is one sixteenth the device size. This is organized into the appropriate number (16 K to 64 K) of 16384-bit (2048-byte) rows. Extension to 24 and 32 gigabits is planned, but it is not yet decided if this will be done by increasing the number of rows, their width, or the number of banks. Larger packages providing double width (four channels) and up to four dies per pair of channels (8 dies total per package) are also defined. Data is accessed in bursts of either 16 or 32 transfers (256 or 512 bits, 32 or 64 bytes, 8 or 16 cycles DDR). Bursts must begin on 64-bit boundaries. Since the clock frequency is higher and the minimum burst length longer than earlier standards, control signals can be more highly multiplexed without the command/address bus becoming a bottleneck. LPDDR4 multiplexes the control and address lines onto a 6-bit single data rate CA bus. Commands require 2 clock cycles, and operations encoding an address (e.g., activate row, read or write column) require two commands. For example, to request a read from an idle chip requires four commands taking 8 clock cycles: Activate-1, Activate-2, Read, CAS-2. The chip select line (CS) is active-''high''. The first cycle of a command is identified by chip select being high; it is low during the second cycle. The CAS-2 command is used as the second half of all commands that perform a transfer across the data bus, and provides low-order column address bits: * Read commands must begin on a column address which is a multiple of 4; there is no provision for communicating a non-zero C0 or C1 address bit to the memory. * Write commands must begin on a column address which is a multiple of 16; C2 and C3 must be zero for a write command. * Mode register read and some multi-purpose commands must also be followed by a CAS-2 command, however all the column bits must be zero (low). The burst length can be configured to be 16, 32, or dynamically selectable by the BL bit of read and write operations. One DMI (data mask/invert) signal is associated with each 8 data lines, and can be used to minimize the number of bits driven high during data transfers. When high, the other 8 bits are complemented by both transmitter and receiver. If a byte contains five or more 1 bits, the DMI signal can be driven high, along with three or fewer data lines. As signal lines are terminated low, this reduces power consumption. (An alternative usage, where DMI is used to limit the number of data lines which ''toggle'' on each transfer to at most 4, minimises crosstalk. This may be used by the memory controller during writes, but is not supported by the memory devices.) Data bus inversion can be separately enabled for reads and writes. For masked writes (which have a separate command code), the operation of the DMI signal depends on whether write inversion is enabled. * If DBI on writes is disabled, a high level on DMI indicates that the corresponding data byte is to be ignored and not written * If DBI on writes is enabled, a ''low'' level on DMI, combined with a data byte with 5 or more bits set, indicates a data byte to be ignored and not written. LPDDR4 also includes a mechanism for "targeted row refresh" to avoid corruption due to " row hammer" on adjacent rows. A special sequence of three activate/precharge sequences specifies the row which was activated more often than a device-specified threshold (200,000 to 700,000 per refresh cycle). Internally, the device refreshes physically adjacent rows rather than the one specified in the activate command.LPDDR4X
Samsung Semiconductor proposed an LPDDR4 variant that it called LPDDR4X. LPDDR4X is identical to LPDDR4 except additional power is saved by reducing the I/O voltage (Vddq) from 1.1 V to 0.6 V. On 9 January 2017, SK Hynix announced 8 and 16 GB LPDDR4X packages. JEDEC published the LPDDR4X standard on 8 March 2017. Aside from the lower voltage, additional improvements include a single-channel die option for smaller applications, new MCP, PoP and IoT packages, and additional definition and timing improvements for the highest 4266 MT/s speed grade.LPDDR5
On 19 February 2019, JEDEC published the JESD209-5, Standard for Low Power Double Data Rate 5 (LPDDR5). Samsung announced it had working prototype LPDDR5 chips in July 2018. LPDDR5 introduces the following changes: * Data transfer rate is increased to 6400 Mbit/s per pin * Differential clocks are used (3200 MHz, DDR) * Prefetch is ''not'' doubled again, but remains 16''n'' * The number of banks is increased to 16, divided into four DDR4-like bank groups * Power-saving improvements: ** Data-Copy and Write-X (all one or all zero) commands to decrease data transfer ** Dynamic frequency and voltage scaling * A new clocking architecture, where commands use a quarter-speed master clock (CK), while data is transferred using full-speed Write Clock (WCK) & Read Strobe (RDQS) signals which are enabled only when necessary * One set of full-speed clocks per byte (vs. per 16 bits in LPDDR4) * Elimination of the Clock Enable (CKE) pin; instead low-power mode is entered by a command over the CA bus, and lasts until the chip select signal next goes high AMD Van Gogh, Intel Tiger Lake, Apple silicon (M1 Pro, M1 Max, M1 Ultra, M2 and A16 Bionic), Huawei Kirin 9000 and Snapdragon 888 memory controllers support LPDDR5. The doubling of the transfer rate, and the quarter-speed master clock, results in a master clock which is half the frequency of a similar LPDDR4 clock. The command (CA) bus is widened to 7 bits, and commands are transferred at double data rate, so commands end up being sent at the same rate as LPDDR4. * B''n'' = Burst address bit * C''n'' = Column address bit * R''n'' = Column address bit * BA''n'' = Bank address bit * BG''n'' = Bank group address bit * AB = All banks (ignore BG & BA) * AP = Auto-precharge * MA''n'' = Mode register address bit * OP''n'' = Operation, or mode register data * WS_''xx'' = WCK synchronization * WRX = Write X; do not transfer data, but fill with all-zero or all-one * WXSA, WXSB = Write X select, value to be written * PD = Power down * DSE = Deep sleep enable Compared to earlier standards, the nomenclature for column addresses has changed. Both LPDDR4 and LPDDR5 allow up to 10 bits of column address, but the names are different. LPDDR4's C0–C9 are renamed B0–B3 and C0–C5. As with LPDDR4, writes must start at a multiple-of-16 address with B0–B3 zero, but reads may request a burst be transferred in a different order by specifying a non-zero value for B3. As with LPDDR4, to read some data requires 4 commands: two activate commands to select a row, then a CAS and a read command to select a column. Unlike LPDDR4, the CAS command comes ''before'' the read or write command. In fact, it is something of a misnomer, in that it does not select a column at all. Instead, its primary function is to prepare the DRAM to synchronize with the imminent start of the high-speed WCK clock. The WS_FS, WS_RD and WS_WR bits select various timings, with the _RD and _WR options optimized for an immediately following read or write command, while the _FS option starts the clock immediately, and may be followed by multiple reads or writes, accessing multiple banks. CAS also specifies the "write X" option. If the WRX bit is set, writes do not transfer data, but rather fill the burst with all-zeros or all-ones, under the control of the WXS (write-X select) bit. This takes the same amount of time, but saves energy. In addition to the usual bursts of 16, there are commands for performing double-length bursts of 32. Reads (but not writes) may specify a starting position within the 32-word aligned burst using the C0 and B3 bits.LPDDR5X
On 28 July 2021, JEDEC published the JESD209-5B, Standard for Low Power Double Data Rate 5/5X (LPDDR5/5X) with the following changes: * Speed extension up to 8533 Mbit/s * Signal integrity improvements with tx/rx equalization * Reliability improvements via the new Adaptive Refresh Management feature On 9 November 2021, Samsung announced that the company has developed the industry's first LPDDR5X DRAM. Samsung's implementation involves 16-gigabit (2 GB) dies, on a 14 nm process node, with modules with up to 32 dies (64 GB) in a single package. According to the company, the new modules would use 20% less power than LPDDR5. According to Andrei Frumusanu of '' AnandTech'', LPDDR5X in SoCs and other products was expected for the 2023 generation of devices. On 19 November 2021, Micron announced that Mediatek has validated its LPDDR5X DRAM for Mediatek's Dimensity 9000 5G SoC. On 25 January 2023 SK Hynix announced "Low Power Double Data Rate 5 Turbo" (LPDDR5T) chips with a bandwidth of 9.6 Gbit/s. It operates in the ultra-low voltage range of set by JEDEC. It has been incorporated into the LPDDR5X standard as LPDDR5X-9600 making "LPDDR5T" a brand name.LPDDR6
The following standard is under development. Planned extensions include: * Speed extension to 8.8–17.6 Gbit/s/pin * CA bus further narrowed to 4 bits * Data bus width of 12 bits per channel * Bursts of 24 transfers × 12 pins = 288 bits: ** 256 data bits, plus ** 16 tag/ECC bits stored by array, plus ** 16 bits for data bus inversion ''or'' link ECC, not stored. * CAMM2Notes
References
External links