HOME

TheInfoList



OR:

The Machine is an experimental computer made by
Hewlett Packard Enterprise The Hewlett Packard Enterprise Company (HPE) is an American multinational information technology company based in Spring, Texas. It is a business-focused organization which works in servers, storage, networking, containerization software and ...
. It was created as part of a research project to develop a new type of computer architecture for servers. The design focused on a “memory centric computing” architecture, where
NVRAM Non-volatile random-access memory (NVRAM) is random-access memory that retains data without applied power. This is in contrast to dynamic random-access memory (DRAM) and static random-access memory (SRAM), which both maintain data only for as l ...
replaced traditional DRAM and disks in the
memory hierarchy In computer architecture, the memory hierarchy separates computer storage into a hierarchy based on response time. Since response time, complexity, and capacity are related, the levels may also be distinguished by their performance and contr ...
. The NVRAM was byte addressable and could be accessed from any CPU via a
photonic Photonics is a branch of optics that involves the application of generation, detection, and manipulation of light in the form of photons through emission, transmission, modulation, signal processing, switching, amplification, and sensing. Eve ...
interconnect In telecommunications, interconnection is the physical linking of a carrier's network with equipment or facilities not belonging to that network. The term may refer to a connection between a carrier's facilities and the equipment belonging to its ...
. The aim of the project was to build and evaluate this new design.


Hardware overview

The Machine was a
computer cluster A computer cluster is a set of computers that work together so that they can be viewed as a single system. Unlike grid computers, computer clusters have each node set to perform the same task, controlled and scheduled by software. The newes ...
with many individual nodes connected over a memory fabric. The fabric interconnect used
VCSEL The vertical-cavity surface-emitting laser (VCSEL ) is a type of semiconductor laser diode with laser beam emission perpendicular from the top surface, contrary to conventional edge-emitting semiconductor lasers (also called ''in-plane'' laser ...
-based
silicon photonics Silicon photonics is the study and application of photonic systems which use silicon as an optical medium. The silicon is usually patterned with sub-micrometre precision, into microphotonic components. These operate in the infrared, most commo ...
with a custom chip called the X1. Access to memory is non-uniform and may include multiple
hops Hops are the flowers (also called seed cones or strobiles) of the hop plant ''Humulus lupulus'', a member of the Cannabaceae family of flowering plants. They are used primarily as a bittering, flavouring, and stability agent in beer, to whic ...
. The Machine was envisioned to be a rack-scale computer initially with 80 processors and 320 TB of fabric attached memory, with potential for scaling to more enclosures up to 32 ZB. The fabric attached memory is not cache coherent and requires software to be aware of this property. Since traditional locks need cache coherency, hardware was added to the bridges to do atomic operations at that level. Each node also has a limited amount of local private cache-coherent memory (256 GB). Storage and compute on each node had completely separate power domains. The whole fabric attached memory of The Machine is too large to be mapped into a processor's
virtual address space In computing, a virtual address space (VAS) or address space is the set of ranges of virtual addresses that an operating system makes available to a process. The range of virtual addresses usually starts at a low address and can extend to the h ...
(which was 48-bits wide). A way is needed to map windows of the fabric attached memory into processor memory. Therefore, communication between each node SoC and the memory pool goes through an
FPGA A field-programmable gate array (FPGA) is a type of configurable integrated circuit that can be repeatedly programmed after manufacturing. FPGAs are a subset of logic devices referred to as programmable logic devices (PLDs). They consist of a ...
-based “Z-bridge” component that manages memory mapping of the local SoC to the fabric attached memory. The Z-bridge deals with two different kinds of addresses: 53-bit logical Z addresses and 75-bit Z addresses, which allows addressing 8PB and 32ZB respectively. Each Z-bridge also contained a firewall to enforce access control. The interconnect protocol was developed in-house and known as Next Generation Memory Interconnect (NGMI). This protocol evolved into the open Gen-Z standard. The Z-bridge connects to the SoC using PCIe, avoiding major software changes. A half rack prototype of the machine was unveiled at HPE Discover in London in 2016. Each node contained
ARMv8-A ARM (stylised in lowercase as arm, formerly an acronym for Advanced RISC Machines and originally Acorn RISC Machine) is a family of RISC instruction set architectures (ISAs) for computer processors. Arm Holdings develops the ISAs and lice ...
based
Broadcom Broadcom Inc. is an American multinational corporation, multinational designer, developer, manufacturer, and global supplier of a wide range of semiconductor and infrastructure software products. Broadcom's product offerings serve the data cen ...
/
Cavium Cavium, Inc. was a fabless semiconductor company based in San Jose, California, specializing in ARM-based and MIPS-based network, video and security processors and SoCs. The company was co-founded in 2000 by Syed B. Ali and M. Raghib Hussain, ...
ThunderX2 SoCs. In total there were 40 32-core SoCs. Due to unavailability of adequate memristor-based NVRAM or
phase-change memory Phase-change memory (also known as PCM, PCME, PRAM, PCRAM, OUM (ovonic unified memory) and C-RAM or CRAM (chalcogenide RAM)) is a type of non-volatile random-access memory. PRAMs exploit the unique behaviour of chalcogenide glass. In PCM, heat pr ...
, the prototype used 160 TB of battery-backed DRAM. Despite this setback, software architect
Keith Packard Keith Packard (born April 16, 1963) is a software developer, best known for his work on the X Window System. Packard is responsible for many X extensions and technical papers on X. He has been heavily involved in the development of X since the ...
said this "can be used to prove the other parts of the design before switching". According to
The Register ''The Register'' (often also called El Reg) is a British Technology journalism, technology news website co-founded in 1994 by Mike Magee (journalist), Mike Magee and John Lettice. The online newspaper's Nameplate_(publishing), masthead Logo, s ...
, HPE's partnership with
SK Hynix SK Hynix Inc. () is a South Korean supplier of dynamic random-access memory (DRAM) chips and flash memory chips. SK Hynix is one of the world's largest semiconductor vendors. Founded as Hyundai Electronics in 1983, SK Hynix was integrated into ...
to develop
memristor A memristor (; a portmanteau of ''memory resistor'') is a non-linear two-terminal electrical component relating electric charge and magnetic flux linkage. It was described and named in 1971 by Leon Chua, completing a theoretical quartet of ...
-based NVRAM ran into funding and directional problems and they were working with Sandisk on Resistive RAM (
ReRAM Resistive random-access memory (ReRAM or RRAM) is a type of non-volatile (NV) random-access memory, random-access (RAM) computer memory that works by changing the resistance across a dielectric solid-state material, often referred to as a memrist ...
) for The Machine. According to The Next Platform, HPE considered switching to Intel Optane DIMMs "when production quantities of are available on the market". The Next Platform estimated the rack prototype to consume 24 kW to 36 kW of power.


Software overview

Two major software projects were created for the Machine. An experimental version of
Linux Linux ( ) is a family of open source Unix-like operating systems based on the Linux kernel, an kernel (operating system), operating system kernel first released on September 17, 1991, by Linus Torvalds. Linux is typically package manager, pac ...
called Linux++ with all the necessary enhancements to configure the hardware and work with traditional programming models. This included bridge configuration, access control and mapping using the DAX subsystem. In parallel, a new
operating system An operating system (OS) is system software that manages computer hardware and software resources, and provides common daemon (computing), services for computer programs. Time-sharing operating systems scheduler (computing), schedule tasks for ...
(OS) called Carbon was announced that would be designed from first principles to take full advantage of an NVRAM based computer. Primary workloads for The Machine included
in-memory database An in-memory database (IMDb, or main memory database system (MMDB) or memory resident database) is a database management system that primarily relies on main memory for computer data storage. It is contrasted with database management systems that e ...
,
Hadoop Apache Hadoop () is a collection of Open-source software, open-source software utilities for reliable, scalable, distributed computing. It provides a software framework for Clustered file system, distributed storage and processing of big data usin ...
-style software, and real-time
big data analytics Big data primarily refers to data sets that are too large or complex to be dealt with by traditional data-processing software. Data with many entries (rows) offer greater statistical power, while data with higher complexity (more attributes or ...
. HPE claimed that a memory-driven computing design like The Machine could "improve speeds by up to 8000x compared to conventional systems". In the prototype system, the fabric attached memory of the system was organised by a "top of rack" management server component called The Librarian. The Librarian divided the memory into "shelves" of 8GB "books", and hardware protections could be configured on book boundaries. A fine grained 64KB "booklet" was also supported. The mapping of memory is handled by the OS, while the access controls for the memory are configured by the management infrastructure of The Machine system as a whole. Software needs to be aware that fabric attached memory memory reads can have synchronous errors whilst writes can have asynchronous errors. On the Linux system, when a memory error occurs the SIGBUS operating system signal is used. Programming model and data structure changes were also explored, including changes to thread libraries and heap data structures to be resilient with non-volatile memory failure modes.


History

A few years after HP’s re-discovery of the
Memristor A memristor (; a portmanteau of ''memory resistor'') is a non-linear two-terminal electrical component relating electric charge and magnetic flux linkage. It was described and named in 1971 by Leon Chua, completing a theoretical quartet of ...
, the newly appointed CTO of HP, Martin Fink, created a
HP Labs HP Labs is the exploratory and advanced research group for HP Inc. HP Labs' headquarters is in Palo Alto, California and the group has research and development facilities in Bristol, UK. The development of programmable desktop calculators, ink ...
project to build a computer system based on memristor to tackle the slowing of
Moore's law Moore's law is the observation that the Transistor count, number of transistors in an integrated circuit (IC) doubles about every two years. Moore's law is an observation and Forecasting, projection of a historical trend. Rather than a law of ...
. He announced the project at HP’s Discover event in the summer of 2014. Some of the ideas of The Machine also came from Dragonhawk system designs. Three-quarters of HP Labs’s 200 staff were focused on the hardware and software of the machine. Speaking to
Bloomberg Bloomberg may refer to: People * Daniel J. Bloomberg (1905–1984), audio engineer * Georgina Bloomberg (born 1983), professional equestrian * Michael Bloomberg (born 1942), American businessman and founder of Bloomberg L.P.; politician a ...
, HP says it would commercialize The Machine within a few years, “or fall on its face trying.” Kirk Bresniker served as Chief Architect, and
Keith Packard Keith Packard (born April 16, 1963) is a software developer, best known for his work on the X Window System. Packard is responsible for many X extensions and technical papers on X. He has been heavily involved in the development of X since the ...
was hired to work on the Linux enhancements.
Bdale Garbee Bdale Garbee () is an American computer specialist who works with Linux, particularly Debian. He is also an amateur radio hobbyist (''KB0G''), and a member of AMSAT, Tucson Amateur Packet Radio (former vice-president), and the American Radio Rela ...
was hired to manage open source development. In 2015, Hewlett-Packard separated into two separate companies,
HP Inc HP Inc. is an American Multinational corporation, multinational information technology, information technology company with its headquarters in Palo Alto, California, that develops personal computers (PCs), printer (computing), printers and re ...
and
Hewlett Packard Enterprise The Hewlett Packard Enterprise Company (HPE) is an American multinational information technology company based in Spring, Texas. It is a business-focused organization which works in servers, storage, networking, containerization software and ...
(HPE), with The Machine project assigned to the latter. In late 2016, Martin Fink retired as HPE CTO. Fink's retirement announcement also said that
Hewlett Packard Labs Hewlett Packard Labs is the exploratory and advanced research group for Hewlett Packard Enterprise and its businesses. It was formed in November, 2015 when HP Labs spun off Hewlett Packard Labs to reflect the spin off of Hewlett Packard Enterpri ...
staff would be moved into the Enterprise product group to "align our R&D work on The Machine with the business". By early 2017, Hewlett Packard Labs had a slide saying that the project's aim was “to demonstrate progress, not develop products” and they would “collaborate to deliver differentiating Machine value into existing architectures as well as disruptive architectures”.
BleepingComputer ''Bleeping Computer'' is a website covering technology news and offering free computer help via its forums that was created by Lawrence Abrams in 2004. It publishes news focusing heavily on cybersecurity, but also covers other topics including c ...
said "In other words, The Machine is no longer a product in its own right. Instead it will provide technologies that will be used in other HPE products going forward.". HPE restructured its pure R&D organization and placed it in the products group.
Yahoo! Finance Yahoo Finance is a media property that is part of the Yahoo network. It provides financial news, data and commentary including stock quotes, press releases, financial reports, and original content. It also offers online tools for personal fin ...
reported that the Machine prototype "remains years away from being commercially available". In 2018, HPE stated that the project had reached the stage where it needed commercial applications from customers in the next step of its evolution.


References

{{DEFAULTSORT:Machine (computer architecture), The Computer architecture Supercomputers Non-volatile memory Silicon photonics devices