NASA Advanced Supercomputing Facility
   HOME

TheInfoList



OR:

The NASA Advanced Supercomputing (NAS) Division is located at
NASA Ames Research Center The Ames Research Center (ARC), also known as NASA Ames, is a major NASA research center at Moffett Federal Airfield in California's Silicon Valley. It was founded in 1939 as the second National Advisory Committee for Aeronautics (NACA) laborat ...
,
Moffett Field Moffett Federal Airfield , also known as Moffett Field, is a joint civil-military airport located in an unincorporated part of Santa Clara County, California, United States, between northern Mountain View and northern Sunnyvale. On November ...
in the heart of
Silicon Valley Silicon Valley is a region in Northern California that is a global center for high technology and innovation. Located in the southern part of the San Francisco Bay Area, it corresponds roughly to the geographical area of the Santa Clara Valley ...
in
Mountain View, California Mountain View is a city in Santa Clara County, California, United States, part of the San Francisco Bay Area. Named for its views of the Santa Cruz Mountains, the population was 82,376 at the 2020 United States census, 2020 census. Mountain V ...
. It has been the major supercomputing and modeling and simulation resource for NASA missions in aerodynamics, space exploration, studies in weather patterns and ocean currents, and space shuttle and aircraft design and development for almost forty years. The facility currently houses the
petascale Petascale computing refers to computing systems capable of performing at least 1 quadrillion (10^15) floating-point operations per second (FLOPS). These systems are often called petaflops systems and represent a significant leap from traditional ...
Pleiades The Pleiades (), also known as Seven Sisters and Messier 45 (M45), is an Asterism (astronomy), asterism of an open cluster, open star cluster containing young Stellar classification#Class B, B-type stars in the northwest of the constellation Tau ...
, Aitken, and Electra
supercomputer A supercomputer is a type of computer with a high level of performance as compared to a general-purpose computer. The performance of a supercomputer is commonly measured in floating-point operations per second (FLOPS) instead of million instruc ...
s, as well as the terascale Endeavour supercomputer. The systems are based on SGI and HPE architecture with
Intel Intel Corporation is an American multinational corporation and technology company headquartered in Santa Clara, California, and Delaware General Corporation Law, incorporated in Delaware. Intel designs, manufactures, and sells computer compo ...
processors. The main building also houses disk and archival tape storage systems with a capacity of over an
exabyte The byte is a unit of digital information that most commonly consists of eight bits. Historically, the byte was the number of bits used to encode a single character of text in a computer and for this reason it is the smallest addressable un ...
of data, the hyperwall visualization system, and one of the largest
InfiniBand InfiniBand (IB) is a computer networking communications standard used in high-performance computing that features very high throughput and very low latency. It is used for data interconnect both among and within computers. InfiniBand is also used ...
network fabrics in the world. The NAS Division is part of NASA's Exploration Technology Directorate and operates NASA's High-End Computing Capability (HECC) Project.


History


Founding

In the mid-1970s, a group of aerospace engineers at Ames Research Center began to look into transferring
aerospace Aerospace is a term used to collectively refer to the atmosphere and outer space. Aerospace activity is very diverse, with a multitude of commercial, industrial, and military applications. Aerospace engineering consists of aeronautics and astron ...
research and development from costly and time-consuming wind tunnel testing to simulation-based design and engineering using
computational fluid dynamics Computational fluid dynamics (CFD) is a branch of fluid mechanics that uses numerical analysis and data structures to analyze and solve problems that involve fluid dynamics, fluid flows. Computers are used to perform the calculations required ...
(CFD) models on supercomputers more powerful than those commercially available at the time. This endeavor was later named the Numerical Aerodynamic Simulator (NAS) Project and the first computer was installed at the Central Computing Facility at Ames Research Center in 1984. Groundbreaking on a state-of-the-art supercomputing facility took place on March 14, 1985 in order to construct a building where CFD experts, computer scientists, visualization specialists, and network and storage engineers could be under one roof in a collaborative environment. In 1986, NAS transitioned into a full-fledged NASA division and in 1987, NAS staff and equipment, including a second supercomputer, a
Cray-2 The Cray-2 is a supercomputer with four vector processors made by Cray Research starting in 1985. At 1.9 GFLOPS peak performance, it was the fastest machine in the world when it was released, replacing the Cray X-MP in that spot. It was, ...
named Navier, were relocated to the new facility, which was dedicated on March 9, 1987. In 1995, NAS changed its name to the Numerical Aerospace Simulation Division, and in 2001 to the name it has today.


Industry leading innovations

NAS has been one of the leading innovators in the supercomputing world, developing many tools and processes that became widely used in commercial supercomputing. Some of these firsts include: * Installed
Cray Cray Inc., a subsidiary of Hewlett Packard Enterprise, is an American supercomputer manufacturer headquartered in Seattle, Washington. It also manufactures systems for data storage and analytics. Several Cray supercomputer systems are listed ...
's first
UNIX Unix (, ; trademarked as UNIX) is a family of multitasking, multi-user computer operating systems that derive from the original AT&T Unix, whose development started in 1969 at the Bell Labs research center by Ken Thompson, Dennis Ritchie, a ...
-based supercomputer * Implemented a client/server model linking the supercomputers and workstations together to distribute computation and visualization * Developed and implemented a high-speed
wide area network A wide area network (WAN) is a telecommunications network that extends over a large geographic area. Wide area networks are often established with leased telecommunication circuits. Businesses, as well as schools and government entities, use ...
(WAN) connecting supercomputing resources to remote users (AEROnet) * Co-developed NASA's first method for dynamic distribution of production loads across supercomputing resources in geographically distant locations (NASA Metacenter) * Implemented
TCP/IP The Internet protocol suite, commonly known as TCP/IP, is a framework for organizing the communication protocols used in the Internet and similar computer networks according to functional criteria. The foundational protocols in the suite are ...
networking in a supercomputing environment * Developed a
batch-queuing system A job scheduler is a computer application for controlling unattended background program execution of jobs. This is commonly called batch scheduling, as execution of non-interactive jobs is often called batch processing, though traditional ''job' ...
for supercomputers (NQS) * Developed a UNIX-based hierarchical mass storage system (NAStore) * Co-developed (with SGI) the first
IRIX IRIX (, ) is a discontinued operating system developed by Silicon Graphics (SGI) to run on the company's proprietary MIPS architecture, MIPS workstations and servers. It is based on UNIX System V with Berkeley Software Distribution, BSD extensio ...
single system image In distributed computing, a single system image (SSI) cluster is a cluster (computing), cluster of machines that appears to be one single system. The concept is often considered synonymous with that of a distributed operating system, but a single i ...
256-, 512-, and 1,024-processor supercomputers * Co-developed (with SGI) the first
Linux Linux ( ) is a family of open source Unix-like operating systems based on the Linux kernel, an kernel (operating system), operating system kernel first released on September 17, 1991, by Linus Torvalds. Linux is typically package manager, pac ...
-based single-system image 512- and 1,024-processor supercomputers * A 2,048-processor shared memory environment


Software development

NAS develops and adapts software in order to "complement and enhance the work performed on its supercomputers, including software for systems support, monitoring systems, security, and scientific visualization," and often provides this software to its users through the NASA Open Source Agreement (NOSA). A few of the important software developments from NAS include: * NAS Parallel Benchmarks (NPB) were developed to evaluate highly parallel supercomputers and mimic the characteristics of large-scale CFD applications. *
Portable Batch System Portable Batch System (or simply PBS) is the name of computer software that performs job scheduling. Its primary task is to allocate computational tasks, i.e., batch jobs, among the available computing resources. It is often used in conjunction ...
(PBS) was the first batch queuing software for parallel and distributed systems. It was released commercially in 1998 and is still widely used in the industry. * PLOT3D was created in 1982 and is a computer graphics program still used today to visualize the grids and solutions of structured CFD datasets. The PLOT3D team was awarded the fourth largest prize ever given by the NASA Space Act Program for the development of their software, which revolutionized scientific visualization and analysis of 3D CFD solutions. * FAST (Flow Analysis Software Toolkit) is a software environment based on PLOT3D and used to analyze data from numerical simulations which, though tailored to CFD visualization, can be used to visualize almost any
scalar Scalar may refer to: *Scalar (mathematics), an element of a field, which is used to define a vector space, usually the field of real numbers *Scalar (physics), a physical quantity that can be described by a single element of a number field such a ...
and
vector Vector most often refers to: * Euclidean vector, a quantity with a magnitude and a direction * Disease vector, an agent that carries and transmits an infectious pathogen into another living organism Vector may also refer to: Mathematics a ...
data. It was awarded the NASA Software of the Year Award in 1995. * INS2D and INS3D are codes developed by NAS engineers to solve incompressible Navier-Stokes equations in two- and three-dimensional generalized coordinates, respectively, for steady-state and time varying flow. In 1994, INS3D won the NASA Software of the Year Award. * Cart3D is a high-fidelity analysis package for aerodynamic design which allows users to perform automated CFD simulations on complex forms. It is still used at NASA and other government agencies to test conceptual and preliminary air- and spacecraft designs. The Cart3D team won the NASA Software of the Year award in 2002. * OVERFLOW (Overset grid flow solver) is a software package developed to simulate fluid flow around solid bodies using Reynolds-averaged, Navier-Stokes CFD equations. It was the first general-purpose NASA CFD code for overset (Chimera) grid systems and was released outside of NASA in 1992. * Chimera Grid Tools (CGT) is a software package containing a variety of tools for the Chimera overset grid approach for solving CFD problems of surface and volume grid generation; as well as grid manipulation, smoothing, and projection. * HiMAP A three level (Intra/Inter discipline, multicase) parallel HIgh fidelity Multidisciplinary (Fluids, Structures, Controls) Analysis Process,


Supercomputing history

Since its construction in 1987, the NASA Advanced Supercomputing Facility has housed and operated some of the most powerful supercomputers in the world. Many of these computers include
testbed A testbed (also spelled test bed) is a platform for conducting rigorous, transparent, and replicable testing of scientific theories, computing tools, and new technologies. The term is used across many disciplines to describe experimental research ...
systems built to test new architecture, hardware, or networking set-ups that might be utilized on a larger scale. Peak performance is shown in Floating Point Operations Per Second (FLOPS).


Storage resources


Disk storage

In 1987, NAS partnered with the
Defense Advanced Research Projects Agency The Defense Advanced Research Projects Agency (DARPA) is a research and development agency of the United States Department of Defense responsible for the development of emerging technologies for use by the military. Originally known as the Adva ...
(DARPA) and the
University of California, Berkeley The University of California, Berkeley (UC Berkeley, Berkeley, Cal, or California), is a Public university, public Land-grant university, land-grant research university in Berkeley, California, United States. Founded in 1868 and named after t ...
in the Redundant Array of Inexpensive Disks (RAID) project, which sought to create a storage technology that combined multiple disk drive components into one logical unit. Completed in 1992, the RAID project lead to the distributed data storage technology used today. The NAS facility currently houses disk mass storage on an SGI parallel DMF cluster with high-availability software consisting of four 32-processor front-end systems, which are connected to the supercomputers and the archival tape storage system. The system has 192 GB of memory per front-end and 7.6 petabytes (PB) of disk cache. Data stored on disk is regularly migrated to the tape archival storage systems at the facility to free up space for other user projects being run on the supercomputers.


Archive and storage systems

In 1987, NAS developed the first UNIX-based hierarchical mass storage system, named NAStore. It contained two
StorageTek Storage Technology Corporation (StorageTek or STK, earlier STC) was a data storage technology company headquartered in Louisville, Colorado. New products include data retention systems, which it calls "information lifecycle management" (ILM). It ...
4400 cartridge tape robots, each with a storage capacity of approximately 1.1 terabytes, cutting tape retrieval time from 4 minutes to 15 seconds. With the installation of the Pleiades supercomputer in 2008, the StorageTek systems that NAS had been using for 20 years were unable to meet the needs of the greater number of users and increasing file sizes of each project's
dataset A data set (or dataset) is a collection of data. In the case of tabular data, a data set corresponds to one or more database tables, where every column of a table represents a particular variable, and each row corresponds to a given record o ...
s. In 2009, NAS brought in
Spectra Logic Spectra Logic Corporation is a computer data storage company based in Boulder, Colorado in the United States. The company builds backup and archive technology for secondary storage to protect data after it migrates from primary disk. Spectra Logic's ...
T950 robotic tape systems which increased the maximum capacity at the facility to 16 petabytes of space available for users to archive their data from the supercomputers. As of March 2019, the NAS facility increased the total archival storage capacity of the Spectra Logic tape libraries to 1,048 petabytes (or 1 exabyte) with 35% compression. SGI's Data Migration Facility (DMF) and OpenVault manage disk-to-tape data migration and tape-to-disk de-migration for the NAS facility. As of March 2019, there is over 110 petabytes of unique data stored in the NAS archival storage system.


Data visualization systems

In 1984, NAS purchased 25 SGI IRIS 1000 graphics terminals, the beginning of their long partnership with the Silicon Valley–based company, which made a significant impact on post-processing and visualization of CFD results run on the supercomputers at the facility. Visualization became a key process in the analysis of simulation data run on the supercomputers, allowing engineers and scientists to view their results spatially and in ways that allowed for a greater understanding of the CFD forces at work in their designs.


The hyperwall

In 2002, NAS visualization experts developed a visualization system called the "hyperwall" which included 49 linked
LCD A liquid-crystal display (LCD) is a flat-panel display or other electronically modulated optical device that uses the light-modulating properties of liquid crystals combined with polarizers to display information. Liquid crystals do not em ...
panels that allowed scientists to view complex
dataset A data set (or dataset) is a collection of data. In the case of tabular data, a data set corresponds to one or more database tables, where every column of a table represents a particular variable, and each row corresponds to a given record o ...
s on a large, dynamic seven-by-seven screen array. Each screen had its own processing power, allowing each one to display, process, and share datasets so that a single image could be displayed across all screens or configured so that data could be displayed in "cells" like a giant visual spreadsheet. The second generation "hyperwall-2" was developed in 2008 by NAS in partnership with Colfax International and is made up of 128 LCD screens arranged in an 8x16 grid 23 feet wide by 10 feet tall. It is capable of rendering one quarter billion
pixels In digital imaging, a pixel (abbreviated px), pel, or picture element is the smallest addressable element in a raster image, or the smallest addressable element in a dot matrix display device. In most digital display devices, pixels are the sma ...
, making it the highest resolution scientific visualization system in the world. It contains 128 nodes, each with two quad-core
AMD Advanced Micro Devices, Inc. (AMD) is an American multinational corporation and technology company headquartered in Santa Clara, California and maintains significant operations in Austin, Texas. AMD is a hardware and fabless company that de ...
Opteron Opteron is AMD's x86 former server and workstation Microprocessor, processor line, and was the first processor which supported the AMD64 instruction set architecture (known generically as x86-64). It was released on April 22, 2003, with the ''Sl ...
(
Barcelona Barcelona ( ; ; ) is a city on the northeastern coast of Spain. It is the capital and largest city of the autonomous community of Catalonia, as well as the second-most populous municipality of Spain. With a population of 1.6 million within c ...
) processors and a
Nvidia Nvidia Corporation ( ) is an American multinational corporation and technology company headquartered in Santa Clara, California, and incorporated in Delaware. Founded in 1993 by Jensen Huang (president and CEO), Chris Malachowsky, and Curti ...
GeForce 480 GTX
graphics processing unit A graphics processing unit (GPU) is a specialized electronic circuit designed for digital image processing and to accelerate computer graphics, being present either as a discrete video card or embedded on motherboards, mobile phones, personal ...
(GPU) for a dedicated peak processing power of 128 teraflops across the entire system—100 times more powerful than the original hyperwall. The hyperwall-2 is directly connected to the Pleiades supercomputer's filesystem over an InfiniBand network, which allows the system to read data directly from the filesystem without needing to copy files onto the hyperwall-2's memory. In 2014, the hyperwall was upgraded with new hardware: 256 Intel Xeon "Ivy Bridge" processors and 128 NVIDIA Geforce 780 Ti GPUs. The upgrade increased the system's peak processing power from 9 teraflops to 57 teraflops, and now has nearly 400 gigabytes of graphics memory. In 2020, the hyperwall was further upgraded with new hardware: 256 Intel Xeon Platinum 8268 (Cascade Lake) processors and 128 NVIDIA Quadro RTX 6000 GPUs with a total of 3.1 terabytes of graphics memory. The upgrade increased the system's peak processing power from 57 teraflops to 512 teraflops.


Concurrent visualization

An important feature of the hyperwall technology developed at NAS is that it allows for "concurrent visualization" of data, which enables scientists and engineers to analyze and interpret data while the calculations are running on the supercomputers. Not only does this show the current state of the calculation for runtime monitoring, steering, and termination, but it also "allows higher temporal resolution visualization compared to post-processing because I/O and storage space requirements are largely obviated... ndmay show features in a simulation that would otherwise not be visible." The NAS visualization team developed a configurable concurrent
pipeline A pipeline is a system of Pipe (fluid conveyance), pipes for long-distance transportation of a liquid or gas, typically to a market area for consumption. The latest data from 2014 gives a total of slightly less than of pipeline in 120 countries ...
for use with a massively parallel forecast model run on the Columbia supercomputer in 2005 to help predict the Atlantic hurricane season for the
National Hurricane Center The National Hurricane Center (NHC) is the division of the United States' NOAA/National Weather Service responsible for tracking and predicting tropical weather systems between the IERS Reference Meridian, Prime Meridian and the 140th meridian ...
. Because of the deadlines to submit each of the forecasts, it was important that the visualization process would not significantly impede the simulation or cause it to fail.


References


External links


NASA Advanced Supercomputing Resources


NASA Advanced Supercomputing (NAS) Division homepage













Other Online Resources


NASA Official Website



NASA's High-End Computing Capability Project homepage

TOP500 official website
{{authority control Ames Research Center Supercomputer sites