The Cornell box is a test scene designed to evaluate the accuracy of
rendering software
Software consists of computer programs that instruct the Execution (computing), execution of a computer. Software also includes design documents and specifications.
The history of software is closely tied to the development of digital comput ...
by comparing a
rendered image with a
photograph
A photograph (also known as a photo, or more generically referred to as an ''image'' or ''picture'') is an image created by light falling on a photosensitivity, photosensitive surface, usually photographic film or an electronic image sensor. Th ...
of a real-world model under the same
lighting
Lighting or illumination is the deliberate use of light to achieve practical or aesthetic effects. Lighting includes the use of both artificial light sources like lamps and light fixtures, as well as natural illumination by capturing daylight. ...
conditions. It has become a
commonly used 3D test model in
computer graphics
Computer graphics deals with generating images and art with the aid of computers. Computer graphics is a core technology in digital photography, film, video games, digital art, cell phone and computer displays, and many specialized applications. ...
research.
The box was created by Cindy M. Goral, Kenneth E. Torrance,
Donald P. Greenberg
Donald Peter Greenberg (born 1934) is the Jacob Gould Schurman Professor of Computer Graphics at Cornell University.
Early life
Greenberg earned his undergraduate and Ph.D. degrees from Cornell University, where he played on the tennis and soccer ...
, and Bennett Battaile at the ''
Cornell University
Cornell University is a Private university, private Ivy League research university based in Ithaca, New York, United States. The university was co-founded by American philanthropist Ezra Cornell and historian and educator Andrew Dickson W ...
Program of Computer Graphics'' as part of their research on
radiosity and
diffuse interreflection. Their findings were published in the paper ''Modeling the Interaction of Light Between Diffuse Surfaces'', presented at
SIGGRAPH
SIGGRAPH (Special Interest Group on Computer Graphics and Interactive Techniques) is an annual conference centered around computer graphics organized by ACM, starting in 1974 in Boulder, CO. The main conference has always been held in North ...
'84.
[History of the Cornell Box](_blank)
/ref>[Cindy M. Goral, Kenneth E. Torrance, Donald P. Greenberg, and Bennett Battaile. ]
Modeling the Interaction of Light Between Diffuse Surfaces
''. SIGGRAPH 1984.
Reference Model
A physical model of the Cornell box is constructed and photographed using calibrated equipment. Photographic image references shared via Cornell University website[Cornell Box Data](_blank)
/ref> were captured using a liquid-cooled Photometrics PXL1300L CCD camera
A charge-coupled device (CCD) is an integrated circuit containing an array of linked, or coupled, capacitors. Under the control of an external circuit, each capacitor can transfer its electric charge to a neighboring capacitor. CCD sensors are a ...
with a precision of 12 bits. Seven narrow-band filters
Filtration is a physical process that separates solid matter and fluid from a mixture.
Filter, filtering, filters or filtration may also refer to:
Science and technology
Computing
* Filter (higher-order function), in functional programming
* Fil ...
are employed to obtain a coarse sampling across the visible spectrum
The visible spectrum is the spectral band, band of the electromagnetic spectrum that is visual perception, visible to the human eye. Electromagnetic radiation in this range of wavelengths is called ''visible light'' (or simply light).
The optica ...
. To enhance accuracy, dark current is subtracted from the images, and flat-field correction is applied to compensate for cosine fall-off and lens fall-off. The precise settings of the scene are measured
Measurement is the quantification of attributes of an object or event, which can be used to compare with other objects or events.
In other words, measurement is a process of determining how large or small a physical quantity is as compared to ...
, including the emission spectrum
The emission spectrum of a chemical element or chemical compound is the Spectrum (physical sciences), spectrum of frequencies of electromagnetic radiation emitted due to electrons making a atomic electron transition, transition from a high energ ...
of the light source
Light, visible light, or visible radiation is electromagnetic radiation that can be visual perception, perceived by the human eye. Visible light spans the visible spectrum and is usually defined as having wavelengths in the range of 400– ...
, reflectance spectra of all surface
A surface, as the term is most generally used, is the outermost or uppermost layer of a physical object or space. It is the portion or region of the object that can first be perceived by an observer using the senses of sight and touch, and is ...
s, and the exact position and dimensions of objects, walls, light sources, and the camera.
A matching virtual 3D scene
This is a glossary of terms relating to computer graphics.
For more general computer hardware terms, see glossary of computer hardware terms.
0–9
A
B
...
is created, and a digital image
A digital image is an image composed of picture elements, also known as pixels, each with '' finite'', '' discrete quantities'' of numeric representation for its intensity or gray level that is an output from its two-dimensional functions f ...
is generated during the rendering process for comparison with the reference photograph. The comparison helps evaluate the accuracy of rendering algorithms, particularly in handling global illumination
Global illumination (GI), or indirect illumination, is a group of algorithms used in 3D computer graphics that are meant to add more realistic lighting
Lighting or illumination is the deliberate use of light to achieve practical or aest ...
, radiosity, and light transport. The Cornell box is designed to demonstrate '' diffuse interreflection''. Light reflecting off the red and green walls subtly tints the adjacent white walls, demonstrating complex global illumination effects.
Scene Configuration
The basic environment consists of:
* A single light source
Light, visible light, or visible radiation is electromagnetic radiation that can be visual perception, perceived by the human eye. Visible light spans the visible spectrum and is usually defined as having wavelengths in the range of 400– ...
* A green right wall
* A red left wall
* A white back wall, floor, and ceiling
Objects are commonly placed inside the box to study their interaction with light. The original configuration included two boxes, while subsequent versions introduced a reflective
Reflection is the change in direction of a wavefront at an interface between two different media so that the wavefront returns into the medium from which it originated. Common examples include the reflection of light, sound and water waves. The ...
mirror
A mirror, also known as a looking glass, is an object that Reflection (physics), reflects an image. Light that bounces off a mirror forms an image of whatever is in front of it, which is then focused through the lens of the eye or a camera ...
sphere
A sphere (from Ancient Greek, Greek , ) is a surface (mathematics), surface analogous to the circle, a curve. In solid geometry, a sphere is the Locus (mathematics), set of points that are all at the same distance from a given point in three ...
and a refractive
In physics, refraction is the redirection of a wave as it passes from one medium to another. The redirection can be caused by the wave's change in speed or by a change in the medium. Refraction of light is the most commonly observed phenome ...
glass
Glass is an amorphous (non-crystalline solid, non-crystalline) solid. Because it is often transparency and translucency, transparent and chemically inert, glass has found widespread practical, technological, and decorative use in window pane ...
sphere, commonly used in ray tracing research.
History
The Original Cornell Box
The original Cornell Box was described by Cindy M. Goral, Kenneth E. Torrance, and Donald P. Greenberg in their 1984 paper titled ''Modeling the Interaction of Light Between Diffuse Surfaces'', presented at SIGGRAPH '84.
In this initial version, the interior was painted in red, blue, and gray, and no occluding objects were placed inside the box. Rather than placing a light source inside, the box was illuminated indirectly using a set of lights and a white diffuse surface.
Cornell_box_render_SIGGRAPH'84.png , The rendered image is on the left, and photo reference on the right.
The ray tracing program was written in C on a VAX-11/780
The VAX-11 is a discontinued family of 32-bit superminicomputers, running the Virtual Address eXtension (VAX) instruction set architecture (ISA), developed and manufactured by Digital Equipment Corporation (DEC). Development began in 1976. In ad ...
superminicomputer
A superminicomputer, colloquially supermini, is a high-end minicomputer. The term is used to distinguish the emerging 32-bit architecture midrange computers introduced in the mid to late 1970s from the classical 16-bit systems that preceded them ...
.
Cornell_box_setup_1984.png , Sketch of the scene configuration from the same paper with Cornell box referred to as a ''test cube''.
----
Hemi-Cube Form Factors
This simulation of the Cornell Box was carried out by Michael F. Cohen and Donald P. Greenberg for their 1985 paper ''The Hemi-Cube: A Radiosity Solution for Complex Environments'' presented at SIGGRAPH '85.[Michael F. Cohen, Donald P. Greenberg. ]
The Hemi-Cube: A Radiosity Solution for Complex Environments
'. SIGGRAPH 1985, Vol. 19, No. 3, pp. 31-40. The hemi-cube technique allowed form factors to be calculated using scan conversion algorithms, which were supported by hardware at the time, and made it possible to calculate shadows from occluding objects inside the scene.
This version of the Cornell Box was the first to feature objects placed inside — a short block on the left side, a tall block on the right side, and a light source in the center of the 'ceiling'. This configuration matches the scene data shared by Cornell University discussed in Scene data (section). This layout became a standard reference and was widely reproduced, but in many rendered versions of this scene, the arrangement of the blocks is often mirrored.
----
Spherical Harmonics
François X. Sillion, James Arvo, Stephen Westin, and Donald P. Greenberg made significant contributions to global illumination in computer graphics, particularly through the use of spherical harmonics
In mathematics and physical science, spherical harmonics are special functions defined on the surface of a sphere. They are often employed in solving partial differential equations in many scientific fields. The table of spherical harmonics co ...
.
In their 1991 paper presented at SIGGRAPH '91, ''A Global Illumination Solution for General Reflectance Distributions'', they demonstrated a method that extended light transfer simulations to handle complex reflectance properties beyond ideal diffuse or specular surfaces. This approach utilized spherical harmonic decomposition to encode bidirectional reflectance distribution function
The bidirectional reflectance distribution function (BRDF), symbol f_(\omega_,\, \omega_), is a function of four real variables that defines how light from a source is reflected off an Opacity (optics), opaque surface. It is employed in the optic ...
s (BRDFs) and directional intensity distributions, allowing for more accurate and efficient rendering of materials with complex reflectance characteristics. In this version of the Cornell box, the blocks are arranged in a flipped configuration, and the tall block features a mirror-like (aluminum) surface instead of a diffuse one. This modification does not align with the scene data shared by Cornell University and could contribute to the misconception described in Scene data (section).
The photographic and synthetic image references found in the SIGGRAPH '91 paper, in a specific subpage of the Cornell University website named 'Cornell Box Comparison' and in the officially provided reproducible data are all unique, even though they look fairly identical. Common misconceptions are discussed in the section below, but it is worth noting that while the synthetic image found in this section is entirely different from the 'officially' provided synthetic reference, the photographic reference closely resembles the 'official' photographic image discussed in Photographic images (section). At first glance, it is easy to notice the changed camera position, as well as the addition of a square object obstructing and absorbing light — a so-called 'flag
A flag is a piece of textile, fabric (most often rectangular) with distinctive colours and design. It is used as a symbol, a signalling device, or for decoration. The term ''flag'' is also used to refer to the graphic design employed, and fla ...
', often used in photography to achieve the opposite effect of ' bounce cards'. It was likely used to block distracting effects caused by the strong light source being directly visible by the camera. This absorbing object is not present in the officially provided photographic image (section).
Cornell University data and common misconceptions
Publicly released resources from Cornell University contain mixed data that has led to some misconceptions. Over the years, edits were made to the website,[The Cornell Box website](_blank)
/ref> but few details about these changes are provided, and no documentation exists regarding the modifications. The website was created before web archiving services became widely available, meaning some updates may not be accessible or documented.
Synthetic image
A rendered image reference was uploaded by the ''Cornell University Program of Computer Graphics''. The camera position and Cornell box configuration match the provided scene data, but short and tall blocks are mirrored horizontally. The synthetic image contains a shading artifact near the left corner of the short block, making it easily recognizable. The website provides no details on the ray tracing or rendering methods used. No publicly available photograph of the original reference exists, aside from the photographic images discussed in the section below. As a result, a rendered image with artifacts, which is over 30-years-old, is still often used as a reference for recreating the scene, despite significant advancements in ray tracing software and hardware. Some examples that can be found on this page were path traced with the unbiased Octane Render
Octane Render is an unbiased rendering application with real-time capability developed by graphics software company OTOY Inc.
Octane Render was the first commercially available unbiased path-tracer that fully utilized the GPU, allowing users ...
rendering system, but intentional biased inaccuracies had to be introduced to mimic the limitations of older software. Without these adjustments, modern results would be too accurate and would deviate significantly from the available synthetic reference.
It's a bit of a paradox now: this reference image, the Cornell Box itself, and its role in research that helped to pave the way for the advancements seen in modern ray tracers. These very same modern ray tracers must now be deliberately constrained when recreating this image, because they are capable of producing results that are far more accurate and realistic than what was initially possible.
The synthetic Cornell Box image reference was created at a time when rendering technology was still a work-in-progress. The limitations of the source data and the absence of the original non-synthetic photographic image reference are key reasons why the Cornell Box, in its currently available form, is no longer suitable for modern applications. While it doesn't sound like a paradox, it's quite natural that some test tools, assets, or references become outdated over time because they fulfilled their task or newer solutions became available. However, the fact that people recreate an old, already rendered image of an asset — originally created to test and improve things — because the original reference is unavailable does sound like a little paradox. Of course, it was a gesture from Cornell University to share the data used in the research and it's usually used recreationally by people who find it interesting (apparently).
Over the years, advancements in software and hardware have led to significant improvements allowing more accurate reference gathering and measurement. Today, even an average social media profile picture has a higher resolution than the provided synthetic reference. Despite this, the Cornell Box remains an important historical artifact and an enduring icon. In many ways, it is the 'Hello, World!' of ray tracing world.
Photographic images
Cornell University shared data that must be manually processed before being assembled into an image (e.g., through multispectral imaging
Multispectral imaging captures image data within specific wavelength ranges across the electromagnetic spectrum. The wavelengths may be separated by filters or detected with the use of instruments that are sensitive to particular wavelengths, ...
). Few details are provided, except for the camera used, the implementation of seven narrow-band filters (to capture multiple wavelengths of the visible spectrum), and applied post-processing (noise reduction and lens correction). The captured wavelengths are briefly mentioned in the filename
A filename or file name is a name used to uniquely identify a computer file in a file system. Different file systems impose different restrictions on filename lengths.
A filename may (depending on the file system) include:
* name – base ...
s, and seven different monochromatic
A monochrome or monochromatic image, object or palette is composed of one color (or values of one color). Images using only shades of grey are called grayscale (typically digital) or black-and-white (typically analog). In physics, mon ...
sets are available. The specific configuration and capture time of the data are not listed, except for the reference to the data being taken 'in its current configuration' — which refers to the box's configuration at the time of capture, and is fairly self-explanatory. The data has been available on the website since at least 1998.
The 'final' processed image is not available for download, and no official or unofficial renditions have been released. To display it, the original data must first be interpreted and assembled. Because no examples are accessible online, an image was crafted specifically for this article and released on Wikimedia Commons
Wikimedia Commons, or simply Commons, is a wiki-based Digital library, media repository of Open content, free-to-use images, sounds, videos and other media. It is a project of the Wikimedia Foundation.
Files from Wikimedia Commons can be used ...
, along with some additional context.
Images found below were carefully recreated using available data, and they are for illustrative purposes only. Different methods and techniques can be used, leading to a variety of possible results. It is not difficult to notice that the compiled image (the result of digitally processed data) does not match the synthetic image reference[Synthetic image reference provided by Cornell University discussed in Synthetic image section] (e.g., the tall block has a highly specular surface) or the scene data[Reference scene data provided by Cornell University and discussed in Scene data section] (e.g., the camera position has been offset to reduce a distracting reflection on the tall block).
Seven monochromatic datasets were captured at different visible spectrum wavelengths. Using information obtained from the filenames, we can determine the captured wavelengths: 700 nm, 650 nm, 600 nm, 550 nm, 500 nm, 450 nm, and 400 nm. These datasets can be combined to create red, green, and blue channels, which are then used to produce a polychromatic image. One method to create these channels is by assigning the closest corresponding colors on the visible spectrum. The channels were assigned as follows:
* Red channel: 700 nm, 650 nm, 600 nm
* Green channel: 550 nm, 500 nm
* Blue channel: 450 nm, 400 nm
Basic corrections were not applied, so no adjustments were made to contrast
Contrast may refer to:
Science
* Contrast (vision), the contradiction in form, colour and light between parts of an image
* Contrast (statistics), a combination of averages whose coefficients add up to zero, or the difference between two means
* ...
, saturation
Saturation, saturated, unsaturation or unsaturated may refer to:
Chemistry
*Saturated and unsaturated compounds, a classification of compounds related to their ability to resist addition reactions
** Degree of unsaturation
**Saturated fat or satu ...
, white balance
In photography and image processing, color balance is the global adjustment of the intensities of the colors (typically red, green, and blue primary colors). An important goal of this adjustment is to render specific colors – particularly neu ...
, etc. No advanced corrections (such as lens corrections for chromatic aberration
In optics, chromatic aberration (CA), also called chromatic distortion, color aberration, color fringing, or purple fringing, is a failure of a lens to focus all colors to the same point. It is caused by dispersion: the refractive index of the ...
) were performed. Minimal tonemapping was applied, and no pixels were cropped. Defective pixel
A defective pixel or a dead pixel is a pixel on a liquid crystal display (LCD) that is not functioning properly. The ISO standard ISO 13406-2 distinguishes between three different types of defective pixels, while hardware companies tend to have ...
s and capture errors were retouched, but the unedited version is also available. The same principles apply to the synthetic renders, which were generated with a slightly offset camera, as the precise data for this particular shot is not provided. The camera was not properly calibrated and is only vaguely aligned with the short block.
Scene data
The original geometry data supplied by Cornell University clearly defines the positions of the objects, with the tall block on the right and the short block on the left, as shown in the Hemi-Cube Form Factors section. While 3D applications and algorithms may interpret axes differently, causing potential confusion, this does not affect the specific misconception in this case. We have direct access to the original data, and although we understand the correct configuration, it's unclear why a mirrored version (flipped along one axis) is more prominently used.
This change was first observed in the 'Spherical Harmonics
In mathematics and physical science, spherical harmonics are special functions defined on the surface of a sphere. They are often employed in solving partial differential equations in many scientific fields. The table of spherical harmonics co ...
' rendition of the box. Although the authors did not explicitly comment on this change, a plausible explanation could be that without the flipped layout and repositioned camera, the uneven reflection on the tall block would be too distracting. Additionally, if the green wall were visible in the reflection, it would not contrast as strongly as the red wall does on the white background.[Examples can be viewed in #Photographic images section.] Repositioning the objects and camera could be a more efficient solution than recreating and repainting the entire box.
An accurate method for setting up the camera according to the original specifications is to set the focal length to 35mm and the sensor size (film gate) to 25mm. Alternative setup methods, including the camera's position, rotation, and other relevant details, are provided in the description of the 3D Cornell Box model featured in this section. The alternative coordinates that could be used to reproduce the repositioned camera were never published.
Historical Context and Applications
The Cornell box was developed in the early 1980s as part of research into radiosity, one of the first rendering techniques capable of simulating diffuse interreflection. This work laid the foundation for physically based rendering
Physically based rendering (PBR) is a computer graphics approach that seeks to render images in a way that models the lights and surfaces with optics in the real world. It is often referred to as "Physically Based Lighting" or "Physically Based ...
methods, played a pivotal role in validating global illumination
Global illumination (GI), or indirect illumination, is a group of algorithms used in 3D computer graphics that are meant to add more realistic lighting
Lighting or illumination is the deliberate use of light to achieve practical or aest ...
and inspired advancements in ray tracing methods.
Later, the Cornell box was adapted for evaluating newer methods such as photon mapping
In computer graphics, photon mapping is a two-pass global illumination rendering algorithm developed by Henrik Wann Jensen between 1995 and 2001Jensen, H. (1996). ''Global Illumination using Photon Maps''. nlineAvailable at: http://graphics.sta ...
, introduced by Henrik Wann Jensen in the 1990s, which improved the simulation of caustics and indirect lighting. Modern applications of the Cornell box extend to testing Monte Carlo
Monte Carlo ( ; ; or colloquially ; , ; ) is an official administrative area of Monaco, specifically the Ward (country subdivision), ward of Monte Carlo/Spélugues, where the Monte Carlo Casino is located. Informally, the name also refers to ...
path tracing
Path tracing is a rendering algorithm in computer graphics that Simulation, simulates how light interacts with Physical object, objects, voxels, and Volumetric_path_tracing, participating media to generate realistic (''physically plausible'') R ...
, machine learning
Machine learning (ML) is a field of study in artificial intelligence concerned with the development and study of Computational statistics, statistical algorithms that can learn from data and generalise to unseen data, and thus perform Task ( ...
-based rendering techniques and other advanced approaches to rendering. It's also frequently used as a benchmark
Benchmark may refer to:
Business and economics
* Benchmarking, evaluating performance within organizations
* Benchmark price
* Benchmark (crude oil), oil-specific practices
Science and technology
* Experimental benchmarking, the act of defining a ...
and remains an essential test scene in many rendering engines
An engine or motor is a machine designed to convert one or more forms of energy into mechanical energy.
Available energy sources include potential energy (e.g. energy of the Earth's gravitational field as exploited in hydroelectric power gen ...
like Blender
A blender (sometimes called a mixer (from Latin ''mixus, the PPP of miscere eng. to Mix)'' or liquidiser in British English) is a kitchen and laboratory appliance used to mix, crush, purée or emulsify food and other substances. A stationary ...
, Unreal Engine
Unreal Engine (UE) is a 3D computer graphics game engine developed by Epic Games, first showcased in the 1998 first-person shooter video game '' Unreal''. Initially developed for PC first-person shooters, it has since been used in a variety of ...
, and Arnold
Arnold may refer to:
People
* Arnold (given name), a masculine given name
* Arnold (surname), a German and English surname
Places Australia
* Arnold, Victoria, a small town in the Australian state of Victoria
Canada
* Arnold, Nova Scotia
U ...
.
Since its creation in the previous millennium, many companies have developed their own updated and higher-quality references for internal use, as advancements in technology have allowed for more detailed and accurate measurements. Although primarily used for computer graphics, variations of the Cornell box have also been employed in acoustics research to model sound reflections and validate simulation methods.
See also
* Utah teapot
* Stanford bunny
* Stanford dragon
* List of common 3D test models
This is a list of models and meshes commonly used in 3D computer graphics for testing and demonstrating rendering algorithms and visual effects. Their use is important for comparing results, similar to the way standard test images are used in im ...
* Path tracing
Path tracing is a rendering algorithm in computer graphics that Simulation, simulates how light interacts with Physical object, objects, voxels, and Volumetric_path_tracing, participating media to generate realistic (''physically plausible'') R ...
* 3D modeling
In 3D computer graphics, 3D modeling is the process of developing a mathematical coordinate-based Computer representation of surfaces, representation of a surface of an object (inanimate or living) in Three-dimensional space, three dimensions vi ...
References
Annotations (𝓲)
External links
The Cornell Box website
{{Standard test item
3D graphics models
Test items
Cornell University