The Cornell box is a test scene designed to evaluate the accuracy of rendering

software Software consists of computer programs that instruct the Execution (computing), execution of a computer. Software also includes design documents and specifications. The history of software is closely tied to the development of digital comput ...

by comparing a rendered image with a

photograph A photograph (also known as a photo, or more generically referred to as an ''image'' or ''picture'') is an image created by light falling on a photosensitivity, photosensitive surface, usually photographic film or an electronic image sensor. Th ...

of a real-world model under the same

lighting Lighting or illumination is the deliberate use of light to achieve practical or aesthetic effects. Lighting includes the use of both artificial light sources like lamps and light fixtures, as well as natural illumination by capturing daylight. ...

conditions. It has become a commonly used 3D test model in

computer graphics Computer graphics deals with generating images and art with the aid of computers. Computer graphics is a core technology in digital photography, film, video games, digital art, cell phone and computer displays, and many specialized applications. ...

research. The box was created by Cindy M. Goral, Kenneth E. Torrance,

Donald P. Greenberg Donald Peter Greenberg (born 1934) is the Jacob Gould Schurman Professor of Computer Graphics at Cornell University. Early life Greenberg earned his undergraduate and Ph.D. degrees from Cornell University, where he played on the tennis and soccer ...

, and Bennett Battaile at the ''

Cornell University Cornell University is a Private university, private Ivy League research university based in Ithaca, New York, United States. The university was co-founded by American philanthropist Ezra Cornell and historian and educator Andrew Dickson W ...

Program of Computer Graphics'' as part of their research on radiosity and diffuse interreflection. Their findings were published in the paper ''Modeling the Interaction of Light Between Diffuse Surfaces'', presented at

SIGGRAPH SIGGRAPH (Special Interest Group on Computer Graphics and Interactive Techniques) is an annual conference centered around computer graphics organized by ACM, starting in 1974 in Boulder, CO. The main conference has always been held in North ...

'84.History of the Cornell Box
/ref>Cindy M. Goral, Kenneth E. Torrance, Donald P. Greenberg, and Bennett Battaile.
Modeling the Interaction of Light Between Diffuse Surfaces
''. SIGGRAPH 1984.

Reference Model

Cornell Box with 3 balls of different materials

A physical model of the Cornell box is constructed and photographed using calibrated equipment. Photographic image references shared via Cornell University websiteCornell Box Data
/ref> were captured using a liquid-cooled Photometrics PXL1300L

CCD camera A charge-coupled device (CCD) is an integrated circuit containing an array of linked, or coupled, capacitors. Under the control of an external circuit, each capacitor can transfer its electric charge to a neighboring capacitor. CCD sensors are a ...

with a precision of 12 bits. Seven narrow-band

filters Filtration is a physical process that separates solid matter and fluid from a mixture. Filter, filtering, filters or filtration may also refer to: Science and technology Computing * Filter (higher-order function), in functional programming * Fil ...

are employed to obtain a coarse sampling across the

visible spectrum The visible spectrum is the spectral band, band of the electromagnetic spectrum that is visual perception, visible to the human eye. Electromagnetic radiation in this range of wavelengths is called ''visible light'' (or simply light). The optica ...

. To enhance accuracy, dark current is subtracted from the images, and flat-field correction is applied to compensate for cosine fall-off and lens fall-off. The precise settings of the scene are

measured Measurement is the quantification of attributes of an object or event, which can be used to compare with other objects or events. In other words, measurement is a process of determining how large or small a physical quantity is as compared to ...

, including the

emission spectrum The emission spectrum of a chemical element or chemical compound is the Spectrum (physical sciences), spectrum of frequencies of electromagnetic radiation emitted due to electrons making a atomic electron transition, transition from a high energ ...

of the

light source Light, visible light, or visible radiation is electromagnetic radiation that can be visual perception, perceived by the human eye. Visible light spans the visible spectrum and is usually defined as having wavelengths in the range of 400– ...

, reflectance spectra of all

surface A surface, as the term is most generally used, is the outermost or uppermost layer of a physical object or space. It is the portion or region of the object that can first be perceived by an observer using the senses of sight and touch, and is ...

s, and the exact position and dimensions of objects, walls, light sources, and the camera. A matching virtual

3D scene This is a glossary of terms relating to computer graphics. For more general computer hardware terms, see glossary of computer hardware terms. 0–9 A B ...

is created, and a

digital image A digital image is an image composed of picture elements, also known as pixels, each with '' finite'', '' discrete quantities'' of numeric representation for its intensity or gray level that is an output from its two-dimensional functions f ...

is generated during the rendering process for comparison with the reference photograph. The comparison helps evaluate the accuracy of rendering algorithms, particularly in handling

global illumination Global illumination (GI), or indirect illumination, is a group of algorithms used in 3D computer graphics that are meant to add more realistic lighting Lighting or illumination is the deliberate use of light to achieve practical or aest ...

, radiosity, and light transport. The Cornell box is designed to demonstrate '' diffuse interreflection''. Light reflecting off the red and green walls subtly tints the adjacent white walls, demonstrating complex global illumination effects.

Scene Configuration

The basic environment consists of: * A single

* A green right wall * A red left wall * A white back wall, floor, and ceiling Objects are commonly placed inside the box to study their interaction with light. The original configuration included two boxes, while subsequent versions introduced a

reflective Reflection is the change in direction of a wavefront at an interface between two different media so that the wavefront returns into the medium from which it originated. Common examples include the reflection of light, sound and water waves. The ...

mirror A mirror, also known as a looking glass, is an object that Reflection (physics), reflects an image. Light that bounces off a mirror forms an image of whatever is in front of it, which is then focused through the lens of the eye or a camera ...

sphere A sphere (from Ancient Greek, Greek , ) is a surface (mathematics), surface analogous to the circle, a curve. In solid geometry, a sphere is the Locus (mathematics), set of points that are all at the same distance from a given point in three ...

and a

refractive In physics, refraction is the redirection of a wave as it passes from one medium to another. The redirection can be caused by the wave's change in speed or by a change in the medium. Refraction of light is the most commonly observed phenome ...

glass Glass is an amorphous (non-crystalline solid, non-crystalline) solid. Because it is often transparency and translucency, transparent and chemically inert, glass has found widespread practical, technological, and decorative use in window pane ...

sphere, commonly used in ray tracing research.

History

The Original Cornell Box

The original Cornell Box was described by Cindy M. Goral, Kenneth E. Torrance, and Donald P. Greenberg in their 1984 paper titled ''Modeling the Interaction of Light Between Diffuse Surfaces'', presented at SIGGRAPH '84. In this initial version, the interior was painted in red, blue, and gray, and no occluding objects were placed inside the box. Rather than placing a light source inside, the box was illuminated indirectly using a set of lights and a white diffuse surface.
Cornell_box_render_SIGGRAPH'84.png , The rendered image is on the left, and photo reference on the right.
The ray tracing program was written in C on a

VAX-11/780 The VAX-11 is a discontinued family of 32-bit superminicomputers, running the Virtual Address eXtension (VAX) instruction set architecture (ISA), developed and manufactured by Digital Equipment Corporation (DEC). Development began in 1976. In ad ...

superminicomputer A superminicomputer, colloquially supermini, is a high-end minicomputer. The term is used to distinguish the emerging 32-bit architecture midrange computers introduced in the mid to late 1970s from the classical 16-bit systems that preceded them ...

. Cornell_box_setup_1984.png , Sketch of the scene configuration from the same paper with Cornell box referred to as a ''test cube''. ----

Hemi-Cube Form Factors

This simulation of the Cornell Box was carried out by Michael F. Cohen and Donald P. Greenberg for their 1985 paper ''The Hemi-Cube: A Radiosity Solution for Complex Environments'' presented at SIGGRAPH '85.Michael F. Cohen, Donald P. Greenberg.
The Hemi-Cube: A Radiosity Solution for Complex Environments
'. SIGGRAPH 1985, Vol. 19, No. 3, pp. 31-40. The hemi-cube technique allowed form factors to be calculated using scan conversion algorithms, which were supported by hardware at the time, and made it possible to calculate shadows from occluding objects inside the scene. This version of the Cornell Box was the first to feature objects placed inside — a short block on the left side, a tall block on the right side, and a light source in the center of the 'ceiling'. This configuration matches the scene data shared by Cornell University discussed in Scene data (section). This layout became a standard reference and was widely reproduced, but in many rendered versions of this scene, the arrangement of the blocks is often mirrored.

----

Spherical Harmonics

François X. Sillion, James Arvo, Stephen Westin, and Donald P. Greenberg made significant contributions to global illumination in computer graphics, particularly through the use of

spherical harmonics In mathematics and physical science, spherical harmonics are special functions defined on the surface of a sphere. They are often employed in solving partial differential equations in many scientific fields. The table of spherical harmonics co ...

. In their 1991 paper presented at SIGGRAPH '91, ''A Global Illumination Solution for General Reflectance Distributions'', they demonstrated a method that extended light transfer simulations to handle complex reflectance properties beyond ideal diffuse or specular surfaces. This approach utilized spherical harmonic decomposition to encode

bidirectional reflectance distribution function The bidirectional reflectance distribution function (BRDF), symbol f_(\omega_,\, \omega_), is a function of four real variables that defines how light from a source is reflected off an Opacity (optics), opaque surface. It is employed in the optic ...

s (BRDFs) and directional intensity distributions, allowing for more accurate and efficient rendering of materials with complex reflectance characteristics. In this version of the Cornell box, the blocks are arranged in a flipped configuration, and the tall block features a mirror-like (aluminum) surface instead of a diffuse one. This modification does not align with the scene data shared by Cornell University and could contribute to the misconception described in Scene data (section). The photographic and synthetic image references found in the SIGGRAPH '91 paper, in a specific subpage of the Cornell University website named 'Cornell Box Comparison' and in the officially provided reproducible data are all unique, even though they look fairly identical. Common misconceptions are discussed in the section below, but it is worth noting that while the synthetic image found in this section is entirely different from the 'officially' provided synthetic reference, the photographic reference closely resembles the 'official' photographic image discussed in Photographic images (section). At first glance, it is easy to notice the changed camera position, as well as the addition of a square object obstructing and absorbing light — a so-called '

flag A flag is a piece of textile, fabric (most often rectangular) with distinctive colours and design. It is used as a symbol, a signalling device, or for decoration. The term ''flag'' is also used to refer to the graphic design employed, and fla ...

', often used in photography to achieve the opposite effect of ' bounce cards'. It was likely used to block distracting effects caused by the strong light source being directly visible by the camera. This absorbing object is not present in the officially provided photographic image (section).

Cornell University data and common misconceptions

Publicly released resources from Cornell University contain mixed data that has led to some misconceptions. Over the years, edits were made to the website,The Cornell Box website
/ref> but few details about these changes are provided, and no documentation exists regarding the modifications. The website was created before web archiving services became widely available, meaning some updates may not be accessible or documented.

Synthetic image

A rendered image reference was uploaded by the ''Cornell University Program of Computer Graphics''. The camera position and Cornell box configuration match the provided scene data, but short and tall blocks are mirrored horizontally. The synthetic image contains a shading artifact near the left corner of the short block, making it easily recognizable. The website provides no details on the ray tracing or rendering methods used. No publicly available photograph of the original reference exists, aside from the photographic images discussed in the section below. As a result, a rendered image with artifacts, which is over 30-years-old, is still often used as a reference for recreating the scene, despite significant advancements in ray tracing software and hardware. Some examples that can be found on this page were path traced with the unbiased

Octane Render Octane Render is an unbiased rendering application with real-time capability developed by graphics software company OTOY Inc. Octane Render was the first commercially available unbiased path-tracer that fully utilized the GPU, allowing users ...

rendering system, but intentional biased inaccuracies had to be introduced to mimic the limitations of older software. Without these adjustments, modern results would be too accurate and would deviate significantly from the available synthetic reference. It's a bit of a paradox now: this reference image, the Cornell Box itself, and its role in research that helped to pave the way for the advancements seen in modern ray tracers. These very same modern ray tracers must now be deliberately constrained when recreating this image, because they are capable of producing results that are far more accurate and realistic than what was initially possible. The synthetic Cornell Box image reference was created at a time when rendering technology was still a work-in-progress. The limitations of the source data and the absence of the original non-synthetic photographic image reference are key reasons why the Cornell Box, in its currently available form, is no longer suitable for modern applications. While it doesn't sound like a paradox, it's quite natural that some test tools, assets, or references become outdated over time because they fulfilled their task or newer solutions became available. However, the fact that people recreate an old, already rendered image of an asset — originally created to test and improve things — because the original reference is unavailable does sound like a little paradox. Of course, it was a gesture from Cornell University to share the data used in the research and it's usually used recreationally by people who find it interesting (apparently). Over the years, advancements in software and hardware have led to significant improvements allowing more accurate reference gathering and measurement. Today, even an average social media profile picture has a higher resolution than the provided synthetic reference. Despite this, the Cornell Box remains an important historical artifact and an enduring icon. In many ways, it is the 'Hello, World!' of ray tracing world.

Photographic images

Cornell University shared data that must be manually processed before being assembled into an image (e.g., through

multispectral imaging Multispectral imaging captures image data within specific wavelength ranges across the electromagnetic spectrum. The wavelengths may be separated by filters or detected with the use of instruments that are sensitive to particular wavelengths, ...

). Few details are provided, except for the camera used, the implementation of seven narrow-band filters (to capture multiple wavelengths of the visible spectrum), and applied post-processing (noise reduction and lens correction). The captured wavelengths are briefly mentioned in the

filename A filename or file name is a name used to uniquely identify a computer file in a file system. Different file systems impose different restrictions on filename lengths. A filename may (depending on the file system) include: * name – base ...

s, and seven different

monochromatic A monochrome or monochromatic image, object or palette is composed of one color (or values of one color). Images using only shades of grey are called grayscale (typically digital) or black-and-white (typically analog). In physics, mon ...

sets are available. The specific configuration and capture time of the data are not listed, except for the reference to the data being taken 'in its current configuration' — which refers to the box's configuration at the time of capture, and is fairly self-explanatory. The data has been available on the website since at least 1998. The 'final' processed image is not available for download, and no official or unofficial renditions have been released. To display it, the original data must first be interpreted and assembled. Because no examples are accessible online, an image was crafted specifically for this article and released on

Wikimedia Commons Wikimedia Commons, or simply Commons, is a wiki-based Digital library, media repository of Open content, free-to-use images, sounds, videos and other media. It is a project of the Wikimedia Foundation. Files from Wikimedia Commons can be used ...

, along with some additional context. Images found below were carefully recreated using available data, and they are for illustrative purposes only. Different methods and techniques can be used, leading to a variety of possible results. It is not difficult to notice that the compiled image (the result of digitally processed data) does not match the synthetic image referenceSynthetic image reference provided by Cornell University discussed in Synthetic image section (e.g., the tall block has a highly specular surface) or the scene dataReference scene data provided by Cornell University and discussed in Scene data section (e.g., the camera position has been offset to reduce a distracting reflection on the tall block). Seven monochromatic datasets were captured at different visible spectrum wavelengths. Using information obtained from the filenames, we can determine the captured wavelengths: 700 nm, 650 nm, 600 nm, 550 nm, 500 nm, 450 nm, and 400 nm. These datasets can be combined to create red, green, and blue channels, which are then used to produce a polychromatic image. One method to create these channels is by assigning the closest corresponding colors on the visible spectrum. The channels were assigned as follows: * Red channel: 700 nm, 650 nm, 600 nm * Green channel: 550 nm, 500 nm * Blue channel: 450 nm, 400 nm Basic corrections were not applied, so no adjustments were made to

contrast Contrast may refer to: Science * Contrast (vision), the contradiction in form, colour and light between parts of an image * Contrast (statistics), a combination of averages whose coefficients add up to zero, or the difference between two means * ...

saturation Saturation, saturated, unsaturation or unsaturated may refer to: Chemistry *Saturated and unsaturated compounds, a classification of compounds related to their ability to resist addition reactions ** Degree of unsaturation **Saturated fat or satu ...

white balance In photography and image processing, color balance is the global adjustment of the intensities of the colors (typically red, green, and blue primary colors). An important goal of this adjustment is to render specific colors – particularly neu ...

, etc. No advanced corrections (such as lens corrections for

chromatic aberration In optics, chromatic aberration (CA), also called chromatic distortion, color aberration, color fringing, or purple fringing, is a failure of a lens to focus all colors to the same point. It is caused by dispersion: the refractive index of the ...

) were performed. Minimal tonemapping was applied, and no pixels were cropped.

Defective pixel A defective pixel or a dead pixel is a pixel on a liquid crystal display (LCD) that is not functioning properly. The ISO standard ISO 13406-2 distinguishes between three different types of defective pixels, while hardware companies tend to have ...

s and capture errors were retouched, but the unedited version is also available. The same principles apply to the synthetic renders, which were generated with a slightly offset camera, as the precise data for this particular shot is not provided. The camera was not properly calibrated and is only vaguely aligned with the short block.

Scene data

The original geometry data supplied by Cornell University clearly defines the positions of the objects, with the tall block on the right and the short block on the left, as shown in the Hemi-Cube Form Factors section. While 3D applications and algorithms may interpret axes differently, causing potential confusion, this does not affect the specific misconception in this case. We have direct access to the original data, and although we understand the correct configuration, it's unclear why a mirrored version (flipped along one axis) is more prominently used. This change was first observed in the '

Spherical Harmonics In mathematics and physical science, spherical harmonics are special functions defined on the surface of a sphere. They are often employed in solving partial differential equations in many scientific fields. The table of spherical harmonics co ...

' rendition of the box. Although the authors did not explicitly comment on this change, a plausible explanation could be that without the flipped layout and repositioned camera, the uneven reflection on the tall block would be too distracting. Additionally, if the green wall were visible in the reflection, it would not contrast as strongly as the red wall does on the white background.Examples can be viewed in #Photographic images section. Repositioning the objects and camera could be a more efficient solution than recreating and repainting the entire box. An accurate method for setting up the camera according to the original specifications is to set the focal length to 35mm and the sensor size (film gate) to 25mm. Alternative setup methods, including the camera's position, rotation, and other relevant details, are provided in the description of the 3D Cornell Box model featured in this section. The alternative coordinates that could be used to reproduce the repositioned camera were never published.

Historical Context and Applications

The Cornell box was developed in the early 1980s as part of research into radiosity, one of the first rendering techniques capable of simulating diffuse interreflection. This work laid the foundation for

physically based rendering Physically based rendering (PBR) is a computer graphics approach that seeks to render images in a way that models the lights and surfaces with optics in the real world. It is often referred to as "Physically Based Lighting" or "Physically Based ...

methods, played a pivotal role in validating

and inspired advancements in ray tracing methods. Later, the Cornell box was adapted for evaluating newer methods such as

photon mapping In computer graphics, photon mapping is a two-pass global illumination rendering algorithm developed by Henrik Wann Jensen between 1995 and 2001Jensen, H. (1996). ''Global Illumination using Photon Maps''. nlineAvailable at: http://graphics.sta ...

, introduced by Henrik Wann Jensen in the 1990s, which improved the simulation of caustics and indirect lighting. Modern applications of the Cornell box extend to testing

Monte Carlo Monte Carlo ( ; ; or colloquially ; , ; ) is an official administrative area of Monaco, specifically the Ward (country subdivision), ward of Monte Carlo/Spélugues, where the Monte Carlo Casino is located. Informally, the name also refers to ...

path tracing Path tracing is a rendering algorithm in computer graphics that Simulation, simulates how light interacts with Physical object, objects, voxels, and Volumetric_path_tracing, participating media to generate realistic (''physically plausible'') R ...

machine learning Machine learning (ML) is a field of study in artificial intelligence concerned with the development and study of Computational statistics, statistical algorithms that can learn from data and generalise to unseen data, and thus perform Task ( ...

-based rendering techniques and other advanced approaches to rendering. It's also frequently used as a

benchmark Benchmark may refer to: Business and economics * Benchmarking, evaluating performance within organizations * Benchmark price * Benchmark (crude oil), oil-specific practices Science and technology * Experimental benchmarking, the act of defining a ...

and remains an essential test scene in many rendering

engines An engine or motor is a machine designed to convert one or more forms of energy into mechanical energy. Available energy sources include potential energy (e.g. energy of the Earth's gravitational field as exploited in hydroelectric power gen ...

Blender A blender (sometimes called a mixer (from Latin ''mixus, the PPP of miscere eng. to Mix)'' or liquidiser in British English) is a kitchen and laboratory appliance used to mix, crush, purée or emulsify food and other substances. A stationary ...

Unreal Engine Unreal Engine (UE) is a 3D computer graphics game engine developed by Epic Games, first showcased in the 1998 first-person shooter video game '' Unreal''. Initially developed for PC first-person shooters, it has since been used in a variety of ...

, and

Arnold Arnold may refer to: People * Arnold (given name), a masculine given name * Arnold (surname), a German and English surname Places Australia * Arnold, Victoria, a small town in the Australian state of Victoria Canada * Arnold, Nova Scotia U ...

. Since its creation in the previous millennium, many companies have developed their own updated and higher-quality references for internal use, as advancements in technology have allowed for more detailed and accurate measurements. Although primarily used for computer graphics, variations of the Cornell box have also been employed in acoustics research to model sound reflections and validate simulation methods.

References

Annotations (𝓲)

External links

The Cornell Box website
{{Standard test item 3D graphics models Test items Cornell University