Geometrical optics, or ray optics, is a model of
optics
Optics is the branch of physics that studies the behaviour and properties of light, including its interactions with matter and the construction of instruments that use or detect it. Optics usually describes the behaviour of visible, ultra ...
that describes
light
Light or visible light is electromagnetic radiation that can be perceived by the human eye. Visible light is usually defined as having wavelengths in the range of 400–700 nanometres (nm), corresponding to frequencies of 750–420 te ...
propagation in terms of ''
rays''. The ray in geometrical optics is an
abstraction
Abstraction in its main sense is a conceptual process wherein general rules and concepts are derived from the usage and classification of specific examples, literal ("real" or " concrete") signifiers, first principles, or other methods.
"An a ...
useful for approximating the paths along which light propagates under certain circumstances.
The simplifying assumptions of geometrical optics include that light rays:
* propagate in straight-line paths as they travel in a
homogeneous
Homogeneity and heterogeneity are concepts often used in the sciences and statistics relating to the uniformity of a substance or organism. A material or image that is homogeneous is uniform in composition or character (i.e. color, shape, siz ...
medium
* bend, and in particular circumstances may split in two, at the
interface
Interface or interfacing may refer to:
Academic journals
* ''Interface'' (journal), by the Electrochemical Society
* '' Interface, Journal of Applied Linguistics'', now merged with ''ITL International Journal of Applied Linguistics''
* '' Int ...
between two dissimilar
media
Media may refer to:
Communication
* Media (communication), tools used to deliver information or data
** Advertising media, various media, content, buying and placement for advertising
** Broadcast media, communications delivered over mass el ...
* follow curved paths in a medium in which the
refractive index
In optics, the refractive index (or refraction index) of an optical medium is a dimensionless number that gives the indication of the light bending ability of that medium.
The refractive index determines how much the path of light is bent, o ...
changes
* may be absorbed or reflected.
Geometrical optics does not account for certain optical effects such as
diffraction and
interference. This simplification is useful in practice; it is an excellent approximation when the wavelength is small compared to the size of structures with which the light interacts. The techniques are particularly useful in describing geometrical aspects of
imaging
Imaging is the representation or reproduction of an object's form; especially a visual representation (i.e., the formation of an image).
Imaging technology is the application of materials and methods to create, preserve, or duplicate images.
...
, including
optical aberrations.
Explanation
A light ray is a
line
Line most often refers to:
* Line (geometry), object with zero thickness and curvature that stretches to infinity
* Telephone line, a single-user circuit on a telephone communication system
Line, lines, The Line, or LINE may also refer to:
Art ...
or
curve
In mathematics, a curve (also called a curved line in older texts) is an object similar to a line, but that does not have to be straight.
Intuitively, a curve may be thought of as the trace left by a moving point. This is the definition that ...
that is
perpendicular
In elementary geometry, two geometric objects are perpendicular if they intersect at a right angle (90 degrees or π/2 radians). The condition of perpendicularity may be represented graphically using the ''perpendicular symbol'', ⟂. It can ...
to the light's
wavefront
In physics, the wavefront of a time-varying ''wave field'' is the set ( locus) of all points having the same '' phase''. The term is generally meaningful only for fields that, at each point, vary sinusoidally in time with a single temporal fre ...
s (and is therefore
collinear
In geometry, collinearity of a set of points is the property of their lying on a single line. A set of points with this property is said to be collinear (sometimes spelled as colinear). In greater generality, the term has been used for aligned o ...
with the
wave vector
In physics, a wave vector (or wavevector) is a vector used in describing a wave, with a typical unit being cycle per metre. It has a magnitude and direction. Its magnitude is the wavenumber of the wave (inversely proportional to the wavelength) ...
).
A slightly more rigorous definition of a light ray follows from
Fermat's principle
Fermat's principle, also known as the principle of least time, is the link between ray optics and wave optics. In its original "strong" form, Fermat's principle states that the path taken by a ray between two given points is the pat ...
, which states that the path taken between two points by a ray of light is the path that can be traversed in the least time.
Geometrical optics is often simplified by making the
paraxial approximation
In geometric optics, the paraxial approximation is a small-angle approximation used in Gaussian optics and ray tracing of light through an optical system (such as a lens).
A paraxial ray is a ray which makes a small angle (''θ'') to the opti ...
, or "small angle approximation". The mathematical behavior then becomes
linear
Linearity is the property of a mathematical relationship ('' function'') that can be graphically represented as a straight line. Linearity is closely related to '' proportionality''. Examples in physics include rectilinear motion, the linear ...
, allowing optical components and systems to be described by simple matrices. This leads to the techniques of
Gaussian optics
Gaussian optics is a technique in geometrical optics that describes the behaviour of light rays in optical systems by using the paraxial approximation, in which only rays which make small angles with the optical axis of the system are consider ...
and ''paraxial
ray tracing'', which are used to find basic properties of optical systems, such as approximate
image
An image is a visual representation of something. It can be two-dimensional, three-dimensional, or somehow otherwise feed into the visual system to convey information. An image can be an artifact, such as a photograph or other two-dimensio ...
and object positions and
magnification
Magnification is the process of enlarging the apparent size, not physical size, of something. This enlargement is quantified by a calculated number also called "magnification". When this number is less than one, it refers to a reduction in si ...
s.
Reflection
Glossy surfaces such as
mirror
A mirror or looking glass is an object that Reflection (physics), reflects an image. Light that bounces off a mirror will show an image of whatever is in front of it, when focused through the lens of the eye or a camera. Mirrors reverse the ...
s reflect light in a simple, predictable way. This allows for production of reflected images that can be associated with an actual (
real) or extrapolated (
virtual) location in space.
With such surfaces, the direction of the reflected ray is determined by the angle the incident ray makes with the
surface normal
In geometry, a normal is an object such as a line, ray, or vector that is perpendicular to a given object. For example, the normal line to a plane curve at a given point is the (infinite) line perpendicular to the tangent line to the curve ...
, a line perpendicular to the surface at the point where the ray hits. The incident and reflected rays lie in a single plane, and the angle between the reflected ray and the surface normal is the same as that between the incident ray and the normal.
[ Chapter 35.] This is known as the
Law of Reflection.
For
flat mirrors, the law of reflection implies that images of objects are upright and the same distance behind the mirror as the objects are in front of the mirror. The image size is the same as the object size. (The
magnification
Magnification is the process of enlarging the apparent size, not physical size, of something. This enlargement is quantified by a calculated number also called "magnification". When this number is less than one, it refers to a reduction in si ...
of a flat mirror is equal to one.) The law also implies that
mirror images are
parity inverted, which is perceived as a left-right inversion.
Mirrors with curved surfaces can be modeled by
ray tracing and using the law of reflection at each point on the surface. For
mirrors with parabolic surfaces, parallel rays incident on the mirror produce reflected rays that converge at a common
focus. Other curved surfaces may also focus light, but with aberrations due to the diverging shape causing the focus to be smeared out in space. In particular, spherical mirrors exhibit
spherical aberration. Curved mirrors can form images with magnification greater than or less than one, and the image can be upright or inverted. An upright image formed by reflection in a mirror is always virtual, while an inverted image is real and can be projected onto a screen.
Refraction
Refraction occurs when light travels through an area of space that has a changing index of refraction. The simplest case of refraction occurs when there is an interface between a uniform medium with index of refraction
and another medium with index of refraction
. In such situations,
Snell's Law
Snell's law (also known as Snell–Descartes law and ibn-Sahl law and the law of refraction) is a formula used to describe the relationship between the angles of incidence and refraction, when referring to light or other waves passing through ...
describes the resulting deflection of the light ray:
:
where
and
are the angles between the normal (to the interface) and the incident and refracted waves, respectively. This phenomenon is also associated with a changing speed of light as seen from the definition of index of refraction provided above which implies:
:
where
and
are the wave velocities through the respective media.
Various consequences of Snell's Law include the fact that for light rays traveling from a material with a high index of refraction to a material with a low index of refraction, it is possible for the interaction with the interface to result in zero transmission. This phenomenon is called
total internal reflection
Total internal reflection (TIR) is the optical phenomenon in which waves arriving at the interface (boundary) from one medium to another (e.g., from water to air) are not refracted into the second ("external") medium, but completely reflecte ...
and allows for
fiber optics
An optical fiber, or optical fibre in Commonwealth English, is a flexible, transparent fiber made by drawing glass (silica) or plastic to a diameter slightly thicker than that of a human hair. Optical fibers are used most often as a means ...
technology. As light signals travel down a fiber optic cable, they undergo total internal reflection allowing for essentially no light lost over the length of the cable. It is also possible to produce
polarized light rays using a combination of reflection and refraction: When a refracted ray and the reflected ray form a
right angle, the reflected ray has the property of "plane polarization". The angle of incidence required for such a scenario is known as
Brewster's angle.
Snell's Law can be used to predict the deflection of light rays as they pass through "linear media" as long as the indexes of refraction and the geometry of the media are known. For example, the propagation of light through a
prism results in the light ray being deflected depending on the shape and orientation of the prism. Additionally, since different frequencies of light have slightly different indexes of refraction in most materials, refraction can be used to produce
dispersion spectra that appear as rainbows. The discovery of this phenomenon when passing light through a prism is famously attributed to
Isaac Newton
Sir Isaac Newton (25 December 1642 – 20 March 1726/27) was an English mathematician, physicist, astronomer, alchemist, theologian, and author (described in his time as a " natural philosopher"), widely recognised as one of the g ...
.
Some media have an index of refraction which varies gradually with position and, thus, light rays curve through the medium rather than travel in straight lines. This effect is what is responsible for
mirage
A mirage is a naturally-occurring optical phenomenon in which light rays bend via refraction to produce a displaced image of distant objects or the sky. The word comes to English via the French ''(se) mirer'', from the Latin ''mirari'', meani ...
s seen on hot days where the changing index of refraction of the air causes the light rays to bend creating the appearance of specular reflections in the distance (as if on the surface of a pool of water). Material that has a varying index of refraction is called a gradient-index (GRIN) material and has many useful properties used in modern optical scanning technologies including
photocopiers and
scanners. The phenomenon is studied in the field of
gradient-index optics.
A device which produces converging or diverging light rays due to refraction is known as a
lens
A lens is a transmissive optical device which focuses or disperses a light beam by means of refraction. A simple lens consists of a single piece of transparent material, while a compound lens consists of several simple lenses (''elements'' ...
. Thin lenses produce focal points on either side that can be modeled using the
lensmaker's equation
A lens is a transmissive optical device which focuses or disperses a light beam by means of refraction. A simple lens consists of a single piece of transparent material, while a compound lens consists of several simple lenses (''elements ...
.
[ Chapters 5 & 6.] In general, two types of lenses exist:
convex lens
A lens is a transmissive optical device which focuses or disperses a light beam by means of refraction. A simple lens consists of a single piece of transparent material, while a compound lens consists of several simple lenses (''elements''), ...
es, which cause parallel light rays to converge, and
concave lens
A lens is a transmissive optical device which focuses or disperses a light beam by means of refraction. A simple lens consists of a single piece of transparent material, while a compound lens consists of several simple lenses (''elements' ...
es, which cause parallel light rays to diverge. The detailed prediction of how images are produced by these lenses can be made using ray-tracing similar to curved mirrors. Similarly to curved mirrors, thin lenses follow a simple equation that determines the location of the images given a particular focal length (
) and object distance (
):
:
where
is the distance associated with the image and is considered by convention to be negative if on the same side of the lens as the object and positive if on the opposite side of the lens.
The focal length f is considered negative for concave lenses.
Incoming parallel rays are focused by a convex lens into an inverted real image one focal length from the lens, on the far side of the lens.
Rays from an object at finite distance are focused further from the lens than the focal distance; the closer the object is to the lens, the further the image is from the lens. With concave lenses, incoming parallel rays diverge after going through the lens, in such a way that they seem to have originated at an upright virtual image one focal length from the lens, on the same side of the lens that the parallel rays are approaching on.
Rays from an object at finite distance are associated with a virtual image that is closer to the lens than the focal length, and on the same side of the lens as the object. The closer the object is to the lens, the closer the virtual image is to the lens.
Likewise, the magnification of a lens is given by
:
where the negative sign is given, by convention, to indicate an upright object for positive values and an inverted object for negative values. Similar to mirrors, upright images produced by single lenses are virtual while inverted images are real.
Lenses suffer from
aberrations that distort images and focal points. These are due to both to geometrical imperfections and due to the changing index of refraction for different wavelengths of light (
chromatic aberration
In optics, chromatic aberration (CA), also called chromatic distortion and spherochromatism, is a failure of a lens to focus all colors to the same point. It is caused by dispersion: the refractive index of the lens elements varies with the ...
).
Underlying mathematics
As a mathematical study, geometrical optics emerges as a short-
wavelength
In physics, the wavelength is the spatial period of a periodic wave—the distance over which the wave's shape repeats.
It is the distance between consecutive corresponding points of the same phase on the wave, such as two adjacent crests, tr ...
limit for solutions to
hyperbolic partial differential equations (Sommerfeld–Runge method) or as a property of propagation of field discontinuities according to Maxwell's equations (Luneburg method). In this short-wavelength limit, it is possible to approximate the solution locally by
:
where
satisfy a
dispersion relation
In the physical sciences and electrical engineering, dispersion relations describe the effect of dispersion on the properties of waves in a medium. A dispersion relation relates the wavelength or wavenumber of a wave to its frequency. Given t ...
, and the amplitude
varies slowly. More precisely, the
leading order The leading-order terms (or corrections) within a mathematical equation, expression or model are the terms with the largest order of magnitude.J.K.Hunter, ''Asymptotic Analysis and Singular Perturbation Theory'', 2004. http://www.math.ucdavis.edu/~ ...
solution takes the form
:
The phase
can be linearized to recover large wavenumber
, and frequency
. The amplitude
satisfies a
transport equation. The small parameter
enters the scene due to highly oscillatory initial conditions. Thus, when initial conditions oscillate much faster than the coefficients of the differential equation, solutions will be highly oscillatory, and transported along rays. Assuming coefficients in the differential equation are smooth, the rays will be too. In other words,
refraction
In physics, refraction is the redirection of a wave as it passes from one medium to another. The redirection can be caused by the wave's change in speed or by a change in the medium. Refraction of light is the most commonly observed phenomen ...
does not take place. The motivation for this technique comes from studying the typical scenario of light propagation where short wavelength light travels along rays that minimize (more or less) its travel time. Its full application requires tools from
microlocal analysis.
Sommerfeld–Runge method
The method of obtaining equations of geometrical optics by taking the limit of zero wavelength was first described by
Arnold Sommerfeld and J. Runge in 1911. Their derivation was based on an oral remark by
Peter Debye
Peter Joseph William Debye (; ; March 24, 1884 – November 2, 1966) was a Dutch-American physicist and physical chemistry, physical chemist, and List of Nobel laureates in Chemistry, Nobel laureate in Chemistry.
Biography
Early life
Born Petr ...
. Consider a monochromatic scalar field
, where
could be any of the components of
electric or
magnetic field
A magnetic field is a vector field that describes the magnetic influence on moving electric charges, electric currents, and magnetic materials. A moving charge in a magnetic field experiences a force perpendicular to its own velocity and t ...
and hence the function
satisfy the wave equation
:
where
with
being the
speed of light
The speed of light in vacuum, commonly denoted , is a universal physical constant that is important in many areas of physics. The speed of light is exactly equal to ). According to the special theory of relativity, is the upper limit fo ...
in vacuum. Here,
is the
refractive index
In optics, the refractive index (or refraction index) of an optical medium is a dimensionless number that gives the indication of the light bending ability of that medium.
The refractive index determines how much the path of light is bent, o ...
of the medium. Without loss of generality, let us introduce
to convert the equation to
:
Since the underlying principle of geometrical optics lies in the limit
, the following asymptotic series is assumed,
:
For large but finite value of
, the series diverges, and one has to be careful in keeping only appropriate first few terms. For each value of
,
one can find an optimum number of terms to be kept and adding more terms than the optimum number might result in a poorer approximation. Substituting the series into the equation and collecting terms of different orders, one finds
:
in general,
:
The first equation is known as the
eikonal equation, which determines the eikonal
is a
Hamilton–Jacobi equation, written for example in Cartesian coordinates becomes
:
The remaining equations determine the functions
.
Luneburg method
The method of obtaining equations of geometrical optics by analysing surfaces of discontinuities of solutions to Maxwell's equations was first described by
Rudolf Karl Luneburg in 1944. It does not restrict the electromagnetic field to have a special form required by the Sommerfeld-Runge method which assumes the amplitude
and phase
satisfy the equation
. This condition is satisfied by e.g. plane waves but is not additive.
The main conclusion of Luneburg's approach is the following:
Theorem. Suppose the fields
and
(in a linear isotropic medium described by dielectric constants
and
) have finite discontinuities along a (moving) surface in
described by the equation
. Then Maxwell's equations in the integral form imply that
satisfies the eikonal equation:
:
,
where
is the index of refraction of the medium (Gaussian units).
An example of such surface of discontinuity is the initial wave front emanating from a source that starts radiating at a certain instant of time.
The surfaces of field discontinuity thus become geometrical optics wave fronts with the corresponding geometrical optics fields defined as:
:
:
Those fields obey transport equations consistent with the transport equations of the Sommerfeld-Runge approach. Light rays in Luneburg's theory are defined as trajectories orthogonal to the discontinuity surfaces and with the right parametrisation they can be shown to obey Fermat's principle of least time thus establishing the identity of those rays with light rays of standard optics.
The above developments can be generalised to anisotropic media.
The proof of Luneburg's theorem is based on investigating how Maxwell's equations govern the propagation of discontinuities of solutions. The basic technical lemma is as follows:
A technical lemma. Let
be a hypersurface (a 3-dimensional manifold) in spacetime
on which one or more of:
,
,
,
, have a finite discontinuity. Then at each point of the hypersurface the following formulas hold:
:
:
:
:
where the
operator acts in the
-space (for every fixed
) and the square brackets denote the difference in values on both sides of the discontinuity surface (set up according to an arbitrary but fixed convention, e.g. the gradient
pointing in the direction of the quantities being subtracted ''from'').
Sketch of proof. Start with Maxwell's equations away from the sources (Gaussian units):
:
:
:
:
Using Stokes' theorem in
one can conclude from the first of the above equations that for any domain
in
with a piecewise smooth (3-dimensional) boundary
the following is true:
:
where
is the projection of the outward unit normal
of
onto the 3D slice
, and
is the volume 3-form on
. Similarly, one establishes the following from the remaining Maxwell's equations:
:
:
:
Now by considering arbitrary small sub-surfaces
of
and setting up small neighbourhoods surrounding
in
, and subtracting the above integrals accordingly, one obtains:
:
:
:
:
where
denotes the gradient in the 4D
-space. And since
is arbitrary, the integrands must be equal to 0 which proves the lemma.
It's now easy to show that as they propagate through a continuous medium, the discontinuity surfaces obey the eikonal equation. Specifically, if
and
are continuous, then the discontinuities of
and
satisfy: