Special Relativity
   HOME

TheInfoList



OR:

In
physics Physics is the scientific study of matter, its Elementary particle, fundamental constituents, its motion and behavior through space and time, and the related entities of energy and force. "Physical science is that department of knowledge whi ...
, the special theory of relativity, or special relativity for short, is a scientific theory of the relationship between space and time. In
Albert Einstein Albert Einstein (14 March 187918 April 1955) was a German-born theoretical physicist who is best known for developing the theory of relativity. Einstein also made important contributions to quantum mechanics. His mass–energy equivalence f ...
's 1905 paper, "On the Electrodynamics of Moving Bodies", the theory is presented as being based on just two postulates: # The laws of physics are invariant (identical) in all inertial frames of reference (that is, frames of reference with no
acceleration In mechanics, acceleration is the Rate (mathematics), rate of change of the velocity of an object with respect to time. Acceleration is one of several components of kinematics, the study of motion. Accelerations are Euclidean vector, vector ...
). This is known as the
principle of relativity In physics, the principle of relativity is the requirement that the equations describing the laws of physics have the same form in all admissible frames of reference. For example, in the framework of special relativity, the Maxwell equations ...
. # The
speed of light The speed of light in vacuum, commonly denoted , is a universal physical constant exactly equal to ). It is exact because, by international agreement, a metre is defined as the length of the path travelled by light in vacuum during a time i ...
in
vacuum A vacuum (: vacuums or vacua) is space devoid of matter. The word is derived from the Latin adjective (neuter ) meaning "vacant" or "void". An approximation to such vacuum is a region with a gaseous pressure much less than atmospheric pressur ...
is the same for all observers, regardless of the motion of light source or observer. This is known as the principle of light constancy, or the principle of light speed invariance. The first postulate was first formulated by
Galileo Galilei Galileo di Vincenzo Bonaiuti de' Galilei (15 February 1564 – 8 January 1642), commonly referred to as Galileo Galilei ( , , ) or mononymously as Galileo, was an Italian astronomer, physicist and engineer, sometimes described as a poly ...
(see '' Galilean invariance'').


Background

Special relativity builds upon important physics ideas. The non-technical ideas include: *
speed In kinematics, the speed (commonly referred to as ''v'') of an object is the magnitude of the change of its position over time or the magnitude of the change of its position per unit of time; it is thus a non-negative scalar quantity. Intro ...
or
velocity Velocity is a measurement of speed in a certain direction of motion. It is a fundamental concept in kinematics, the branch of classical mechanics that describes the motion of physical objects. Velocity is a vector (geometry), vector Physical q ...
, how the relative distance between an object and a reference point changes with time. *
speed of light The speed of light in vacuum, commonly denoted , is a universal physical constant exactly equal to ). It is exact because, by international agreement, a metre is defined as the length of the path travelled by light in vacuum during a time i ...
, the maximum speed of information, independent of the speed of the source and receiver, * event: something that happens at a definite place and time. For examples, an explosion or a flash of light from an atom; a generalization of a point in geometrical space, * clocks, relativity is all about time; in relativity observers read clocks. Two observers in relative motion receive information about two events via light signals traveling at constant speed, independent of either observer's speed. Their motion during the transit time causes them to get the information at different times on their local clock. The more technical background ideas include: * invariance: when physical laws do not change when a specific circumstance changes, such as observations at different uniform velocities; *
spacetime In physics, spacetime, also called the space-time continuum, is a mathematical model that fuses the three dimensions of space and the one dimension of time into a single four-dimensional continuum. Spacetime diagrams are useful in visualiz ...
: a union of geometrical space and time. * spacetime interval between two events: a measure of separation that generalizes distance: (\text)^2 = \left \text \right2 - \left \text \right2 *
coordinate system In geometry, a coordinate system is a system that uses one or more numbers, or coordinates, to uniquely determine and standardize the position of the points or other geometric elements on a manifold such as Euclidean space. The coordinates are ...
or reference frame: a mechanism to specify points in space with respect to common reference axes, * inertial reference frame: a limited region of spacetime within which all objects experience the same gravitational acceleration and, absent other forces, do not accelerate relative to each other, * relative velocity: the amount and direction of uniform, relative motion of objects in two reference frames, * coordinate transformation: a procedure to respecify a points against a different coordinate system. The spacetime interval is an invariant between inertial frames, demonstrating the physical unity of spacetime. Coordinate systems are not invariant between inertial frames and require transformations.


Overview


Basis

Unusual among modern topics in physics, the theory of special relativity needs only mathematics at high school level and yet it fundamentally alters our understanding, especially our understanding of the concept of
time Time is the continuous progression of existence that occurs in an apparently irreversible process, irreversible succession from the past, through the present, and into the future. It is a component quantity of various measurements used to sequ ...
. Built on just two postulates or assumptions, many interesting consequences follow. The two postulates both concern observers moving at a constant speed relative to each other. The first postulate, the , says the laws of physics do not depend on objects being at absolute rest: for example, an observer on a train sees natural phenomena on that train that look the same whether the train is moving or not. The second postulate, constant speed of light, says observers on a moving train or on in the train station see light travel at the same speed. A light signal from the station to the train has the same speed, no matter how fast a train goes. In the theory of special relativity, the two postulates combine to change the definition of "relative speed". Rather than the simple concept of distance traveled divided by time spent, the new theory incorporates the speed of light as the maximum possible speed. In special relativity, covering ten times more distance on the ground in the same amount of time according to a moving watch does not result in a speed up as seen from the ground by a factor of ten.


Consequences

Special relativity has a wide range of consequences that have been experimentally verified. The conceptual effects include: * The § relativity of simultaneity events that appear simultaneous to one observer may not be simultaneous to an observer in motion * time measured between two events by observers in motions differ * distances between two events by observers in motions differ * The velocities no longer simply add Combined with other laws of physics, the two postulates of special relativity predict the equivalence of
mass Mass is an Intrinsic and extrinsic properties, intrinsic property of a physical body, body. It was traditionally believed to be related to the physical quantity, quantity of matter in a body, until the discovery of the atom and particle physi ...
and
energy Energy () is the physical quantity, quantitative physical property, property that is transferred to a physical body, body or to a physical system, recognizable in the performance of Work (thermodynamics), work and in the form of heat and l ...
, as expressed in the
mass–energy equivalence In physics, mass–energy equivalence is the relationship between mass and energy in a system's rest frame. The two differ only by a multiplicative constant and the units of measurement. The principle is described by the physicist Albert Einstei ...
formula , where c is the
speed of light The speed of light in vacuum, commonly denoted , is a universal physical constant exactly equal to ). It is exact because, by international agreement, a metre is defined as the length of the path travelled by light in vacuum during a time i ...
in vacuum.The Feynman Lectures on Physics Vol. I Ch. 15-9: Equivalence of mass and energy
/ref> Special relativity replaced the conventional notion of an absolute, universal time with the notion of a time that is local to each observer. Information about distant objects can arrive no faster than the speed of light so visual observations always report events that have happened in the past. This effect makes visual descriptions of the effects of special relativity especially prone to mistakes. Special relativity also has profound technical consequences. A defining feature of special relativity is the replacement of
Euclidean geometry Euclidean geometry is a mathematical system attributed to ancient Greek mathematics, Greek mathematician Euclid, which he described in his textbook on geometry, ''Euclid's Elements, Elements''. Euclid's approach consists in assuming a small set ...
with Lorentzian geometry. Distances in Euclidean geometry are calculated with the
Pythagorean theorem In mathematics, the Pythagorean theorem or Pythagoras' theorem is a fundamental relation in Euclidean geometry between the three sides of a right triangle. It states that the area of the square whose side is the hypotenuse (the side opposite t ...
and only involved spatial coordinates. In Lorentzian geometry, 'distances' become 'intervals' and include a time coordinate with a minus sign. Unlike spatial distances, the interval between two events has the same value for all observers independent of their relative velocity. When comparing two sets of coordinates in relative motion is
Lorentz transformation In physics, the Lorentz transformations are a six-parameter family of Linear transformation, linear coordinate transformation, transformations from a Frame of Reference, coordinate frame in spacetime to another frame that moves at a constant vel ...
replace Galilean transformations of Newtonian mechanics. Other effects include the relativistic corrects to the Doppler effect and the Thomas precession. It also explains how electricity and magnetism are related.


History

The principle of relativity, forming one of the two postulates of special relativity, was described by
Galileo Galilei Galileo di Vincenzo Bonaiuti de' Galilei (15 February 1564 – 8 January 1642), commonly referred to as Galileo Galilei ( , , ) or mononymously as Galileo, was an Italian astronomer, physicist and engineer, sometimes described as a poly ...
in 1632 using a thought experiment involving observing natural phenomena on a moving ship. His conclusions were summarized as Galilean relativity and used as the basis of
Newtonian mechanics Newton's laws of motion are three physical laws that describe the relationship between the motion of an object and the forces acting on it. These laws, which provide the basis for Newtonian mechanics, can be paraphrased as follows: # A body r ...
. This principle can be expressed as a coordinate transformation, between two coordinate systems.
Isaac Newton Sir Isaac Newton () was an English polymath active as a mathematician, physicist, astronomer, alchemist, theologian, and author. Newton was a key figure in the Scientific Revolution and the Age of Enlightenment, Enlightenment that followed ...
noted that many transformations, such as those involving rotation or acceleration, will not preserve the observation of physical phenomena. Newton considered only those transformations involving motion with respect to an immovable absolute space, now called transformations between inertial frames.. In 1864
James Clerk Maxwell James Clerk Maxwell (13 June 1831 – 5 November 1879) was a Scottish physicist and mathematician who was responsible for the classical theory of electromagnetic radiation, which was the first theory to describe electricity, magnetism an ...
presented a theory of
electromagnetism In physics, electromagnetism is an interaction that occurs between particles with electric charge via electromagnetic fields. The electromagnetic force is one of the four fundamental forces of nature. It is the dominant force in the interacti ...
which did not obey Galilean relativity. The theory specifically predicted a constant speed of light in vacuum, no matter the motion (velocity, acceleration, etc.) of the light emitter or receiver or its frequency, wavelength, direction, polarization, or phase. This, as yet untested theory, was thought at the time to be only valid in inertial frames fixed in an aether. Numerous experiments followed, attempting to measure the speed of light as Earth moved through the proposed fixed aether, culminating in the 1887 Michelson–Morley experiment which only confirmed the constant speed of light. Several fixes to the aether theory were proposed, with those of George Francis Fitzgerald,
Hendrik Antoon Lorentz Hendrik Antoon Lorentz ( ; ; 18 July 1853 – 4 February 1928) was a Dutch theoretical physicist who shared the 1902 Nobel Prize in Physics with Pieter Zeeman for their discovery and theoretical explanation of the Zeeman effect. He derived ...
, and Jules Henri Poincare all pointing in the direction of a result similar to the theory of special relativity. The final important step was taken by Albert Einstein in a paper published on 26 September 1905 titled "On the Electrodynamics of Moving Bodies".
Albert Einstein Albert Einstein (14 March 187918 April 1955) was a German-born theoretical physicist who is best known for developing the theory of relativity. Einstein also made important contributions to quantum mechanics. His mass–energy equivalence f ...
(1905)
''Zur Elektrodynamik bewegter Körper''
, ''Annalen der Physik'' 17: 891; English translatio
On the Electrodynamics of Moving Bodies
by George Barker Jeffery and Wilfrid Perrett (1923); Another English translation On the Electrodynamics of Moving Bodies by Megh Nad Saha (1920).
Einstein applied the Lorentz transformations known to be compatible with
Maxwell's equations Maxwell's equations, or Maxwell–Heaviside equations, are a set of coupled partial differential equations that, together with the Lorentz force law, form the foundation of classical electromagnetism, classical optics, Electrical network, electr ...
for electrodynamics to the classical laws of mechanics. This changed Newton's mechanics situations involving all motions, especially velocities close to that of light (known as '). Another way to describe the advance made by the special theory is to say Einstein extended the Galilean principle so that it accounted for the constant speed of light, a phenomenon that had been observed in the Michelson–Morley experiment. He also postulated that it holds for all the laws of physics, including both the laws of mechanics and of electrodynamics. The theory became essentially complete in 1907, with Hermann Minkowski's papers on spacetime. Special relativity has proven to be the most accurate model of motion at any speed when gravitational and quantum effects are negligible. Even so, the Newtonian model remains accurate at low velocities relative to the speed of light, for example, everyday motion on Earth. When updating his 1911 book on relativity, to include general relativity in 1920, Robert Daniel Carmichael called the earlier work the "restricted theory" as a "special case" of the new general theory; he also used the phrase "special theory of relativity". In comparing to the general theory in 1923 Einstein specifically called his earlier work "the special theory of relativity", saying he meant a restriction to frames uniform motion. Just as Galilean relativity is accepted as an approximation of special relativity that is valid for low speeds, special relativity is considered an approximation of general relativity that is valid for weak
gravitational field In physics, a gravitational field or gravitational acceleration field is a vector field used to explain the influences that a body extends into the space around itself. A gravitational field is used to explain gravitational phenomena, such as ...
s, that is, at a sufficiently small scale (e.g., when tidal forces are negligible) and in conditions of
free fall In classical mechanics, free fall is any motion of a physical object, body where gravity is the only force acting upon it. A freely falling object may not necessarily be falling down in the vertical direction. If the common definition of the word ...
. But general relativity incorporates non-Euclidean geometry to represent gravitational effects as the geometric curvature of spacetime. Special relativity is restricted to the flat spacetime known as Minkowski space. As long as the universe can be modeled as a
pseudo-Riemannian manifold In mathematical physics, a pseudo-Riemannian manifold, also called a semi-Riemannian manifold, is a differentiable manifold with a metric tensor that is everywhere nondegenerate. This is a generalization of a Riemannian manifold in which the ...
, a Lorentz-invariant frame that abides by special relativity can be defined for a sufficiently small neighborhood of each point in this curved spacetime.


Traditional "two postulates" approach to special relativity

Einstein discerned two fundamental propositions that seemed to be the most assured, regardless of the exact validity of the (then) known laws of either mechanics or electrodynamics. These propositions were the constancy of the speed of light in vacuum and the independence of physical laws (especially the constancy of the speed of light) from the choice of inertial system. In his initial presentation of special relativity in 1905 he expressed these postulates as: * The
principle of relativity In physics, the principle of relativity is the requirement that the equations describing the laws of physics have the same form in all admissible frames of reference. For example, in the framework of special relativity, the Maxwell equations ...
– the laws by which the states of physical systems undergo change are not affected, whether these changes of state be referred to the one or the other of two systems in uniform translatory motion relative to each other. * The principle of invariant light speed – "... light is always propagated in empty space with a definite velocity peed''c'' which is independent of the state of motion of the emitting body" (from the preface). That is, light in vacuum propagates with the speed ''c'' (a fixed constant, independent of direction) in at least one system of inertial coordinates (the "stationary system"), regardless of the state of motion of the light source. The constancy of the speed of light was motivated by Maxwell's theory of electromagnetism and the lack of evidence for the luminiferous ether. There is conflicting evidence on the extent to which Einstein was influenced by the null result of the Michelson–Morley experiment. In any case, the null result of the Michelson–Morley experiment helped the notion of the constancy of the speed of light gain widespread and rapid acceptance. The derivation of special relativity depends not only on these two explicit postulates, but also on several tacit assumptions ( made in almost all theories of physics), including the isotropy and homogeneity of space and the independence of measuring rods and clocks from their past history.Einstein, "Fundamental Ideas and Methods of the Theory of Relativity", 1920


Principle of relativity


Reference frames and relative motion

Reference frames play a crucial role in relativity theory. The term reference frame as used here is an observational perspective in space that is not undergoing any change in motion (acceleration), from which a position can be measured along 3 spatial axes (so, at rest or constant velocity). In addition, a reference frame has the ability to determine measurements of the time of events using a "clock" (any reference device with uniform periodicity). An event is an occurrence that can be assigned a single unique moment and location in space relative to a reference frame: it is a "point" in
spacetime In physics, spacetime, also called the space-time continuum, is a mathematical model that fuses the three dimensions of space and the one dimension of time into a single four-dimensional continuum. Spacetime diagrams are useful in visualiz ...
. Since the speed of light is constant in relativity irrespective of the reference frame, pulses of light can be used to unambiguously measure distances and refer back to the times that events occurred to the clock, even though light takes time to reach the clock after the event has transpired. For example, the explosion of a firecracker may be considered to be an "event". We can completely specify an event by its four spacetime coordinates: The time of occurrence and its 3-dimensional spatial location define a reference point. Let's call this reference frame ''S''. In relativity theory, we often want to calculate the coordinates of an event from differing reference frames. The equations that relate measurements made in different frames are called ''transformation equations''.


Standard configuration

To gain insight into how the spacetime coordinates measured by observers in different reference frames compare with each other, it is useful to work with a simplified setup with frames in a ''standard configuration''. With care, this allows simplification of the math with no loss of generality in the conclusions that are reached. In Fig. 2-1, two Galilean reference frames (i.e., conventional 3-space frames) are displayed in relative motion. Frame S belongs to a first observer ''O'', and frame (pronounced "S prime" or "S dash") belongs to a second observer . * The ''x'', ''y'', ''z'' axes of frame S are oriented parallel to the respective primed axes of frame . * Frame moves, for simplicity, in a single direction: the ''x''-direction of frame S with a constant velocity ''v'' as measured in frame ''S''. * The origins of frames S and are coincident when time for frame S and for frame . Since there is no absolute reference frame in relativity theory, a concept of "moving" does not strictly exist, as everything may be moving with respect to some other reference frame. Instead, any two frames that move at the same speed in the same direction are said to be ''comoving''. Therefore, ''S'' and are not ''comoving''.


Lack of an absolute reference frame

The
principle of relativity In physics, the principle of relativity is the requirement that the equations describing the laws of physics have the same form in all admissible frames of reference. For example, in the framework of special relativity, the Maxwell equations ...
, which states that physical laws have the same form in each inertial reference frame, dates back to
Galileo Galileo di Vincenzo Bonaiuti de' Galilei (15 February 1564 – 8 January 1642), commonly referred to as Galileo Galilei ( , , ) or mononymously as Galileo, was an Italian astronomer, physicist and engineer, sometimes described as a poly ...
, and was incorporated into Newtonian physics. But in the late 19th century the existence of
electromagnetic waves In physics, electromagnetic radiation (EMR) is a self-propagating wave of the electromagnetic field that carries momentum and radiant energy through space. It encompasses a broad spectrum, classified by frequency or its inverse, wavelength, ran ...
led some physicists to suggest that the universe was filled with a substance they called " aether", which, they postulated, would act as the medium through which these waves, or vibrations, propagated (in many respects similar to the way sound propagates through air). The aether was thought to be an absolute reference frame against which all speeds could be measured, and could be considered fixed and motionless relative to Earth or some other fixed reference point. The aether was supposed to be sufficiently elastic to support electromagnetic waves, while those waves could interact with matter, yet offering no resistance to bodies passing through it (its one property was that it allowed electromagnetic waves to propagate). The results of various experiments, including the Michelson–Morley experiment in 1887 (subsequently verified with more accurate and innovative experiments), led to the theory of special relativity, by showing that the aether did not exist. Einstein's solution was to discard the notion of an aether and the absolute state of rest. In relativity, any reference frame moving with uniform motion will observe the same laws of physics. In particular, the speed of light in vacuum is always measured to be ''c'', even when measured by multiple systems that are moving at different (but constant) velocities.


Relativity without the second postulate

From the principle of relativity alone without assuming the constancy of the speed of light (i.e., using the isotropy of space and the symmetry implied by the principle of special relativity) it can be shown that the spacetime transformations between inertial frames are either Euclidean, Galilean, or Lorentzian. In the Lorentzian case, one can then obtain relativistic interval conservation and a certain finite limiting speed. Experiments suggest that this speed is the speed of light in vacuum.David Morin (2007) ''Introduction to Classical Mechanics'', Cambridge University Press, Cambridge, chapter 11, Appendix I, .


Lorentz invariance as the essential core of special relativity


Two- vs one- postulate approaches

In Einstein's own view, the two postulates of relativity and the invariance of the speed of light lead to a single postulate, the Lorentz transformation: Following Einstein's original presentation of special relativity in 1905, many different sets of postulates have been proposed in various alternative derivations, but Einstein stuck to his approach throughout work.Einstein, On the Relativity Principle and the Conclusions Drawn from It, 1907; "The Principle of Relativity and Its Consequences in Modern Physics", 1910; "The Theory of Relativity", 1911; Manuscript on the Special Theory of Relativity, 1912; Theory of Relativity, 1913; Einstein, Relativity, the Special and General Theory, 1916; The Principal Ideas of the Theory of Relativity, 1916; What Is The Theory of Relativity?, 1919; The Principle of Relativity (Princeton Lectures), 1921; Physics and Reality, 1936; The Theory of Relativity, 1949.
Henri Poincaré Jules Henri Poincaré (, ; ; 29 April 185417 July 1912) was a French mathematician, Theoretical physics, theoretical physicist, engineer, and philosophy of science, philosopher of science. He is often described as a polymath, and in mathemati ...
provided the mathematical framework for relativity theory by proving that Lorentz transformations are a subset of his Poincaré group of symmetry transformations. Einstein later derived these transformations from his axioms. While the traditional two-postulate approach to special relativity is presented in innumerable college textbooks and popular presentations, other treatments of special relativity base it on the single postulate of universal Lorentz covariance, or, equivalently, on the single postulate of Minkowski spacetime.Das, A. (1993) ''The Special Theory of Relativity, A Mathematical Exposition'', Springer, .Schutz, J. (1997) Independent Axioms for Minkowski Spacetime, Addison Wesley Longman Limited, . Textbooks starting with the single postulate of Minkowski spacetime include those by Taylor and Wheeler and by Callahan.


Lorentz transformation and its inverse

Define an event to have spacetime coordinates in system ''S'' and in a reference frame moving at a velocity ''v'' on the ''x''-axis with respect to that frame, . Then the
Lorentz transformation In physics, the Lorentz transformations are a six-parameter family of Linear transformation, linear coordinate transformation, transformations from a Frame of Reference, coordinate frame in spacetime to another frame that moves at a constant vel ...
specifies that these coordinates are related in the following way: \begin t' &= \gamma \ (t - vx/c^2) \\ x' &= \gamma \ (x - v t) \\ y' &= y \\ z' &= z , \end where \gamma = \frac is the
Lorentz factor The Lorentz factor or Lorentz term (also known as the gamma factor) is a dimensionless quantity expressing how much the measurements of time, length, and other physical properties change for an object while it moves. The expression appears in sev ...
and ''c'' is the
speed of light The speed of light in vacuum, commonly denoted , is a universal physical constant exactly equal to ). It is exact because, by international agreement, a metre is defined as the length of the path travelled by light in vacuum during a time i ...
in vacuum, and the velocity ''v'' of , relative to ''S'', is parallel to the ''x''-axis. For simplicity, the ''y'' and ''z'' coordinates are unaffected; only the ''x'' and ''t'' coordinates are transformed. These Lorentz transformations form a one-parameter group of linear mappings, that parameter being called
rapidity In special relativity, the classical concept of velocity is converted to rapidity to accommodate the limit determined by the speed of light. Velocities must be combined by Einstein's velocity-addition formula. For low speeds, rapidity and velo ...
. Solving the four transformation equations above for the unprimed coordinates yields the inverse Lorentz transformation: \begin t &= \gamma ( t' + v x'/c^2) \\ x &= \gamma ( x' + v t') \\ y &= y' \\ z &= z'. \end This shows that the unprimed frame is moving with the velocity −''v'', as measured in the primed frame. There is nothing special about the ''x''-axis. The transformation can apply to the ''y''- or ''z''-axis, or indeed in any direction parallel to the motion (which are warped by the ''γ'' factor) and perpendicular; see the article
Lorentz transformation In physics, the Lorentz transformations are a six-parameter family of Linear transformation, linear coordinate transformation, transformations from a Frame of Reference, coordinate frame in spacetime to another frame that moves at a constant vel ...
for details. A quantity that is invariant under Lorentz transformations is known as a
Lorentz scalar In a relativistic theory of physics, a Lorentz scalar is a scalar expression whose value is invariant under any Lorentz transformation. A Lorentz scalar may be generated from, e.g., the scalar product of vectors, or by contracting tensors. Whil ...
. Writing the Lorentz transformation and its inverse in terms of coordinate differences, where one event has coordinates and , another event has coordinates and , and the differences are defined as *    \Delta x' = x'_2-x'_1 \ , \ \Delta t' = t'_2-t'_1 \ . *    \Delta x = x_2-x_1 \ , \ \ \Delta t = t_2-t_1 \ . we get *    \Delta x' = \gamma \ (\Delta x - v \,\Delta t) \ ,\ \ \Delta t' = \gamma \ \left(\Delta t - v \ \Delta x / c^ \right) \ . *    \Delta x = \gamma \ (\Delta x' + v \,\Delta t') \ , \ \Delta t = \gamma \ \left(\Delta t' + v \ \Delta x' / c^ \right) \ . If we take differentials instead of taking differences, we get *    dx' = \gamma \ (dx - v\,dt) \ ,\ \ dt' = \gamma \ \left( dt - v \ dx / c^ \right) \ . *    dx = \gamma \ (dx' + v\,dt') \ , \ dt = \gamma \ \left(dt' + v \ dx' / c^ \right) \ .


Graphical representation of the Lorentz transformation

Spacetime diagrams ( Minkowski diagrams) are an extremely useful aid to visualizing how coordinates transform between different reference frames. Although it is not as easy to perform exact computations using them as directly invoking the Lorentz transformations, their main power is their ability to provide an intuitive grasp of the results of a relativistic scenario. To draw a spacetime diagram, begin by considering two Galilean reference frames, S and S′, in standard configuration, as shown in Fig. 2-1. Fig. 3-1a. Draw the x and t axes of frame S. The x axis is horizontal and the t (actually ct) axis is vertical, which is the opposite of the usual convention in kinematics. The ct axis is scaled by a factor of c so that both axes have common units of length. In the diagram shown, the gridlines are spaced one unit distance apart. The 45° diagonal lines represent the ''worldlines'' of two photons passing through the origin at time t = 0. The slope of these worldlines is 1 because the photons advance one unit in space per unit of time. Two events, \text and \text, have been plotted on this graph so that their coordinates may be compared in the S and S' frames. Fig. 3-1b. Draw the x' and ct' axes of frame S'. The ct' axis represents the worldline of the origin of the S' coordinate system as measured in frame S. In this figure, v = c/2. Both the ct' and x' axes are tilted from the unprimed axes by an angle \alpha = \tan^(\beta), where \beta = v/c. The primed and unprimed axes share a common origin because frames S and S' had been set up in standard configuration, so that t=0 when t'=0. Fig. 3-1c. Units in the primed axes have a different scale from units in the unprimed axes. From the Lorentz transformations, we observe that (x', ct') coordinates of (0, 1) in the primed coordinate system transform to (\beta \gamma, \gamma) in the unprimed coordinate system. Likewise, (x', ct') coordinates of (1, 0) in the primed coordinate system transform to (\gamma, \beta \gamma) in the unprimed system. Draw gridlines parallel with the ct' axis through points (k \gamma, k \beta \gamma) as measured in the unprimed frame, where k is an integer. Likewise, draw gridlines parallel with the x' axis through (k \beta \gamma, k \gamma) as measured in the unprimed frame. Using the Pythagorean theorem, we observe that the spacing between ct' units equals \sqrt times the spacing between ct units, as measured in frame S. This ratio is always greater than 1, and ultimately it approaches infinity as \beta \to 1. Fig. 3-1d. Since the speed of light is an invariant, the ''worldlines'' of two photons passing through the origin at time t' = 0 still plot as 45° diagonal lines. The primed coordinates of \text and \text are related to the unprimed coordinates through the Lorentz transformations and ''could'' be approximately measured from the graph (assuming that it has been plotted accurately enough), but the real merit of a Minkowski diagram is its granting us a geometric view of the scenario. For example, in this figure, we observe that the two timelike-separated events that had different x-coordinates in the unprimed frame are now at the same position in space. While the unprimed frame is drawn with space and time axes that meet at right angles, the primed frame is drawn with axes that meet at acute or obtuse angles. This asymmetry is due to unavoidable distortions in how spacetime coordinates map onto a Cartesian plane, but the frames are actually equivalent.


Consequences derived from the Lorentz transformation

The consequences of special relativity can be derived from the
Lorentz transformation In physics, the Lorentz transformations are a six-parameter family of Linear transformation, linear coordinate transformation, transformations from a Frame of Reference, coordinate frame in spacetime to another frame that moves at a constant vel ...
equations. These transformations, and hence special relativity, lead to different physical predictions than those of Newtonian mechanics at all relative velocities, and most pronounced when relative velocities become comparable to the speed of light. The speed of light is so much larger than anything most humans encounter that some of the effects predicted by relativity are initially counterintuitive.


Invariant interval

In Galilean relativity, the spatial separation, (), and the temporal separation, (), between two events are independent invariants, the values of which do not change when observed from different frames of reference. In special relativity, however, the interweaving of spatial and temporal coordinates generates the concept of an invariant interval, denoted as : \Delta s^2 \; \overset\text \; c^2 \Delta t^2 - (\Delta x^2 + \Delta y^2 + \Delta z^2) In considering the physical significance of , there are three cases to note: * Δs2 > 0: In this case, the two events are separated by more time than space, and they are hence said to be ''timelike'' separated. This implies that , and given the Lorentz transformation , it is evident that there exists a v less than c for which \Delta x' = 0 (in particular, ). In other words, given two events that are timelike separated, it is possible to find a frame in which the two events happen at the same place. In this frame, the separation in time, , is called the ''proper time''. * Δs2 < 0: In this case, the two events are separated by more space than time, and they are hence said to be ''spacelike'' separated. This implies that , and given the Lorentz transformation , there exists a v less than c for which \Delta t' = 0 (in particular, ). In other words, given two events that are spacelike separated, it is possible to find a frame in which the two events happen at the same time. In this frame, the separation in space, , is called the ''proper distance'', or ''proper length''. For values of v greater than and less than , the sign of \Delta t' changes, meaning that the temporal order of spacelike-separated events changes depending on the frame in which the events are viewed. But the temporal order of timelike-separated events is absolute, since the only way that v could be greater than c^2 \Delta t / \Delta x would be if . * Δs2 = 0: In this case, the two events are said to be ''lightlike'' separated. This implies that , and this relationship is frame independent due to the invariance of . From this, we observe that the speed of light is c in every inertial frame. In other words, starting from the assumption of universal Lorentz covariance, the constant speed of light is a derived result, rather than a postulate as in the two-postulates formulation of the special theory. The interweaving of space and time revokes the implicitly assumed concepts of absolute simultaneity and synchronization across non-comoving frames. The form of , being the ''difference'' of the squared time lapse and the squared spatial distance, demonstrates a fundamental discrepancy between Euclidean and spacetime distances. The invariance of this interval is a property of the ''general'' Lorentz transform (also called the Poincaré transformation), making it an isometry of spacetime. The general Lorentz transform extends the standard Lorentz transform (which deals with translations without rotation, that is, Lorentz boosts, in the x-direction) with all other translations, reflections, and rotations between any Cartesian inertial frame. In the analysis of simplified scenarios, such as spacetime diagrams, a reduced-dimensionality form of the invariant interval is often employed: \Delta s^2 \, = \, c^2 \Delta t^2 - \Delta x^2 Demonstrating that the interval is invariant is straightforward for the reduced-dimensionality case and with frames in standard configuration: \begin c^2 \Delta t^2 - \Delta x^2 &= c^2 \gamma ^2 \left(\Delta t' + \dfrac \right)^2 - \gamma ^2 \ (\Delta x' + v \Delta t')^2 \\ &= \gamma ^2 \left( c^2 \Delta t' ^ + 2 v \Delta x' \Delta t' + \dfrac \right) - \gamma ^2 \ (\Delta x' ^ + 2 v \Delta x' \Delta t' + v^2 \Delta t' ^ ) \\ &= \gamma ^2 c^2 \Delta t' ^ - \gamma ^2 v^2 \Delta t' ^ - \gamma ^2 \Delta x' ^ + \gamma ^2 \dfrac \\ &= \gamma ^2 c^2 \Delta t' ^ \left( 1 - \dfrac \right) - \gamma ^2 \Delta x' ^ \left( 1 - \dfrac \right) \\ &= c^2 \Delta t' ^ - \Delta x' ^ \end The value of \Delta s^2 is hence independent of the frame in which it is measured.


Relativity of simultaneity

Consider two events happening in two different locations that occur simultaneously in the reference frame of one inertial observer. They may occur non-simultaneously in the reference frame of another inertial observer (lack of absolute simultaneity). From (the forward Lorentz transformation in terms of coordinate differences) \Delta t' = \gamma \left(\Delta t - \frac \right) It is clear that the two events that are simultaneous in frame ''S'' (satisfying ), are not necessarily simultaneous in another inertial frame (satisfying ). Only if these events are additionally co-local in frame ''S'' (satisfying ), will they be simultaneous in another frame . The Sagnac effect can be considered a manifestation of the relativity of simultaneity. Since relativity of simultaneity is a first order effect in , instruments based on the Sagnac effect for their operation, such as
ring laser gyroscope A ring laser gyroscope (RLG) consists of a ring laser having two independent counter-propagating resonant modes over the same path; the difference in phase is used to detect rotation. It operates on the principle of the Sagnac effect which shi ...
s and fiber optic gyroscopes, are capable of extreme levels of sensitivity.


Time dilation

The time lapse between two events is not invariant from one observer to another, but is dependent on the relative speeds of the observers' reference frames. Suppose a
clock A clock or chronometer is a device that measures and displays time. The clock is one of the oldest Invention, human inventions, meeting the need to measure intervals of time shorter than the natural units such as the day, the lunar month, a ...
is at rest in the unprimed system ''S''. The location of the clock on two different ticks is then characterized by . To find the relation between the times between these ticks as measured in both systems, can be used to find: : \Delta t' = \gamma\, \Delta t for events satisfying\Delta x = 0 \ . This shows that the time (Δ) between the two ticks as seen in the frame in which the clock is moving (), is ''longer'' than the time (Δ''t'') between these ticks as measured in the rest frame of the clock (''S''). Time dilation explains a number of physical phenomena; for example, the lifetime of high speed
muon A muon ( ; from the Greek letter mu (μ) used to represent it) is an elementary particle similar to the electron, with an electric charge of −1 '' e'' and a spin of  ''ħ'', but with a much greater mass. It is classified as a ...
s created by the collision of cosmic rays with particles in the Earth's outer atmosphere and moving towards the surface is greater than the lifetime of slowly moving muons, created and decaying in a laboratory. Whenever one hears a statement to the effect that "moving clocks run slow", one should envision an inertial reference frame thickly populated with identical, synchronized clocks. As a moving clock travels through this array, its reading at any particular point is compared with a stationary clock at the same point. The measurements that we would get if we actually ''looked'' at a moving clock would, in general, not at all be the same thing, because the time that we would see would be delayed by the finite speed of light, i.e. the times that we see would be distorted by the Doppler effect. Measurements of relativistic effects must always be understood as having been made after finite speed-of-light effects have been factored out.


Langevin's light-clock

Paul Langevin, an early proponent of the theory of relativity, did much to popularize the theory in the face of resistance by many physicists to Einstein's revolutionary concepts. Among his numerous contributions to the foundations of special relativity were independent work on the mass–energy relationship, a thorough examination of the twin paradox, and investigations into rotating coordinate systems. His name is frequently attached to a hypothetical construct called a "light-clock" (originally developed by Lewis and Tolman in 1909), which he used to perform a novel derivation of the Lorentz transformation. A light-clock is imagined to be a box of perfectly reflecting walls wherein a light signal reflects back and forth from opposite faces. The concept of time dilation is frequently taught using a light-clock that is traveling in uniform inertial motion perpendicular to a line connecting the two mirrors. (Langevin himself made use of a light-clock oriented parallel to its line of motion.) Consider the scenario illustrated in Observer A holds a light-clock of length L as well as an electronic timer with which she measures how long it takes a pulse to make a round trip up and down along the light-clock. Although observer A is traveling rapidly along a train, from her point of view the emission and receipt of the pulse occur at the same place, and she measures the interval using a single clock located at the precise position of these two events. For the interval between these two events, observer A finds . A time interval measured using a single clock that is motionless in a particular reference frame is called a '' proper time interval''. Fig. 4-3B illustrates these same two events from the standpoint of observer B, who is parked by the tracks as the train goes by at a speed of . Instead of making straight up-and-down motions, observer B sees the pulses moving along a zig-zag line. However, because of the postulate of the constancy of the speed of light, the speed of the pulses along these diagonal lines is the same c that observer A saw for her up-and-down pulses. B measures the speed of the vertical component of these pulses as \pm \sqrt, so that the total round-trip time of the pulses is t_\text = 2L \big/ \sqrt = . Note that for observer B, the emission and receipt of the light pulse occurred at different places, and he measured the interval using two stationary and synchronized clocks located at two different positions in his reference frame. The interval that B measured was therefore ''not'' a proper time interval because he did not measure it with a single resting clock.


Reciprocal time dilation

In the above description of the Langevin light-clock, the labeling of one observer as stationary and the other as in motion was completely arbitrary. One could just as well have observer B carrying the light-clock and moving at a speed of v to the left, in which case observer A would perceive B's clock as running slower than her local clock. There is no paradox here, because there is no independent observer C who will agree with both A and B. Observer C necessarily makes his measurements from his own reference frame. If that reference frame coincides with A's reference frame, then C will agree with A's measurement of time. If C's reference frame coincides with B's reference frame, then C will agree with B's measurement of time. If C's reference frame coincides with neither A's frame nor B's frame, then C's measurement of time will disagree with ''both'' A's and B's measurement of time.


Twin paradox

The reciprocity of time dilation between two observers in separate inertial frames leads to the so-called twin paradox, articulated in its present form by Langevin in 1911. Langevin imagined an adventurer wishing to explore the future of the Earth. This traveler boards a projectile capable of traveling at 99.995% of the speed of light. After making a round-trip journey to and from a nearby star lasting only two years of his own life, he returns to an Earth that is two hundred years older. This result appears puzzling because both the traveler and an Earthbound observer would see the other as moving, and so, because of the reciprocity of time dilation, one might initially expect that each should have found the other to have aged less. In reality, there is no paradox at all, because in order for the two observers to perform side-by-side comparisons of their elapsed proper times, the symmetry of the situation must be broken: At least one of the two observers must change their state of motion to match that of the other. Knowing the general resolution of the paradox, however, does not immediately yield the ability to calculate correct quantitative results. Many solutions to this puzzle have been provided in the literature and have been reviewed in the Twin paradox article. We will examine in the following one such solution to the paradox. Our basic aim will be to demonstrate that, after the trip, both twins are in perfect agreement about who aged by how much, regardless of their different experiences. illustrates a scenario where the traveling twin flies at to and from a star distant. During the trip, each twin sends yearly time signals (measured in their own proper times) to the other. After the trip, the cumulative counts are compared. On the outward phase of the trip, each twin receives the other's signals at the lowered rate of . Initially, the situation is perfectly symmetric: note that each twin receives the other's one-year signal at two years measured on their own clock. The symmetry is broken when the traveling twin turns around at the four-year mark as measured by her clock. During the remaining four years of her trip, she receives signals at the enhanced rate of . The situation is quite different with the stationary twin. Because of light-speed delay, he does not see his sister turn around until eight years have passed on his own clock. Thus, he receives enhanced-rate signals from his sister for only a relatively brief period. Although the twins disagree in their respective measures of total time, we see in the following table, as well as by simple observation of the Minkowski diagram, that each twin is in total agreement with the other as to the total number of signals sent from one to the other. There is hence no paradox.


Length contraction

The dimensions (e.g., length) of an object as measured by one observer may be smaller than the results of measurements of the same object made by another observer (e.g., the ladder paradox involves a long ladder traveling near the speed of light and being contained within a smaller garage). Similarly, suppose a measuring rod is at rest and aligned along the ''x''-axis in the unprimed system ''S''. In this system, the length of this rod is written as Δ''x''. To measure the length of this rod in the system , in which the rod is moving, the distances ' to the end points of the rod must be measured simultaneously in that system . In other words, the measurement is characterized by , which can be combined with to find the relation between the lengths Δ''x'' and Δ: : \Delta x' = \frac for events satisfying\Delta t' = 0 \ . This shows that the length (Δ) of the rod as measured in the frame in which it is moving (), is ''shorter'' than its length (Δ''x'') in its own rest frame (''S''). Time dilation and length contraction are not merely appearances. Time dilation is explicitly related to our way of measuring ''time intervals'' between events that occur at the same place in a given coordinate system (called "co-local" events). These time intervals (which can be, and are, actually measured experimentally by relevant observers) are ''different'' in another coordinate system moving with respect to the first, unless the events, in addition to being co-local, are also simultaneous. Similarly, length contraction relates to our measured distances between separated but simultaneous events in a given coordinate system of choice. If these events are not co-local, but are separated by distance (space), they will ''not'' occur at the same ''spatial distance'' from each other when seen from another moving coordinate system.


Lorentz transformation of velocities

Consider two frames ''S'' and in standard configuration. A particle in ''S'' moves in the x direction with velocity vector . What is its velocity \mathbf in frame ? We can write Substituting expressions for dx' and dt' from into , followed by straightforward mathematical manipulations and back-substitution from yields the Lorentz transformation of the speed u to : The inverse relation is obtained by interchanging the primed and unprimed symbols and replacing v with . For \mathbf not aligned along the x-axis, we write: The forward and inverse transformations for this case are: and can be interpreted as giving the ''resultant'' \mathbf of the two velocities \mathbf and , and they replace the formula . which is valid in Galilean relativity. Interpreted in such a fashion, they are commonly referred to as the ''relativistic velocity addition (or composition) formulas'', valid for the three axes of ''S'' and being aligned with each other (although not necessarily in standard configuration). We note the following points: * If an object (e.g., a
photon A photon () is an elementary particle that is a quantum of the electromagnetic field, including electromagnetic radiation such as light and radio waves, and the force carrier for the electromagnetic force. Photons are massless particles that can ...
) were moving at the speed of light in one frame , then it would also be moving at the speed of light in any other frame, moving at . * The resultant speed of two velocities with magnitude less than ''c'' is always a velocity with magnitude less than ''c''. * If both and (and then also and ) are small with respect to the speed of light (that is, e.g., , then the intuitive Galilean transformations are recovered from the transformation equations for special relativity * Attaching a frame to a photon (''riding a light beam'' like Einstein considers) requires special treatment of the transformations. There is nothing special about the ''x'' direction in the standard configuration. The above formalism applies to any direction; and three orthogonal directions allow dealing with all directions in space by decomposing the velocity vectors to their components in these directions. See '' Velocity-addition formula'' for details.


Thomas rotation

The composition of two non-collinear Lorentz boosts (i.e., two non-collinear Lorentz transformations, neither of which involve rotation) results in a Lorentz transformation that is not a pure boost but is the composition of a boost and a rotation. Thomas rotation results from the relativity of simultaneity. In Fig. 4-5a, a rod of length L in its rest frame (i.e., having a proper length of ) rises vertically along the y-axis in the ground frame. In Fig. 4-5b, the same rod is observed from the frame of a rocket moving at speed v to the right. If we imagine two clocks situated at the left and right ends of the rod that are synchronized ''in the frame of the rod'', relativity of simultaneity causes the observer in the rocket frame to observe (not ''see'') the clock at the right end of the rod as being advanced in time by , and the rod is correspondingly observed as tilted. Unlike second-order relativistic effects such as length contraction or time dilation, this effect becomes quite significant even at fairly low velocities. For example, this can be seen in the spin of moving particles, where Thomas precession is a relativistic correction that applies to the spin of an elementary particle or the rotation of a macroscopic
gyroscope A gyroscope (from Ancient Greek γῦρος ''gŷros'', "round" and σκοπέω ''skopéō'', "to look") is a device used for measuring or maintaining Orientation (geometry), orientation and angular velocity. It is a spinning wheel or disc in ...
, relating the
angular velocity In physics, angular velocity (symbol or \vec, the lowercase Greek letter omega), also known as the angular frequency vector,(UP1) is a pseudovector representation of how the angular position or orientation of an object changes with time, i ...
of the spin of a particle following a curvilinear orbit to the angular velocity of the orbital motion. Thomas rotation provides the resolution to the well-known "meter stick and hole paradox".


Causality and prohibition of motion faster than light

In Fig. 4-6, the time interval between the events A (the "cause") and B (the "effect") is 'timelike'; that is, there is a frame of reference in which events A and B occur at the ''same location in space'', separated only by occurring at different times. If A precedes B in that frame, then A precedes B in all frames accessible by a Lorentz transformation. It is possible for matter (or information) to travel (below light speed) from the location of A, starting at the time of A, to the location of B, arriving at the time of B, so there can be a causal relationship (with A the cause and B the effect). The interval AC in the diagram is 'spacelike'; that is, there is a frame of reference in which events A and C occur simultaneously, separated only in space. There are also frames in which A precedes C (as shown) and frames in which C precedes A. But no frames are accessible by a Lorentz transformation, in which events A and C occur at the same location. If it were possible for a cause-and-effect relationship to exist between events A and C, paradoxes of causality would result. For example, if signals could be sent faster than light, then signals could be sent into the sender's past (observer B in the diagrams). A variety of causal paradoxes could then be constructed. Consider the spacetime diagrams in Fig. 4-7. A and B stand alongside a railroad track, when a high-speed train passes by, with C riding in the last car of the train and D riding in the leading car. The world lines of A and B are vertical (''ct''), distinguishing the stationary position of these observers on the ground, while the world lines of C and D are tilted forwards ('), reflecting the rapid motion of the observers C and D stationary in their train, as observed from the ground. # Fig. 4-7a. The event of "B passing a message to D", as the leading car passes by, is at the origin of D's frame. D sends the message along the train to C in the rear car, using a fictitious "instantaneous communicator". The worldline of this message is the fat red arrow along the -x' axis, which is a line of simultaneity in the primed frames of C and D. In the (unprimed) ground frame the signal arrives ''earlier'' than it was sent. # Fig. 4-7b. The event of "C passing the message to A", who is standing by the railroad tracks, is at the origin of their frames. Now A sends the message along the tracks to B via an "instantaneous communicator". The worldline of this message is the blue fat arrow, along the +x axis, which is a line of simultaneity for the frames of A and B. As seen from the spacetime diagram, in the primed frames of C and D, B will receive the message before it was sent out, a violation of causality. It is not necessary for signals to be instantaneous to violate causality. Even if the signal from D to C were slightly shallower than the x' axis (and the signal from A to B slightly steeper than the x axis), it would still be possible for B to receive his message before he had sent it. By increasing the speed of the train to near light speeds, the ct' and x' axes can be squeezed very close to the dashed line representing the speed of light. With this modified setup, it can be demonstrated that even signals only ''slightly'' faster than the speed of light will result in causality violation. Therefore, if causality is to be preserved, one of the consequences of special relativity is that no information signal or material object can travel faster than light in vacuum. This is not to say that ''all'' faster than light speeds are impossible. Various trivial situations can be described where some "things" (not actual matter or energy) move faster than light. For example, the location where the beam of a search light hits the bottom of a cloud can move faster than light when the search light is turned rapidly (although this does not violate causality or any other relativistic phenomenon).


Optical effects


Dragging effects

In 1850,
Hippolyte Fizeau Armand Hippolyte Louis Fizeau (; 23 September 1819 – 18 September 1896) was a French physicist who, in 1849, measured the speed of light to within 5% accuracy. In 1851, he measured the speed of light in moving water in an experiment known as t ...
and Léon Foucault independently established that light travels more slowly in water than in air, thus validating a prediction of Fresnel's wave theory of light and invalidating the corresponding prediction of Newton's corpuscular theory. The speed of light was measured in still water. What would be the speed of light in flowing water? In 1851, Fizeau conducted an experiment to answer this question, a simplified representation of which is illustrated in Fig. 5-1. A beam of light is divided by a beam splitter, and the split beams are passed in opposite directions through a tube of flowing water. They are recombined to form interference fringes, indicating a difference in optical path length, that an observer can view. The experiment demonstrated that dragging of the light by the flowing water caused a displacement of the fringes, showing that the motion of the water had affected the speed of the light. According to the theories prevailing at the time, light traveling through a moving medium would be a simple sum of its speed ''through'' the medium plus the speed ''of'' the medium. Contrary to expectation, Fizeau found that although light appeared to be dragged by the water, the magnitude of the dragging was much lower than expected. If u' = c/n is the speed of light in still water, and v is the speed of the water, and u_ is the water-borne speed of light in the lab frame with the flow of water adding to or subtracting from the speed of light, then u_ =\frac \pm v\left(1-\frac\right) \ . Fizeau's results, although consistent with Fresnel's earlier hypothesis of partial aether dragging, were extremely disconcerting to physicists of the time. Among other things, the presence of an index of refraction term meant that, since n depends on wavelength, ''the aether must be capable of sustaining different motions at the same time''. A variety of theoretical explanations were proposed to explain Fresnel's dragging coefficient, that were completely at odds with each other. Even before the Michelson–Morley experiment, Fizeau's experimental results were among a number of observations that created a critical situation in explaining the optics of moving bodies. From the point of view of special relativity, Fizeau's result is nothing but an approximation to , the relativistic formula for composition of velocities. : u_ = \frac = \frac \approx c \left( \frac \pm \frac \right) \left( 1 \mp \frac \right) \approx \frac \pm v \left( 1 - \frac \right)


Relativistic aberration of light

Because of the finite speed of light, if the relative motions of a source and receiver include a transverse component, then the direction from which light arrives at the receiver will be displaced from the geometric position in space of the source relative to the receiver. The classical calculation of the displacement takes two forms and makes different predictions depending on whether the receiver, the source, or both are in motion with respect to the medium. (1) If the receiver is in motion, the displacement would be the consequence of the aberration of light. The incident angle of the beam relative to the receiver would be calculable from the vector sum of the receiver's motions and the velocity of the incident light. (2) If the source is in motion, the displacement would be the consequence of light-time correction. The displacement of the apparent position of the source from its geometric position would be the result of the source's motion during the time that its light takes to reach the receiver. The classical explanation failed experimental test. Since the aberration angle depends on the relationship between the velocity of the receiver and the speed of the incident light, passage of the incident light through a refractive medium should change the aberration angle. In 1810, Arago used this expected phenomenon in a failed attempt to measure the speed of light, and in 1870, George Airy tested the hypothesis using a water-filled telescope, finding that, against expectation, the measured aberration was identical to the aberration measured with an air-filled telescope. A "cumbrous" attempt to explain these results used the hypothesis of partial aether-drag, but was incompatible with the results of the Michelson–Morley experiment, which apparently demanded ''complete'' aether-drag. Assuming inertial frames, the relativistic expression for the aberration of light is applicable to both the receiver moving and source moving cases. A variety of trigonometrically equivalent formulas have been published. Expressed in terms of the variables in Fig. 5-2, these include : \cos \theta ' = \frac   OR   \sin \theta ' = \frac   OR   \tan \frac = \left( \frac \right)^ \tan \frac


Relativistic Doppler effect


Relativistic longitudinal Doppler effect

The classical Doppler effect depends on whether the source, receiver, or both are in motion with respect to the medium. The relativistic Doppler effect is independent of any medium. Nevertheless, relativistic Doppler shift for the longitudinal case, with source and receiver moving directly towards or away from each other, can be derived as if it were the classical phenomenon, but modified by the addition of a
time dilation Time dilation is the difference in elapsed time as measured by two clocks, either because of a relative velocity between them (special relativity), or a difference in gravitational potential between their locations (general relativity). When unsp ...
term, and that is the treatment described here. Assume the receiver and the source are moving ''away'' from each other with a relative speed v as measured by an observer on the receiver or the source (The sign convention adopted here is that v is ''negative'' if the receiver and the source are moving ''towards'' each other). Assume that the source is stationary in the medium. Then f_ = \left(1 - \frac v \right) f_s where c_s is the speed of sound. For light, and with the receiver moving at relativistic speeds, clocks on the receiver are time dilated relative to clocks at the source. The receiver will measure the received frequency to be f_r = \gamma\left(1 - \beta\right) f_s = \sqrt\,f_s. where * \beta = v/c   and * \gamma = \frac is the
Lorentz factor The Lorentz factor or Lorentz term (also known as the gamma factor) is a dimensionless quantity expressing how much the measurements of time, length, and other physical properties change for an object while it moves. The expression appears in sev ...
. An identical expression for relativistic Doppler shift is obtained when performing the analysis in the reference frame of the ''receiver'' with a moving source.


Transverse Doppler effect

The transverse Doppler effect is one of the main novel predictions of the special theory of relativity. Classically, one might expect that if source and receiver are moving transversely with respect to each other with no longitudinal component to their relative motions, that there should be no Doppler shift in the light arriving at the receiver. Special relativity predicts otherwise. Fig. 5-3 illustrates two common variants of this scenario. Both variants can be analyzed using simple time dilation arguments. In Fig. 5-3a, the receiver observes light from the source as being blueshifted by a factor of . In Fig. 5-3b, the light is redshifted by the same factor.


Measurement versus visual appearance

Time dilation and length contraction are not optical illusions, but genuine effects. Measurements of these effects are not an artifact of
Doppler shift The Doppler effect (also Doppler shift) is the change in the frequency of a wave in relation to an observer who is moving relative to the source of the wave. The ''Doppler effect'' is named after the physicist Christian Doppler, who described t ...
, nor are they the result of neglecting to take into account the time it takes light to travel from an event to an observer. Scientists make a fundamental distinction between ''measurement'' or ''observation'' on the one hand, versus ''visual appearance'', or what one ''sees''. The measured shape of an object is a hypothetical snapshot of all of the object's points as they exist at a single moment in time. But the visual appearance of an object is affected by the varying lengths of time that light takes to travel from different points on the object to one's eye. For many years, the distinction between the two had not been generally appreciated, and it had generally been thought that a length contracted object passing by an observer would in fact actually be ''seen'' as length contracted. In 1959, James Terrell and
Roger Penrose Sir Roger Penrose (born 8 August 1931) is an English mathematician, mathematical physicist, Philosophy of science, philosopher of science and Nobel Prize in Physics, Nobel Laureate in Physics. He is Emeritus Rouse Ball Professor of Mathematics i ...
independently pointed out that differential time lag effects in signals reaching the observer from the different parts of a moving object result in a fast moving object's visual appearance being quite different from its measured shape. For example, a receding object would ''appear'' contracted, an approaching object would ''appear'' elongated, and a passing object would have a skew appearance that has been likened to a rotation. A sphere in motion retains the circular outline for all speeds, for any distance, and for all view angles, although the surface of the sphere and the images on it will appear distorted. Both Fig. 5-4 and Fig. 5-5 illustrate objects moving transversely to the line of sight. In Fig. 5-4, a cube is viewed from a distance of four times the length of its sides. At high speeds, the sides of the cube that are perpendicular to the direction of motion appear hyperbolic in shape. The cube is actually not rotated. Rather, light from the rear of the cube takes longer to reach one's eyes compared with light from the front, during which time the cube has moved to the right. At high speeds, the sphere in Fig. 5-5 takes on the appearance of a flattened disk tilted up to 45° from the line of sight. If the objects' motions are not strictly transverse but instead include a longitudinal component, exaggerated distortions in perspective may be seen. This illusion has come to be known as '' Terrell rotation'' or the ''Terrell–Penrose effect''.Even though it has been many decades since Terrell and Penrose published their observations, popular writings continue to conflate measurement versus appearance. For example, Michio Kaku wrote in ''Einstein's Cosmos'' (W. W. Norton & Company, 2004. p. 65): "... imagine that the speed of light is only 20 miles per hour. If a car were to go down the street, it might look compressed in the direction of motion, being squeezed like an accordion down to perhaps 1 inch in length." Another example where visual appearance is at odds with measurement comes from the observation of apparent superluminal motion in various
radio galaxies A radio galaxy is a galaxy with giant regions of radio emission extending well beyond its visible structure. These energetic radio lobes are powered by jets from its active galactic nucleus. They have luminosities up to 1039 watt, W at radio ...
, BL Lac objects,
quasars A quasar ( ) is an extremely Luminosity, luminous active galactic nucleus (AGN). It is sometimes known as a quasi-stellar object, abbreviated QSO. The emission from an AGN is powered by accretion onto a supermassive black hole with a mass rangi ...
, and other astronomical objects that eject relativistic-speed jets of matter at narrow angles with respect to the viewer. An apparent optical illusion results giving the appearance of faster than light travel. In Fig. 5-6, galaxy M87 streams out a high-speed jet of subatomic particles almost directly towards us, but Penrose–Terrell rotation causes the jet to appear to be moving laterally in the same manner that the appearance of the cube in Fig. 5-4 has been stretched out.


Dynamics

Section ' dealt strictly with
kinematics In physics, kinematics studies the geometrical aspects of motion of physical objects independent of forces that set them in motion. Constrained motion such as linked machine parts are also described as kinematics. Kinematics is concerned with s ...
, the study of the motion of points, bodies, and systems of bodies without considering the forces that caused the motion. This section discusses masses, forces, energy and so forth, and as such requires consideration of physical effects beyond those encompassed by the Lorentz transformation itself.


Equivalence of mass and energy

Mass–energy equivalence is a consequence of special relativity. The energy and momentum, which are separate in Newtonian mechanics, form a four-vector in relativity, and this relates the time component (the energy) to the space components (the momentum) in a non-trivial way. For an object at rest, the energy–momentum four-vector is : it has a time component, which is the energy, and three space components, which are zero. By changing frames with a Lorentz transformation in the x direction with a small value of the velocity v, the energy momentum four-vector becomes . The momentum is equal to the energy multiplied by the velocity divided by ''c''2. As such, the Newtonian mass of an object, which is the ratio of the momentum to the velocity for slow velocities, is equal to ''E''/''c''2. The energy and momentum are properties of matter and radiation, and it is impossible to deduce that they form a four-vector just from the two basic postulates of special relativity by themselves, because these do not talk about matter or radiation, they only talk about space and time. The derivation therefore requires some additional physical reasoning. In his 1905 paper, Einstein used the additional principles that Newtonian mechanics should hold for slow velocities, so that there is one energy scalar and one three-vector momentum at slow velocities, and that the conservation law for energy and momentum is exactly true in relativity. Furthermore, he assumed that the energy of light is transformed by the same Doppler-shift factor as its frequency, which he had previously shown to be true based on Maxwell's equations. The first of Einstein's papers on this subject was "Does the Inertia of a Body Depend upon its Energy Content?" in 1905.Does the inertia of a body depend upon its energy content?
A. Einstein, ''Annalen der Physik''. 18:639, 1905 (English translation by W. Perrett and G.B. Jeffery)
Although Einstein's argument in this paper is nearly universally accepted by physicists as correct, even self-evident, many authors over the years have suggested that it is wrong. Other authors suggest that the argument was merely inconclusive because it relied on some implicit assumptions. Einstein acknowledged the controversy over his derivation in his 1907 survey paper on special relativity. There he notes that it is problematic to rely on Maxwell's equations for the heuristic mass–energy argument. The argument in his 1905 paper can be carried out with the emission of any massless particles, but the Maxwell equations are implicitly used to make it obvious that the emission of light in particular can be achieved only by doing work. To emit electromagnetic waves, all you have to do is shake a charged particle, and this is clearly doing work, so that the emission is of energy.
A. Einstein, Annalen der Physik 23 (1907): 371–384
In a letter to Carl Seelig in 1955, Einstein wrote "I had already previously found that Maxwell's theory did not account for the micro-structure of radiation and could therefore have no general validity.", Einstein letter to Carl Seelig, 1955.


Einstein's 1905 demonstration of ''E'' = ''mc''2

In his fourth of his 1905 Annus mirabilis papers, Einstein presented a heuristic argument for the equivalence of mass and energy. Although, as discussed above, subsequent scholarship has established that his arguments fell short of a broadly definitive proof, the conclusions that he reached in this paper have stood the test of time. Einstein took as starting assumptions his recently discovered formula for relativistic Doppler shift, the laws of
conservation of energy The law of conservation of energy states that the total energy of an isolated system remains constant; it is said to be Conservation law, ''conserved'' over time. In the case of a Closed system#In thermodynamics, closed system, the principle s ...
and
conservation of momentum In Newtonian mechanics, momentum (: momenta or momentums; more specifically linear momentum or translational momentum) is the product of the mass and velocity of an object. It is a vector quantity, possessing a magnitude and a direction. ...
, and the relationship between the frequency of light and its energy as implied by
Maxwell's equations Maxwell's equations, or Maxwell–Heaviside equations, are a set of coupled partial differential equations that, together with the Lorentz force law, form the foundation of classical electromagnetism, classical optics, Electrical network, electr ...
. Fig. 6-1 (top). Consider a system of plane waves of light having frequency f traveling in direction \phi relative to the x-axis of reference frame ''S''. The frequency (and hence energy) of the waves as measured in frame that is moving along the x-axis at velocity v is given by the relativistic Doppler shift formula that Einstein had developed in his 1905 paper on special relativity: : \frac = \frac Fig. 6-1 (bottom). Consider an arbitrary body that is stationary in reference frame ''S''. Let this body emit a pair of equal-energy light-pulses in opposite directions at angle \phi with respect to the x-axis. Each pulse has energy . Because of conservation of momentum, the body remains stationary in ''S'' after emission of the two pulses. Let E_0 be the energy of the body before emission of the two pulses and E_1 after their emission. Next, consider the same system observed from frame that is moving along the x-axis at speed v relative to frame ''S''. In this frame, light from the forwards and reverse pulses will be relativistically Doppler-shifted. Let H_0 be the energy of the body measured in reference frame before emission of the two pulses and H_1 after their emission. We obtain the following relationships: : \begin E_0 &= E_1 + \tfracL + \tfracL = E_1 + L \\ muH_0 &= H_1 + \tfrac12 L \frac + \tfrac12 L \frac = H_1 + \frac \end From the above equations, we obtain the following: The two differences of form H - E seen in the above equation have a straightforward physical interpretation. Since H and E are the energies of the arbitrary body in the moving and stationary frames, H_0 - E_0 and H_1 - E_1 represents the kinetic energies of the bodies before and after the emission of light (except for an additive constant that fixes the zero point of energy and is conventionally set to zero). Hence, Taking a Taylor series expansion and neglecting higher order terms, he obtained Comparing the above expression with the classical expression for kinetic energy, ''K.E.'' = ''mv''2, Einstein then noted: "If a body gives off the energy ''L'' in the form of radiation, its mass diminishes by ''L''/''c''2." Rindler has observed that Einstein's heuristic argument suggested merely that energy ''contributes'' to mass. In 1905, Einstein's cautious expression of the mass–energy relationship allowed for the possibility that "dormant" mass might exist that would remain behind after all the energy of a body was removed. By 1907, however, Einstein was ready to assert that ''all'' inertial mass represented a reserve of energy. "To equate ''all'' mass with energy required an act of aesthetic faith, very characteristic of Einstein." Einstein's bold hypothesis has been amply confirmed in the years subsequent to his original proposal. For a variety of reasons, Einstein's original derivation is currently seldom taught. Besides the vigorous debate that continues until this day as to the formal correctness of his original derivation, the recognition of special relativity as being what Einstein called a "principle theory" has led to a shift away from reliance on electromagnetic phenomena to purely dynamic methods of proof.


How far can you travel from the Earth?

Since nothing can travel faster than light, one might conclude that a human can never travel farther from Earth than ~ 100 light years. You would easily think that a traveler would never be able to reach more than the few solar systems that exist within the limit of 100 light years from Earth. However, because of time dilation, a hypothetical spaceship can travel thousands of light years during a passenger's lifetime. If a spaceship could be built that accelerates at a constant 1''g'', it will, after one year, be travelling at almost the speed of light as seen from Earth. This is described by: v(t) = \frac , where ''v''(''t'') is the velocity at a time ''t'', ''a'' is the acceleration of the spaceship and ''t'' is the coordinate time as measured by people on Earth. Therefore, after one year of accelerating at 9.81 m/s2, the spaceship will be travelling at and after three years, relative to Earth. After three years of this acceleration, with the spaceship achieving a velocity of 94.6% of the speed of light relative to Earth, time dilation will result in each second experienced on the spaceship corresponding to 3.1 seconds back on Earth. During their journey, people on Earth will experience more time than they do – since their clocks (all physical phenomena) would really be ticking 3.1 times faster than those of the spaceship. A 5-year round trip for the traveller will take 6.5 Earth years and cover a distance of over 6 light-years. A 20-year round trip for them (5 years accelerating, 5 decelerating, twice each) will land them back on Earth having travelled for 335 Earth years and a distance of 331 light years. A full 40-year trip at 1''g'' will appear on Earth to last 58,000 years and cover a distance of 55,000 light years. A 40-year trip at will take years and cover about light years. A one-way 28 year (14 years accelerating, 14 decelerating as measured with the astronaut's clock) trip at 1''g'' acceleration could reach 2,000,000 light-years to the Andromeda Galaxy. This same time dilation is why a muon travelling close to ''c'' is observed to travel much farther than ''c'' times its
half-life Half-life is a mathematical and scientific description of exponential or gradual decay. Half-life, half life or halflife may also refer to: Film * Half-Life (film), ''Half-Life'' (film), a 2008 independent film by Jennifer Phang * ''Half Life: ...
(when at rest).


Elastic collisions

Examination of the collision products generated by particle accelerators around the world provides scientists evidence of the structure of the subatomic world and the natural laws governing it. Analysis of the collision products, the sum of whose masses may vastly exceed the masses of the incident particles, requires special relativity. In Newtonian mechanics, analysis of collisions involves use of the conservation laws for mass,
momentum In Newtonian mechanics, momentum (: momenta or momentums; more specifically linear momentum or translational momentum) is the product of the mass and velocity of an object. It is a vector quantity, possessing a magnitude and a direction. ...
and
energy Energy () is the physical quantity, quantitative physical property, property that is transferred to a physical body, body or to a physical system, recognizable in the performance of Work (thermodynamics), work and in the form of heat and l ...
. In relativistic mechanics, mass is not independently conserved, because it has been subsumed into the total relativistic energy. We illustrate the differences that arise between the Newtonian and relativistic treatments of particle collisions by examining the simple case of two perfectly elastic colliding particles of equal mass. (''Inelastic'' collisions are discussed in Spacetime#Conservation laws. Radioactive decay may be considered a sort of time-reversed inelastic collision.) Elastic scattering of charged elementary particles deviates from ideality due to the production of Bremsstrahlung radiation.


Newtonian analysis

Fig. 6-2 provides a demonstration of the result, familiar to billiard players, that if a stationary ball is struck elastically by another one of the same mass (assuming no sidespin, or "English"), then after collision, the diverging paths of the two balls will subtend a right angle. (a) In the stationary frame, an incident sphere traveling at 2v strikes a stationary sphere. (b) In the center of momentum frame, the two spheres approach each other symmetrically at ±v. After elastic collision, the two spheres rebound from each other with equal and opposite velocities ±u. Energy conservation requires that = . (c) Reverting to the stationary frame, the rebound velocities are . The dot product , indicating that the vectors are orthogonal.


Relativistic analysis

Consider the elastic collision scenario in Fig. 6-3 between a moving particle colliding with an equal mass stationary particle. Unlike the Newtonian case, the angle between the two particles after collision is less than 90°, is dependent on the angle of scattering, and becomes smaller and smaller as the velocity of the incident particle approaches the speed of light: The relativistic momentum and total relativistic energy of a particle are given by Conservation of momentum dictates that the sum of the momenta of the incoming particle and the stationary particle (which initially has momentum = 0) equals the sum of the momenta of the emergent particles: Likewise, the sum of the total relativistic energies of the incoming particle and the stationary particle (which initially has total energy mc2) equals the sum of the total energies of the emergent particles: Breaking down () into its components, replacing v with the dimensionless , and factoring out common terms from () and () yields the following: From these we obtain the following relationships: For the symmetrical case in which \phi = \theta and , () takes on the simpler form:


Beyond the basics


Rapidity

Lorentz transformations relate coordinates of events in one reference frame to those of another frame. Relativistic composition of velocities is used to add two velocities together. The formulas to perform the latter computations are nonlinear, making them more complex than the corresponding Galilean formulas. This nonlinearity is an artifact of our choice of parameters. We have previously noted that in an spacetime diagram, the points at some constant spacetime interval from the origin form an invariant hyperbola. We have also noted that the coordinate systems of two spacetime reference frames in standard configuration are hyperbolically rotated with respect to each other. The natural functions for expressing these relationships are the hyperbolic analogs of the trigonometric functions. Fig. 7-1a shows a
unit circle In mathematics, a unit circle is a circle of unit radius—that is, a radius of 1. Frequently, especially in trigonometry, the unit circle is the circle of radius 1 centered at the origin (0, 0) in the Cartesian coordinate system in the Eucli ...
with sin(''a'') and cos(''a''), the only difference between this diagram and the familiar unit circle of elementary trigonometry being that ''a'' is interpreted, not as the angle between the ray and the , but as twice the area of the sector swept out by the ray from the . Numerically, the angle and measures for the unit circle are identical. Fig. 7-1b shows a unit hyperbola with sinh(''a'') and cosh(''a''), where ''a'' is likewise interpreted as twice the tinted area. Fig. 7-2 presents plots of the sinh, cosh, and tanh functions. For the unit circle, the slope of the ray is given by : \text = \tan a = \frac. In the Cartesian plane, rotation of point into point by angle ''θ'' is given by : \begin x' \\ y' \\ \end = \begin \cos \theta & -\sin \theta \\ \sin \theta & \cos \theta \\ \end\begin x \\ y \\ \end. In a spacetime diagram, the velocity parameter \beta is the analog of slope. The ''rapidity'', ''φ'', is defined by : \beta \equiv \tanh \phi \equiv \frac, where : \tanh \phi = \frac = \frac. The rapidity defined above is very useful in special relativity because many expressions take on a considerably simpler form when expressed in terms of it. For example, rapidity is simply additive in the collinear velocity-addition formula; : \beta = \frac = \frac = \tanh(\phi_1 + \phi_2), or in other words, . The Lorentz transformations take a simple form when expressed in terms of rapidity. The ''γ'' factor can be written as : \gamma = \frac = \frac = \cosh \phi, : \gamma \beta = \frac = \frac = \sinh \phi. Transformations describing relative motion with uniform velocity and without rotation of the space coordinate axes are called ''boosts''. Substituting ''γ'' and ''γβ'' into the transformations as previously presented and rewriting in matrix form, the Lorentz boost in the may be written as : \begin c t' \\ x' \end = \begin \cosh \phi & -\sinh \phi \\ -\sinh \phi & \cosh \phi \end \begin ct \\ x \end, and the inverse Lorentz boost in the may be written as : \begin c t \\ x \end = \begin \cosh \phi & \sinh \phi \\ \sinh \phi & \cosh \phi \end \begin c t' \\ x' \end. In other words, Lorentz boosts represent hyperbolic rotations in Minkowski spacetime. The advantages of using hyperbolic functions are such that some textbooks such as the classic ones by Taylor and Wheeler introduce their use at a very early stage.Rapidity arises naturally as a coordinates on the pure boost generators inside the
Lie algebra In mathematics, a Lie algebra (pronounced ) is a vector space \mathfrak g together with an operation called the Lie bracket, an alternating bilinear map \mathfrak g \times \mathfrak g \rightarrow \mathfrak g, that satisfies the Jacobi ident ...
algebra of the Lorentz group. Likewise, rotation angles arise naturally as coordinates (modulo ) on the pure rotation generators in the Lie algebra. (Together they coordinatize the whole Lie algebra.) A notable difference is that the resulting rotations are periodic in the rotation angle, while the resulting boosts are not periodic in rapidity (but rather one-to-one). The similarity between boosts and rotations is formal resemblance.


4‑vectors

Four‑vectors have been mentioned above in context of the energy–momentum , but without any great emphasis. Indeed, none of the elementary derivations of special relativity require them. But once understood, , and more generally tensors, greatly simplify the mathematics and conceptual understanding of special relativity. Working exclusively with such objects leads to formulas that are ''manifestly'' relativistically invariant, which is a considerable advantage in non-trivial contexts. For instance, demonstrating relativistic invariance of
Maxwell's equations Maxwell's equations, or Maxwell–Heaviside equations, are a set of coupled partial differential equations that, together with the Lorentz force law, form the foundation of classical electromagnetism, classical optics, Electrical network, electr ...
in their usual form is not trivial, while it is merely a routine calculation, really no more than an observation, using the field strength tensor formulation. On the other hand, general relativity, from the outset, relies heavily on , and more generally tensors, representing physically relevant entities. Relating these via equations that do not rely on specific coordinates requires tensors, capable of connecting such even within a ''curved'' spacetime, and not just within a ''flat'' one as in special relativity. The study of tensors is outside the scope of this article, which provides only a basic discussion of spacetime.


Definition of 4-vectors

A 4-tuple, is a "4-vector" if its component ''Ai'' transform between frames according to the Lorentz transformation. If using coordinates, ''A'' is a if it transforms (in the ) according to : \begin A_0' &= \gamma \left( A_0 - (v/c) A_1 \right) \\ A_1' &= \gamma \left( A_1 - (v/c) A_0 \right)\\ A_2' &= A_2 \\ A_3' &= A_3 \end , which comes from simply replacing ''ct'' with ''A''0 and ''x'' with ''A''1 in the earlier presentation of the Lorentz transformation. As usual, when we write ''x'', ''t'', etc. we generally mean Δ''x'', Δ''t'' etc. The last three components of a must be a standard vector in three-dimensional space. Therefore, a must transform like under Lorentz transformations as well as rotations.


Properties of 4-vectors

* Closure under linear combination: If ''A'' and ''B'' are , then is also a . * Inner-product invariance: If ''A'' and ''B'' are , then their inner product (scalar product) is invariant, i.e. their inner product is independent of the frame in which it is calculated. Note how the calculation of inner product differs from the calculation of the inner product of a . In the following, \vec and \vec are : *: A \cdot B \equiv A_0 B_0 - A_1 B_1 - A_2 B_2 - A_3 B_3 \equiv A_0 B_0 - \vec \cdot \vec : In addition to being invariant under Lorentz transformation, the above inner product is also invariant under rotation in . : Two vectors are said to be ''orthogonal'' if . Unlike the case with , orthogonal are not necessarily at right angles to each other. The rule is that two are orthogonal if they are offset by equal and opposite angles from the 45° line, which is the world line of a light ray. This implies that a lightlike is orthogonal to ''itself''. * Invariance of the magnitude of a vector: The magnitude of a vector is the inner product of a with itself, and is a frame-independent property. As with intervals, the magnitude may be positive, negative or zero, so that the vectors are referred to as timelike, spacelike or null (lightlike). Note that a null vector is not the same as a zero vector. A null vector is one for which , while a zero vector is one whose components are all zero. Special cases illustrating the invariance of the norm include the invariant interval c^2 t^2 - x^2 and the invariant length of the relativistic momentum vector .


Examples of 4-vectors

* Displacement 4-vector: Otherwise known as the ''spacetime separation'', this is or for infinitesimal separations, . *: dS \equiv (dt, dx, dy, dz) * Velocity 4-vector: This results when the displacement is divided by d \tau, where d \tau is the proper time between the two events that yield ''dt'', ''dx'', ''dy'', and ''dz''. *: V \equiv \frac = \frac = \gamma \left(1, \frac, \frac, \frac \right) = (\gamma, \gamma \vec ) : The is tangent to the world line of a particle, and has a length equal to one unit of time in the frame of the particle. : An accelerated particle does not have an inertial frame in which it is always at rest. However, an inertial frame can always be found that is momentarily comoving with the particle. This frame, the ''momentarily comoving reference frame'' (MCRF), enables application of special relativity to the analysis of accelerated particles. : Since photons move on null lines, d \tau = 0 for a photon, and a cannot be defined. There is no frame in which a photon is at rest, and no MCRF can be established along a photon's path. * Energy–momentum 4-vector: *: P \equiv (E/c, \vec) = (E/c, p_x, p_y, p_z) : As indicated before, there are varying treatments for the energy–momentum so that one may also see it expressed as (E, \vec) or . The first component is the total energy (including mass) of the particle (or system of particles) in a given frame, while the remaining components are its spatial momentum. The energy–momentum is a conserved quantity. * Acceleration 4-vector: This results from taking the derivative of the velocity with respect to . *: A \equiv \frac = \frac (\gamma, \gamma \vec) = \gamma \left( \frac, \frac \right) * Force 4-vector: This is the derivative of the momentum with respect to \tau . *: F \equiv \frac = \gamma \left(\frac, \frac \right) = \gamma \left( \frac,\vec \right) As expected, the final components of the above are all standard corresponding to spatial , etc.


4-vectors and physical law

The first postulate of special relativity declares the equivalency of all inertial frames. A physical law holding in one frame must apply in all frames, since otherwise it would be possible to differentiate between frames. Newtonian momenta fail to behave properly under Lorentzian transformation, and Einstein preferred to change the definition of momentum to one involving rather than give up on conservation of momentum. Physical laws must be based on constructs that are frame independent. This means that physical laws may take the form of equations connecting scalars, which are always frame independent. However, equations involving require the use of tensors with appropriate rank, which themselves can be thought of as being built up from .


Acceleration

Special relativity does accommodate accelerations as well as accelerating frames of reference. It is a common misconception that special relativity is applicable only to inertial frames, and that it is unable to handle accelerating objects or accelerating reference frames. It is only when gravitation is significant that general relativity is required. Properly handling accelerating frames does require some care, however. The difference between special and general relativity is that (1) In special relativity, all velocities are relative, but acceleration is absolute. (2) In general relativity, all motion is relative, whether inertial, accelerating, or rotating. To accommodate this difference, general relativity uses curved spacetime. In this section, we analyze several scenarios involving accelerated reference frames.


Dewan–Beran–Bell spaceship paradox

The Dewan–Beran–Bell spaceship paradox ( Bell's spaceship paradox) is a good example of a problem where intuitive reasoning unassisted by the geometric insight of the spacetime approach can lead to issues. In Fig. 7-4, two identical spaceships float in space and are at rest relative to each other. They are connected by a string that is capable of only a limited amount of stretching before breaking. At a given instant in our frame, the observer frame, both spaceships accelerate in the same direction along the line between them with the same constant proper acceleration.In relativity theory, proper acceleration is the physical acceleration (i.e., measurable acceleration as by an accelerometer) experienced by an object. It is thus acceleration relative to a free-fall, or inertial, observer who is momentarily at rest relative to the object being measured. Will the string break? When the paradox was new and relatively unknown, even professional physicists had difficulty working out the solution. Two lines of reasoning lead to opposite conclusions. Both arguments, which are presented below, are flawed even though one of them yields the correct answer. # To observers in the rest frame, the spaceships start a distance ''L'' apart and remain the same distance apart during acceleration. During acceleration, ''L'' is a length contracted distance of the distance in the frame of the accelerating spaceships. After a sufficiently long time, ''γ'' will increase to a sufficiently large factor that the string must break. # Let ''A'' and ''B'' be the rear and front spaceships. In the frame of the spaceships, each spaceship sees the other spaceship doing the same thing that it is doing. ''A'' says that ''B'' has the same acceleration that he has, and ''B'' sees that ''A'' matches her every move. So the spaceships stay the same distance apart, and the string does not break. The problem with the first argument is that there is no "frame of the spaceships." There cannot be, because the two spaceships measure a growing distance between the two. Because there is no common frame of the spaceships, the length of the string is ill-defined. Nevertheless, the conclusion is correct, and the argument is mostly right. The second argument, however, completely ignores the relativity of simultaneity. A spacetime diagram (Fig. 7-5) makes the correct solution to this paradox almost immediately evident. Two observers in Minkowski spacetime accelerate with constant magnitude k acceleration for proper time \sigma (acceleration and elapsed time measured by the observers themselves, not some inertial observer). They are comoving and inertial before and after this phase. In Minkowski geometry, the length along the line of simultaneity A'B'' turns out to be greater than the length along the line of simultaneity . The length increase can be calculated with the help of the Lorentz transformation. If, as illustrated in Fig. 7-5, the acceleration is finished, the ships will remain at a constant offset in some frame . If x_ and x_=x_+L are the ships' positions in , the positions in frame S' are: : \begin x'_& = \gamma\left(x_-vt\right)\\ x'_& = \gamma\left(x_+L-vt\right)\\ L'& = x'_-x'_ =\gamma L \end The "paradox", as it were, comes from the way that Bell constructed his example. In the usual discussion of Lorentz contraction, the rest length is fixed and the moving length shortens as measured in frame . As shown in Fig. 7-5, Bell's example asserts the moving lengths AB and A'B' measured in frame S to be fixed, thereby forcing the rest frame length A'B'' in frame S' to increase.


Accelerated observer with horizon

Certain special relativity problem setups can lead to insight about phenomena normally associated with general relativity, such as event horizons. In the text accompanying Section "Invariant hyperbola" of the article Spacetime, the magenta hyperbolae represented actual paths that are tracked by a constantly accelerating traveler in spacetime. During periods of positive acceleration, the traveler's velocity just ''approaches'' the speed of light, while, measured in our frame, the traveler's acceleration constantly decreases. Fig. 7-6 details various features of the traveler's motions with more specificity. At any given moment, her space axis is formed by a line passing through the origin and her current position on the hyperbola, while her time axis is the tangent to the hyperbola at her position. The velocity parameter \beta approaches a limit of one as ct increases. Likewise, \gamma approaches infinity. The shape of the invariant hyperbola corresponds to a path of constant proper acceleration. This is demonstrable as follows: # We remember that . # Since , we conclude that . # \gamma = 1/\sqrt = \sqrt/s # From the relativistic force law, F = dp/dt = . # Substituting \beta(ct) from step 2 and the expression for \gamma from step 3 yields , which is a constant expression. Fig. 7-6 illustrates a specific calculated scenario. Terence (A) and Stella (B) initially stand together 100 light hours from the origin. Stella lifts off at time 0, her spacecraft accelerating at 0.01 ''c'' per hour. Every twenty hours, Terence radios updates to Stella about the situation at home (solid green lines). Stella receives these regular transmissions, but the increasing distance (offset in part by time dilation) causes her to receive Terence's communications later and later as measured on her clock, and she ''never'' receives any communications from Terence after 100 hours on his clock (dashed green lines). After 100 hours according to Terence's clock, Stella enters a dark region. She has traveled outside Terence's timelike future. On the other hand, Terence can continue to receive Stella's messages to him indefinitely. He just has to wait long enough. Spacetime has been divided into distinct regions separated by an ''apparent'' event horizon. So long as Stella continues to accelerate, she can never know what takes place behind this horizon.


Relativity and unifying electromagnetism

Theoretical investigation in classical electromagnetism led to the discovery of wave propagation. Equations generalizing the electromagnetic effects found that finite propagation speed of the E and B fields required certain behaviors on charged particles. The general study of moving charges forms the Liénard–Wiechert potential, which is a step towards special relativity. The Lorentz transformation of the
electric field An electric field (sometimes called E-field) is a field (physics), physical field that surrounds electrically charged particles such as electrons. In classical electromagnetism, the electric field of a single charge (or group of charges) descri ...
of a moving charge into a non-moving observer's reference frame results in the appearance of a mathematical term commonly called the
magnetic field A magnetic field (sometimes called B-field) is a physical field that describes the magnetic influence on moving electric charges, electric currents, and magnetic materials. A moving charge in a magnetic field experiences a force perpendicular ...
. Conversely, the ''magnetic'' field generated by a moving charge disappears and becomes a purely ''electrostatic'' field in a comoving frame of reference.
Maxwell's equations Maxwell's equations, or Maxwell–Heaviside equations, are a set of coupled partial differential equations that, together with the Lorentz force law, form the foundation of classical electromagnetism, classical optics, Electrical network, electr ...
are thus simply an empirical fit to special relativistic effects in a classical model of the Universe. As electric and magnetic fields are reference frame dependent and thus intertwined, one speaks of ''electromagnetic'' fields. Special relativity provides the transformation rules for how an electromagnetic field in one inertial frame appears in another inertial frame.
Maxwell's equations Maxwell's equations, or Maxwell–Heaviside equations, are a set of coupled partial differential equations that, together with the Lorentz force law, form the foundation of classical electromagnetism, classical optics, Electrical network, electr ...
in the 3D form are already consistent with the physical content of special relativity, although they are easier to manipulate in a manifestly covariant form, that is, in the language of
tensor In mathematics, a tensor is an algebraic object that describes a multilinear relationship between sets of algebraic objects associated with a vector space. Tensors may map between different objects such as vectors, scalars, and even other ...
calculus.


Theories of relativity and quantum mechanics

''Special'' relativity can be combined with
quantum mechanics Quantum mechanics is the fundamental physical Scientific theory, theory that describes the behavior of matter and of light; its unusual characteristics typically occur at and below the scale of atoms. Reprinted, Addison-Wesley, 1989, It is ...
to form relativistic quantum mechanics and
quantum electrodynamics In particle physics, quantum electrodynamics (QED) is the Theory of relativity, relativistic quantum field theory of electrodynamics. In essence, it describes how light and matter interact and is the first theory where full agreement between quant ...
. How ''general'' relativity and quantum mechanics can be unified is one of the unsolved problems in physics;
quantum gravity Quantum gravity (QG) is a field of theoretical physics that seeks to describe gravity according to the principles of quantum mechanics. It deals with environments in which neither gravitational nor quantum effects can be ignored, such as in the v ...
and a "
theory of everything A theory of everything (TOE), final theory, ultimate theory, unified field theory, or master theory is a hypothetical singular, all-encompassing, coherent theoretical physics, theoretical framework of physics that fully explains and links togeth ...
", which require a unification including general relativity too, are active and ongoing areas in theoretical research. The early Bohr–Sommerfeld atomic model explained the fine structure of
alkali metal The alkali metals consist of the chemical elements lithium (Li), sodium (Na), potassium (K),The symbols Na and K for sodium and potassium are derived from their Latin names, ''natrium'' and ''kalium''; these are still the origins of the names ...
atoms using both special relativity and the preliminary knowledge on
quantum mechanics Quantum mechanics is the fundamental physical Scientific theory, theory that describes the behavior of matter and of light; its unusual characteristics typically occur at and below the scale of atoms. Reprinted, Addison-Wesley, 1989, It is ...
of the time. In 1928,
Paul Dirac Paul Adrien Maurice Dirac ( ; 8 August 1902 – 20 October 1984) was an English mathematician and Theoretical physics, theoretical physicist who is considered to be one of the founders of quantum mechanics. Dirac laid the foundations for bot ...
constructed an influential relativistic wave equation, now known as the
Dirac equation In particle physics, the Dirac equation is a relativistic wave equation derived by British physicist Paul Dirac in 1928. In its free form, or including electromagnetic interactions, it describes all spin-1/2 massive particles, called "Dirac ...
in his honour, that is fully compatible both with special relativity and with the final version of quantum theory existing after 1926. This equation not only described the intrinsic angular momentum of the electrons called '' spin'', it also led to the prediction of the
antiparticle In particle physics, every type of particle of "ordinary" matter (as opposed to antimatter) is associated with an antiparticle with the same mass but with opposite physical charges (such as electric charge). For example, the antiparticle of the ...
of the electron (the
positron The positron or antielectron is the particle with an electric charge of +1''elementary charge, e'', a Spin (physics), spin of 1/2 (the same as the electron), and the same Electron rest mass, mass as an electron. It is the antiparticle (antimatt ...
), and fine structure could only be fully explained with special relativity. It was the first foundation of '' relativistic quantum mechanics''. On the other hand, the existence of antiparticles leads to the conclusion that relativistic quantum mechanics is not enough for a more accurate and complete theory of particle interactions. Instead, a theory of particles interpreted as quantized fields, called ''
quantum field theory In theoretical physics, quantum field theory (QFT) is a theoretical framework that combines Field theory (physics), field theory and the principle of relativity with ideas behind quantum mechanics. QFT is used in particle physics to construct phy ...
'', becomes necessary; in which particles can be created and destroyed throughout space and time.


Status

Special relativity in its Minkowski spacetime is accurate only when the
absolute value In mathematics, the absolute value or modulus of a real number x, is the non-negative value without regard to its sign. Namely, , x, =x if x is a positive number, and , x, =-x if x is negative (in which case negating x makes -x positive), ...
of the
gravitational potential In classical mechanics, the gravitational potential is a scalar potential associating with each point in space the work (energy transferred) per unit mass that would be needed to move an object to that point from a fixed reference point in the ...
is much less than ''c''2 in the region of interest. In a strong gravitational field, one must use
general relativity General relativity, also known as the general theory of relativity, and as Einstein's theory of gravity, is the differential geometry, geometric theory of gravitation published by Albert Einstein in 1915 and is the current description of grav ...
. General relativity becomes special relativity at the limit of a weak field. At very small scales, such as at the Planck length and below, quantum effects must be taken into consideration resulting in
quantum gravity Quantum gravity (QG) is a field of theoretical physics that seeks to describe gravity according to the principles of quantum mechanics. It deals with environments in which neither gravitational nor quantum effects can be ignored, such as in the v ...
. But at macroscopic scales and in the absence of strong gravitational fields, special relativity is experimentally tested to extremely high degree of accuracy (10−20) and thus accepted by the physics community. Experimental results that appear to contradict it are not reproducible and are thus widely believed to be due to experimental errors. Special relativity is mathematically self-consistent, and it is an organic part of all modern physical theories, most notably
quantum field theory In theoretical physics, quantum field theory (QFT) is a theoretical framework that combines Field theory (physics), field theory and the principle of relativity with ideas behind quantum mechanics. QFT is used in particle physics to construct phy ...
, string theory, and general relativity (in the limiting case of negligible gravitational fields). Newtonian mechanics mathematically follows from special relativity at small velocities (compared to the speed of light) – thus Newtonian mechanics can be considered as a special relativity of slow moving bodies. See ''
Classical mechanics Classical mechanics is a Theoretical physics, physical theory describing the motion of objects such as projectiles, parts of Machine (mechanical), machinery, spacecraft, planets, stars, and galaxies. The development of classical mechanics inv ...
'' for a more detailed discussion. Several experiments predating Einstein's 1905 paper are now interpreted as evidence for relativity. Of these it is known Einstein was aware of the Fizeau experiment before 1905, and historians have concluded that Einstein was at least aware of the Michelson–Morley experiment as early as 1899 despite claims he made in his later years that it played no role in his development of the theory. * The Fizeau experiment (1851, repeated by Michelson and Morley in 1886) measured the speed of light in moving media, with results that are consistent with relativistic addition of colinear velocities. * The famous Michelson–Morley experiment (1881, 1887) gave further support to the postulate that detecting an absolute reference velocity was not achievable. It should be stated here that, contrary to many alternative claims, it said little about the invariance of the speed of light with respect to the source and observer's velocity, as both source and observer were travelling together at the same velocity at all times. * The Trouton–Noble experiment (1903) showed that the torque on a capacitor is independent of position and inertial reference frame. * The Experiments of Rayleigh and Brace (1902, 1904) showed that length contraction does not lead to birefringence for a co-moving observer, in accordance with the relativity principle.
Particle accelerator A particle accelerator is a machine that uses electromagnetic fields to propel electric charge, charged particles to very high speeds and energies to contain them in well-defined particle beam, beams. Small accelerators are used for fundamental ...
s accelerate and measure the properties of particles moving at near the speed of light, where their behavior is consistent with relativity theory and inconsistent with the earlier
Newtonian mechanics Newton's laws of motion are three physical laws that describe the relationship between the motion of an object and the forces acting on it. These laws, which provide the basis for Newtonian mechanics, can be paraphrased as follows: # A body r ...
. These machines would simply not work if they were not engineered according to relativistic principles. In addition, a considerable number of modern experiments have been conducted to test special relativity. Some examples: * Tests of relativistic energy and momentum – testing the limiting speed of particles * Ives–Stilwell experiment – testing relativistic Doppler effect and time dilation * Experimental testing of time dilation – relativistic effects on a fast-moving particle's half-life * Kennedy–Thorndike experiment – time dilation in accordance with Lorentz transformations * Hughes–Drever experiment – testing isotropy of space and mass * Modern searches for Lorentz violation – various modern tests * Experiments to test emission theory demonstrated that the speed of light is independent of the speed of the emitter. * Experiments to test the
aether drag hypothesis In the 19th century, the theory of the luminiferous aether as the hypothetical Transmission medium, medium for the propagation of light waves was widely discussed. The aether hypothesis arose because physicists of that era could not conceive of lig ...
– no "aether flow obstruction".


Technical discussion of spacetime


Geometry of spacetime


Comparison between flat Euclidean space and Minkowski space

Special relativity uses a "flat" 4-dimensional Minkowski space – an example of a
spacetime In physics, spacetime, also called the space-time continuum, is a mathematical model that fuses the three dimensions of space and the one dimension of time into a single four-dimensional continuum. Spacetime diagrams are useful in visualiz ...
. Minkowski spacetime appears to be very similar to the standard 3-dimensional
Euclidean space Euclidean space is the fundamental space of geometry, intended to represent physical space. Originally, in Euclid's ''Elements'', it was the three-dimensional space of Euclidean geometry, but in modern mathematics there are ''Euclidean spaces ...
, but there is a crucial difference with respect to time. In 3D space, the differential of distance (line element) ''ds'' is defined by ds^2 = d\mathbf \cdot d\mathbf = dx_1^2 + dx_2^2 + dx_3^2, where are the differentials of the three spatial dimensions. In Minkowski geometry, there is an extra dimension with coordinate ''X''0 derived from time, such that the distance differential fulfills ds^2 = -dX_0^2 + dX_1^2 + dX_2^2 + dX_3^2, where are the differentials of the four spacetime dimensions. This suggests a deep theoretical insight: special relativity is simply a
rotational symmetry Rotational symmetry, also known as radial symmetry in geometry, is the property a shape (geometry), shape has when it looks the same after some rotation (mathematics), rotation by a partial turn (angle), turn. An object's degree of rotational s ...
of our spacetime, analogous to the rotational symmetry of Euclidean space (see Fig. 10-1). Just as Euclidean space uses a Euclidean metric, so spacetime uses a
Minkowski metric In physics, Minkowski space (or Minkowski spacetime) () is the main mathematical description of spacetime in the absence of general_relativity, gravitation. It combines inertial space and time manifolds into a four-dimensional model. The model ...
. Basically, special relativity can be stated as the ''invariance of any spacetime interval'' (that is the 4D distance between any two events) when viewed from ''any inertial reference frame''. All equations and effects of special relativity can be derived from this rotational symmetry (the Poincaré group) of Minkowski spacetime. The actual form of ''ds'' above depends on the metric and on the choices for the ''X''0 coordinate. To make the time coordinate look like the space coordinates, it can be treated as imaginary: (this is called a
Wick rotation In physics, Wick rotation, named after Italian physicist Gian Carlo Wick, is a method of finding a solution to a mathematical problem in Minkowski space from a solution to a related problem in Euclidean space by means of a transformation that sub ...
). According to Misner, Thorne and Wheeler (1971, §2.3), ultimately the deeper understanding of both special and general relativity will come from the study of the Minkowski metric (described below) and to take , rather than a "disguised" Euclidean metric using ''ict'' as the time coordinate. Some authors use , with factors of ''c'' elsewhere to compensate; for instance, spatial coordinates are divided by ''c'' or factors of ''c''±2 are included in the metric tensor. These numerous conventions can be superseded by using natural units where . Then space and time have equivalent units, and no factors of ''c'' appear anywhere.


3D spacetime

If we reduce the spatial dimensions to 2, so that we can represent the physics in a 3D space ds^2 = dx_1^2 + dx_2^2 - c^2 dt^2, we see that the
null Null may refer to: Science, technology, and mathematics Astronomy *Nuller, an optical tool using interferometry to block certain sources of light Computing *Null (SQL) (or NULL), a special marker and keyword in SQL indicating that a data value do ...
geodesics lie along a dual-cone (see Fig. 10-2) defined by the equation; ds^2 = 0 = dx_1^2 + dx_2^2 - c^2 dt^2 or simply dx_1^2 + dx_2^2 = c^2 dt^2, which is the equation of a circle of radius ''c dt''.


4D spacetime

If we extend this to three spatial dimensions, the null geodesics are the 4-dimensional cone: ds^2 = 0 = dx_1^2 + dx_2^2 + dx_3^2 - c^2 dt^2 so dx_1^2 + dx_2^2 + dx_3^2 = c^2 dt^2. As illustrated in Fig. 10-3, the null geodesics can be visualized as a set of continuous concentric spheres with radii = ''c dt''. This null dual-cone represents the "line of sight" of a point in space. That is, when we look at the
star A star is a luminous spheroid of plasma (physics), plasma held together by Self-gravitation, self-gravity. The List of nearest stars and brown dwarfs, nearest star to Earth is the Sun. Many other stars are visible to the naked eye at night sk ...
s and say "The light from that star that I am receiving is ''X'' years old", we are looking down this line of sight: a null geodesic. We are looking at an event a distance d = \sqrt away and a time ''d''/''c'' in the past. For this reason the null dual cone is also known as the "light cone". (The point in the lower left of the Fig. 10-2 represents the star, the origin represents the observer, and the line represents the null geodesic "line of sight".) The cone in the −''t'' region is the information that the point is "receiving", while the cone in the +''t'' section is the information that the point is "sending". The geometry of Minkowski space can be depicted using Minkowski diagrams, which are useful also in understanding many of the
thought experiment A thought experiment is an imaginary scenario that is meant to elucidate or test an argument or theory. It is often an experiment that would be hard, impossible, or unethical to actually perform. It can also be an abstract hypothetical that is ...
s in special relativity.


Physics in spacetime


Transformations of physical quantities between reference frames

Above, the Lorentz transformation for the time coordinate and three space coordinates illustrates that they are intertwined. This is true more generally: certain pairs of "timelike" and "spacelike" quantities naturally combine on equal footing under the same Lorentz transformation. The Lorentz transformation in standard configuration above, that is, for a boost in the ''x''-direction, can be recast into matrix form as follows: \begin ct'\\ x'\\ y'\\ z' \end = \begin \gamma & -\beta\gamma & 0 & 0\\ -\beta\gamma & \gamma & 0 & 0\\ 0 & 0 & 1 & 0\\ 0 & 0 & 0 & 1 \end \begin ct\\ x\\ y\\ z \end = \begin \gamma ct- \gamma\beta x\\ \gamma x - \beta \gamma ct \\ y\\ z \end. In Newtonian mechanics, quantities that have magnitude and direction are mathematically described as 3d vectors in Euclidean space, and in general they are parametrized by time. In special relativity, this notion is extended by adding the appropriate timelike quantity to a spacelike vector quantity, and we have 4d vectors, or " four-vectors", in Minkowski spacetime. The components of vectors are written using tensor index notation, as this has numerous advantages. The notation makes it clear the equations are manifestly covariant under the Poincaré group, thus bypassing the tedious calculations to check this fact. In constructing such equations, we often find that equations previously thought to be unrelated are, in fact, closely connected being part of the same tensor equation. Recognizing other
physical quantities A physical quantity (or simply quantity) is a property of a material or system that can be quantified by measurement. A physical quantity can be expressed as a ''value'', which is the algebraic multiplication of a '' numerical value'' and a '' ...
as tensors simplifies their transformation laws. Throughout, upper indices (superscripts) are contravariant indices rather than exponents except when they indicate a square (this should be clear from the context), and lower indices (subscripts) are covariant indices. For simplicity and consistency with the earlier equations, Cartesian coordinates will be used. The simplest example of a four-vector is the position of an event in spacetime, which constitutes a timelike component ''ct'' and spacelike component , in a contravariant position four-vector with components: X^\nu = (X^0, X^1, X^2, X^3)= (ct, x, y, z) = (ct, \mathbf ). where we define so that the time coordinate has the same dimension of distance as the other spatial dimensions; so that space and time are treated equally. Now the transformation of the contravariant components of the position 4-vector can be compactly written as: X^=\Lambda^_\nu X^\nu where there is an implied summation on \nu from 0 to 3, and \Lambda^_ is a
matrix Matrix (: matrices or matrixes) or MATRIX may refer to: Science and mathematics * Matrix (mathematics), a rectangular array of numbers, symbols or expressions * Matrix (logic), part of a formula in prenex normal form * Matrix (biology), the m ...
. More generally, all contravariant components of a four-vector T^\nu transform from one frame to another frame by a
Lorentz transformation In physics, the Lorentz transformations are a six-parameter family of Linear transformation, linear coordinate transformation, transformations from a Frame of Reference, coordinate frame in spacetime to another frame that moves at a constant vel ...
: T^ = \Lambda^_ T^\nu Examples of other 4-vectors include the four-velocity , defined as the derivative of the position 4-vector with respect to
proper time In relativity, proper time (from Latin, meaning ''own time'') along a timelike world line is defined as the time as measured by a clock following that line. The proper time interval between two events on a world line is the change in proper time ...
: U^\mu = \frac = \gamma(v)( c , v_x , v_y, v_z ) = \gamma(v) (c, \mathbf ). where the Lorentz factor is: \gamma(v)= \frac \qquad v^2 = v_x^2 + v_y^2 + v_z^2. The relativistic energy E = \gamma(v)mc^2 and relativistic momentum \mathbf = \gamma(v)m \mathbf of an object are respectively the timelike and spacelike components of a contravariant
four-momentum In special relativity, four-momentum (also called momentum–energy or momenergy) is the generalization of the classical three-dimensional momentum to four-dimensional spacetime. Momentum is a vector in three dimensions; similarly four-momentum i ...
vector: P^\mu = m U^\mu = m\gamma(v)(c,v_x,v_y,v_z)= \left (\frac,p_x,p_y,p_z \right ) = \left (\frac, \mathbf \right ). where ''m'' is the
invariant mass The invariant mass, rest mass, intrinsic mass, proper mass, or in the case of bound systems simply mass, is the portion of the total mass of an object or system of objects that is independent of the overall motion of the system. More precisely, ...
. The four-acceleration is the proper time derivative of 4-velocity: A^\mu = \frac. The transformation rules for ''three''-dimensional velocities and accelerations are very awkward; even above in standard configuration the velocity equations are quite complicated owing to their non-linearity. On the other hand, the transformation of ''four''-velocity and ''four''-acceleration are simpler by means of the Lorentz transformation matrix. The four-gradient of a
scalar field In mathematics and physics, a scalar field is a function associating a single number to each point in a region of space – possibly physical space. The scalar may either be a pure mathematical number ( dimensionless) or a scalar physical ...
φ transforms covariantly rather than contravariantly: \begin \dfrac \dfrac & \dfrac & \dfrac & \dfrac \end = \begin \dfrac \dfrac & \dfrac & \dfrac & \dfrac \end \begin \gamma & +\beta\gamma & 0 & 0\\ +\beta\gamma & \gamma & 0 & 0\\ 0 & 0 & 1 & 0\\ 0 & 0 & 0 & 1 \end , which is the transpose of: (\partial_ \phi) = \Lambda_^ (\partial_\nu \phi)\qquad \partial_ \equiv \frac. only in Cartesian coordinates. It is the
covariant derivative In mathematics and physics, covariance is a measure of how much two variables change together, and may refer to: Statistics * Covariance matrix, a matrix of covariances between a number of variables * Covariance or cross-covariance between ...
that transforms in manifest covariance, in Cartesian coordinates this happens to reduce to the partial derivatives, but not in other coordinates. More generally, the ''co''variant components of a 4-vector transform according to the ''inverse'' Lorentz transformation: T_ = \Lambda_^ T_\nu, where \Lambda_^ is the reciprocal matrix of . The postulates of special relativity constrain the exact form the Lorentz transformation matrices take. More generally, most physical quantities are best described as (components of)
tensor In mathematics, a tensor is an algebraic object that describes a multilinear relationship between sets of algebraic objects associated with a vector space. Tensors may map between different objects such as vectors, scalars, and even other ...
s. So to transform from one frame to another, we use the well-known tensor transformation law T^_ = \Lambda^_ \Lambda^_ \cdots \Lambda^_ \Lambda_^ \Lambda_^ \cdots \Lambda_^ T^_ where \Lambda_^ is the reciprocal matrix of . All tensors transform by this rule. An example of a four-dimensional second order antisymmetric tensor is the relativistic angular momentum, which has six components: three are the classical
angular momentum Angular momentum (sometimes called moment of momentum or rotational momentum) is the rotational analog of Momentum, linear momentum. It is an important physical quantity because it is a Conservation law, conserved quantity – the total ang ...
, and the other three are related to the boost of the center of mass of the system. The derivative of the relativistic angular momentum with respect to proper time is the relativistic torque, also second order antisymmetric tensor. The
electromagnetic field tensor In electromagnetism, the electromagnetic tensor or electromagnetic field tensor (sometimes called the field strength tensor, Faraday tensor or Maxwell bivector) is a mathematical object that describes the electromagnetic field in spacetime. T ...
is another second order antisymmetric tensor field, with six components: three for the
electric field An electric field (sometimes called E-field) is a field (physics), physical field that surrounds electrically charged particles such as electrons. In classical electromagnetism, the electric field of a single charge (or group of charges) descri ...
and another three for the
magnetic field A magnetic field (sometimes called B-field) is a physical field that describes the magnetic influence on moving electric charges, electric currents, and magnetic materials. A moving charge in a magnetic field experiences a force perpendicular ...
. There is also the stress–energy tensor for the electromagnetic field, namely the electromagnetic stress–energy tensor.


Metric

The
metric tensor In the mathematical field of differential geometry, a metric tensor (or simply metric) is an additional structure on a manifold (such as a surface) that allows defining distances and angles, just as the inner product on a Euclidean space allows ...
allows one to define the
inner product In mathematics, an inner product space (or, rarely, a Hausdorff pre-Hilbert space) is a real vector space or a complex vector space with an operation called an inner product. The inner product of two vectors in the space is a scalar, ofte ...
of two vectors, which in turn allows one to assign a magnitude to the vector. Given the four-dimensional nature of spacetime the
Minkowski metric In physics, Minkowski space (or Minkowski spacetime) () is the main mathematical description of spacetime in the absence of general_relativity, gravitation. It combines inertial space and time manifolds into a four-dimensional model. The model ...
''η'' has components (valid with suitably chosen coordinates), which can be arranged in a matrix: \eta_ = \begin -1 & 0 & 0 & 0\\ 0 & 1 & 0 & 0\\ 0 & 0 & 1 & 0\\ 0 & 0 & 0 & 1 \end , which is equal to its reciprocal, , in those frames. Throughout we use the signs as above, different authors use different conventions – see
Minkowski metric In physics, Minkowski space (or Minkowski spacetime) () is the main mathematical description of spacetime in the absence of general_relativity, gravitation. It combines inertial space and time manifolds into a four-dimensional model. The model ...
alternative signs. The Poincaré group is the most general group of transformations that preserves the Minkowski metric: \eta_ = \eta_ \Lambda^_\alpha \Lambda^_\beta and this is the physical symmetry underlying special relativity. The metric can be used for raising and lowering indices on vectors and tensors. Invariants can be constructed using the metric, the inner product of a 4-vector ''T'' with another 4-vector ''S'' is: T^S_=T^\eta_S^ = T_\eta^S_ = \text Invariant means that it takes the same value in all inertial frames, because it is a scalar (0 rank tensor), and so no appears in its trivial transformation. The magnitude of the 4-vector ''T'' is the positive square root of the inner product with itself: , \mathbf, = \sqrt One can extend this idea to tensors of higher order, for a second order tensor we can form the invariants: T^_,T^_T^_,T^_T^_T^_ = \text, similarly for higher order tensors. Invariant expressions, particularly inner products of 4-vectors with themselves, provide equations that are useful for calculations, because one does not need to perform Lorentz transformations to determine the invariants.


Relativistic kinematics and invariance

The coordinate differentials transform also contravariantly: dX^=\Lambda^_\nu dX^\nu so the squared length of the differential of the position four-vector ''dXμ'' constructed using d\mathbf^2 = dX^\mu \,dX_\mu = \eta_\,dX^\mu \,dX^\nu = -(c\,dt)^2+(dx)^2+(dy)^2+(dz)^2 is an invariant. Notice that when the line element ''d''X2 is negative that is the differential of
proper time In relativity, proper time (from Latin, meaning ''own time'') along a timelike world line is defined as the time as measured by a clock following that line. The proper time interval between two events on a world line is the change in proper time ...
, while when ''d''X2 is positive, is differential of the proper distance. The 4-velocity ''U''μ has an invariant form: \mathbf U^2 = \eta_ U^\nu U^\mu = -c^2 \,, which means all velocity four-vectors have a magnitude of ''c''. This is an expression of the fact that there is no such thing as being at coordinate rest in relativity: at the least, you are always moving forward through time. Differentiating the above equation by ''τ'' produces: 2\eta_A^\mu U^\nu = 0. So in special relativity, the acceleration four-vector and the velocity four-vector are orthogonal.


Relativistic dynamics and invariance

The invariant magnitude of the momentum 4-vector generates the energy–momentum relation: \mathbf^2 = \eta^P_\mu P_\nu = -\left (\frac \right )^2 + p^2 . We can work out what this invariant is by first arguing that, since it is a scalar, it does not matter in which reference frame we calculate it, and then by transforming to a frame where the total momentum is zero. \mathbf^2 = - \left (\frac \right )^2 = - (m c)^2 . We see that the rest energy is an independent invariant. A rest energy can be calculated even for particles and systems in motion, by translating to a frame in which momentum is zero. The rest energy is related to the mass according to the celebrated equation discussed above: E_\text = m c^2. The mass of systems measured in their center of momentum frame (where total momentum is zero) is given by the total energy of the system in this frame. It may not be equal to the sum of individual system masses measured in other frames. To use
Newton's third law of motion Newton's laws of motion are three physical laws that describe the relationship between the motion of an object and the forces acting on it. These laws, which provide the basis for Newtonian mechanics, can be paraphrased as follows: # A body r ...
, both forces must be defined as the rate of change of momentum with respect to the same time coordinate. That is, it requires the 3D force defined above. Unfortunately, there is no tensor in 4D that contains the components of the 3D force vector among its components. If a particle is not traveling at ''c'', one can transform the 3D force from the particle's co-moving reference frame into the observer's reference frame. This yields a 4-vector called the four-force. It is the rate of change of the above energy momentum four-vector with respect to proper time. The covariant version of the four-force is: F_\nu = \frac = m A_\nu In the rest frame of the object, the time component of the four-force is zero unless the "
invariant mass The invariant mass, rest mass, intrinsic mass, proper mass, or in the case of bound systems simply mass, is the portion of the total mass of an object or system of objects that is independent of the overall motion of the system. More precisely, ...
" of the object is changing (this requires a non-closed system in which energy/mass is being directly added or removed from the object) in which case it is the negative of that rate of change of mass, times ''c''. In general, though, the components of the four-force are not equal to the components of the three-force, because the three force is defined by the rate of change of momentum with respect to coordinate time, that is, ''dp''/''dt'' while the four-force is defined by the rate of change of momentum with respect to proper time, that is, ''dp''/''dτ''. In a continuous medium, the 3D ''density of force'' combines with the ''density of power'' to form a covariant 4-vector. The spatial part is the result of dividing the force on a small cell (in 3-space) by the volume of that cell. The time component is −1/''c'' times the power transferred to that cell divided by the volume of the cell. This will be used below in the section on electromagnetism.


See also

; People : *
Max Planck Max Karl Ernst Ludwig Planck (; ; 23 April 1858 – 4 October 1947) was a German Theoretical physics, theoretical physicist whose discovery of energy quantum, quanta won him the Nobel Prize in Physics in 1918. Planck made many substantial con ...
* Hermann Minkowski * Max von Laue * Arnold Sommerfeld *
Max Born Max Born (; 11 December 1882 – 5 January 1970) was a German-British theoretical physicist who was instrumental in the development of quantum mechanics. He also made contributions to solid-state physics and optics, and supervised the work of a ...
* Mileva Marić ; Relativity : * History of special relativity * Doubly special relativity * Bondi k-calculus * Einstein synchronisation * Rietdijk–Putnam argument * Special relativity (alternative formulations) * Relativity priority dispute ; Physics : * Einstein's thought experiments *
physical cosmology Physical cosmology is a branch of cosmology concerned with the study of cosmological models. A cosmological model, or simply cosmology, provides a description of the largest-scale structures and dynamics of the universe and allows study of fu ...
* Relativistic Euler equations * Lorentz ether theory * Moving magnet and conductor problem * Shape waves * Relativistic heat conduction * Relativistic disk * Born rigidity * Born coordinates ; Mathematics : * Lorentz group * Relativity in the APS formalism ; Philosophy : * actualism * conventionalism ; Paradoxes : * Ehrenfest paradox * Bell's spaceship paradox * Velocity composition paradox * Lighthouse paradox


Notes


Primary sources


References


Further reading


Texts by Einstein and text about history of special relativity

* Einstein, Albert (1920). '' Relativity: The Special and General Theory''. * Einstein, Albert (1996). ''The Meaning of Relativity''. Fine Communications. * Logunov, Anatoly A. (2005).
Henri Poincaré and the Relativity Theory
' (transl. from Russian by G. Pontocorvo and V. O. Soloviev, edited by V. A. Petrov). Nauka, Moscow.


Textbooks

* Charles Misner, Kip Thorne, and
John Archibald Wheeler John Archibald Wheeler (July 9, 1911April 13, 2008) was an American theoretical physicist. He was largely responsible for reviving interest in general relativity in the United States after World War II. Wheeler also worked with Niels Bohr to e ...
(1971) ''Gravitation''. W. H. Freeman & Co. * Post, E.J., 1997 (1962) ''Formal Structure of Electromagnetics: General Covariance and Electromagnetics''. Dover Publications. * Wolfgang Rindler (1991). ''Introduction to Special Relativity'' (2nd ed.), Oxford University Press. ; * Harvey R. Brown (2005). Physical relativity: space–time structure from a dynamical perspective, Oxford University Press, ; * * * Silberstein, Ludwik (1914). ''The Theory of Relativity''. * * * * Taylor, Edwin, and
John Archibald Wheeler John Archibald Wheeler (July 9, 1911April 13, 2008) was an American theoretical physicist. He was largely responsible for reviving interest in general relativity in the United States after World War II. Wheeler also worked with Niels Bohr to e ...
(1992). ''Spacetime Physics'' (2nd ed.). W. H. Freeman & Co. . * Tipler, Paul, and Llewellyn, Ralph (2002). ''Modern Physics'' (4th ed.). W. H. Freeman & Co. .


Journal articles

* * *
Special Relativity
Scholarpedia *


External links


Original works


''Zur Elektrodynamik bewegter Körper''
Einstein's original work in German, Annalen der Physik,
Bern Bern (), or Berne (), ; ; ; . is the ''de facto'' Capital city, capital of Switzerland, referred to as the "federal city".; ; ; . According to the Swiss constitution, the Swiss Confederation intentionally has no "capital", but Bern has gov ...
1905
''On the Electrodynamics of Moving Bodies''
English Translation as published in the 1923 book ''The Principle of Relativity''.


Special relativity for a general audience (no mathematical knowledge required)


Einstein Light
A
award
winning, non-technical introduction (film clips and demonstrations) supported by dozens of pages of further explanations and animations, at levels with or without mathematics.

Introduction to relativity theory, from the Max Planck Institute for Gravitational Physics. * Audio: Cain/Gay (2006)
Astronomy Cast
Einstein's Theory of Special Relativity


Special relativity explained (using simple or more advanced mathematics)



– A simple introduction to the special theory of relativity.

.
The Hogg Notes on Special Relativity
A good introduction to special relativity at the undergraduate level, using calculus.
Relativity Calculator: Special Relativity
– An algebraic and integral calculus derivation for .

A complete online book on relativity with an extensive bibliography.
Special Relativity
An introduction to special relativity at the undergraduate level. * , by
Albert Einstein Albert Einstein (14 March 187918 April 1955) was a German-born theoretical physicist who is best known for developing the theory of relativity. Einstein also made important contributions to quantum mechanics. His mass–energy equivalence f ...

Special Relativity Lecture Notes
is a standard introduction to special relativity containing illustrative explanations based on drawings and spacetime diagrams from Virginia Polytechnic Institute and State University.

The theory of special relativity in an easily understandable way.
An Introduction to the Special Theory of Relativity
(1964) by Robert Katz, "an introduction ... that is accessible to any student who has had an introduction to general physics and some slight acquaintance with the calculus" (130 pp; pdf format).
Lecture Notes on Special Relativity
by J D Cresser Department of Physics Macquarie University.
SpecialRelativity.net
– An overview with visualizations and minimal mathematics.
Relativity 4-ever?
The problem of superluminal motion is discussed in an entertaining manner.


Visualization



Software visualizing several scenarios under the influence of special relativity.
Real Time Relativity
The Australian National University. Relativistic visual effects experienced through an interactive program.
Spacetime travel
A variety of visualizations of relativistic effects, from relativistic motion to black holes.
Through Einstein's Eyes
The Australian National University. Relativistic visual effects explained with movies and images.
Warp Special Relativity Simulator
A computer program to show the effects of traveling close to the speed of light. * visualizing the Lorentz transformation.

from John de Pillis illustrating Lorentz and Galilean frames, Train and Tunnel Paradox, the Twin Paradox, Wave Propagation, Clock Synchronization, etc.
lightspeed
An OpenGL-based program developed to illustrate the effects of special relativity on the appearance of moving objects.

showing the stars near Earth, as seen from a spacecraft accelerating rapidly to light speed. {{DEFAULTSORT:Special Relativity Albert Einstein