The General Theory of Relativity of Einstein

Derivations, Applications and Considerations – by Albert Prins

Appendix 9 — Special Relativity

In Special Relativity, Einstein considered only coordinate systems that move uniformly, i.e., with constant velocity relative to each other. The influence of masses, and thus gravity, was not taken into account.

The assumptions on which Special Relativity (SR) is based are:

  • The maximum possible speed, in any coordinate system, is the speed of light \(c = 299\,792\,458\ \text{m/s}\).
  • The laws of physics are valid in every uniformly moving coordinate system.

In Newton’s approach, time intervals were identical in the “rest frame” and in the moving frame. However, Special Relativity showed that:

  • time intervals in a moving frame are smaller than in a rest frame (time dilation),
  • the length of an object decreases in the direction of motion (length contraction).

Both effects follow from the observation that the speed of light in vacuum is always the same in every frame, regardless of the velocity of the frame.

In this chapter we summarize several points that are frequently used in SR and that are relevant for applications in General Relativity (GR).

We begin by establishing the relationship between two coordinate systems that move with constant velocity relative to each other. This relationship is known as the Lorentz transformation.

Appendix 9.1 — Simple Derivation of the Lorentz Transformation

vector_9_1_1
Fig. 1
Coordinate system k’ moves uniformly with velocity v relative to coordinate system k.

We consider two coordinate systems whose origins move with constant velocity \(v\) relative to each other, along the \(x\)- and \(x'\)-directions respectively.

Although the coordinate systems are four-dimensional \((t, x, y, z)\), only the \(t\)- and \(x\)-axes are shown for simplicity, since there is no motion in the \(y\)- and \(z\)-directions.

A light signal emitted at time \(t = t' = 0\) in the positive \(x\)-direction satisfies in frame \(k\):

\begin{align} x = ct \quad\Rightarrow\quad x - ct = 0 \label{eq:R01} \end{align}

Since the same light signal also propagates with speed \(c\) in frame \(k'\), we have:

\begin{align} x' = ct' \quad\Rightarrow\quad x' - ct' = 0 \label{eq:R02} \end{align}

All spacetime points (events) that satisfy (\ref{eq:R01}) must also satisfy (\ref{eq:R02}). This is guaranteed if:

\begin{align} x' - ct' = \lambda (x - ct) \label{eq:R03} \end{align}

where \(\lambda\) is a constant. If \(x - ct = 0\), then automatically \(x' - ct' = 0\), regardless of the value of \(\lambda\).

Light signal in the negative direction

For a light signal moving along the negative \(x\)-axis, we have in \(k\):

\begin{align} x + ct = 0 \label{eq:R04} \end{align}

and in \(k'\):

\begin{align} x' + ct' = 0 \label{eq:R05} \end{align}

Therefore:

\begin{align} x' + ct' = \mu (x + ct) \label{eq:R06} \end{align}

with \(\mu\) a second constant.

Linear combination of the two conditions

By adding and subtracting (\ref{eq:R03}) and (\ref{eq:R06}), and introducing the constants

\begin{align} a = \frac{\lambda + \mu}{2}, \qquad b = \frac{\lambda - \mu}{2} \label{eq:R07} \end{align}

we obtain:

\begin{align} x' = a x - b c t, \qquad ct' = a ct - b x \label{eq:R08} \end{align}

This is the general linear form of the Lorentz transformation, where the constants \(a\) and \(b\) still need to be determined.

Determination of the ratio \(b/a\)

For the origin of \(k'\) we always have:

\begin{align} x' = 0. \end{align}

Substituting into the first equation of (\ref{eq:R08}) gives:

\begin{align} 0 = a x - b c t \quad\Rightarrow\quad x = \frac{b c}{a}\, t. \end{align}

The origin of \(k'\) therefore moves in \(k\) with velocity:

\begin{align} v = \frac{b c}{a} \label{eq:R11} \end{align}

This value \(v\) is the relative velocity of the two frames. The same value is obtained when calculating the velocity of any other point of \(k'\) relative to \(k\), or vice versa.

The principle of relativity tells us that — as observed from \(k\) — the length of a measuring rod at rest in \(k'\) must be exactly equal to the length of a measuring rod at rest in \(k\), as observed from \(k'\).

To see how the points on the \(x'\)-axis appear from \(k\), we take a snapshot of \(k'\) from \(k\). This means we choose a fixed value of \(t\), for example:

\begin{align} t = 0. \end{align}

For this value, the first equation of (\ref{eq:R08}) gives:

\begin{align} x' = a x. \end{align}

Two points on the \(x'\)-axis that are separated by a distance \(x' = L\) in \(k'\), are in our snapshot separated by:

\begin{align} \Delta x = \frac{L}{a} \label{eq:R14} \end{align}

Snapshot from \(k'\): \(t' = 0\)

If the snapshot is taken from \(k'\), i.e., at \(t' = 0\), then from the second equation of (\ref{eq:R08}) we obtain:

\begin{align} 0 = a c t - b x \quad\Rightarrow\quad t = \frac{b}{a c} x. \end{align}

Substituting into the first equation of (\ref{eq:R08}):

\begin{align} x' = a x - b c t = a x - b c \left(\frac{b}{a c} x\right) = a x \left(1 - \frac{b^{2}}{a^{2}}\right). \end{align}

From equation (\ref{eq:R11}):

\begin{align} \frac{b}{a} = \frac{v}{c}, \end{align}
thus:

\begin{align} x' = a \left(1 - \frac{v^{2}}{c^{2}}\right) x \label{eq:R18} \end{align}

Therefore, two points on the \(x\)-axis, separated by a distance \(L\) in \(k\), are represented in the snapshot from \(k'\) by:

\begin{align} \Delta x' = a\left(1 - \frac{v^{2}}{c^{2}}\right) L \label{eq:R19} \end{align}

Equality of the two snapshots

According to the principle of relativity, both snapshots must be identical:

\begin{align} \Delta x = \Delta x'. \end{align}

Thus, according to equations (\ref{eq:R14}) and (\ref{eq:R19}):

\begin{align} \frac{L}{a} = a\left(1 - \frac{v^{2}}{c^{2}}\right) L. \end{align}

After simplification:

\begin{align} \frac{1}{a} = a\left(1 - \frac{v^{2}}{c^{2}}\right) \quad\Rightarrow\quad a^{2} = \frac{1}{1 - v^{2}/c^{2}} \label{eq:R22} \end{align}

Thus:

\begin{align} a = \gamma = \frac{1}{\sqrt{1 - v^{2}/c^{2}}}, \qquad b = \gamma \frac{v}{c}. \end{align}

The Lorentz transformation

By substituting these values of \(a\) and \(b\) into (\ref{eq:R08}), we obtain:

\begin{align} x' = a x - b c t = \gamma x - \gamma v t = \gamma (x - v t) \label{eq:R24} \end{align}

And:

\begin{align} ct' = a c t - b x = \gamma c t - \gamma \frac{v}{c} x = \gamma\left(ct - \frac{v}{c} x\right) \label{eq:R25} \end{align}

Thus, the full Lorentz transformation is:

\begin{align} \boxed{ \begin{aligned} x' &= \gamma (x - vt), \\ t' &= \gamma\left(t - \frac{v}{c^{2}}x\right) \label{eq:R26} \end{aligned} } \end{align}

Lorentz transformation for events on the x-axis

From the earlier derivation, the Lorentz transformation follows:

\begin{align} x' = \frac{x - vt}{\sqrt{1 - v^{2}/c^{2}}}, \qquad t' = \frac{t - \frac{v}{c^{2}}x}{\sqrt{1 - v^{2}/c^{2}}} \label{eq:R27} \end{align}

This transformation satisfies the invariance of the spacetime interval:

\begin{align} x'^{2} - c^{2}t'^{2} = x^{2} - c^{2}t^{2} \label{eq:R28} \end{align}

Extension to events off the x-axis

For motion purely along the x-axis, the transformations for the other coordinates remain unchanged:

\begin{align} y' = y, \qquad z' = z \label{eq:R29} \end{align}

With (\ref{eq:R27}) and (\ref{eq:R29}), we satisfy the postulate that the speed of light in vacuum has the same value in every inertial frame.

Verification with a light signal

A light signal emitted from the origin of \(k\) at time \(t = 0\) satisfies:

\begin{align} r = \sqrt{x^{2} + y^{2} + z^{2}} = ct. \end{align}

Squaring gives:

\begin{align} x^{2} + y^{2} + z^{2} - c^{2}t^{2} = 0 \label{eq:R31} \end{align}

According to the principle of relativity, the same signal in \(k'\) must satisfy:

\begin{align} x'^{2} + y'^{2} + z'^{2} - c^{2}t'^{2} = 0 \label{eq:R32} \end{align}

To make (\ref{eq:R32}) a consequence of (\ref{eq:R31}), we must have:

\begin{align} x'^{2} + y'^{2} + z'^{2} - c^{2}t'^{2} = \sigma\, \left( x^{2} + y^{2} + z^{2} - c^{2}t^{2} \right) \label{eq:R33} \end{align}

But for points on the x-axis, we already have (8a), thus:

\begin{align} \sigma = 1. \end{align}

This shows that the Lorentz transformation (\ref{eq:R27})(\ref{eq:R29}) leaves the speed of light invariant.

General form of the Lorentz transformation

The Lorentz transformation derived above applies to the case where:

  • the axes of \(k\) and \(k'\) are parallel,
  • the relative velocity \(v\) lies along the x-axis.

However, this is not a restriction. In general, any Lorentz transformation can be constructed from:

  • a Lorentz transformation in the specific sense (translation along one axis),
  • followed by a purely spatial rotation of the coordinate system.

This corresponds to replacing the rectangular coordinate system with a new system whose axes point in different directions.

In this way, we obtain the full Lorentz group, consisting of all combinations of transformations and rotations.

The generalized Lorentz transformation

Mathematically, the generalized Lorentz transformation can be characterized as follows: it expresses \(x', y', z', t'\) as linear homogeneous functions of \(x, y, z, t\), such that the relation

\begin{align} x'^2 + y'^2 + z'^2 - c^2 t'^2 = x^2 + y^2 + z^2 - c^2 t^2 \label{eq:R35} \end{align}

is identically satisfied. That is: when we substitute the expressions for \(x', y', z', t'\) in terms of \(x, y, z, t\) into the left-hand side, it becomes identical to the right-hand side.

Use of an imaginary time coordinate

We can characterize the Lorentz transformation even more simply by introducing the imaginary quantity \(i\), where \(i\) denotes \(\sqrt{-1}\). Define:

\begin{align} x_1 = x,\qquad x_2 = y,\qquad x_3 = z,\qquad x_4 = i\,ct. \end{align}

And analogously for the primed system \(k'\). Then the transformation condition becomes:

\begin{align} x_1'^2 + x_2'^2 + x_3'^2 + x_4'^2 = x_1^2 + x_2^2 + x_3^2 + x_4^2 \label{eq:R37} \end{align}

With this choice of “coordinates”, equation (\ref{eq:R35}) is transformed into (\ref{eq:R37}).

We see that the imaginary time coordinate \(x_4\) appears in the transformation condition in exactly the same way as the spatial coordinates \(x_1, x_2, x_3\). This reflects the relativistic insight that time and space are treated on equal footing in the laws of nature.

Minkowski space

A four-dimensional continuum described by the coordinates \((x_1, x_2, x_3, x_4)\) was called the world by Minkowski. An event is called a world point.

The four-dimensional “world” shows a strong analogy with three-dimensional Euclidean space. In Euclidean geometry, a rotation satisfies:

\begin{align} x_1'^2 + x_2'^2 + x_3'^2 = x_1^2 + x_2^2 + x_3^2. \end{align}

The analogy with (\ref{eq:R37}) is complete: the Lorentz transformation corresponds to a “rotation” in four-dimensional Minkowski space, where the time coordinate carries an imaginary component.

Appendix 9.2 — Alternative derivation of time dilation and length contraction

To illustrate the effects of Special Relativity on time and length, we use a light signal in a rapidly moving object, for example a rocket. We consider two reference frames:

  • our stationary reference frame with time \(t\),
  • the reference frame of the rocket with time \(t'\).

Time dilation

First, we send a light pulse perpendicular to the direction of motion of the rocket. In the rocket frame, the light pulse moves from the bottom to the top of the rocket. The height of the rocket in that frame is \(h\), and the light pulse travels this distance in time \(t'\):

\begin{align} h = ct' \end{align}
vector_9_2_1

From our stationary reference frame, the rocket appears to move to the right with velocity \(v\). The light pulse still travels the vertical distance \(h\), but because the rocket moves horizontally, the light follows a diagonal path in our frame. The horizontal displacement of the rocket is \(vt\), while the vertical displacement of the light remains \(h\). The total distance of the light in our frame is \(ct\). From the Pythagorean relation it follows:

\begin{align} c^2 t^2 = c^2 t'^2 + v^2 t^2 \end{align}
\begin{align} c^2 t^2 - v^2 t^2 = c^2 t'^2 \end{align}
\begin{align} t^2 (c^2 - v^2) = c^2 t'^2 \end{align}
\begin{align} t^2 \left(1 - \frac{v^2}{c^2}\right) = t'^2 \end{align}
\begin{align} t' = t \sqrt{1-\frac{v^2}{c^2}} \end{align}

This shows that a clock in the moving frame runs slower: the time \(t'\) in the rocket is shorter than the time \(t\) in our stationary frame. This is the phenomenon of time dilation.

Length contraction

When measuring a length in a reference frame, the positions of the two endpoints must be determined at the same time in that frame. In our stationary frame, the positions of the rear and front of the rocket are therefore recorded simultaneously at time \(t\). In the rocket frame itself, the positions of the two endpoints are recorded at the same moment \(t'\).

Starting from the Lorentz transformations:

\begin{align} x'=\frac{x-vt}{\sqrt{1-\frac{v^2}{c^2}}} \end{align}
\begin{align} t'=\frac{t-\frac{v}{c^2}x}{\sqrt{1-\frac{v^2}{c^2}}} \end{align}

Since \(x'\) and \(t'\) are functions of \(x\) and \(t\), it also follows that:

\begin{align} dx'=\frac{dx-v\,dt}{\sqrt{1-\frac{v^2}{c^2}}} \end{align}
\begin{align} dt'=\frac{dt-\frac{v}{c^2}dx}{\sqrt{1-\frac{v^2}{c^2}}} \end{align}

To measure the length in our frame, we must keep \(t\) constant, so \(dt=0\), which gives:

\begin{align} dx'=\frac{dx}{\sqrt{1-\frac{v^2}{c^2}}} \end{align}

The length in the rocket is \(L_0\), since it is at rest in that frame, and we then observe the length in our frame as:

\begin{align} L=L_0\sqrt{1-\frac{v^2}{c^2}} \end{align}

Due to the relativity of simultaneity, events that are simultaneous in one frame are not necessarily simultaneous in another frame. This explains why a moving rocket appears shorter from our perspective, with length \(L\), than the proper length \(L_0\) measured in the rocket frame.

Summary of the results

\begin{align} t' = t\sqrt{1-\frac{v^2}{c^2}}, \qquad L = L_0\sqrt{1-\frac{v^2}{c^2}} \end{align}
  • A clock in a moving frame runs slower (time dilation).
  • A moving object is shorter in the direction of motion (length contraction).
  • Length contraction arises from the combination of motion and the relativity of simultaneity.

Appendix 9.3 Symmetry of Lorentz transformations in spacetime

When a Lorentz transformation is applied due to a constant velocity \(v\) in the x-direction, formally only the coordinates \(x\) and \(t\) are directly affected:

\begin{align} x'=\gamma (x-vt) \end{align}
\begin{align} t'=\gamma \left(t-\frac{v}{c^2}x\right) \end{align}
\begin{align} y'=y \end{align}
\begin{align} z'=z \end{align}

At first glance, it appears that only the \(x - t\) plane changes, while the other spatial coordinates \(y\) and \(z\) remain unchanged. However, this is only partially correct.

When we consider spacetime as a four-dimensional structure, we see that all processes involving the time component \(t\) are indirectly affected. For example:

  • A clock located at a fixed \(y\)-position runs slower for a stationary observer, just as a clock at a fixed \(x\)-position does.
  • Similarly, events in the \(y - t\) or \(z - t\) planes are affected, because the time \(t\) is transformed.

Conversely, in planes such as \(y - x\) or \(z - x\), the coordinate \(x\) is directly transformed. Here too, spacetime is altered, although the effect is less directly visible.

The key insight is that the Lorentz transformation is not limited to the plane of motion. Every four-dimensional combination involving \(x\) or \(t\) is affected. This reveals a structural symmetry: although \(y\) and \(z\) themselves do not change, all processes in these directions evolve according to a transformed time or spatial component. The entire spacetime is restructured, and with it the physical descriptions within that frame.

Appendix 9.4 Trigonometric Tools

Since trigonometric formulas are frequently used in special relativity, we provide a brief overview of several of them and how they can be derived easily.

vector_9_2_4

By definition:

\begin{align} e^{i\theta} = \cos\theta + i\sin\theta \label{eq:R56} \end{align}

Where:

\begin{align} i=\sqrt{-1} \end{align}

Justification of this equation:

We first consider a function:

\begin{align} F(x) = e^{\alpha x}. \end{align}

Its derivative is:

\begin{align} \frac{dF(x)}{dx} = \alpha F(x). \end{align}

Thus, the derivative of an exponential function is a factor \(\alpha\) times the function itself.

Complex trigonometric function

Now consider the function:

\begin{align} F(x) = \cos(\alpha x) + i\sin(\alpha x). \end{align}

Its derivative is:

\begin{align} \frac{d}{dx}\left[\cos(\alpha x) + i\sin(\alpha x)\right] = -\alpha\sin(\alpha x) + i\alpha\cos(\alpha x) = i\alpha\left[\cos(\alpha x) + i\sin(\alpha x)\right]. \end{align}

Thus:

\begin{align} \frac{dF(x)}{dx} = i\alpha F(x). \end{align}

From this it follows that:

\begin{align} F(x) = e^{i\alpha x} = \cos(\alpha x) + i\sin(\alpha x). \end{align}

For \(\alpha = 1\), we obtain the well-known Euler equation:

\begin{align} \boxed{ e^{i\theta} = \cos\theta + i\sin\theta } \label{eq:R64} \end{align}

Derived trigonometric formulas

From (\ref{eq:R64}) it follows directly:

\begin{align} e^{-i\theta} = \cos\theta - i\sin\theta. \label{eq:R65} \end{align}

By adding (\ref{eq:R64}) and (\ref{eq:R65}):

\begin{align} \cos\theta = \frac{e^{i\theta} + e^{-i\theta}}{2}. \end{align}

By subtracting (\ref{eq:R64}) and (\ref{eq:R65}):

\begin{align} \sin\theta = \frac{e^{i\theta} - e^{-i\theta}}{2i}. \end{align}

Furthermore:

\begin{align} e^{i\theta} \cdot e^{-i\theta} = e^{i\theta - i\theta} = e^{0} = 1, \end{align}

and:

\begin{align} (\cos\theta + i\sin\theta)(\cos\theta - i\sin\theta) = \cos^{2}\theta + \sin^{2}\theta = 1. \end{align}

Hyperbolic functions

We define:

\begin{align} \cosh x = \frac{e^{x} + e^{-x}}{2}, \qquad \sinh x = \frac{e^{x} - e^{-x}}{2}. \end{align}

From this it follows:

\begin{align} \cosh(x) = \cosh(-x), \qquad \sinh(x) = -\sinh(-x). \end{align}

Furthermore:

\begin{align} \cosh(ix) = \cos x, \qquad \sinh(ix) = i\sin x. \end{align}

Thus:

\begin{align} \boxed{ \cosh(ix) = \cos x,\qquad \sinh(ix) = i\sin x. } \end{align}

These relations form the basis for the use of hyperbolic functions in special relativity, particularly in describing Lorentz boosts via rapidity.

Appendix 9.5 — Velocity addition

We consider two coordinate systems \(A\) and \(B\) moving with constant velocity \(v\) relative to each other. The axes are chosen such that the relative motion occurs along the \(x\)-axes.

In system \(A\), an object moves with velocity \(V'_x, V'_y, V'_z\). We now want to determine the velocity of this object relative to system \(B\).

According to Newton, the velocity in the \(x\)-direction would simply be:

\begin{align} V_x = V'_x + v. \end{align}
But according to special relativity, this is not correct.

Lorentz transformation

We begin with the Lorentz transformations:

\begin{align} ct' = \gamma(ct - \beta x), \label{eq:R75} \end{align}
\begin{align} x' = \gamma(x - \beta ct), \label{eq:R76} \end{align}
\begin{align} y' = y,\qquad z' = z, \end{align}

where:

\begin{align} \gamma = \frac{1}{\sqrt{1 - v^{2}/c^{2}}},\qquad \beta = \frac{v}{c}. \end{align}

The inverse transformation is:

\begin{align} ct = \gamma(ct' + \beta x'), \label{eq:R79} \end{align}
\begin{align} x = \gamma(x' + \beta ct'), \label{eq:R80} \end{align}
\begin{align} y = y',\qquad z = z'. \end{align}

Velocity in the \(x'\)-direction

Take the derivative of (\ref{eq:R76}):

\begin{align} V'_x = \frac{dx'}{dt'} = \gamma\frac{dx}{dt}\frac{dt}{dt'} - \beta c \frac{dt}{dt'} = \gamma(V_x - \beta c)\frac{dt}{dt'}. \label{eq:R82} \end{align}

Now take the derivative of (\ref{eq:R75}):

\begin{align} c = \gamma\left(c\frac{dt}{dt'} - \beta \frac{dx}{dt}\frac{dt}{dt'}\right) = \gamma\left(c - \beta V_x\right)\frac{dt}{dt'}. \end{align}

Thus:

\begin{align} \frac{dt}{dt'} = \frac{1}{\gamma\left(1 - \beta\frac{V_x}{c}\right)}. \label{eq:R84} \end{align}

Substitute (\ref{eq:R84}) into (\ref{eq:R82}):

\begin{align} V'_x = \gamma(V_x - \beta c) \cdot \frac{1}{\gamma\left(1 - \beta\frac{V_x}{c}\right)} = \frac{V_x - v}{1 - \frac{v V_x}{c^{2}}}. \label{eq:R85} \end{align}

This is the relativistic velocity addition in the \(x\)-direction.

Result

\begin{align} \boxed{ V'_x = \frac{V_x - v}{1 - \frac{v V_x}{c^{2}}} } \label{eq:R86} \end{align}

This replaces the Newtonian addition \(V'_x = V_x - v\).

The velocities in the other directions become:

\begin{align} V'_y = \frac{V_y}{\gamma\left(1 - \frac{v V_x}{c^{2}}\right)}, \qquad V'_z = \frac{V_z}{\gamma\left(1 - \frac{v V_x}{c^{2}}\right)}. \label{eq:R87} \end{align}

These formulas show that velocities do not simply add in relativity, but are influenced by both the Lorentz factor and the projection of the velocity onto the direction of motion.

From equation (\ref{eq:R86}) we already had:

\begin{align} V'_x = \frac{\gamma(V_x - \beta c)}{\gamma\left(1 - \beta \frac{V_x}{c}\right)} = \frac{V_x - \beta c}{1 - \beta \frac{V_x}{c}} = \frac{V_x - v}{1 - \frac{v V_x}{c^{2}}}. \label{eq:R88} \end{align}

Velocity in the \(y'\)-direction

\begin{align} V'_y = \frac{\partial y'}{\partial t'} = \frac{\partial y}{\partial t'} = \frac{\partial y}{\partial t}\frac{dt}{dt'} = V_y \frac{dt}{dt'}. \end{align}

From equation (\ref{eq:R84}):

\begin{align} \frac{dt}{dt'} = \gamma\left(1 - \beta\frac{V_x}{c}\right), \end{align}

thus:

\begin{align} V'_y = \frac{V_y}{\gamma\left(1 - \beta\frac{V_x}{c}\right)}. \label{eq:R91} \end{align}

Velocity in the \(z'\)-direction

In an identical manner, we obtain:

\begin{align} V'_z = \frac{V_z}{\gamma\left(1 - \beta\frac{V_x}{c}\right)}. \label{eq:R92} \end{align}

Interpretation of equation (\ref{eq:R84})

From (\ref{eq:R84}):

\begin{align} \frac{dt}{dt'} = \frac{1}{\gamma\left(1 - \beta\frac{V_x}{c}\right)} = \frac{1 - v^{2}/c^{2}}{1 - \frac{v V_x}{c^{2}}}. \end{align}

In the special case where \(V'_x = 0\), we have \(V_x = v\). Then:

\begin{align} \frac{dt}{dt'} = \frac{1 - v^{2}/c^{2}}{1 - v^{2}/c^{2}} = \frac{1}{1 - v^{2}/c^{2} } = \gamma^{2}. \end{align}

Thus:

\begin{align} dt' = \sqrt{1 - \frac{v^{2}}{c^{2}}}\, dt, \end{align}

and therefore:

\begin{align} dt' \ll dt. \end{align}

This is again time dilation.

Back to the general case

From (\ref{eq:R86}):

\begin{align} V'_x = \frac{V_x - v}{1 - \frac{v V_x}{c^{2}}}. \end{align}

Solving for \(V_x\) gives:

\begin{align} V_x = \frac{V'_x + v}{1 + \frac{v V'_x}{c^{2}}}. \end{align}

This is the inverse relativistic velocity addition.

In compact form:

\begin{align} \boxed{ V_x = \frac{V'_x + v}{1 + \beta \frac{V'_x}{c}} } \label{eq:R99} \end{align}

For the other components, we obtain analogously:

\begin{align} V_y = \frac{V'_y}{\gamma\left(1 + \beta\frac{V'_x}{c}\right)}, \label{eq:R100} \end{align}

and:

\begin{align} V_z = \frac{V'_z}{\gamma\left(1 + \beta\frac{V'_x}{c}\right)}. \label{eq:R101} \end{align}

Summary

The relativistic velocity addition is:

\begin{align} \boxed{ V'_x = \frac{V_x - v}{1 - \frac{v V_x}{c^{2}}} } \end{align}

\begin{align} \boxed{ V'_y = \frac{V_y}{\gamma\left(1 - \frac{v V_x}{c^{2}}\right)} } \end{align}

\begin{align} \boxed{ V'_z = \frac{V_z}{\gamma\left(1 - \frac{v V_x}{c^{2}}\right)} } \end{align}

and the inverse transformation:

\begin{align} \boxed{ V_x = \frac{V'_x + v}{1 + \frac{v V'_x}{c^{2}}} } \end{align}

\begin{align} \boxed{ V_y = \frac{V'_y}{\gamma\left(1 + \frac{v V'_x}{c^{2}}\right)} } \end{align}

\begin{align} \boxed{ V_z = \frac{V'_z}{\gamma\left(1 + \frac{v V'_x}{c^{2}}\right)} } \end{align}

For the \(z\)-component, we obtain in the same way:

\begin{align} V'_z = \frac{V_z}{\gamma\left(1 + \beta \frac{V'_x}{c}\right)}. \label{eq:R108} \end{align}

According to Newton, in the \(x\)-direction we would simply have an added velocity:

\begin{align} V_x = V'_x + v. \end{align}

But according to special relativity, this is corrected to:

\begin{align} V_x = \frac{V'_x + v}{1 + \frac{v V'_x}{c^{2}}}. \end{align}

In general, when the term \(\frac{v V'_x}{c^{2}}\) is very small, the relativistic result can be approximated by the Newtonian result:

\begin{align} V_x \approx V'_x + v. \end{align}

Appendix 9.6 Collisions

Consider a perfectly elastic collision between two identical particles; an elastic collision is a collision without loss of kinetic energy. The initial velocities of the particles are \(\vec{u_1}\) and \(\vec{u_2}\), and after the collision \(\vec{v_1}\) and \(\vec{v_2}\). Due to momentum conservation:

\begin{align} m_{1u}u_1+m_{2u}u_2=m_{1v}v_1+m_{2v}v_2 \end{align}
Here, \(m_{1u}\) and \(m_{2u}\) are the masses before the collision, and \(m_{1v}\) and \(m_{2v}\) the masses after the collision.

vector_9_6_1

First, we consider the collision from a coordinate system moving with particle one. Then particle 1 moves upward with velocity \(w_1\) and downward with \(w_2\). These velocities are equal in magnitude but opposite in direction. Particle 2 has velocity \(V\) with an x-component \(u\) and a y-component \(v\).

vector_9_6_2
Left: Collision between two identical particles in a coordinate system \(S\) moving with particle 1. Right: The same, but now \(S'\) moving with particle 2.

Relation between the y-components of momentum

We now want to find the relation between the y-components of the momentum of particles 1 and 2 in system \(S\), i.e., between \(w\) and \(v\).

In the previous section, we found the relation:

\begin{align} V'_y = \frac{V_y}{\gamma\left(1 - \beta \frac{V_x}{c}\right)}. \end{align}

Since in this case:

\begin{align} V_y = w,\qquad V_x = 0, \end{align}
we obtain:

\begin{align} v = \frac{w}{\gamma} \end{align}

By symmetry, \(w\) is the velocity of particle 1 in system \(S\) and the velocity of particle 2 in system \(S'\). Conversely, \(v\) is the y-component of particle 2 in \(S\) and of particle 1 in \(S'\).

Total velocity

The total velocity of the moving particle in \(S\) and in \(S'\) is the same:

\begin{align} V = \sqrt{v^{2} + u^{2}}. \end{align}

Momentum conservation in the y-direction

Momentum conservation in the y-direction gives:

\begin{align} m_w w - m_V v = -m_w w + m_V v. \end{align}

From this follows:

\begin{align} m_w w = m_V v. \end{align}

Thus:

\begin{align} \frac{m_V}{m_w} = \frac{w}{v} = \frac{w}{w/\gamma} = \gamma. \label{eq:R119} \end{align}

This result shows that the Lorentz factor \(\gamma\) arises directly from momentum conservation in a collision viewed symmetrically from two inertial frames.

Limit of small velocities

Now suppose that the velocity \(w\) is very small. In this limit:

\begin{align} \lim_{w \to 0} v = 0, \qquad \lim_{w \to 0} V = u. \end{align}

In that case, relativistic effects can be neglected and the classical expression for momentum is recovered.

Since:

\begin{align} \lim_{w \to 0} m_w = m, \end{align}
we substitute this into equation (\ref{eq:R119}):

\begin{align} \lim_{w \to 0} m_V = \gamma m = \frac{m}{\sqrt{1 - \frac{u^{2}}{c^{2}}}}. \end{align}

Due to momentum conservation, the definition of momentum must be modified. The relativistic momentum is therefore:

\begin{align} \boxed{ \vec{p} = \gamma m \vec{v} } \end{align}

Appendix 9.7 — The Energy of a Moving Object

Using a thought experiment, Einstein showed that energy and mass are equivalent via the relation \(E=mc^2\). We have shown that for an object moving with velocity, momentum must be adapted to the relativistic description:

\begin{align} \vec{p}=\gamma m \vec{v} \end{align}
Thus, the energy of an object can be written as:
\begin{align} E=\gamma m c^2 \end{align}
Thus:
\begin{align} E=\frac{mc^2}{\sqrt{1-\frac{u^2}{c^2}}} \end{align}
With the Taylor series expansion:
\begin{align} E=\gamma m c^2 \approx mc^2\left(1+\frac{v^2}{2c^2}-\frac{3v^4}{8c^4}\text{.....} \right) \end{align}
If \(v\) is much smaller than \(c\), the third and higher-order terms in the parentheses can be neglected. This leads to:
\begin{align} E\approx mc^2+\frac{1}{2}mv^2 \end{align}
Thus, this is the kinetic energy \(\frac{1}{2}mv^2\) plus a constant \(mc^2\).

Appendix 9.8 — Energy-Momentum Vector

As found by Minkowski, the spacetime interval is:

\begin{align} c^{2} d\tau^{2} = c^{2} dt^{2} - dx^{2} - dy^{2} - dz^{2}. \label{eq:R129} \end{align}

We write this as:

\begin{align} c^{2} d\tau^{2} = c^{2} dt^{2} \left( 1 - \frac{dx^{2} + dy^{2} + dz^{2}}{c^{2} dt^{2}} \right) = c^2dt^{2}\left(1 - \frac{v^{2}}{c^{2}}\right). \end{align}

Since:

\begin{align} \gamma = \frac{1}{\sqrt{1 - v^{2}/c^{2}}}, \end{align}
it follows that:

\begin{align} 1 - \frac{v^{2}}{c^{2}} = \frac{1}{\gamma^{2}}, \qquad d\tau^{2} = \frac{dt^{2}}{\gamma^{2}}, \qquad \gamma = \frac{dt}{d\tau}. \end{align}

Derivation of the energy–momentum relation

Start again from (\ref{eq:R129}):

\begin{align} c^{2} = c^{2}\left(\frac{dt}{d\tau}\right)^{2} - \left(\frac{dx}{dt}\right)^{2}\left(\frac{dt}{d\tau}\right)^{2} - \left(\frac{dy}{dt}\right)^{2}\left(\frac{dt}{d\tau}\right)^{2} - \left(\frac{dz}{dt}\right)^{2}\left(\frac{dt}{d\tau}\right)^{2} \end{align}

Multiply by the rest mass \(m_{0}^{2}\):

\begin{align} m_{0}^{2} c^{2} = m_{0}^{2} c^{2}\left(\frac{dt}{d\tau}\right)^{2} - m_{0}^{2}\left(\frac{dx}{dt}\right)^{2}\left(\frac{dt}{d\tau}\right)^{2} - m_{0}^{2}\left(\frac{dy}{dt}\right)^{2}\left(\frac{dt}{d\tau}\right)^{2} - m_{0}^{2}\left(\frac{dz}{dt}\right)^{2}\left(\frac{dt}{d\tau}\right)^{2} \end{align}

Since:

\begin{align} \frac{dt}{d\tau} = \gamma, \end{align}
we obtain:

\begin{align} m_{0}^{2} c^{2} = \gamma^{2} m_{0}^{2} c^{2} - \gamma^{2} m_{0}^{2} v_x^{2} - \gamma^{2} m_{0}^{2} v_y^{2} - \gamma^{2} m_{0}^{2} v_z^{2}. \end{align}

Now define the four-momentum:

\begin{align} p_{0} = \frac{E}{c}, \qquad p_{1} = p_x, \qquad p_{2} = p_y, \qquad p_{3} = p_z, \end{align}

with:

\begin{align} p_i = \gamma m_0 v_i. \end{align}

Then the Minkowski norm becomes:

\begin{align} p_{\tau}^{2} = \left(\frac{E}{c}\right)^{2} - p_x^{2} - p_y^{2} - p_z^{2} = m_{0}^{2} c^{2}. \end{align}

Or more compactly:

\begin{align} E^{2} - c^{2} |\vec{p}|^{2} = m_{0}^{2} c^{4}. \end{align}

Thus:

\begin{align} \boxed{ E^{2} = m_{0}^{2} c^{4} + c^{2} p^{2} } \end{align}

and for positive energy:

\begin{align} \boxed{ E = +\sqrt{m_{0}^{2} c^{4} + c^{2} p^{2}}. } \end{align}

We previously found that:

\begin{align} p = \gamma m_0 v = \frac{m_0 v}{\sqrt{1 - \frac{v^{2}}{c^{2}}}, } \end{align}
where \(m_0\) is the rest mass (the mass at zero velocity).

From the relation:

\begin{align} E = \frac{m_0 c^{2}}{\sqrt{1 - \frac{v^{2}}{c^{2}}} } \end{align}
it follows that:

\begin{align} E^{2} = \frac{m_0^{2} c^{4}}{1 - \frac{v^{2}}{c^{2}}}. \end{align}

After further division:

\begin{align} E^{2} = m_0^{2} c^{4} + \frac{m_0^{2} c^{4} \frac{v^{2}}{c^{2}}}{1 - \frac{v^{2}}{c^{2}}} = m_0^{2} c^{4} + \frac{m_0^{2} v^{2} c^{2}}{1 - \frac{v^{2}}{c^{2}}}. \end{align}

But:

\begin{align} p = \frac{m_0 v}{\sqrt{1 - \frac{v^{2}}{c^{2}}} } \quad\Rightarrow\quad p^{2} = \frac{m_0^{2} v^{2}}{1 - \frac{v^{2}}{c^{2}}}. \end{align}

Thus:

\begin{align} E^{2} = m_0^{2} c^{4} + p^{2} c^{2}. \end{align}

Or, as commonly written:

\begin{align} \boxed{ E^{2} = p^{2} c^{2} + m_{0}^{2} c^{4} } \label{eq:R149} \end{align}

where:

\begin{align} p = \frac{m_0 v}{\sqrt{1 - \frac{v^{2}}{c^{2}}}. } \end{align}

Appendix 9.8.1 Alternative derivation of the energy–momentum–mass relation

We had:

\begin{align} p=mv \end{align}
\begin{align} p=\gamma m_0 v=\gamma m_0 c^2 \frac{v}{c^2} \end{align}
\begin{align} pc=\gamma m_0 c^2\frac{v}{c}=\beta \gamma m_0 c^2 \end{align}

Here:

\begin{align} \gamma=\frac{1}{\sqrt{1-\beta^2}} \quad \text{and} \quad \beta=\frac{v}{c} \end{align}

Now, using the above, we examine what happens:

\begin{align} \left( pc \right)^2+\left(m_0c^2 \right)^2=\left(\beta \gamma m_0 c^2 \right)^2+\left(m_0c^2 \right)^2 \end{align}
\begin{align} \left( pc \right)^2+\left(m_0c^2 \right)^2=\left(m_0 c^2 \right)^2\left(\beta^2\gamma^2+1\right)^2 \end{align}
\begin{align} \left( pc \right)^2+\left(m_0c^2 \right)^2=\left(m_0 c^2 \right)^2\left(1+\beta^2 \frac{1}{1-\beta^2}\right)^2 \end{align}
\begin{align} \left( pc \right)^2+\left(m_0c^2 \right)^2 =\left(m_0 c^2 \right)^2\left(\frac{1-\beta^2+\beta^2}{1-\beta^2}\right)^2 =\left(m_0 c^2 \right)^2\left(1+\beta^2 \frac{1}{1-\beta^2}\right)^2=\left(m_0 c^2\right)^2 \gamma^2 \end{align}
\begin{align} \left( pc \right)^2+\left(m_0c^2 \right)^2=\left(m_0 c^2\right)^2 \gamma^2 =\left(\gamma m_0 c^2\right)^2=E^2 \end{align}

Thus:

\begin{align} \boxed{ E^{2} = p^{2} c^{2} + m_{0}^{2} c^{4} } \end{align}

Appendix 9.8.2 — Classical proof of energy conservation

The total mechanical energy of a particle is the sum of the kinetic energy \(K\) and the potential energy \(U\):

\begin{align} E = \frac{1}{2} m v^{2} + U(x). \end{align}

Take the time derivative (one-dimensional motion):

\begin{align} \frac{dE}{dt} = m v \frac{dv}{dt} + \frac{dU}{dx}\frac{dx}{dt} = m v a + v \frac{dU}{dx}. \end{align}

The force associated with a potential energy \(U(x)\) is:

\begin{align} F = -\frac{dU}{dx}. \end{align}

Thus:

\begin{align} \frac{dE}{dt} = v\left( m a + \frac{dU}{dx} \right) = v\left( m a - F \right). \end{align}

According to Newton’s second law:

\begin{align} F = m a. \end{align}

Thus:

\begin{align} \frac{dE}{dt} = 0. \end{align}

Therefore:

\begin{align} \boxed{E = \text{constant}} \end{align}

The total mechanical energy is thus conserved.

Appendix 9.9 Derivation of \(E=mc^2\)

Einstein’s thought experiment with a light pulse in a box

Einstein derived the equation \(E = mc^{2}\) through an elegant thought experiment. Consider a stationary box floating freely in space, without the influence of gravity or external forces.

On the left side of the box, a photon is emitted moving to the right. Due to conservation of momentum, the box moves slightly to the left. When the photon reaches the right wall, it transfers its entire momentum to the box, causing the box to stop moving.

The photon has moved, and the box has also moved, but there are no external forces present. Therefore, the center of mass of the total system must remain constant.

Relativistic energy of the photon

From Appendix 9.6 (equation (\ref{eq:R149})) we know:

\begin{align} E^{2} = p^{2} c^{2} + m_{0}^{2} c^{4}. \end{align}

For a photon \(m_{0} = 0\), so:

\begin{align} E = pc. \end{align}

The momentum of the photon is therefore:

\begin{align} p_{\text{photon}} = \frac{E}{c}. \end{align}

Momentum of the box

The box with mass \(M\) moves slightly to the left with velocity \(v\). The momentum of the box is:

\begin{align} p_{\text{box}} = M v. \end{align}

During the time \(\Delta t\) it takes the photon to reach the right side, the box moves a distance \(\Delta x\). The velocity of the box is:

\begin{align} v = -\frac{\Delta x}{\Delta t}. \end{align}

Due to conservation of momentum:

\begin{align} p_{\text{photon}} + p_{\text{box}} = 0 \quad\Rightarrow\quad p_{\text{box}} = -p_{\text{photon}}. \end{align}

Thus:

\begin{align} M \frac{\Delta x}{\Delta t} = \frac{E}{c}. \end{align}

The length of the box is \(L\), so the time it takes the photon to reach the other side is:

\begin{align} \Delta t = \frac{L}{c}. \end{align}

Thus:

\begin{align} M \Delta x = \frac{E L}{c^{2}}. \end{align}

Center of mass of the system

Now suppose hypothetically that the photon has a small mass \(m\). Then we can determine the center of mass of the system. If the position of the box is \(x_{1}\) and the position of the photon is \(x_{2}\), then the center of mass is:

\begin{align} \bar{x} = \frac{M x_{1} + m x_{2}}{M + m}. \end{align}

Since there are no external forces, this center of mass must remain constant:

\begin{align} \frac{Mx_1+mx_2}{M+m}=\frac{M(x_1-\Delta x)+mL}{M+m} \end{align}

The photon starts at \(x_2 = 0\), so we obtain:

\begin{align} mL = M\Delta x \end{align}

Now we get:

\begin{align} mL=\frac{EL}{c^2} \end{align}

With some rearrangement we obtain the famous relation:

\begin{align} \boxed{ E = mc^{2}. } \end{align}

Remark — precise treatment of the photon path

In the original derivation it is assumed that the photon travels a distance L. But in reality, the box moves a small distance \(\Delta x\) in the opposite direction during the photon’s flight. The effective path of the photon is therefore:

\begin{align} L\sqrt{1 - \frac{v^{2}}{c^{2}}}\; - \Delta x. \end{align}

This leads to an adjusted travel time:

\begin{align} \Delta t = \frac{L\sqrt{1 - \frac{v^{2}}{c^{2}}} - \Delta x}{c}. \end{align}

The momentum balance gives:

\begin{align} M\,\frac{\Delta x}{\Delta t} = \frac{E}{c} \quad\Rightarrow\quad M\Delta x = \frac{E}{c}\,\Delta t. \end{align}

Substituting \(\Delta t\) gives:

\begin{align} M\Delta x = \frac{E}{c^{2}} \left( L\sqrt{1 - \frac{v^{2}}{c^{2}}} - \Delta x \right). \end{align}

Center-of-mass condition

The center of mass of box + photon must remain constant:

\begin{align} \frac{M x_{1} + m x_{2}}{M + m} = \frac{M(x_{1} - \Delta x) + m\left(L\sqrt{1 - \frac{v^{2}}{c^{2}}} - \Delta x\right)} {M + m}. \end{align}

This gives:

\begin{align} -M\Delta x + m\left(L\sqrt{1 - \frac{v^{2}}{c^{2}}} - \Delta x\right) = 0. \end{align}

Thus:

\begin{align} m\left(L\sqrt{1 - \frac{v^{2}}{c^{2}}} - \Delta x\right) = M\Delta x. \end{align}

But earlier we found:

\begin{align} M\Delta x = \frac{E}{c^{2}} \left( L\sqrt{1 - \frac{v^{2}}{c^{2}}} - \Delta x \right). \end{align}

Therefore:

\begin{align} \frac{E}{c^{2}} \left( L\sqrt{1 - \frac{v^{2}}{c^{2}}} - \Delta x \right) = m\left( L\sqrt{1 - \frac{v^{2}}{c^{2}}} - \Delta x \right). \end{align}

The factor in parentheses is nonzero, so:

\begin{align} \boxed{E = m c^{2}}. \end{align}

Conclusion

Even when we:

  • include the displacement of the box \(\Delta x\),
  • use the shortened photon path,
  • include Lorentz contraction,

the derivation still leads exactly to:

\begin{align} \boxed{E = m c^{2}}. \end{align}

Appendix 9.10 — Applications

Appendix 9.10.1 — Nuclear Fusion and Fission

When a proton \(p\) and a neutron \(n\) are brought together, they can fuse into a deuterium nucleus \(d\). The masses of the particles involved are:

\begin{align} m_{p} = 938.27231\ \text{MeV}/c^{2},\qquad m_{n} = 939.56563\ \text{MeV}/c^{2},\qquad m_{d} = 1875.61339\ \text{MeV}/c^{2}. \end{align}

Unit: MeV/\(c^{2}\)

From the relation \(E = mc^{2}\) it follows that mass can be expressed as energy divided by \(c^{2}\). In particle physics, the electronvolt (eV) is therefore often used:

\begin{align} 1\ \text{eV} = 1.6\times 10^{-19}\ \text{J},\qquad 1\ \text{MeV} = 10^{6}\ \text{eV}. \end{align}

The unit MeV/\(c^{2}\) is thus a practical measure of mass.

Energy released in fusion

Since the mass of the deuteron is smaller than the sum of the masses of the proton and neutron, energy must have been released. If \(p\) and \(n\) combine with negligible velocity:

\begin{align} E = m_{p}c^{2} + m_{n}c^{2} - m_{d}c^{2} = 2.22455\ \text{MeV}. \end{align}
vector_9_10_1_1

This energy is released in the form of a photon:

\begin{align} p + n \;\rightarrow\; d + \gamma. \end{align}

A photon is massless and carries energy and momentum. To ensure conservation of momentum, the formed deuteron moves in the opposite direction of the photon. Since the mass of \(d\) is large, the kinetic energy of \(d\) is very small:

\begin{align} E = \sqrt{p^{2}c^{2} + m^{2}c^{4}} \approx mc^{2} \qquad\text{if }\quad pc \ll mc^{2}. \end{align}

Nuclear fusion

The reaction described above is an example of nuclear fusion. Light nuclei can fuse into heavier nuclei while releasing energy. All nuclei up to iron (\(^{56}\text{Fe}\)) can be formed via fusion with net energy production.

Nuclear fission

For very heavy nuclei, such as uranium, the opposite holds: the total mass of the nucleus is greater than the sum of the masses of the individual nucleons. Therefore, energy is released when such heavy nuclei are split:

nuclear fission.

This explains why:

  • fusion releases energy for light elements,
  • fission releases energy for heavy elements.

Appendix 9.10.2 — Driving an Electric Car on 1 gram of Hydrogen via Nuclear Fusion

Here we investigate how much energy is released during hydrogen fusion as in the Sun, where four hydrogen atoms fuse into one helium atom. A small fraction of the mass disappears and is converted into energy according to \(E = mc^{2}\). We then determine how many kilometers an electric car could theoretically drive using this energy.

1. Energy yield of nuclear fusion

Fusion in the Sun proceeds via the proton–proton cycle. The net reaction is:

\begin{align} 4\,{}^{1}_1\!H \;\rightarrow\; {}^{4}_2\!He + 2e^{+} + 2\nu_{e} + 2\gamma. \end{align}

The mass of four hydrogen atoms is greater than that of one helium nucleus. The mass difference is released as energy. Per fusion of four hydrogen atoms, approximately:

\begin{align} 26.7\ \text{MeV} \end{align}
released.

In 1 gram of hydrogen there are approximately \(6.022\times 10^{23}\) (Avogadro’s number) hydrogen atoms (1 mole). Thus, in 1 gram of hydrogen we have

\begin{align}\frac{6.022 \times 10^{23}}{4}\approx1.505 \times 10^{23}\end{align}
fusion reactions.

Each fusion reaction yields 26.7 MeV of energy, so total energy:

\begin{align} E_{\text{fusion}} = 1.505\times 10^{23} \times 26.7\ \text{MeV}. \end{align}

2. Conversion from MeV to Joule

A Joule is equal to moving a charge of 1 Coulomb through a potential of 1 Volt. Thus:

\begin{align}Joule=qV\end{align}
The charge of an electron \(e\) is \(1.60218 \times 10^{-19}\, \text{Coulomb}\).
Then:

\begin{align} 1\ \text{eV} = 1.60218\times 10^{-19}\ \text{J},\qquad 1\ \text{MeV} = 1.60218\times 10^{-13}\ \text{J}. \end{align}

Thus per 1 gram of hydrogen, the total energy in Joules is:

\begin{align} E_{\text{tot}} = 1.505\times 10^{23} \times 26.7 \times 1.60218\times 10^{-13} \approx 6.43\times 10^{11}\ \text{J}. \end{align}

3. Calculation of the energy:

\begin{align} E_{\text{tot}} \approx 6.43\times 10^{11}\ \text{Joules per gram of hydrogen}. \end{align}

This is the energy released in this process, where a small portion of the mass is converted into energy.

For comparison, we can consider the theoretical calculation where 1 gram of matter is fully converted according to \(E=mc^2\):

\begin{align} E=\frac{1}{1000}\times \left(3\times 10^8\right)^2 \approx 9 \times 10^{13}\, \text{Joules} \end{align}
Thus this differs by about a factor of 140 (or in percentage terms, fusion is about 0.7% of the energy from total conversion of 1 gram of mass).

4. Alternative mass defect calculation

In fusion, 4 moles of hydrogen are converted into 1 mole of helium:

\begin{align} m_{H,4} = 4\times 1.00784 = 4.03136\ \text{g}, \qquad m_{He} = 4.0026\ \text{g}. \end{align}

Mass difference:

\begin{align} \Delta m = 0.02876\ \text{g} = 2.876\times 10^{-5}\ \text{kg}. \end{align}

Energy released:

\begin{align} E = \Delta m\,c^{2} = 2.876\times 10^{-5}(3\times 10^{8})^{2} \approx 2.588\times 10^{12}\ \text{J} \end{align}
for 4.03136 g hydrogen, thus:
\begin{align} E \approx 6.42\times 10^{11}\ \text{J per gram}. \end{align}

5. Energy consumption of an electric car

Electric cars consume on average:

\begin{align} 17\ \text{kWh per 100 km}. \end{align}
\begin{align} 1\ \text{kWh} = 3.6\times 10^{6}\ \text{J}, \qquad 17\ \text{kWh} = 61.2\times 10^{6}\ \text{J per 100 km}. \end{align}

6. Theoretical driving range (100% efficiency)

\begin{align} \text{Distance} = \frac{6.43\times 10^{11}}{61.2\times 10^{6}} \times 100\ \text{km} \approx 1.05\times 10^{6}\ \text{km}. \end{align}

7. Realistic efficiency

  • Fusion → electricity efficiency: 40%
  • Electric drivetrain efficiency: 90%

Total efficiency:

\begin{align} \eta = 0.4 \times 0.9 = 0.36. \end{align}

Usable energy:

\begin{align} E_{\text{usable}} = 6.43\times 10^{11} \times 0.36 = 2.31\times 10^{11}\ \text{J}. \end{align}

8. Practical driving range

\begin{align} \text{Distance} = \frac{2.31\times 10^{11}}{61.2\times 10^{6}} \times 100\ \text{km} \approx 3.77\times 10^{5}\ \text{km}. \end{align}

Thus, an electric car can theoretically:

\begin{align} \boxed{ \text{approximately } 3.77\times 10^{5}\ \text{km} } \end{align}
drive on the energy from nuclear fusion of 1 gram of hydrogen.

With an average annual mileage of 15,000 km, this means:

\begin{align} \frac{3.77\times 10^{5}}{1.5\times 10^{4}} \approx 25\ \text{years}. \end{align}

In other words: with 1 gram of hydrogen you can drive electrically for about 25 years.

Appendix 9.11 — Relativistic Electromagnetism

(Calculations based on Richard Feynman, Feynman Lectures on Physics, Vol. II, Chapter 13 )

Appendix 9.11.1 — Introduction

The term electromagnetism suggests that there are two types of fields: an electric field and a magnetic field, each with its own sources. In reality, we know only one fundamental source: electric charge.

Electric charges — electrons with charge \(-e\) and protons with charge \(+e\) — are the only known sources of the electric field. To date, no magnetic monopoles have been found that could serve as a source of a magnetic field.

It appears that magnetic fields always arise from:

  • moving electric charges (current), or
  • time variations in the electric field.

Even at the quantum level, magnetic fields result from electrical phenomena, such as the spins of electrons and atoms.

Therefore, one can argue that the magnetic field model is an extremely useful mathematical tool for describing electromagnetic phenomena, but that the underlying physical phenomenon is entirely electrical in nature: an electric field and its variation in space and time.

Appendix 9.11.2 — Calculations

When analyzing a current-carrying wire, we normally use the Maxwell equations to determine both the electric and magnetic fields.

An alternative — and in a relativistic context very insightful — approach is to perform the entire calculation based solely on the electric field, and to treat the magnetic field as a relativistic byproduct.

This idea forms the core of Feynman’s treatment of electromagnetism: the magnetic field is what an electric field looks like when viewed from another inertial frame.

vector_9_11_2_1
Fig. 1. The interaction between a current carrier and a charge.

We consider a wire carrying an electric current and a test charge \(q\). The situation is observed in two different inertial frames:

  • Frame S — the wire is at rest, the charge is moving.
  • Frame S′ — the charge is at rest, the wire is moving.

Although the physical situation is the same, the fields in both frames will be observed differently. This is precisely where relativity and electromagnetism meet.

In the following sections, we will derive several fundamental formulas that show how electric fields and charge densities transform under Lorentz transformations, and how the magnetic field follows automatically from this.

Current density and charge distribution

The current density is the average flow velocity of the charges. Suppose there is a distribution of charges with an average velocity \(\vec{v}\). The charge \(\Delta q\) that passes through a surface element \(\Delta S\) in a time interval \(\Delta t\) is:

\begin{align} \Delta q = \rho\, \vec{v}\cdot\vec{n}\, \Delta S\, \Delta t. \label{eq:R219} \end{align}

Here \(\rho\) is the charge density: charge per unit volume. The term \(\vec{v}\Delta t \cdot \Delta S\) can be interpreted as a volume. Thus, the charge is the charge density times the volume.

The charge per unit time is then:

\begin{align} \rho\, \vec{v}\cdot\vec{n}\, \Delta S. \end{align}

Therefore we define the current density:

\begin{align} \boxed{\vec{j} = \rho\, \vec{v}} \label{eq:R221} \end{align}

The total current through a surface \(S\) is:

\begin{align} i = \int_{S} \vec{j}\cdot d\vec{S}. \label{eq:R222} \end{align}

Current carrier at rest

We now consider a wire that is at rest. The electrons (negative charges) move with velocity \(v\) to the right. The protons (positive charges) remain at rest in the wire.

A test particle with negative charge \(q^{-}\) moves with the same velocity as the electrons to the right. We observe everything in the frame where the wire is at rest.

The wire is electrically neutral:

\begin{align} \rho_{+} + \rho_{-} = 0. \end{align}

Force on the test particle

The force on a charge is given by the Lorentz force:

\begin{align} \vec{F} = q(\vec{E} + \vec{v} \times \vec{B}). \end{align}

The magnetic field around a long straight wire is:

\begin{align} H = \frac{i}{2\pi r}, \qquad B = \mu_{0} H. \end{align}

Since the wire is neutral, the electric field outside the wire is zero:

\begin{align} \vec{E} = 0. \end{align}

The force on the test particle is then:

\begin{align} F = q\, v B \sin\varphi. \end{align}

Since \(\vec{v}\) is perpendicular to \(\vec{B}\), we have \(\sin\varphi = 1\), so:

\begin{align} F = q v B = q v \mu_{0} H = q v \mu_{0} \frac{i}{2\pi r}. \end{align}

Charge density

The charge density is defined as:

\begin{align} \rho = \frac{q}{V}. \end{align}

If \(A\) is the cross-sectional area of the wire and \(L\) an arbitrary length along the wire, then the volume is:

\begin{align} V = A L, \end{align}
and thus:
\begin{align} q = \rho A L. \end{align}

When the wire is at rest:

\begin{align} \rho_{+} + \rho_{-} = 0. \end{align}

This forms the basis for the relativistic analysis that follows: in one frame the wire is neutral, but in another frame — due to Lorentz contraction — the charge density changes, and an electric field arises that exactly corresponds to the magnetic effect in the original frame.

Relativistic analysis from the perspective of the test particle

We now consider the situation from the frame in which the test particle is at rest. In this frame, the wire moves to the left with velocity \(v\). The volume is determined by the cross-sectional area \(A\) and the length \(L\).

The length of a moving volume relative to a volume at rest is:

\begin{align} L_{\text{moving}} = L_{\text{rest}} \sqrt{1 - \frac{v^{2}}{c^{2}}}. \end{align}

Since the electrons have the same velocity as the test particle, they are at rest in this frame. Thus:

\begin{align} L_{\text{rest}} = \frac{L_{\text{moving}}}{\sqrt{1 - \frac{v^{2}}{c^{2}}}}. \end{align}

The positive ions now move with velocity \(v\) to the left. Their length is Lorentz-contracted by the factor:

\begin{align} \frac{1}{\sqrt{1 - \frac{v^{2}}{c^{2}}}}. \end{align}

New charge density

In the rest frame of the wire, the external electric field was zero:

\begin{align} \rho_{+} + \rho_{-} = 0. \end{align}

But in the frame of the test particle, the moving length is smaller, thus the moving volume is smaller, and therefore the charge density is larger.

The charge density of the electrons becomes:

\begin{align} \rho_{-}' = \rho_{-} \sqrt{1 - \frac{v^{2}}{c^{2}}}. \end{align}

The positive charge density becomes:

\begin{align} \rho_{+}' = \frac{\rho_{+}}{\sqrt{1 - \frac{v^{2}}{c^{2}}}}. \end{align}

The total charge density is therefore:

\begin{align} \rho_{\text{net}} = \rho_{+}' + \rho_{-}' = \frac{\rho_{+}}{\sqrt{1 - \frac{v^{2}}{c^{2}}}} + \rho_{-} \sqrt{1 - \frac{v^{2}}{c^{2}}}. \end{align}

Since \(\rho_{-} = -\rho_{+}\):

\begin{align} \rho_{\text{net}} = \rho_{+} \left( \frac{1}{\sqrt{1 - \frac{v^{2}}{c^{2}}}} - \sqrt{1 - \frac{v^{2}}{c^{2}}} \right). \end{align}

Rewrite this as:

\begin{align} \rho_{\text{net}} = \rho_{+} \frac{ 1 - \left(1 - \frac{v^{2}}{c^{2}}\right) }{ \sqrt{1 - \frac{v^{2}}{c^{2}}} } = \rho_{+} \frac{\frac{v^{2}}{c^{2}}}{ \sqrt{1 - \frac{v^{2}}{c^{2}}} }. \end{align}

Thus:

\begin{align} \boxed{ \rho_{\text{net}} = \rho_{+}\, \frac{v^{2}/c^{2}}{\sqrt{1 - v^{2}/c^{2}}} } \end{align}

Charge in a length \(L\)

The volume of a length \(L\) of the wire is:

\begin{align} V = A L. \end{align}

The total charge in this volume is:

\begin{align} q = \rho_{\text{net}} A L = \rho_{+}\, \frac{v^{2}/c^{2}}{\sqrt{1 - v^{2}/c^{2}}} A L. \end{align}

Since \(\rho_{\text{net}} \neq 0\), the electric field outside the wire is no longer zero. It is perpendicular to the wire and behaves like the field of a charged line.

Volume of a cylindrical Gaussian surface

Consider a cylindrical tube around the wire, with:

  • length \(L\),
  • radius \(r\).

The lateral surface area is:

\begin{align} S = 2\pi r L. \end{align}

This will be used to determine the electric field via Gauss’s law:

\begin{align} \oiint \vec{E}\cdot d\vec{S} = \frac{q}{\varepsilon_{0}}. \end{align}

In the next step, this leads to an electric field that exactly corresponds to the magnetic force in the original frame — a beautiful example of how magnetism is a relativistic effect.

Electric field in the rest frame of the test particle

From Gauss’s law, the electric field outside the wire follows:

\begin{align} E = \frac{\rho_{+}\, v^{2}/c^{2}}{2\pi \varepsilon_{0} rL} \frac{A L}{\sqrt{1 - v^{2}/c^{2}}} = \frac{\rho_{+} v^{2}}{2\pi \varepsilon_{0} r c^{2}} \frac{1}{\sqrt{1 - v^{2}/c^{2}}}A \end{align}

Thus, the force on the test particle in this frame is:

\begin{align} F' = qE = q\, \frac{\rho_{+} v^{2}}{2\pi \varepsilon_{0} r c^{2}} \frac{A}{\sqrt{1 - v^{2}/c^{2}}}. \label{eq:R248} \end{align}

For \(v \ll c\), this becomes:

\begin{align} F' \approx q\, \frac{\rho_{+}}{2\pi \varepsilon_{0} r} \frac{v^{2}}{c^{2}} A. \end{align}

Force in the original frame (magnetic)

In the rest frame of the wire, the force was:

\begin{align} F = q v B = q v \mu_{0} H = q v \mu_{0} \frac{i}{2\pi r}. \label{eq:R250} \end{align}

Since the current density:

\begin{align} J = \rho v, \end{align}
we obtain:

\begin{align} F = q v \mu_{0} \frac{J A}{2\pi r} = q v \mu_{0} \frac{\rho v A}{2\pi r}. \end{align}

Now use:

\begin{align} c^{2} = \frac{1}{\varepsilon_{0}\mu_{0}} \quad\Rightarrow\quad \mu_{0} = \frac{1}{\varepsilon_{0} c^{2}}, \end{align}
thus:

\begin{align} F = q\, \frac{\rho v^{2} A}{2\pi r\, \varepsilon_{0} c^{2}}. \label{eq:R254} \end{align}

Comparison of the two forces

From (\ref{eq:R248}) and (\ref{eq:R254}) it follows:

\begin{align} F' = \frac{F}{\sqrt{1 - v^{2}/c^{2}}}. \end{align}

Or:

\begin{align} \boxed{ F' = \gamma F } \end{align}

This is exactly what we expect: the force in the transverse plane (y-direction) transforms with a factor \(\gamma\).

Momentum relation in both frames

The forces act only in the transverse y-direction. Therefore, the change in momentum in the y-direction must be the same in both frames.

In the original frame:

\begin{align} \Delta p_{y} = F\, \Delta t. \end{align}

In the frame of the test particle:

\begin{align} \Delta p'_{y} = F'\, \Delta t'. \end{align}

Since time runs slower for a moving particle:

\begin{align} \Delta t' = \frac{\Delta t}{\gamma}. \end{align}

Substitute this into the momentum equation:

\begin{align} \Delta p'_{y} = F' \Delta t' = \gamma F \cdot \frac{\Delta t}{\gamma} = F \Delta t = \Delta p_{y}. \end{align}

Thus:

\begin{align} \boxed{ \Delta p'_{y} = \Delta p_{y} } \end{align}

This confirms that the transverse momentum is invariant under Lorentz transformation.

And once again we see that the magnetic field in one frame is nothing more than an electric field in another frame — one of the most beautiful results of special relativity.

Relation between forces in both frames

From time dilation it follows:

\begin{align} \Delta t = \frac{\Delta t'}{\sqrt{1 - \frac{v^{2}}{c^{2}}}}. \end{align}

The momentum change in both frames is:

\begin{align} \Delta p_{y} = F\, \Delta t, \qquad \Delta p'_{y} = F'\, \Delta t'. \end{align}

Since transverse momentum is invariant:

\begin{align} \Delta p_{y} = \Delta p'_{y}, \end{align}
thus:

\begin{align} F\, \Delta t = F'\, \Delta t' = F'\, \Delta t \sqrt{1 - \frac{v^{2}}{c^{2}}}. \end{align}

It follows that:

\begin{align} F' = \frac{F}{\sqrt{1 - \frac{v^{2}}{c^{2}}}}. \label{eq:R266} \end{align}

Using the results from (\ref{eq:R250}) and (\ref{eq:R254}), we obtain:

\begin{align} F = q v B = \sqrt{1 - \frac{v^{2}}{c^{2}}}\, F' = \sqrt{1 - \frac{v^{2}}{c^{2}}}\, q E. \end{align}

Or:

\begin{align} \boxed{ q v B = \sqrt{1 - \frac{v^{2}}{c^{2}}}\, q E } \end{align}

This is exactly the relativistic relation between electric and magnetic forces.

Appendix 9.11.3 — Conclusion

We have found that we obtain the same physical result, regardless of whether we analyze the motion of a particle along a current-carrying wire in:

  • a coordinate system at rest with respect to the wire, or
  • a system at rest with respect to the particle.

In the first case, the force was entirely magnetic. In the second case, the force was entirely electric.

Since both descriptions lead to exactly the same momentum change, electric and magnetic fields must be manifestations of one and the same underlying relativistic field.

This demonstrates that:

\begin{align} \boxed{ \text{Magnetism is a relativistic effect of electricity.} } \end{align}

This is one of the most beautiful and profound insights of special relativity, and forms the basis for the tensor formulation of the electromagnetic field.