Einstein’s General Theory of Relativity

Derivations, Applications and Reflections – by Albert Prins

Part IV – Experiments and Verifications


4 Experiments Confirming Einstein’s Theory

In this chapter we discuss a series of experiments that empirically support Einstein’s general theory of relativity. A central tool in the analysis of these experiments is the Schwarzschild solution of the Einstein field equations.

The following experiments are discussed:

Together, these experiments form powerful evidence for the validity of the general theory of relativity. In each case, the Schwarzschild metric provides a mathematical framework that accurately explains the observed phenomena.

4.1 Experiment 1 – The Hafele & Keating Experiment with the Schwarzschild Equation

Derivation based on: A Hafele & Keating-like thought experiment, by Paul B. Andersen, 16 October 2008 (Andersen, 2008).

The famous Hafele–Keating experiment tested quantitative predictions of relativity, particularly time dilation due to both motion (special relativity) and gravity (general relativity).

In this experiment, two airplanes were equipped with cesium clocks and flown simultaneously around the Earth in opposite directions. A third cesium clock remained at a fixed location on Earth (in Washington). The results showed that the clocks on board experienced different time dilation effects depending on their direction of motion and their position relative to Earth.

The clock in the eastward‑flying aircraft moved with the rotation of the Earth. As a result, this clock had a higher speed relative to the non‑rotating center of the Earth than the ground clock. This led to stronger time dilation: the clock lagged behind.

Conversely, the westward‑flying aircraft moved against the rotation of the Earth, resulting in a lower speed relative to the Earth’s center, and thus weaker time dilation: this clock ran ahead. This difference in elapsed time demonstrates that the passage of time depends on the motion of the observer — an effect predicted by Einstein in 1905 in his original paper on special relativity.

vector_4_1
All three clocks move eastward. Even though the airplane flying west moves westward relative to the air, the air itself moves eastward due to Earth’s rotation.
Source: (Crowell, 11 March 2018)

Purpose and Setup

Theoretical Framework: Schwarzschild Metric

The Schwarzschild metric describes spacetime outside a spherically symmetric massive body such as the Earth:

The Schwarzschild metric is:

\[ c^2 d\tau^2=\left(1-\frac{2GM}{ c^2r}\right) dt^2-\left(1-\frac{2GM}{ c^2r} \right)^{-1} dr^2-r^2 d\theta^2-r^2 \sin^2\theta \, d\phi^2 \tag{1} \]

Here:

The Schwarzschild metric uses a universal (spherical) coordinate system with its origin at Earth’s center of mass. The Earth rotates within this coordinate system. Changes in the angles θ and describe motion across the Earth’s surface.

Small changes in time and space are denoted by dt, dr, and d∅. Note that dt is the time change for a hypothetical observer far from gravitational influences; it is not a directly measured time but a theoretical coordinate time.

The actual time measured by a clock at a given location is , the proper time.

We will use the Schwarzschild metric to derive an approximate formula describing the time dilation of the clocks, based on their position and motion. We will then also provide the full (exact) solution. Although the latter is more complex, it can be handled using computer programs such as Excel and yields an accurate result.

4.1.1 Approximate Formula for Time Dilation

We approximate the situation in which the clocks move in circular orbits around the Earth: either at sea level or at some altitude above the Earth’s surface. Because the orbits are circular, dr = 0. Moreover, we assume the motion takes place in the equatorial plane, so θ = π/2 is constant and thus dθ = 0.

With this, the Schwarzschild metric simplifies to:

\[ c^2 d\tau^2=\left(1 - \frac{2GM}{c^2 r}\right) c^2 dt^2-r^2 d\phi^2 \tag{2} \]

Dividing by \(c^2\) gives:

\[ d\tau^2=\left(1 - \frac{2GM}{c^2 r}\right) dt^2-\frac{r^2}{c^2} \left(\frac{d\phi}{dt} \right)^2 dt^2 \tag{3} \]

Using the definition of orbital speed \(v=r \frac{d\phi}{dt}\), we obtain:

\[ d\tau=\sqrt{1 - \frac{2GM}{c^2 r} - \frac{v^2}{c^2}}\, dt \tag{4} \]

Because the terms \(\frac{2GM}{c^2r}\) and \(\frac{v^2}{c^2}\) are very small compared to 1, we apply a first‑order Taylor approximation:

\[ d\tau\approx\left(1-\frac{GM}{c^2 r}-\frac{v^2}{2c^2}\right)dt \tag{5} \]

Since \(r\) and \(v\) are constant, we can integrate directly:

\[ \tau=\left(1-\frac{GM}{c^2 r}-\frac{v^2}{2c^2}\right) t+\tau_0 \tag{6} \]

We now compare two clocks. Clock 1 is located on the Earth’s surface, with radius \(r_1\), the distance from Earth’s center, and speed \(v_1\), due to Earth’s rotation. For this clock:

\[ d\tau_1=\left(1-\frac{GM}{c^2 r_1}-\frac{v_1^2}{2c^2}\right)dt \tag{7} \]

For clock 2, for example in an airplane at altitude \(h\), with radius \(r_2\) and speed \(v_2\), we have:

\[ d\tau_2=\left(1-\frac{GM}{c^2 r_2}-\frac{v_2^2}{2c^2}\right)dt \tag{8} \]

The ratio between the two time intervals becomes:

\[ d\tau_2=\frac{1 - \frac{GM}{c^2 r_2} - \frac{v_2^2}{2c^2}}{1 - \frac{GM}{c^2 r_1} - \frac{v_1^2}{2c^2}}\, d\tau_1 \]

Using the approximation: \[ (1-\varepsilon)^{-1} \approx 1+\varepsilon, \] we obtain:

\[ d\tau_2 \approx \left( 1 - \frac{GM}{c^2 r_2} - \frac{v_2^2}{2c^2} \right) \left( 1 + \frac{GM}{c^2 r_1} + \frac{v_1^2}{2c^2} \right) d\tau_1 \]

\[ d\tau_2 \approx \left[ 1 + \frac{GM}{c^2 r_1} + \frac{v_1^2}{2c^2} - \frac{GM}{c^2 r_2}\left( 1 + \frac{GM}{c^2 r_1} + \frac{v_1^2}{2c^2} \right) - \frac{v_2^2}{2c^2}\left( 1 + \frac{GM}{c^2 r_1} + \frac{v_1^2}{2c^2} \right) \right] d\tau_1 \]

Since the terms \(\frac{GM}{c^2 r_1}, \frac{v_1^2}{2c^2}, \frac{GM}{c^2 r_2}, \frac{v_2^2}{2c^2}\) are very small, their products can be neglected. This yields:

\[ d\tau_2 \approx \left[ 1 + \frac{GM}{c^2}\left(\frac{1}{r_1}-\frac{1}{r_2}\right) + \frac{v_1^2 - v_2^2}{2c^2} \right] d\tau_1 \tag{9} \]

If we assume that both clocks start at \(\tau_1 = \tau_2 = 0\), then the integration is straightforward:

\[ \tau_2 \approx \left[ 1 + \frac{GM}{c^2}\left(\frac{1}{r_1}-\frac{1}{r_2}\right) + \frac{v_1^2 - v_2^2}{2c^2} \right] \tau_1 \tag{10} \]

Comparison Between Clocks

For a clock on Earth’s surface \((r_1, v_1)\) and a clock in an airplane \((r_2, v_2)\):

\[ \tau_2 - \tau_1 \approx \left[ \frac{GM}{c^2}\left(\frac{1}{r_1}-\frac{1}{r_2}\right) + \frac{v_1^2 - v_2^2}{2c^2} \right]\tau_1 \tag{11} \]

Suppose clock 1 is located on Earth’s surface with radius \(R\), and clock 2 is in an airplane at altitude \(h\). Then \(r_2 = R + h\). Since \(h \ll R\), we may approximate:

\[ \frac{1}{R} - \frac{1}{R+h} \approx \frac{h}{R^2} \quad\Rightarrow\quad \frac{GM}{c^2}\left(\frac{1}{R}-\frac{1}{R+h}\right) \approx \frac{gh}{c^2} \]

Thus:

\[ \tau_2 - \tau_1 \approx \left[ \frac{GM}{c^2}\left(\frac{1}{R}-\frac{1}{R+h}\right] + \frac{v_1^2 - v_2^2}{2c^2} \right)\tau_1 \tag{12} \]

If we assume that \(h/R \ll 1\) and \(g = GM/R^2\), then:

\[ \tau_2 - \tau_1 = \left( \frac{gh}{c^2} - \frac{v_2^2 - v_1^2}{2c^2} \right)\tau_1 \tag{13} \]

Since \(v_1 = v_{\text{earth}}\) (Earth’s rotational speed) and \(v_2 = v_{\text{plane}} + v_{\text{earth}}\), we obtain:

\[ v_1^2 - v_2^2 = v_{\text{earth}}^2 - (v_{\text{plane}} + v_{\text{earth}})^2 = -v_{\text{plane}}^2 - 2 v_{\text{earth}} v_{\text{plane}} \]

Substituting into (13) gives:

\[ \tau_{\text{plane}} - \tau_{\text{earth}} = \left( \frac{gh}{c^2} - \frac{v_{\text{plane}}(v_{\text{plane}} + 2v_{\text{earth}})}{2c^2} \right)\tau_{\text{earth}} \tag{14} \]

This equation is fully derived from the Schwarzschild metric with a few approximations. It matches the approximation used in the original Hafele–Keating experiment.

Remark 1. If the airplane’s speed is interpreted as ground speed, then at altitude \(h\) we may approximate:

\[ v_2=\frac{R + h}{R}\left(v_{\text{plane}} + v_{\text{earth}}\right) \]

In that case, the above formula must be adjusted accordingly.

Remark 2. A more precise treatment of \(v_1\) and \(v_2\) follows in the next chapter, where the velocities are derived more explicitly based on the chosen coordinate system.

4.1.2 Derivation of v1 and v2 in Equation (13)

The velocity \(v_1\) in equation (3.1.1.13) is the velocity of a stationary point on the equator of Earth’s surface. It is expressed as:

\[ v_1 = r_1 \frac{d\phi}{dt} \]

where dt is the coordinate time in the “universal” reference frame. However, measurements on Earth’s surface are made with respect to local proper time \(\tau\), so a conversion is required.

Using the chain rule:

\[ v_{1,t}= r_1 \frac{d\phi}{dt}= r_1 \frac{d\phi}{d\tau} \frac{d\tau}{dt}= v_{1,\tau} \frac{d\tau}{dt}\tag{14a} \]

We now use the Schwarzschild metric (in the equatorial plane with \(dr = 0\)) to determine \(\frac{d\tau}{dt}\):

\[ c^2 d\tau^2=\left(1 - \frac{2GM}{c^2 r_1}\right)c^2 dt^2- r_1^2 \left(\frac{d\phi}{d\tau} \right)^2 d\tau^2 \]

Rearranging:

\[ \left[1 + \frac{r_1^2}{c^2}\left(\frac{d\phi}{d\tau}\right)^2 \right] d\tau^2=\left(1 - \frac{2GM}{c^2 r_1}\right) dt^2 \]

We define:

\[ \sigma^2 = 1 - \frac{2GM}{c^2 r} \]

Then:

\[ \left(1 + \frac{v_{1,\tau}^2}{c^2}\right) d\tau^2= \sigma_1^2 dt^2 \] \[ \left(\frac{d\tau}{dt}\right)^2=\frac{\sigma_1^2}{1 + \frac{v_{1,\tau}^2}{c^2}}\tag{14b} \]

Substituting into equation (14a):

\[ v_{1,t}^2=v_{1,\tau}^2\left(\frac{d\tau}{dt}\right)^2=v_{1,\tau}^2\frac{\sigma_1^2}{1 + \frac{v_{1,\tau}^2}{c^2}} \]

This expression shows that \(\frac{d\tau}{dt}\), the conversion between coordinate time and proper time, depends on Earth’s rotational speed \(v_{1,\tau}\), measured in local proper time.

Now consider an airplane flying eastward. The total speed at Earth level (measured in proper time) is:

\[ v_{1\tau \text{_plane}}=v_{\text{plane_}\tau}+v_{1\tau \text{_earth}}=r_1 \frac{d\phi}{d\tau} \]

where:

Angular Velocity in the Universal Frame

The corresponding angular velocity in the universal frame is:

\[ r_1 \frac{d\phi}{dt} = r_1 \frac{d\phi}{d\tau}\frac{d\tau}{dt} = \left( v_{\text{plane_}\tau}+v_{1\tau \text{_earth}} \right) \frac{d\tau}{dt} \]

\[ \frac{d\tau}{dt} = \frac{\sigma_1}{\sqrt{\left(1 + \frac{v_{1\tau\text{_earth}}^{2}}{c^{2}}\right)}} \]

\[ \frac{d\phi}{dt} =\left( v_{\text{plane_}\tau}+v_{1\tau \text{_earth}} \right) \frac{\sigma_1}{r_1\sqrt{\left(1 + \frac{v_{1\tau\text{_earth}}^{2}}{c^{2}}\right)}} \]

Here we have computed the rotational speed (angular velocity) in the universal frame. This is valid at any level, i.e., any distance from the center. But the actual speed is determined by r times this angular velocity.

Speed at the Airplane’s Altitude

\( v_{\text{plane},\tau} \) is the measured airplane speed at ground level relative to proper time, the only available time at that level. \( v_{\text{earth},\tau} \) is the rotational speed of a stationary point on Earth relative to the universal frame, but measured in proper time at Earth level.

The airplane’s speed in the universal frame at altitude \( r_2 \) is:

\[ v_{2t}=r_2 \frac{d\phi}{dt}=\frac{r_2}{r_1}\frac{\sigma_1}{\sqrt{1 + \frac{v_{1\tau \text{_earth}}^2}{c^2}}}\left( v_{\text{plane_}\tau}+v_{1\tau \text{_earth}} \right) \]

We split this speed into an “Earth rotation” and an “airplane” component:

\[ v_{2t} = v_{2t\text{_earth}} + v_{2t\text{_plane}} \]

With:

\[ v_{2t\text{_earth}} = \frac{r_2}{r_1}\, \frac{\sigma_1 v_{1\tau\text{_earth}}}{\sqrt{1 + \frac{v_{1\tau\text{_earth}}^2}{c^2}}} \]

And:

\[ v_{2t\text{_plane}} = v_{2t} - v_{2t\text{_earth}} = \frac{r_2}{r_1}\, \frac{\sigma_1 v_{\text{plane_}\tau}}{\sqrt{1 + \frac{v_{1\tau\text{_earth}}^2}{c^2}}} \]

Summary of the Result:

The conversion of speed at Earth level to the universal frame:

\[ v_{1t\text{_earth}} = v_{1\tau\text{_earth}} \, \frac{\sigma_{\text{earth}}}{1 + \sqrt{\frac{v_{1\tau\text{_earth}})^2}{c^2}}} \tag{15} \]

The speed \( v_2 \) of the airplane in the universal frame is:

\[ v_{2t} = \frac{r_2}{r_1}\, \frac{ \sigma_{\text{earth}}\left(v_{\text{plane_}\tau} + v_{\text{earth_}\tau}\right) }{ \sqrt{1 + \frac{v_{1\tau\text{_earth}}^2}{c^2}} } \tag{16} \]

Substitution into equation (3.1.1._13):

\[ \tau_2 - \tau_1 = \left( \frac{gh}{c^2} - \frac{v_2^2 - v_1^2}{2c^2} \right) \tau_1 \tag{13} \]

becomes:

\[ \tau_2 - \tau_1 = \left( \frac{gh}{c^2} - \frac{ \sigma_{\text{earth}}^2 }{ 1 + \frac{(v_{1\tau\text{_earth}})^2}{c^2} } \cdot \frac{1}{2c^2} \left[ \left( \frac{R+h}{R} \right)^2 \left( v_{\tau\text{_plane}} + v_{\tau\text{_earth}} \right)^2 - v_{1\tau\text{_earth}}^2\right]\right) \;\tau_1 \tag{17} \]

This equation describes the time dilation between a clock on Earth’s surface and a clock aboard an airplane, accounting for both gravitational and velocity‑dependent effects, all based on locally measurable quantities.

4.1.3 The Exact Derivation

Instead of an approximation, we now perform an exact derivation, fully based on the Schwarzschild metric. We begin with equation (4):

\[ d\tau = \sqrt{\left(1 - \frac{2GM}{c^2 r} - \frac{v^2}{c^2}\right)} \, dt \tag{4} \]

Since \( r \) and \( v \) are constant, the integration is straightforward:

\[ \tau = \sqrt{\left(1 - \frac{2GM}{c^2 r} - \frac{v^2}{c^2}\right)} \, t + \tau_0 \tag{6a} \]

The goal is to compare the proper time of different clocks. As a reference we take the clock on Earth’s surface. Other clocks are located in airplanes, at higher altitude and with different speeds. Even the clock on Earth has a non‑zero speed due to Earth’s rotation.

For the clock on Earth’s surface (radius \( r_1 \), speed \( v_1 \)):

\[ d\tau_1 = \sqrt{\left(1 - \frac{2GM}{c^2 r_1} - \frac{v_1^2}{c^2}\right)} \, dt \tag{7a} \]

For the clock in the airplane (radius \( r_2 \), speed \( v_2 \)):

\[ d\tau_2 = \sqrt{\left(1 - \frac{2GM}{c^2 r_2} - \frac{v_2^2}{c^2}\right)} \, dt \tag{8a} \]

To find the ratio between the two proper times, we divide these expressions:

\[ \frac{d\tau_2}{d\tau_1} =\sqrt{ \frac{ 1 - \frac{2GM}{c^2 r_2} - \frac{v_2^2}{c^2} }{ {1 - \frac{2GM}{c^2 r_1} - \frac{v_1^2}{c^2}} }} \tag{9a} \]

With equal initial times \( \tau_2(0) = \tau_1(0) = 0 \), the solution is:

\[ \tau_2 =\left(\sqrt{ \frac{ 1 - \frac{2GM}{c^2 r_2} - \frac{v_2^2}{c^2} }{ 1 - \frac{2GM}{c^2 r_1} - \frac{v_1^2}{c^2} }}\right) \tau_1 \tag{10a} \]

Time Difference Between Two Clocks

\[ \tau_2 - \tau_1 = \left(\sqrt{ \frac{ 1 - \frac{2GM}{c^2 r_2} - \frac{v_2^2}{c^2} }{ 1 - \frac{2GM}{c^2 r_1} - \frac{v_1^2}{c^2} }} - 1 \right) \tau_1 \tag{11a} \]

Assume \( r_1 = R \), the radius of the Earth, and \( r_2 = R + h \), the altitude of the aircraft, then this becomes:

\[ \tau_2 - \tau_1 = \left(\sqrt{ \frac{ 1 - \frac{2GM}{c^2 (R+h)} - \frac{v_2^2}{c^2} }{ 1 - \frac{2GM}{c^2 R} - \frac{v_1^2}{c^2} }} - 1 \right) \tau_1 \tag{12a} \]

Velocities and gravitational effects

In this expression:

Both velocities were previously derived in equations 14b and 15b of section 4.1.5.

\[ \tau_{\text{plane}} - \tau_{\text{earth}} = \left(\sqrt{ \frac{ 1 - \frac{2GM}{c^2 (R+h)} - \frac{v_2^2}{c^2} }{ 1 - \frac{2GM}{c^2 R} - \frac{v_1^2}{c^2} }} - 1 \right) \tau_{\text{earth}} \tag{14b} \]

Using the Schwarzschild radius \( R_s = \frac{2GM}{c^2} \), we rewrite this as:

\[ \tau_{\text{plane}} - \tau_{\text{earth}} = \left(\sqrt{ \frac{ 1 - \frac{R_s}{R+h} - \frac{v_2^2}{c^2} }{ 1 - \frac{R_s}{R} - \frac{v_1^2}{c^2} }} - 1 \right) \tau_{\text{earth}} \tag{15b} \]

Conclusion

This equation is an exact expression, derived directly from the Schwarzschild metric. It shows how the difference in proper time between a clock on Earth and a clock in an aircraft is influenced by:

Calculations based on the performed experiments:

vector_4_1_3_1

Conclusion

The approximations are correct within an accuracy of 0.4%.

Practical Application

Results and Interpretation

Experimental Outcome

Summary

4.1.4 The velocity of a stationary point on the equator at Earth’s surface

To compute the velocity of a stationary point on the equator, we must first determine the Earth’s rotation period: the sidereal day.

Sidereal day versus solar day

A normal day or solar day (24 hours) is the time between two consecutive noons. This time is based on the Sun’s apparent motion, not on Earth’s actual rotation.

Because the Earth orbits the Sun, it makes one extra rotation per year relative to the fixed stars. In one year (365.25 solar days), the Earth therefore rotates 366.25 times relative to the stars.

From this follows the duration of one sidereal day:

\[ T_{\text{sidereal}}=\frac{365.25}{366.25}\cdot24\cdot3600=86164.1\ \text{s} \]

Converted:

\[ \frac{86164.1}{3600}=23.93447\ \text{hours} \]

That is: 23 hours, 56 minutes and 4 seconds.

vector_4_1_4

Velocity at the equator

With the Earth’s radius:

\[ R = 6371\ \text{km}=6.371 \times 10^{6}\ \text{m}, \]

we compute the equatorial circumference:

\[ \text{Circumference}=2\pi R=2\pi \times 6.371 \times 10^{6}\approx4.003 \times 10^{7}\ \text{m}. \]

The velocity of a stationary point on the equator (relative to a non‑rotating reference frame) is:

\[ v_{\text{earth}}=\frac{2\pi R}{T_{\text{sidereal}}}=\frac{4.003\times 10^{7}}{86164.1}=464.58\ \text{m/s}. \]

For comparison: if we incorrectly used a 24‑hour day:

\[ v=\frac{2\pi R}{86400}=463.3\ \text{m/s}. \]

Summary

4.1.5 Correction to derivation based on Paul Anderson

In Anderson’s original derivation, the velocity of the aircraft is inserted relative to the Earth’s surface. However, in his formula 3.1.1.3 this velocity is expressed with respect to coordinate time \(dt\), while clocks in motion measure proper time \(d\tau\). This requires a correction: the velocity of the object must be expressed relative to its own clock, i.e. via \(d\tau\).

Starting point: The full Schwarzschild relation

We take as starting point equation 2 from section 4.1.1, without approximation:

\[ c^{2}\,d\tau^{2} = \left(1 - \frac{2GM}{c^{2}r}\right)c^{2}\,dt^{2} - r^{2}\,d\phi^{2} \tag{2} \]

Dividing by \(c^{2}\) gives:

\[ d\tau^{2} = \left(1 - \frac{2GM}{c^{2}r}\right)dt^{2} - \frac{r^{2}}{c^{2}}\left(\frac{d\phi}{d\tau}\right)^{2}d\tau^{2} \tag{3b} \]

We rewrite this as:

\[ d\tau^{2} \left(1 + \frac{r^{2}}{c^{2}}\left(\frac{d\phi}{d\tau}\right)^{2}\right)= \left(1 - \frac{2GM}{c^{2}r}\right)dt^{2} \tag{3c} \]

And therefore:

\[ d\tau = \sqrt{ \frac{1 - \dfrac{2GM}{c^{2}r}} {1 + \dfrac{r^{2}}{c^{2}}\left( \dfrac{d\phi}{d\tau}\right)^{2}} }\,dt \tag{4b} \]

With:

\[ v_{\tau} = r\,\frac{d\phi}{d\tau} \tag{4c} \]

Thus:

\[ d\tau_{1} = \sqrt{ \frac{1 - \dfrac{2GM}{c^{2}r_{1}}}{1 + \dfrac{v_{1}^{2}}{c^{2}}}}\,dt \tag{7b} \] \[ d\tau_{2} =\sqrt{ \frac{1 - \dfrac{2GM}{c^{2}r_{2}}} {1 + \dfrac{v_{2}^{2}}{c^{2}}}}\,dt \tag{8b} \]

We then write:

\[ d\tau_{2}=\sqrt{\frac{1 - \dfrac{2GM}{c^{2}r_{2}}}{1 + \dfrac{v_{2}^{2}}{c^{2}}}}\sqrt{ \frac{1 + \dfrac{v_{1}^{2}}{c^{2}}}{1 - \dfrac{2GM}{c^{2}r_{1}}}}\,d\tau_{1} \tag{9b} \]

From this follows the relation between both proper times:

\[ \tau_{2}=\sqrt{\frac{1 - \dfrac{2GM}{c^{2}r_{2}}}{1 + \dfrac{v_{2}^{2}}{c^{2}}}}\sqrt{ \frac{1 + \dfrac{v_{1}^{2}}{c^{2}}}{1 - \dfrac{2GM}{c^{2}r_{1}}}}\,\tau_{1} \tag{10b} \]

If we take \(\tau_{1} = \tau_{\text{earth}}\) (clock at sea level) and \(\tau_{2} = \tau_{\text{plane}}\), this becomes:

\[ \tau_{\text{plane}} - \tau_{\text{earth}} =\left[\sqrt{\frac{1 - \dfrac{2GM}{c^{2}r_{2}}}{1 + \dfrac{v_{2}^{2}}{c^{2}}}}\sqrt{\frac{1 + \dfrac{v_{1}^{2}}{c^{2}}}{1 - \dfrac{2GM}{c^{2}r_{1}}}}- 1\right]\tau_{\text{earth}} \tag{11b} \]

Here \(r_{1} = R\), the radius of the Earth. The distance of the clock in an aircraft is then \(R + h\). We obtain:

\[ \tau_{\text{plane}} - \tau_{\text{earth}}=\left[\sqrt{\frac{1 - \dfrac{2GM}{c^{2}(R + h)}} {1 + \dfrac{v_{\text{earth}}^{2}}{c^{2}}}}\sqrt{\frac{1 + \dfrac{v_{2}^{2}}{c^{2}}}{1 - \dfrac{2GM}{c^{2}R}}}- 1\right]\tau_{\text{earth}} \tag{14b} \]

Or with Schwarzschild radius \(R_{s} = \dfrac{2GM}{c^{2}}\):

\[ \tau_{\text{plane}} - \tau_{\text{earth}}=\left[\sqrt{\frac{1 - \dfrac{R_{s}}{R + h}} {1 + \dfrac{v_{\text{earth}}^{2}}{c^{2}}}}\sqrt{\frac{1 + \dfrac{v_{2}^{2}}{c^{2}}}{1 - \dfrac{R_{s}}{R}}}- 1\right]\tau_{\text{earth}} \tag{15b} \]

Relative velocity at flight altitude

The ground speed of the aircraft (relative to the Earth’s surface) must be converted into a coordinate‑independent velocity at flight altitude:

\[ v_{2}=\left(v_{\text{earth}} + v_{\text{plane (relative to earth point)}}\right)\cdot \frac{R + h}{R} \]

Up to this point the formula is exact, without approximations.

After first‑order Taylor approximations of equation (14b), as done earlier, the result becomes:

\[ \tau_{\text{plane}} - \tau_{\text{earth}}=\left(\left[1 - \frac{GM}{c^{2}(R + h)}\right]\left[1 + \frac{GM}{c^{2}R}\right]\left[1 + \frac{v_{\text{earth}}^{2}}{2c^{2}}\right]\left[1 - \frac{v_{2}^{2}}{2c^{2}}\right]- 1\right)\tau_{\text{earth}} \tag{16} \]

This can be rewritten as:

\[ \tau_{\text{plane}} - \tau_{\text{earth}}=\left(\left[1 + \frac{GM}{c^{2}}\left(\frac{1}{R} - \frac{1}{R + h}\right)\right]\left[1 + \frac{v_{\text{earth}}^{2} - v_{2}^{2}}{2c^{2}} \right]- 1\right)\tau_{\text{earth}} \tag{17} \] \[ \tau_{\text{plane}} - \tau_{\text{earth}}=\left(\left[1 + \frac{GM}{c^{2}}\frac{h}{R^{2}}\right] \left[1 + \frac{v_{\text{earth}}^{2} - v_{2}^{2}}{2c^{2}}\right]- 1\right)\tau_{\text{earth}} \tag{18} \] \[ \tau_{\text{plane}} - \tau_{\text{earth}}=\left(\frac{GM}{c^{2}}\frac{h}{R^{2}}+ \frac{v_{\text{earth}}^{2} - v_{2}^{2}}{2c^{2}}\right)\tau_{\text{earth}} \tag{19} \]

With \(g = \dfrac{GM}{R^{2}}\) it follows:

\[ \tau_{\text{plane}} - \tau_{\text{earth}}=\left(\frac{gh}{c^{2}}- \frac{v_{2}^{2} - v_{\text{earth}}^{2}}{2c^{2}}\right)\tau_{\text{earth}} \tag{20} \]

Remark on velocities and reference frames

The aircraft’s velocity is given as ground speed. It is not immediately clear whether this is measured relative to the Earth clock or the aircraft clock. Let us assume the Earth clock is meant. In that case we must convert to the level of the aircraft, meaning we must consider the clock at that altitude. We do this via the time \(t\) in the universal frame.

If we consider \(\dfrac{d\phi_{\text{earth}}}{dt}\), this is the rotational speed of the Earth in the universal frame. We obtain the Earth’s speed at sea level by multiplying \(\dfrac{d\phi_{\text{earth}}}{dt}\) by \(R\), the distance from the center. The speed of the Earth as seen from aircraft altitude is \((R + h)\dfrac{d\phi_{\text{earth}}}{dt}\). For the aircraft the same applies: at sea level the relative aircraft speed is \(R\dfrac{d\phi_{\text{plane}}}{dt}\) and at aircraft altitude \((R + h)\dfrac{d\phi_{\text{plane}}}{dt}\).

Now we must find \(\dfrac{d\phi_{\text{earth}}}{dt}\) and \(\dfrac{d\phi_{\text{plane}}}{dt}\). We use from section 4.1.5 equation (4c):

\[ v_{\tau} = r\,\frac{d\phi}{d\tau}= r\,\frac{d\phi}{dt}\,\frac{dt}{d\tau}\;\Rightarrow\; \frac{d\phi}{dt}= \frac{v_{\tau}}{r}\,\frac{d\tau}{dt} \]

We then use from section 4.1.5 equation (3.1.5.4b):

\[ \frac{d\tau}{dt}=\sqrt{\frac{1 - \dfrac{2GM}{c^{2}r}}{1 + \dfrac{v_{\tau}^{2}}{c^{2}}}} \]

Thus:

\[ \frac{d\phi}{dt} =\frac{v_{\tau}}{r}\frac{d\tau}{dt}=\frac{v_{\tau}}{r}\sqrt{\frac{1 - \dfrac{2GM}{c^{2}r}}{1 + \dfrac{v_{\tau}^{2}}{c^{2}}}} \]

All components on the right-hand side are known.

At sea level:

\[ \frac{d\phi_{\text{earth}}}{dt}=\frac{v_{\text{earth}}}{R}\sqrt{\frac{1 - \dfrac{2GM}{c^{2}R}}{1 + \dfrac{v_{\text{earth}}^{2}}{c^{2}}}} \]

And for the aircraft analogously:

\[ \frac{d\phi_{\text{plane}}}{dt}=\frac{v_{\text{plane}}}{R}\sqrt{\frac{1 - \dfrac{2GM}{c^{2}R}} {1 + \dfrac{v_{\text{earth}}^{2}}{c^{2}}}} \]

Now at aircraft altitude:

\[ v_{2}=v_{2\tau\text{_earth}} + v_{2\tau\text{_plane}}=(R + h)\left(\frac{d\phi_{ \text{earth}}}{dt}+ \frac{d\phi_{\text{plane}}}{dt}\right) \]

Thus:

\[ v_{2} = \frac{R+h}{R} \sqrt{ \frac{1 - \dfrac{2GM}{c^{2}R}} {1 + \dfrac{v_{\text{earth}}^{2}}{c^{2}}} } \left( v_{\text{earth}} + v_{\text{plane}} \right) \]

Up to this point \(v_{2}\) was exact. With a first‑order Taylor approximation:

\[ v_{2}=\frac{R + h}{R}\sqrt{\left( 1 - \frac{GM}{c^{2}R}\right)\left(1 - \frac{v_{\text{earth}}^{2}}{2c^{2}} \right)}\left(v_{\text{earth}} + v_{\text{plane}}\right) \]

Thus, the relevant formula is:

\[ v_{2}\approx\frac{R + h}{R}\left(1 - \frac{GM}{c^{2}R}- \frac{v_{\text{earth}}^{2}}{2c^{2}}\right) \left( v_{\text{earth}} + v_{\text{plane}}\right) \]

Result after Taylor approximation

Applied to equation (14b), this leads after linearisation to:

\[ \tau_{\text{plane}} - \tau_{\text{earth}}=\left(\frac{gh}{c^{2}}- \frac{v_{2}^{2} - v_{\text{earth}}^{2}}{2c^{2}}\right)\tau_{\text{earth}} \]

Conclusion

This revised approach corrects the inconsistency in the original derivation: velocities must be related to proper time, not coordinate time. After correction and Taylor approximation, the numerical deviation from the approximation in the previous chapter is less than \(0.4\%\) — well within the desired accuracy.

vector_4_1_5

4.1.6 Considerations regarding the Hafele–Keating experiment and the Schwarzschild metric

We begin with the general Schwarzschild equation:

\[ ds^2 = c^2 d\tau^2 = \left(1 - \frac{2GM}{c^2 r}\right)c^2 dt^2 - \left(1 - \frac{2GM}{c^2 r}\right)^{-1} dr^2 - r^2 d\theta^2 - r^2 \sin^2\theta\, d\phi^2 \tag{1} \]

As used earlier, we define:

\[ \sigma =\sqrt{ 1 - \frac{2GM}{c^2 r}} \]

With this notation, equation (1) becomes:

\[ ds^2 = c^2 d\tau^2 = \sigma^2 c^2 dt^2 - \sigma^{-2} dr^2 - r^2 d\theta^2 - r^2 \sin^2\theta\, d\phi^2 \tag{1a} \]

In the Hafele–Keating experiment, the time of the clock at the United States Naval Observatory (USNO) and the velocity of an aircraft are measured. The question is: what do the time and velocity represent in the context of the Schwarzschild metric?

There is a stationary clock at sea level on the equator, and two aircraft moving in the equatorial plane — one eastward, the other westward. Both aircraft follow a circular path with constant speed relative to the Earth’s surface, but in opposite directions.

Since the experiment takes place in the equatorial plane, we assume that \(\theta = \pi/2\) is constant, and that \(r\) is also constant due to the circular orbit. The Schwarzschild metric then simplifies to:

\[ c^2 d\tau^2 = \left(1 - \frac{2GM}{c^2 r}\right)c^2 dt^2 - r^2 d\phi^2 \tag{2} \]

The coordinates \(t, r, \theta, \phi\) in the Schwarzschild metric can be interpreted as belonging to a universal (inertial) reference frame in which the Earth rotates. The clocks on the Earth’s surface and in the aircraft each reside in their own local inertial frame; their measured time is expressed as the proper time \(\tau\).

The universal coordinate time \(t\) is not directly measurable, but is a theoretical quantity. From equation (2) it follows:

\[ dt^2 = \frac{d\tau^2 + \frac{r^2}{c^2} d\phi^2}{1 - \frac{2GM}{c^2 r}} = \sigma^{-2}\left(d\tau^2 + \frac{r^2}{c^2} d\phi^2\right) = \sigma^{-2}\left(1 + \frac{r^2}{c^2}\left(\frac{d\phi}{d\tau}\right)^2\right)d\tau^2 \tag{4} \]

Under the assumption \(t = 0\) when \(\tau = 0\), this leads to:

\[ t =\sigma^{-1}\sqrt{1 + \frac{r^2}{c^2}\left(\frac{d\phi}{d\tau}\right)^2}\,\tau= \sigma^{-1}\sqrt{1 + \frac{v^2}{c^2}}\,\tau \tag{4a} \]

where \(v = r\, d\phi/d\tau\) is the velocity with respect to the universal frame.

For velocities much smaller than \(c\), we may apply a first‑order Taylor approximation:

\[ t = \sigma^{-1}\sqrt{1 + \frac{v^2}{c^2}}\,\tau = \frac{1}{\sigma\sqrt{1 - \frac{v^2}{c^2}}}\,\tau = \frac{\gamma}{\sigma}\, \tau \tag{4b} \]

with \[ \gamma =\frac{1}{ \sqrt{1 - \frac{v^2}{c^2}}} \] the Lorentz factor.

This equation expresses how the proper time \(\tau\) of a moving clock relates to the coordinate time \(t\) in the Schwarzschild frame.

4.2 Experiment 2 – Motion of Particles in Schwarzschild Geometry

The derivations in this chapter are based on:

We derive equations for the motion of particles in Schwarzschild geometry, as the basis for:

We use the Schwarzschild metric as our starting point. Due to symmetry in both the time coordinate \( t \) and the angular coordinate \( \phi \) (no metric component depends on these coordinates), Noether’s theorem applies: every continuous symmetry corresponds to a conservation law. This yields conservation of energy and conservation of angular momentum.

Schwarzschild Metric

The metric reads:

\[ ds^2 = \sigma^2 c^2 dt^2 - \frac{dr^2}{\sigma^2} -\frac{r^2}{R^2_p} dR^2_p\,\theta^2 - \frac{r^2}{R^2_p} \sin^2\theta\, dR^2_p\,\phi^2 \]

To obtain the correct dimensions (all coordinates in meters), we set Rp = 1 m, making the coefficients dimensionless. This leads to the more familiar form:

\[ ds^2 = \sigma^2 c^2 dt^2 - \frac{dr^2}{\sigma^2} - r^2 d\theta^2 - r^2 \sin^2\theta\, d\phi^2 \tag{1a} \]

with:

\[ \sigma =\sqrt{ 1 - \frac{2GM}{c^2 r}} =\sqrt{ 1 - \frac{R_s}{r}}, \qquad R_s = \frac{2GM}{c^2} \]

Metric Coefficients

\[ g_{00} = \sigma^2,\quad g_{11} = -\frac{1}{\sigma^2},\quad g_{22} = -r^2,\quad g_{33} = -r^2 \sin^2\theta \]

Contravariant components:

\[ g^{00} = \frac{1}{\sigma^2},\quad g^{11} = -\sigma^2,\quad g^{22} = -\frac{1}{r^2},\quad g^{33} = -\frac{1}{r^2 \sin^2\theta} \]

Furthermore: \[ \frac{d\sigma}{dr} = \frac{R_s}{2 r^2 \sigma} \]

Derivatives of Metric Components

For the Schwarzschild metric with \(\sigma = 1 - \dfrac{R_s}{r}\) we find the following derivatives:

\[ \frac{\partial g_{00}}{\partial r} = \frac{R_s}{r^2}, \qquad \frac{\partial g_{11}}{\partial r} = \frac{R_s}{r^2 \sigma^4}, \qquad \frac{\partial g_{22}}{\partial r} = -2r, \]

\[ \frac{\partial g_{33}}{\partial r} = -2r \sin^2\theta, \qquad \frac{\partial g_{33}}{\partial \theta} = -2 r^2 \sin\theta \cos\theta. \]

Christoffel Symbols

The Christoffel symbols are defined as:

\[ \Gamma^\rho_{\mu\nu} = \frac{1}{2} g^{\rho\alpha} \left( \frac{\partial g_{\nu\alpha}}{\partial x^\mu} + \frac{\partial g_{\mu\alpha}}{\partial x^\nu} - \frac{\partial g_{\mu\nu}}{\partial x^\alpha} \right) \]

Some relevant non‑zero symbols are:

\[ \Gamma^{0}_{10} = \Gamma^{0}_{01} = \frac{1}{2} g^{00} \left(\frac{\partial g_{00}}{\partial r}\right) = \frac{R_s}{2 r^2 \sigma^2}, \]

\[ \Gamma^{1}_{00} = \frac{1}{2} g^{11} \left(-\frac{\partial g_{00}}{\partial r}\right) = \frac{R_s}{2 r^2}\sigma^2, \qquad \Gamma^{1}_{11} = \frac{1}{2} g^{11} \left(\frac{\partial g_{11}}{\partial r}\right) = -\frac{R_s}{2 r^2 \sigma^2}, \]

\[ \Gamma^{1}_{22} =\frac{1}{2} g^{11} \left(\frac{-\partial g_{22}}{\partial r}\right) = -r \sigma^2, \qquad \Gamma^{1}_{33} =\frac{1}{2} g^{11} \left(-\frac{\partial g_{33}}{\partial r}\right) = -r \sigma^2 \sin^2\theta, \]

\[ \Gamma^{2}_{12} = \Gamma^{2}_{21} =\frac{1}{2} g^{22} \left(\frac{\partial g_{22}}{\partial r}\right) = \frac{1}{r}, \qquad \Gamma^{2}_{33} =\frac{1}{2} g^{22} \left(-\frac{\partial g_{33}}{\partial r}\right) = -\sin\theta\cos\theta, \]

\[ \Gamma^{3}_{13} = \Gamma^{3}_{31} =\frac{1}{2} g^{33} \left(\frac{\partial g_{33}}{\partial r}\right) = \frac{1}{r}, \qquad \Gamma^{3}_{23} = \Gamma^{3}_{32} =\frac{1}{2} g^{33} \left(\frac{\partial g_{33}}{\partial r}\right) = \cot\theta. \]

All remaining Christoffel symbols are zero.

Geodesic Equations

We consider a geodesic worldline, which describes the natural trajectory of a particle in the absence of non‑gravitational forces. The general geodesic equation is:

\[ \frac{d^2 x^\alpha}{d\lambda^2} + \Gamma^\alpha_{\mu\nu} \frac{dx^\mu}{d\lambda} \frac{dx^\nu}{d\lambda} = 0 \]

Geodesic Equations per Coordinate

We work out the four coordinates, where \( \lambda \) is the affine parameter (but here may be taken equal to the proper time \( \tau \)):

For \( t \):

\[ \frac{d^2 t}{d\lambda^2} + \Gamma^{t}_{\mu\nu} \frac{dx^\mu}{d\lambda} \frac{dx^\nu}{d\lambda} = \frac{d^2 t}{d\lambda^2} + 2\,\Gamma^{0}_{10} \frac{dt}{d\lambda} \frac{dr}{d\lambda} = \frac{d^2 t}{d\lambda^2} + 2\,\frac{R_s}{2 r^2 \sigma^2} \frac{dt}{d\lambda} \frac{dr}{d\lambda} = 0 \]

For \( r \):

\[ \frac{d^2 r}{d\lambda^2} + \Gamma^{1}_{00}\left(\frac{dt}{d\lambda}\right)^2 + \Gamma^{1}_{11}\left(\frac{dr}{d\lambda}\right)^2 + \Gamma^{1}_{22}\left(\frac{d\theta}{d\lambda}\right)^2 + \Gamma^{1}_{33}\left(\frac{d\phi}{d\lambda}\right)^2 \]

\[ =\frac{d^2 r}{d\lambda^2} + \sigma^2 \frac{R_s}{2 r^2} \left(\frac{dt}{d\lambda}\right)^2 - \frac{R_s}{2 r^2 \sigma^2} \left(\frac{dr}{d\lambda}\right)^2 - r\sigma^2 \left(\frac{d\theta}{d\lambda}\right)^2 - r\sigma^2 \sin^2\theta \left(\frac{d\phi}{d\lambda}\right)^2 = 0 \]

For \( \theta \):

\[ \frac{d^2\theta}{d\lambda^2} + 2\,\Gamma^{2}_{12} \frac{dr}{d\lambda} \frac{d\theta}{d\lambda} + \Gamma^{2}_{33} \left(\frac{d\phi}{d\lambda}\right)^2 \]

\[ =\frac{d^2\theta}{d\lambda^2} + \frac{1}{r} \frac{dr}{d\lambda} \frac{d\theta}{d\lambda} - \cos\theta\sin\theta \left(\frac{d\phi}{d\lambda}\right)^2 = 0 \]

For \( \phi \):

\[ \frac{d^2\phi}{d\lambda^2} + 2\,\Gamma^{3}_{13} \frac{dr}{d\lambda} \frac{d\phi}{d\lambda} + 2\,\Gamma^{3}_{23} \frac{d\theta}{d\lambda} \frac{d\phi}{d\lambda} \]

\[ =\frac{d^2\phi}{d\lambda^2} + \frac{1}{r} \frac{dr}{d\lambda} \frac{d\phi}{d\lambda} + 2\cos\theta\sin\theta \frac{d\theta}{d\lambda} \frac{d\phi}{d\lambda} = 0 \]

Summarizing, these four component equations yield:

\[ \frac{d^2 t}{d\lambda^2} + 2\frac{R_s}{2 r^2 \sigma^2} \frac{dt}{d\lambda} \frac{dr}{d\lambda} = 0 \tag{1} \]

\[ \frac{d^2 r}{d\lambda^2} + \sigma^2 \frac{R_s}{2 r^2} \left(\frac{dt}{d\lambda}\right)^2 - \frac{R_s}{2 r^2 \sigma^2} \left(\frac{dr}{d\lambda}\right)^2 - r\sigma^2 \left(\frac{d\theta}{d\lambda}\right)^2 - r\sigma^2 \sin^2\theta \left(\frac{d\phi}{d\lambda}\right)^2 = 0 \tag{2} \]

\[ \frac{d^2\theta}{d\lambda^2} + \frac{1}{r} \frac{dr}{d\lambda} \frac{d\theta}{d\lambda} - \cos\theta\sin\theta \left(\frac{d\phi}{d\lambda}\right)^2 = 0 \tag{3} \]

\[ \frac{d^2\phi}{d\lambda^2} + \frac{1}{r} \frac{dr}{d\lambda} \frac{d\phi}{d\lambda} + 2\cos\theta\sin\theta \frac{d\theta}{d\lambda} \frac{d\phi}{d\lambda} = 0 \tag{4} \]


Elegant Approach According to Asaf Pe’er

We first follow the elegant approach of Asaf Pe’er from his article “Schwarzschild Solution and Black Holes” (Pe’er, 2014), and then present a simpler derivation.

According to Asaf Pe’er:

“At first sight, there seems to be little hope of solving this set of four coupled equations in a simple way. Fortunately, our task is greatly simplified by the high degree of symmetry of the Schwarzschild metric.”

Schwarzschild spacetime has four Killing fields: three due to spherical symmetry, and one due to time translation. Each Killing field leads, via Noether’s theorem, to a constant of motion for a freely falling particle.

If \( K_\mu \) is a Killing field, then:

\[ K_\mu \frac{dx^\mu}{d\lambda} = \text{constant} \tag{5} \]

There is also another constant of motion that follows from metric compatibility. Along a geodesic worldline:

\[ ds^2 = g_{\mu\nu} dx^\mu dx^\nu, \qquad \left(\frac{ds}{d\lambda}\right)^2 = c^2 \left(\frac{d\tau}{d\lambda}\right)^2 = c^2 \varepsilon = g_{\mu\nu} \frac{dx^\mu}{d\lambda} \frac{dx^\nu}{d\lambda} \tag{6} \]

Here:

Use of Symmetries and Conservation Laws

Instead of solving the four coupled geodesic equations directly, we use the symmetries that lead to conservation laws via Killing fields.

In flat spacetime, symmetries (via Noether) lead to familiar conserved quantities:

For the Schwarzschild metric, the same applies:

Thus the conserved quantities are:

For photons, these are the actual energy and angular momentum. (For more on angular momentum, see Appendix 10.)

Note that equation (11) is the general‑relativistic analogue of Kepler’s second law: equal areas are swept out in equal times.


Alternative Derivation

Although Asaf Pe’er notes that solving the full set of coupled geodesic equations appears difficult, several of these equations turn out to be relatively straightforward to solve. We demonstrate this using equations (3.2.1) and (3.2.4).

Step 1 — Using Equation (3.2.1)

\[ \frac{d^2 t}{d\lambda^2} + 2\,\frac{R_s}{2 r^2 \sigma^2} \frac{dt}{d\lambda} \frac{dr}{d\lambda} = 0 \tag{3.2.1} \]

where: \[ \sigma^2 = 1 - \frac{2GM}{c^2 r} = 1 - \frac{R_s}{r}. \]

We multiply both sides by \( \sigma^2 \):

\[ \sigma^2 \frac{d^2 t}{d\lambda^2} + \frac{R_s}{r^2} \frac{dt}{d\lambda} \frac{dr}{d\lambda} = 0. \]

Since \( \sigma^2 = 1 - \frac{R_s}{r} \), we rewrite this as:

\[ \left(1 - \frac{R_s}{r}\right)\frac{d^2 t}{d\lambda^2} + \frac{R_s}{r^2} \frac{dt}{d\lambda} \frac{dr}{d\lambda} = 0. \]

We rewrite this:

\[ \frac{d^2 t}{d\lambda^2} + \frac{R_s}{r^2} \frac{dt}{d\lambda} \frac{dr}{d\lambda} - \frac{R_s}{r} \frac{d^2 t}{d\lambda^2} = 0 \]

Or:

\[ \frac{d}{d\lambda} \left( \frac{dt}{d\lambda} - \frac{R_s}{r}\frac{dt}{d\lambda} \right) = 0 \]

Thus: \[ \frac{d}{d\lambda} \left[ \frac{dt}{d\lambda}\left(1 - \frac{R_s}{r}\right) \right] = 0 \]

This shows that the expression \[ \frac{dt}{d\lambda}\left(1 - \frac{R_s}{r}\right) \] is constant along the worldline.

We recognize this as the conserved quantity related to the total energy per unit mass of the particle. Multiplying by \( c \) gives:

\[ \frac{c\,dt}{d\lambda}\left(1 - \frac{R_s}{r}\right) = \text{constant} = \frac{E}{c}\, (\text{total energy}) \tag{9} \]

Step 2 — Using equation (3.2.4)

We now continue with equation (3.2.4). To simplify the derivation, we assume that the particle moves in the equatorial plane, so that: \[ \theta = \frac{\pi}{2}. \]

Then equation (3.2.4) becomes:

\[ \frac{d^2 \Phi}{d\lambda^2} + \frac{2}{r} \frac{dr}{d\lambda} \frac{d\Phi}{d\lambda} + 2\frac{\cos\theta}{\sin\theta}\, \frac{d\theta}{d\lambda} \frac{d\Phi}{d\lambda} = 0. \]

Because in the equatorial plane: \[ \theta = \frac{\pi}{2} \quad\Rightarrow\quad \cos\theta = 0,\;\; \sin\theta = 1, \] the last term vanishes and we obtain:

\[ \frac{d^2 \Phi}{d\lambda^2} + \frac{2}{r} \frac{dr}{d\lambda} \frac{d\Phi}{d\lambda} = 0. \]

This leads to:

\[ \frac{1}{r^2}\frac{d}{d\lambda}\left(r^2 \frac{d\phi}{d\lambda}\right)=0 \]

Which means that:

\[ r^2\frac{d\phi}{d\lambda} \]

is also constant along the geodesic worldline. We recognize this constant as the angular momentum per unit mass:

\[ r^2\frac{d\phi}{d\lambda}= \text{constant} = L \;(\text{angular momentum}) \tag{11} \]

Summary of the conserved quantities

Due to the symmetries, there are two conserved quantities:

4.2.1 The Gravitational Potential

With the previously derived conservation laws, we can now further analyze the motion of particles in the Schwarzschild metric. We begin by writing out equation (6), using the conserved quantities from equations (10) and (11):

\[ \left( 1 - \frac{2GM}{c^{2} r} \right) c^{2} \left( \frac{dt}{d\lambda} \right)^{2} - \left( 1 - \frac{2GM}{c^{2} r} \right)^{-1} \left( \frac{dr}{d\lambda} \right)^{2} - r^{2} \left( \frac{d\phi}{d\lambda} \right)^{2} = c^{2} \varepsilon \tag{12} \]

Substituting the conserved quantities E and L gives:

\[ \left( 1 - \frac{2GM}{c^{2} r} \right) c^{2} \left( \frac{dt}{d\lambda} \right)^{2} - \left( 1 - \frac{2GM}{c^{2} r} \right)^{-1} \left( \frac{dr}{d\lambda} \right)^{2} - \frac{L^{2}}{r^{2}} = c^{2} \varepsilon \]

We multiply this equation by \( 1 - \frac{2GM}{c^{2} r} \) and use \( \frac{E}{c} = c \frac{dt}{d\lambda}\left( 1 - \frac{2GM}{c^{2} r} \right) \) and \( L = r^{2} \frac{d\phi}{d\lambda} \) to rewrite:

\[ \left( 1 - \frac{2GM}{c^{2} r} \right)^{2} c^{2} \left( \frac{dt}{d\lambda} \right)^{2} - \left( \frac{dr}{d\lambda} \right)^{2} - \left( 1 - \frac{2GM}{c^{2} r} \right) \frac{L^{2}}{r^{2}} + c^{2} \varepsilon = 0 \]

Substituting the expression for E yields:

\[ \frac{E^{2}}{c^{2}} - \left( \frac{dr}{d\lambda} \right)^{2} - \left( 1 - \frac{2GM}{c^{2} r} \right) \frac{L^{2}}{r^{2}} + c^{2} \varepsilon = 0 \tag{13} \]

We have thus reduced the four coupled geodesic equations to a single differential equation for \( r(\lambda) \). This is a major simplification.

We rewrite equation (13) in the form:

\[ \frac{1}{2} \left( \frac{dr}{d\lambda} \right)^{2} + V(r) = \frac{1}{2} \frac{E^{2}}{c^{2}} \tag{14} \]

with the effective potential \( V(r) \) defined as:

\[ V(r) = \frac{1}{2} c^{2} \varepsilon - \frac{\varepsilon GM}{r} + \frac{L^{2}}{2 r^{2}} - \frac{GM L^{2}}{c^{2} r^{3}} \tag{15} \]

Equation (14) is formally identical to the classical equation for the motion of a particle (of unit mass) in a one‑dimensional potential \( V(r) \), where the total “energy” is \( \frac{1}{2} \frac{E^{2}}{c^{2}} \). Of course, the true energy is \( E \), but this form makes the equation analogous to classical mechanics.

If we analyze the potential \( V(r) \) in equation (15), we see that it differs from the Newtonian potential in only one place: the last term. This term, proportional to \( 1/r^{3} \), represents a purely relativistic correction and becomes important especially at small \( r \).

The terms can be interpreted as follows:

Note: despite the formal similarity to classical mechanics, this does not describe the motion of a particle freely moving in one dimension. In reality, the particle follows an orbit around a massive object. The relevant quantities are not only \( r(\lambda) \), but also \( t(\lambda) \) and \( \phi(\lambda) \), which together describe the full spacetime trajectory.

vector_4_2_1
Figure 1 – Trajectories of particles in a gravitational potential.

4.2.2 Intermezzo on Energy in Schwarzschild Geometry

In this intermezzo we analyze the form of the energy as derived in equation (3.2.9) of section 4.2. This energy is a conserved quantity in Schwarzschild geometry.

We begin with the relation:

\[ \left( 1 - \frac{2GM}{c^{2} r} \right) \frac{dt}{d\lambda} = \frac{E}{m c^{2}} = \sigma^{2} \frac{dt}{d\lambda} \tag{9} \]

From which follows:

\[ E = \sigma^{2} m c^{2} \frac{dt}{d\lambda} \]

The Schwarzschild metric is:

\[ ds^{2} = c^{2} d\tau^{2} = \sigma^{2} c^{2} dt^{2} - \frac{dr^{2}}{\sigma^{2}} - r^{2} d\theta^{2} - r^{2} \sin^{2}\theta \, d\phi^{2} \]

We use an affine parameter \( \lambda \) with \( d\tau = d\lambda \) and restrict ourselves to the equatorial plane \( \theta = \frac{\pi}{2} \).

\[ \sigma^{2} c^{2} \left( \frac{dt}{d\lambda} \right)^{2} - \sigma^{-2} \left( \frac{dr}{d\lambda} \right)^{2} - r^{2} \left( \frac{d\phi}{d\lambda} \right)^{2} = c^{2} \varepsilon \]

Rewriting in terms of velocities with respect to coordinate time \( t \), we obtain:

\[ \sigma^{2} c^{2} \left(\frac{dt}{d\lambda}\right)^{2} \;-\; \sigma^{-2} \left(\frac{dr}{dt}\right)^{2} \left(\frac{dt}{d\lambda}\right)^{2} \;-\; r^{2} \left(\frac{d\Phi}{dt}\right)^{2} \left(\frac{dt}{d\lambda}\right)^{2} = c^{2}\,\varepsilon \]

For massive particles \( \varepsilon = 1 \):

\[ \sigma^{2} \left( \frac{dt}{d\lambda} \right)^{2} \left( 1 - \frac{\sigma^{-2} \left( \frac{dr}{dt} \right)^{2} + r^{2} \left( \frac{d\phi}{dt} \right)^{2}}{\sigma^{2} c^{2}} \right) = 1 \tag{9a} \] \[ \left( \frac{dt}{d\lambda} \right)^{2}\left( 1 - \frac{v^2}{\sigma^2 c^2}\right) =\frac{1}{\sigma^2} \]

from which follows:

\[ \frac{dt}{d\lambda} = \frac{1}{\sigma} \frac{1}{\sqrt{1 - \frac{v^{2}}{\sigma^{2} c^{2}}}} \]

Where \( v^{2} = \sigma^{-2} \left( \frac{dr}{dt} \right)^{2} + r^{2} \left( \frac{d\phi}{dt} \right)^{2} \) is the total velocity. Combined with the energy (equation 9), this yields:

\[ E = \sigma^{2} m c^{2} \frac{dt}{d\lambda} = \frac{\sigma m c^{2}} {\sqrt{1 - \frac{v^{2}}{\sigma^{2} c^{2}}}} = \gamma_\sigma \sigma m c^{2} \]

This conserved energy consists of:

In the non‑relativistic limit \( v \ll c \), and using a first‑order Taylor expansion of the square root, we find the “kinetic” energy:

\[ E_{\text{kin}} \approx \sigma m c^{2} \left( 1+\frac{v^{2}}{2\sigma^{2}c^2}-1 \right)= \frac{m v^{2}}{2 \sigma} \]

4.2.2.1 Alternative Approach via the Metric

Begin with the Schwarzschild equation:

\[ ds^2 = c^2 d\tau^2 = \sigma^2 c^2 dt^2 - \sigma^{-2} dr^2 - r^2 d\theta^2 - r^2 \sin^2\theta\, d\phi^2 \]

For the equatorial plane \(\theta = \pi/2\) and with an affine parameter \(\lambda\), we have:

\[ \sigma^2 c^2 \left(\frac{dt}{d\lambda}\right)^2 - \sigma^{-2} \left(\frac{dr}{d\lambda}\right)^2 - r^2 \left(\frac{d\phi}{d\lambda}\right)^2 = c^2 \varepsilon \]

Multiply by \(\sigma^2\):

\[ \sigma^4 c^2 \left(\frac{dt}{d\lambda}\right)^2 - \left(\frac{dr}{d\lambda}\right)^2 - \sigma^2 r^2 \left(\frac{d\phi}{d\lambda}\right)^2 = \sigma^2 c^2 \varepsilon \]

Use again:

\[ E = \sigma^2 m c^2 \frac{dt}{d\lambda} \quad \Rightarrow \quad \frac{E}{m c} = \sigma^2 c \frac{dt}{d\lambda} \]

Then it follows:

\[ \left(\frac{E}{m c}\right)^2 = \left(\frac{dr}{d\lambda}\right)^2 + \sigma^2 r^2 \left(\frac{d\phi}{d\lambda}\right)^2 + \sigma^2 c^2 \varepsilon \]

Define:

\[ r^2 \frac{d\phi}{d\lambda} = \frac{L}{m} \quad \Rightarrow \quad r^2 \left(\frac{d\phi}{d\lambda}\right)^2 = \frac{L^2}{r^2 m^2} = \frac{(m v_t r)^2}{r^2 m^2} = v_t^2 \]

Now take \(\lambda = \tau\) and \(\varepsilon = 1\):

\[ \left(\frac{E}{m c}\right)^2 = \left(\frac{dr}{d\tau}\right)^2 + \sigma^2 v_t^2 + \sigma^2 c^2 = v_r^2 + \sigma^2 v_t^2 + \sigma^2 c^2 \]

\[ \frac{E^2}{c^2} = m^2 v_r^2 + m^2 \sigma^2 v_t^2 + m^2 \sigma^2 c^2 \]

where:

The terms are interpreted as:

The kinetic energy becomes:

\[ E_{\text{kin}} = m c\sqrt{v_r^2 + \sigma^2 v_t^2} \]

4.2.2.2 Third Approach: via a Relativistic Energy–Momentum Relation

Start again with the norm of the 4‑velocity:

\[ \sigma^2 c^2 \left(\frac{dt}{d\lambda}\right)^2 - \sigma^{-2} \left(\frac{dr}{d\lambda}\right)^2 - r^2 \left(\frac{d\phi}{d\lambda}\right)^2 = c^2 \varepsilon \]

This leads to:

\[ \left(\frac{E}{\sigma c}\right)^2 - \sigma^{-2} \left(\frac{dr}{d\lambda}\right)^2 - r^2 \left(\frac{d\phi}{d\lambda}\right)^2 = c^2 \varepsilon \]

\[ \frac{E^2}{\sigma^2 c^2} - p^2 = c^2 \varepsilon \quad \Rightarrow \quad E^2 = \sigma^2 c^4 \varepsilon + \sigma^2 c^2 p^2 \]

Here \(p\) is the total spatial momentum per unit mass. In rest, \(E=\sigma m c^2\), and for a photon \(E=\sigma p c\). In general:

\[ E^2 = \sigma^2 m^2 c^4 \varepsilon + \sigma^2 m^2 c^2 U^2 \]

where:

\[ U^2 =\left(\frac{dx^\mu}{d\tau}\right)^2 = \sigma^{-2} \left(\frac{dr}{d\tau}\right)^2 + r^2 \left(\frac{d\phi}{d\tau}\right)^2 \]

is the squared norm of the spatial velocity. For massive particles (\(\varepsilon=1\)) and unit mass this becomes:

\[ E^2 = \sigma^2 c^4 + \sigma^2 c^2 U^2 \]

4.2.3 Summary

4.3 Experiment 3 – Deflection of Light

4.3.1 Historical and Theoretical Background

The deflection of light by gravity was the first experimental test of general relativity. In classical Newtonian gravity, light—being massless—moves in straight lines unaffected by gravity. According to general relativity, however, light follows the curvature of spacetime caused by mass.

Thus a light ray deviates from a straight line when passing near a massive object such as the Sun. This effect can be observed when viewing the light of a star that appears visually close to the Sun.

When the star’s light grazes the Sun, it is bent, so the star appears at a different position in the sky than its true location. Half a year later, when the star is on the opposite side of the sky, its light passes far from the Sun and its position is observed correctly.

To observe this effect, a solar eclipse is required, because otherwise sunlight overwhelms the starlight. In 1919, this effect was first measured by Arthur Eddington during a total solar eclipse. His observations confirmed Einstein’s prediction and marked a major breakthrough in the acceptance of general relativity.

4.3.2 Derivation of the Deflection Angle

We consider a light ray (photon) approaching from infinity and passing near the Sun. The motion of the photon in Schwarzschild spacetime is described by the effective energy equation derived in Section 4.2.1.

For a photon, \(\varepsilon = 0\), so:

\[ \frac{1}{2}\left(\frac{dr}{d\lambda}\right)^2 + V(r) = \frac{1}{2}\frac{E^2}{c^2} \tag{14} \]

with:

\[ V(r) = \frac{1}{2}c^2\varepsilon - \varepsilon \frac{GM}{r} + \frac{L^2}{2r^2} - \frac{GM L^2}{c^2 r^3} \tag{15} \]

where \(L\) is the angular momentum and \(E\) the energy of the photon.

For \(\varepsilon = 0\):

\[ \frac{1}{2}\left(\frac{dr}{d\lambda}\right)^2 + \frac{L^2}{2r^2} - \frac{GM L^2}{c^2 r^3} = \frac{1}{2}\frac{E^2}{c^2} \]

Divide by \(L^2\) and multiply by 2:

\[ \frac{1}{L^2}\left(\frac{dr}{d\lambda}\right)^2 + \frac{1}{r^2} - \frac{2GM}{c^2 r^3} = \frac{E^2}{c^2 L^2} \]

Or equivalently:

\[ \frac{1}{L^2}\left(\frac{dr}{d\lambda}\right)^2 + \frac{1}{r^2} \left(1 - \frac{2GM}{c^2 r}\right) = \frac{E^2}{c^2 L^2} \]

Isolating \((dr/d\lambda)^2\) gives:

\[ \left(\frac{dr}{d\lambda}\right)^2 = L^2 \left( \frac{E^2}{c^2 L^2} - \frac{1}{r^2} \left(1 - \frac{2GM}{c^2 r}\right) \right) \tag{16} \]

4.3.3 Impact Parameter and Angular Momentum

The impact parameter \(b\) is the distance between the centerline of the massive object (the Sun) and the asymptotic direction of the light ray at infinity.

vector_4_3_3
Figure 2. Definition of the impact parameter b. The moving particle approaches the mass M from a large distance with momentum p. A test particle with parallel velocity dives radially toward M. The distance b between their initially parallel paths at “infinity” is the impact parameter.

The angular momentum of the photon is:

\[ L = p\,b \tag{17} \]

For a photon, \(E = pc\), so:

\[ b = \frac{L}{E/c} \tag{18} \]

Additional clarification of (17) and (18):

vector_4_3_3_1

The angular momentum is \(L = p \sin\phi \cdot r = p r \sin\phi\).

The energy in general is \(E^2 = p^2 c^2 + m^2 c^4\); for a photon \(m=0\), so \(E = pc\). Thus:

\[ \frac{L}{E/c}=\frac{pb}{pc/c}=b \]

Hence:

\[ \frac{1}{b^2} = \frac{E^2}{c^2 L^2} \tag{18a} \]

4.3.4 Deriving the Path: The Photon Orbit Equation

From the conserved quantity:

\[ r^2 \frac{d\phi}{d\lambda} = L \]

Thus:

\[ \frac{d\phi}{d\lambda} = \frac{d\phi}{dr}\frac{dr}{d\lambda} = \frac{L}{r^2} \quad \Rightarrow \quad \frac{d\phi}{dr} = \frac{L}{r^2} \left(\frac{dr}{d\lambda}\right)^{-1} \]

Combined with (16):

\[ \frac{d\phi}{dr} = \pm \frac{L}{r^2} \frac{1}{L} \left[ \frac{E^2}{c^2L^2} - \frac{1}{r^2} \left(1 - \frac{2GM}{c^2r}\right) \right]^{-1/2} \]

Using (18a):

\[ \frac{d\phi}{dr} = \pm \frac{1}{r^2} \left( \frac{1}{b^2} - \frac{1}{r^2} \left(1 - \frac{2GM}{c^2 r}\right) \right)^{-1/2} \tag{19} \]

This leads to:

\[ \left(\frac{1}{r^2}\frac{dr}{d\phi}\right)^2 = \frac{1}{b^2} - \frac{1}{r^2} \left(1 - \frac{2GM}{c^2 r}\right) \tag{20} \]

4.3.5 Integrating the Path

The deflection angle is obtained by computing the angular change \(\Delta \phi\) along the photon’s trajectory, from infinity to the perihelion \(r = R\), and back. From (19) we obtain (see Figure 3):

\[ \Delta \phi = 2 \int_{r_1}^{\infty} \frac{dr}{r^2 \sqrt{ \frac{1}{b^2} - \frac{1}{r^2} \left( 1 - \frac{2GM}{c^2 r} \right) }} \tag{21} \]

vector_4_3_5_1
Fig. 3 – Deflection of light by angle \(\delta \varphi_{\text{def}}\)

At the turning point \(r = R\) we have \[ \frac{dr}{d\phi} = 0 \] so that from equation (20) it follows:

\[ \frac{1}{b^2} = \frac{1}{R^2} \left( 1 - \frac{2GM}{c^2 R} \right) \tag{22} \]

Substitute this into equation (20):

\[ \left( \frac{1}{r^2} \frac{dr}{d\phi} \right)^2 = \frac{1}{R^2} \left( 1 - \frac{2GM}{c^2 R} \right) - \frac{1}{r^2} \left( 1 - \frac{2GM}{c^2 r} \right) \tag{23} \]

4.3.6 Variable substitution

Perform the following substitution:

\[ u = \frac{R}{r} \]

Then:

\[ \frac{du}{d\phi} = \frac{du}{dr} \frac{dr}{d\phi} = - \frac{R}{r^2} \frac{dr}{d\phi} \] \[ \left( \frac{du}{d\phi} \right)^2 = \frac{R^2}{r^4} \left( \frac{dr}{d\phi} \right)^2 \]

Here \(u\) varies between \(1\) (at \(r = R\)) and \(0\) (at \(r = \infty\)). Equation (23) then becomes:

\[ \left( \frac{du}{d\phi} \right)^2 = \left( 1 - \frac{2GM}{c^2 R} \right) - u^2 \left( 1 - \frac{2GM}{c^2 R} u \right) \]

Or:

\[ \left( \frac{du}{d\phi} \right)^2 = 1 - u^2 - \frac{2GM}{c^2 R} \left( 1 - u^3 \right) \tag{24} \]

From this it follows:

\[ d\phi = \left[ 1 - u^2 - \frac{2GM}{c^2 R} \left( 1 - u^3 \right) \right]^{-1/2} du \] \[ =\frac{\left(1-u^2\right)^{-1/2}}{\left[1-\frac{2GM}{c^2R}\left(1-u^3\right) \left(1-u^2\right)^{-1}\right]^{1/2}}\,du \tag{25} \]

This integral is difficult to solve in closed form. To simplify it, we use the substitution:

\[ u = \cos \alpha, \qquad 0 < \alpha < \frac{\pi}{2} \quad thus \quad 0 < u < 1 \]

Then:

\[ d\phi = - \left[ 1 - \frac{2GM}{c^2 R} \frac{1 - \cos^3 \alpha}{\sin^2 \alpha} \right]^{-1/2} d\alpha \tag{26} \]

By noting that:

\[ \frac{1 - \cos^3 \alpha}{\sin^2 \alpha} = \frac{(1 - \cos \alpha)(1 + \cos \alpha + \cos^2 \alpha)} {(1 - \cos \alpha)(1 + \cos \alpha)} = \frac{1 + \cos \alpha + \cos^2 \alpha}{1 + \cos \alpha} \] \[ =\frac{1 + \cos \alpha \left(1 + \cos \alpha\right)}{1 + \cos \alpha} =\cos \alpha+\frac{1}{1 + \cos \alpha} \]

we finally obtain:

\[ d\phi = - \left[ 1 - \frac{2GM}{c^2 R}\left(\cos\alpha+\frac{1}{1+\cos\alpha}\right) \right]^{-1/2} d\alpha \tag{27} \]

With:

\[ \cos \alpha = \frac{R}{r} \]

Up to this point no approximation has been applied. This full derivation is suitable for computing the light deflection exactly, although in practice a first‑order approximation is usually sufficient to determine the deflection angle at the solar limb.

4.3.6.1 Approximations and integration

The parameter \[ \frac{2GM}{c^2 R} \approx 4.24 \times 10^{-6} \] is very small at the surface of the Sun.

We apply the Taylor approximation:

\[ \frac{1}{\sqrt{1 - k}} \approx 1 + \frac{1}{2} k \]

Applied to equation (27) this yields:

\[ d\phi = - \left[ 1 - \frac{2GM}{c^2 R}\left(\cos\alpha+\frac{1}{1+\cos\alpha}\right) \right]^{-1/2} d\alpha \]

Or:

\[ d\phi = -\frac{1}{\sqrt{\left[ 1 - \frac{2GM}{c^2 R}\left(\cos\alpha+\frac{1}{1+\cos\alpha}\right) \right]}}d\alpha \]

After approximation:

\[ d\phi \approx - \left[ 1 + \frac{GM}{c^2 R} \left(\cos \alpha+\frac{1}{1 + \cos \alpha}\right) \right] d\alpha \tag{28} \]

We can now compute the total change in azimuth along the entire photon trajectory, from \(\alpha=0\) to \(\alpha=\frac{\pi}{2}\), and then double it:

The total angular change becomes:

\[ \Delta \phi = 2 \int_{0}^{\pi/2} \left[ 1 + \frac{GM}{c^2 R} \left(\cos \alpha+\frac{1}{1 + \cos \alpha}\right) \right] d\alpha \tag{29} \]

To evaluate this integral, we examine the second term separately. We consider the integral:

\[ \int \frac{1}{1+\cos\alpha}\, d\alpha \]

We use the trigonometric identity:

\[ 1+\cos\alpha = 2\cos^2\frac{\alpha}{2}. \] Thus: \[ \frac{1}{1+\cos\alpha} = \frac{1}{2\cos^2\frac{\alpha}{2}}. \]

Note that:

\[ \frac{1}{2\cos^2\frac{\alpha}{2}} = \frac{d}{d\alpha}\left(\tan\frac{\alpha}{2}\right). \] Thus: \[ \int \frac{1}{1+\cos\alpha}\, d\alpha = \tan\frac{\alpha}{2}. \]

Now we can evaluate the full integral (29):

\[ \Delta \phi = 2 \left[ \alpha + \frac{GM}{c^2 R} \left( \sin \alpha + \tan \frac{\alpha}{2} \right) \right]_{0}^{\pi/2} \tag{30} \]

We substitute:

\[ \begin{aligned} \alpha = \frac{\pi}{2}: &\qquad \sin\frac{\pi}{2} = 1,\qquad \tan\frac{\pi}{4} = 1,\\[6pt] \alpha = 0: &\qquad \sin 0 = 0,\qquad \tan 0 = 0. \end{aligned} \] Thus: \[ \Delta\phi = 2\left( \frac{\pi}{2} + \frac{GM}{c^{2}R}(1 + 1) \right) = \pi + \frac{4GM}{c^{2}R}. \] \[ \Delta \phi = \pi + \frac{4GM}{c^2 R} \tag{31} \]

Note: the integral should run from \(r=\infty\) to \(R\), so \(u\) goes from 0 to 1, and thus \(\alpha\) from \(\frac{\pi}{2}\) to 0. By reversing the limits to \(0\) to \(\frac{\pi}{2}\), the sign changes and the minus sign disappears.

vector_4_3_6_1

The first term, \(\pi\), is the total angular change of a photon in flat spacetime — a straight path without deflection. The second term is the extra deflection due to spacetime curvature. The actual deflection angle is therefore:

\[ \delta \phi_{\mathrm{def}} = \Delta \phi - \pi \approx \frac{4GM}{c^2 R} \tag{32} \]

Numerical value

With:

\[ \begin{aligned} G &= 6.674 \cdot 10^{-11}\ \mathrm{N\,m^{2}/kg^{2}},\\ M_{s} &= 1.989 \cdot 10^{30}\ \mathrm{kg},\\ c &= 3.00 \cdot 10^{8}\ \mathrm{m/s},\\ R_{s} &= 6.963 \cdot 10^{8}\ \mathrm{m}, \end{aligned} \] we find: \[ \delta\phi_{\mathrm{def}} = \frac{4GM_{s}}{c^{2}R_{s}} \approx 8.5 \cdot 10^{-6}\ \text{radians}. \]

To convert this to arcseconds we use:

\[ 1\ \text{rad} \approx 206.265''. \] Thus: \[ \delta\phi_{\mathrm{def}} \approx 8.5 \cdot 10^{-6} \times 206.265'' \approx 1.75''. \]

4.3.7 Conclusion

This deflection of 1.75 arcseconds was first observed by Arthur Eddington during the 1919 solar eclipse. The result spectacularly confirmed Einstein’s prediction and marked a milestone in the experimental verification of general relativity.

This effect is also observed outside our solar system and is known as “gravitational lensing”.

4.3.8 Physical interpretation

At the turning point \( r = R \) we have \[ \frac{dr}{d\phi} = 0 \] so that from equation (20) it follows:

\[ \frac{1}{b^2} = \frac{1}{R^2} \left( 1 - \frac{2GM}{c^2 R} \right) \tag{22} \]

Insert this into equation (20):

\[ \left( \frac{1}{r^2} \frac{dr}{d\phi} \right)^2 = \frac{1}{R^2} \left( 1 - \frac{2GM}{c^2 R} \right) - \frac{1}{r^2} \left( 1 - \frac{2GM}{c^2 r} \right) \tag{23} \]

4.3.6 Variable substitution

Perform the following substitution:

\[ u = \frac{R}{r} \]

Then:

\[ \frac{du}{d\phi} = \frac{du}{dr} \frac{dr}{d\phi} = - \frac{R}{r^2} \frac{dr}{d\phi} \] \[ \left( \frac{du}{d\phi} \right)^2 = \frac{R^2}{r^4} \left( \frac{dr}{d\phi} \right)^2 \]

Here \(u\) varies between \(1\) (at \(r = R\)) and \(0\) (at \(r = \infty\)). Equation (23) becomes:

\[ \left( \frac{du}{d\phi} \right)^2 = \left( 1 - \frac{2GM}{c^2 R} \right) - u^2 \left( 1 - \frac{2GM}{c^2 R} u \right) \]

Or:

\[ \left( \frac{du}{d\phi} \right)^2 = 1 - u^2 - \frac{2GM}{c^2 R} \left( 1 - u^3 \right) \tag{24} \]

From this it follows:

\[ d\phi = \left[ 1 - u^2 - \frac{2GM}{c^2 R} \left( 1 - u^3 \right) \right]^{-1/2} du \] \[ = \frac{(1-u^2)^{-1/2}} {\left[ 1 - \frac{2GM}{c^2 R}(1-u^3)(1-u^2)^{-1} \right]^{1/2}} \, du \tag{25} \]

This integral is difficult to solve in closed form. To simplify it, we use the substitution:

\[ u = \cos \alpha, \qquad 0 < \alpha < \frac{\pi}{2} \quad\Rightarrow\quad 0 < u < 1 \]

Then:

\[ d\phi = - \left[ 1 - \frac{2GM}{c^2 R} \frac{1 - \cos^3 \alpha}{\sin^2 \alpha} \right]^{-1/2} d\alpha \tag{26} \]

By noting that:

\[ \frac{1 - \cos^3 \alpha}{\sin^2 \alpha} = \frac{(1 - \cos \alpha)(1 + \cos \alpha + \cos^2 \alpha)} {(1 - \cos \alpha)(1 + \cos \alpha)} = \frac{1 + \cos \alpha + \cos^2 \alpha}{1 + \cos \alpha} \] \[ = \cos \alpha + \frac{1}{1 + \cos \alpha} \]

we finally obtain:

\[ d\phi = - \left[ 1 - \frac{2GM}{c^2 R} \left( \cos\alpha + \frac{1}{1+\cos\alpha} \right) \right]^{-1/2} d\alpha \tag{27} \]

With:

\[ \cos \alpha = \frac{R}{r} \]

Up to this point no approximation has been applied. This full derivation is suitable for computing the light deflection exactly, although in practice a first‑order approximation is usually sufficient to determine the deflection angle near the solar limb.

4.3.6.1 Approximations and integration

The parameter \[ \frac{2GM}{c^2 R} \approx 4.24 \times 10^{-6} \] is extremely small at the solar surface.

We apply the Taylor approximation:

\[ \frac{1}{\sqrt{1 - k}} \approx 1 + \frac{1}{2} k \]

Applied to equation (27) this yields:

\[ d\phi = - \left[ 1 - \frac{2GM}{c^2 R} \left( \cos\alpha + \frac{1}{1+\cos\alpha} \right) \right]^{-1/2} d\alpha \]

Or:

\[ d\phi = - \frac{1}{ \sqrt{ 1 - \frac{2GM}{c^2 R} \left( \cos\alpha + \frac{1}{1+\cos\alpha} \right) }} d\alpha \]

After approximation:

\[ d\phi \approx - \left[ 1 + \frac{GM}{c^2 R} \left( \cos \alpha + \frac{1}{1 + \cos \alpha} \right) \right] d\alpha \tag{28} \]

We can now compute the total azimuthal change along the full photon trajectory, from \(\alpha = 0\) to \(\alpha = \pi/2\), and double it:

The total angle change becomes:

\[ \Delta \phi = 2 \int_{0}^{\pi/2} \left[ 1 + \frac{GM}{c^2 R} \left( \cos \alpha + \frac{1}{1 + \cos \alpha} \right) \right] d\alpha \tag{29} \]

To evaluate this integral, consider the second term separately:

\[ \int \frac{1}{1+\cos\alpha}\, d\alpha \]

Using the trigonometric identity:

\[ 1+\cos\alpha = 2\cos^2\frac{\alpha}{2} \] \[ \frac{1}{1+\cos\alpha} = \frac{1}{2\cos^2(\alpha/2)} = \frac{d}{d\alpha}\left(\tan\frac{\alpha}{2}\right) \]

Thus:

\[ \int \frac{1}{1+\cos\alpha}\, d\alpha = \tan\frac{\alpha}{2} \]

Now we can evaluate the full integral (29):

\[ \Delta \phi = 2 \left[ \alpha + \frac{GM}{c^2 R} \left( \sin \alpha + \tan \frac{\alpha}{2} \right) \right]_{0}^{\pi/2} \tag{30} \]

Substituting:

\[ \alpha = \frac{\pi}{2}: \quad \sin\frac{\pi}{2} = 1,\quad \tan\frac{\pi}{4} = 1, \] \[ \alpha = 0: \quad \sin 0 = 0,\quad \tan 0 = 0. \]

Thus:

\[ \Delta\phi = 2\left( \frac{\pi}{2} + \frac{GM}{c^{2}R}(1 + 1) \right) = \pi + \frac{4GM}{c^{2}R}. \] \[ \Delta \phi = \pi + \frac{4GM}{c^2 R} \tag{31} \]

Note: the integral should run from \(r=\infty\) to \(R\), so \(u\) goes from 0 to 1, and thus \(\alpha\) from \(\pi/2\) to 0. Reversing the limits removes the minus sign.

The first term, \(\pi\), is the total angular change of a photon in flat spacetime — a straight line with no deflection. The second term is the extra deflection due to spacetime curvature. Thus the physical deflection angle is:

\[ \delta \phi_{\mathrm{def}} = \Delta \phi - \pi \approx \frac{4GM}{c^2 R} \tag{32} \]

Numerical value

With:

\[ \begin{aligned} G &= 6.674 \cdot 10^{-11}\ \mathrm{N\,m^{2}/kg^{2}},\\ M_{s} &= 1.989 \cdot 10^{30}\ \mathrm{kg},\\ c &= 3.00 \cdot 10^{8}\ \mathrm{m/s},\\ R_{s} &= 6.963 \cdot 10^{8}\ \mathrm{m}, \end{aligned} \]

we find:

\[ \delta\phi_{\mathrm{def}} = \frac{4GM_{s}}{c^{2}R_{s}} \approx 8.5 \cdot 10^{-6}\ \text{radians}. \]

Converting to arcseconds:

\[ 1\ \text{rad} \approx 206.265'' \] \[ \delta\phi_{\mathrm{def}} \approx 1.75''. \]

4.3.7 Conclusion

This deflection of 1.75 arcseconds was first observed by Arthur Eddington during the 1919 solar eclipse. The result spectacularly confirmed Einstein’s prediction and marked a milestone in the experimental verification of general relativity.

This effect is also observed outside our solar system and is known as “gravitational lensing”.

4.3.8 Physical interpretation

4.3.9 Key insights

4.4 Experiment 4 – Precession of the Perihelia (Mercury)

Based on the article by Owen Biesel (Biesel, 2008).

4.4.1 Introduction

4.4.2 Theoretical framework: Schwarzschild metric

In general relativity, we treat a planet such as Mercury as a test particle moving along a geodesic path through curved spacetime.

The Schwarzschild metric describes this spacetime around a spherically symmetric mass (such as the Sun):

\[ ds^{2} = c^{2} d\tau^{2} = \sigma^{2} c^{2} dt^{2} - \frac{dr^{2}}{\sigma^{2}} - r^{2} d\theta^{2} - r^{2} \sin^{2}\theta \, d\phi^{2} \tag{33} \]

with:

\[ \sigma = \sqrt{1 - \frac{2GM}{c^{2} r}} = \sqrt{1 - \frac{R_{s}}{r}}, \qquad R_{s} = \frac{2GM}{c^{2}} \]

For a planet moving in the equatorial plane \(\left( \theta = \frac{\pi}{2}\right)\) this simplifies to:

\[ 1 = \left(1 - \frac{R_{s}}{r}\right) \left(\frac{dt}{d\tau}\right)^{2} - \frac{1}{c^{2}} \left(1 - \frac{R_{s}}{r}\right)^{-1} \left(\frac{dr}{d\tau}\right)^{2} - \frac{1}{c^{2}} r^{2} \left(\frac{d\phi}{d\tau}\right)^{2} \]

4.4.3 Derivation of the precession via the Lagrangian approach (see Appendix 12)

Although we previously derived the expressions for the energy E (equation_3.2_9) and the angular momentum L (equation_3.2_11), we repeat the derivation here starting from the Lagrangian. We parameterize the orbit as:

\[ x^{\mu}(\tau) = \bigl( t(\tau), r(\tau), \theta(\tau), \phi(\tau) \bigr) \]

with \(\tau\) the proper time. In the equatorial plane \(\theta =\frac{\pi}{2}\) the Lagrangian becomes:

\[ \mathcal{L} = \left(1 - \frac{R_{s}}{r}\right) \left(\frac{dt}{d\tau}\right)^{2} - \frac{1}{c^{2}} \left(1 - \frac{R_{s}}{r}\right)^{-1} \left(\frac{dr}{d\tau}\right)^{2} - \frac{1}{c^{2}} r^{2} \left(\frac{d\phi}{d\tau}\right)^{2} \tag{34} \]

The Euler–Lagrange equations for ∅ and t give:

\[ \frac{d\phi}{d\tau} = \dot{\phi}, \qquad \frac{dt}{d\tau} = \dot{t}. \]

The Lagrangian then becomes:

\[ \mathcal{L} = \left(1 - \frac{R_s}{r}\right)\dot{t}^{\,2} - \frac{1}{c^{2}}\left(1 - \frac{R_s}{r}\right)^{-1}\dot{r}^{\,2} - \frac{1}{c^{2}}\, r^{2}\dot{\phi}^{\,2}. \]

4.4.4 Euler–Lagrange Operation

For \(\phi\):

\[ \frac{d}{d\tau}\left( \frac{\partial \mathcal{L}}{\partial \dot{\phi}} \right) = \frac{\partial \mathcal{L}}{\partial \phi} = 0, \]

and for \( t \):

\[ \frac{d}{d\tau}\left( \frac{\partial \mathcal{L}}{\partial \dot{t}} \right) = \frac{\partial \mathcal{L}}{\partial t} = 0. \]

Thus the Euler–Lagrange equations for \( \phi \) and \( t \) yield:

\[ 0 = \frac{d}{d\tau}\left(2 \frac{1}{c^{2}} r^{2}\frac{d\phi}{d\tau} \right) \quad\Rightarrow\quad r^{2}\frac{d\phi}{d\tau} = \text{constant} = L, \]

\[ 0 = \frac{d}{d\tau}\left[ 2\left(1 - \frac{R_s}{r}\right)\frac{dt}{d\tau} \right] \quad\Rightarrow\quad \left(1 - \frac{R_s}{r}\right)\frac{dt}{d\tau} = \text{constant} = \frac{E}{c^{2}}. \]

Hence \(L\) (angular momentum per unit mass) and \(E\) (energy per unit mass) are constants of motion.

We now rewrite the original normalization condition.

The normalization condition is:

\[ 1 = \left(1 - \frac{R_s}{r}\right)\left(\frac{dt}{d\tau}\right)^{2} - \frac{1}{c^{2}}\left(1 - \frac{R_s}{r}\right)^{-1}\left(\frac{dr}{d\tau}\right)^{2} - \frac{1}{c^{2}} r^{2}\left(\frac{d\phi}{d\tau}\right)^{2}. \]

We substitute the constants \(E\) and \(L\):

\[ 1 = \frac{\frac{E^{2}}{c^{4}}}{1-\frac{R_s}{r}} - \frac{1}{c^{2}}\frac{\left(\frac{dr}{d\tau}\right)^{2}}{1 - \frac{R_s}{r}} - \frac{L^{2}}{c^{2} r^{2}}. \]

\[ 1 - \frac{R_s}{r} = \frac{E^{2}}{c^{4}} - \frac{1}{c^{2}}\left(\frac{dr}{d\tau}\right)^{2} - \frac{L^{2}}{c^{2}r^{2}} + \frac{R_s L^{2}}{c^{2} r^{3}}. \]

Thus:

\[ \left(\frac{dr}{d\tau}\right)^{2} = c^{2}\left(\frac{E^{2}}{c^{4}} - 1\right) + c^{2}\frac{R_s}{r} - \frac{L^{2}}{r^{2}} + \frac{R_s L^{2}}{r^{3}}. \]

Using:

\[ \frac{dr}{d\tau} = \frac{dr}{d\phi}\frac{d\phi}{d\tau} = \frac{dr}{d\phi}\frac{L}{r^{2}} \quad\Rightarrow\quad \left(\frac{dr}{d\phi}\right)^{2} = \frac{r^{4}}{L^{2}}\,\left(\frac{dr}{d\tau}\right)^{2}, \]

we obtain:

\[ \left(\frac{dr}{d\phi}\right)^{2} = \frac{r^{4}}{L^{2}} \left[ c^{2}\left(\frac{E^{2}}{c^{4}} - 1\right) + c^{2}\frac{R_s}{r} - \frac{L^{2}}{r^{2}} + \frac{R_s L^{2}}{r^{3}} \right]. \]

This can be written as:

\[ \left(\frac{dr}{d\phi}\right)^{2} = \text{Newtonian terms} \;+\; \text{relativistic correction}. \]

The extra term \( \frac{R_s L^{2}}{r^{3}} \) in the effective potential is what causes the perihelion precession.

After simplification:

\[ \left(\frac{dr}{d\phi}\right)^{2} = \frac{c^{2}}{L^{2}}\left(\frac{E^{2}}{c^{4}} - 1\right) r^{4} + \frac{c^{2}R_s}{L^2} r^{3} - r^{2} + R_s r. \]

From this we obtain:

\[ d\phi = \frac{dr}{ \sqrt{ \frac{c^{2}}{L^{2}}\left(\frac{E^{2}}{c^{4}} - 1\right) r^{4} + \frac{c^{2}R_s}{L^2} r^{3} - r^{2} + R_s r }}. \]

4.4.5 Precession of the orbit

For a closed orbit the radial motion must be bounded, i.e. \(dr/d\phi = 0\) at two points: the perihelion \(P\) and the aphelion \(A\). But for a non‑closed orbit (precession), the angular shift between perihelion \(P\) and aphelion \(A\) is:

\[ \phi_{A} - \phi_{P} = \int_{P}^{A} \frac{dr}{ \sqrt{ \frac{c^{2}}{L^{2}} \left(\frac{E^{2}}{c^{4}} - 1\right) r^{4} + \frac{c^{2} R_{s}}{L^2} r^{3} - r^{2} + R_{s} r } } \tag{35} \]

To express \(E\) and \(L\) in terms of \(A\), \(P\), and \(R_s\), we impose \(\frac{dr}{d\phi}=0\) at \(r=A\) and \(r=P\). This yields the following equations:

\[ c^{2}\left(\frac{E^{2}}{c^{4}} - 1\right)A^{4} + L^{2}(- A^{2}+ R_{s}A) = -c^{2}R_{s}A^{3} \tag{36} \] \[ c^{2}\left(\frac{E^{2}}{c^{4}} - 1\right)P^{4} + L^{2}(- P^{2}+ R_{s}P) = -c^{2}R_{s}P^{3} \tag{37} \]

By suitable combinations and subtractions of these equations, we can express \(\frac{E^{2}}{c^{4}} - 1\) and \(L^{2}\) entirely in terms of \(A\), \(P\), and \(R_s\). (Details in the original derivation.)

Multiply (36) by \((-P^{2}+R_{s}P)\):

\[ c^{2}\left(\frac{E^{2}}{c^{4}} - 1\right)A^{4}(-P^{2}+R_{s}P) + L^{2}(-A^{2}+R_{s}A)(-P^{2}+R_{s}P) = -c^{2}R_{s}A^{3}(-P^{2}+R_{s}P) \]

Multiply (37) by \((-A^{2}+R_{s}A)\):

\[ c^{2}\left(\frac{E^{2}}{c^{4}} - 1\right)P^{4}(-A^{2}+R_{s}A) + L^{2}(-P^{2}+R_{s}P)(-A^{2}+R_{s}A) = -c^{2}R_{s}P^{3}(-A^{2}+R_{s}A) \]

Subtract the two equations:

\[ c^{2}\left(\frac{E^{2}}{c^{4}} - 1\right) \left[ A^{4}(-P^{2}+R_{s}P) - P^{4}(-A^{2}+R_{s}A) \right] = -c^{2}R_{s}A^{3}(-P^{2}+R_{s}P) + c^{2}R_{s}P^{3}(-A^{2}+R_{s}A) \]

Thus:

\[ c^{2}\left(\frac{E^{2}}{c^{4}} - 1\right) = \frac{ -c^{2}R_{s}A^{3}(-P^{2}+R_{s}P) + c^{2}R_{s}P^{3}(-A^{2}+R_{s}A) }{ A^{4}(-P^{2}+R_{s}P) - P^{4}(-A^{2}+R_{s}A) } \]

Therefore:

\[ \frac{E^{2}}{c^{4}} - 1 = \frac{ -R_{s}A^{3}(-P^{2}+R_{s}P) + R_{s}P^{3}(-A^{2}+R_{s}A) }{ [A^{4}(-P^{2}+R_{s}P) - P^{4}(-A^{2}+R_{s}A)] } \] \[ \frac{E^{2}}{c^{4}} - 1 = \frac{ -R_{s}[A^{3}(-P^{2}+R_{s}P) - P^{3}(-A^{2}+R_{s}A) }{ [A^{4}(-P^{2}+R_{s}P) - P^{4}(-A^{2}+R_{s}A)] } \] \[ \frac{E^{2}}{c^{4}} - 1 = \frac{ -R_{s}[A^{3}P(-P+R_{s}) - P^{3}A(-A+R_{s}) }{ [A^{4}P(-P+R_{s}) - P^{4}A(-A+R_{s})] } \] \[ \frac{E^{2}}{c^{4}} - 1 = \frac{ -R_{s}AP[A^{2}(-P+R_{s}) - P^{2}(-A+R_{s}) }{ AP[A^{3}(-P+R_{s}) - P^{3}(-A+R_{s})] } \] \[ \frac{E^{2}}{c^{4}} - 1 = \frac{ -R_{s}[A^{2}(-P+R_{s}) - P^{2}(-A+R_{s}) }{ [A^{3}(-P+R_{s}) - P^{3}(-A+R_{s})] } \] \[ \frac{E^{2}}{c^{4}} - 1 = \frac{ -R_{s}[-PA^{2}+R_{s}A^{2}+AP^2-R_sP^2] }{ [-PA^{3}+R_sA^3+AP^3-R_sP^3] } \] \[ \frac{E^{2}}{c^{4}} - 1 = \frac{ -R_{s}[-AP(A-P)+R_s(A^2-P^2)] }{ [-AP(A^2-P^2)+R_s(A^3-P^3)] } \] \[ \frac{E^{2}}{c^{4}} - 1 = \frac{ -R_{s}(A-P)[-AP+R_s(A+P)] }{ (A-P)[-AP(A+P)+R_s\frac{A^3-P^3}{A-P}] } \]

Intermezzo to compute \(\frac{A^3-P^3}{A-P}\):

\[ (A^2-P^2)(A+P)=A^3-AP^2+A^2P-P^3 \] \[ A^3-P^3=(A^2-P^2)(A+P)-AP(A-P) \] \[ A^3-P^3=(A-P)(A+P)(A+P)-AP(A-P) \] \[ \Rightarrow\quad \frac{A^3-P^3}{A-P}=(A+P)^2-AP \]

Now substitute:

\[ \frac{E^{2}}{c^{4}} - 1 = \frac{ -R_{s}(A-P)[-AP+R_s(A+P)] }{ (A-P)[-AP(A+P)+R_s(A+P)^2-R_sAP] } \] \[ \frac{E^{2}}{c^{4}} - 1 = \frac{ -R_{s}[-AP+R_s(A+P)] }{ [-AP(A+P+R_s)+R_s(A+P)^2] } \] \[ \frac{E^{2}}{c^{4}} - 1 = \frac{ R_{s}[-AP+R_s(A+P)] }{ [AP(A+P+R_s)-R_s(A+P)^2] } \tag{36a} \]

Now we can find \(L^2/c^2\) by applying the same method to equations (36) and (37):

\[ c^{2}\left(\frac{E^{2}}{c^{4}} - 1\right)A^{4} + L^{2}(- A^{2}+ R_{s}A) = -c^{2}R_{s}A^{3} \tag{36} \] \[ c^{2}\left(\frac{E^{2}}{c^{4}} - 1\right)P^{4} + L^{2}(- P^{2}+ R_{s}P) = -c^{2}R_{s}P^{3} \tag{37} \]

Multiply (36) by \(P^4\):

\[ c^{2}\left(\frac{E^{2}}{c^{4}} - 1\right)A^{4}P^4 + L^{2}(- A^{2}+ R_{s}A)P^4 = -c^{2}R_{s}A^{3}P^4 \]

Multiply (37) by \(A^4\):

\[ c^{2}\left(\frac{E^{2}}{c^{4}} - 1\right)A^4P^{4} + L^{2}(- P^{2}+ R_{s}P)A^4 = -c^{2}R_{s}A^4P^{3} \]

Subtract:

\[ L^{2}[(-A^2+R_sA)P^4-(-P^2+R_sP)A^4] = -c^2R_sA^3P^4+c^2R_sA^4P^3 \] \[ L^2=\frac{c^2R_sA^3P^3(-P+A)}{(-A^2+R_sA)P^4-(-P^2+R_sP)A^4} \] \[ L^2=\frac{c^2R_sA^3P^3(-P+A)}{(-A+R_s)AP^4-(-P+R_s)PA^4} \] \[ L^2=\frac{c^2R_sA^3P^3(-P+A)}{AP[(-A+R_s)P^3-(-P+R_s)A^3]} \] \[ L^2=\frac{c^2R_sA^2P^2(-P+A)}{[(-A+R_s)P^3-(-P+R_s)A^3]} \] \[ L^2=\frac{c^2R_sA^2P^2(-P+A)}{A^3P-AP^3-(A^3-P^3)R_s} \] \[ L^2=\frac{c^2R_sA^2P^2(-P+A)}{AP(A^2-P^2)-(A^3-P^3)R_s} \] \[ L^2=\frac{c^2R_sA^2P^2}{AP(A+P)-R_s(A+P)^2+APR_s} \] \[ L^2=\frac{c^2R_sA^2P^2}{AP(A+P+R_s)-R_s(A+P)^2} \] \[ \frac{L^2}{c^2}=\frac{R_sA^2P^2}{AP(A+P+R_s)-R_s(A+P)^2} \]

Finally we obtain equation (36a) above and the expression for \(L^2/c^2\):

\[ \frac{E^2}{c^4}-1=\frac{-APR_s+(A+P)R_s^2}{AP(A+P+R_s)-R_s(A+P)^2} \] \[ \frac{L^2}{c^2}=\frac{A^2P^2R_s}{AP(A+P+R_s)-R_s(A+P)^2} \]

We now introduce the variable:

\[ D=\frac{AP}{A+P} \]

to further simplify the expressions. This has the dimension of length. Then the expressions for \(E^2-1\) and \(L^2\) become:

We previously found:

\[ \frac{E^{2}}{c^{4}} - 1 =\frac{ -\frac{R_{s}}{AP} + \frac{R_{s}^{2}}{D\,AP} }{ \frac{1}{D} + \frac{R_{s}}{AP} - \frac{R_{s}}{D^{2}} } \] \[ \frac{L^{2}}{c^{2}} = \frac{R_{s}}{ \frac{1}{D} + \frac{R_{s}}{AP} - \frac{R_{s}}{D^{2}} } \]

Thus:

\[ \frac{\frac{L^{2}}{c^{2}}}{ \frac{E^{2}}{c^{4}} - 1 } = \frac{R_{s}}{ -\frac{R_{s}}{AP} + \frac{R_{s}^{2}}{D\,AP} } = \frac{AP}{- 1 + R_{s}/D} \] \[ \frac{ \frac{L^{2}}{c^{2}AP} }{ 1 - \frac{E^{2}}{c^{4}} } = \frac{1}{1 - R_{s}/D} \tag{38} \]

We want an expression for \(\varepsilon\), the third non‑zero root of:

\[ \frac{E^{2}/c^{4} - 1}{L^{2}/c^{2}} r^{4} + \frac{R_{s}}{L^2/c^2} r^{3} - r^{2} + R_{s} r = 0 \] \[ \frac{E^2/c^4-1}{L^2/c^2}\left[r^4+\frac{R_s}{E^2/c^4-1}r^3- \frac{L^2/c^2}{E^2/c^4-1}r^2+\frac{L^2/c^2}{E^2/c^4-1}R_sr \right]=0 \]

This gives the three non‑zero roots: \(A\), \(P\), and \(\varepsilon\).

The full expression becomes:

\[ \frac{E^2/c^4-1}{L^2/c^2} (r-A)(r-P)(r-\varepsilon) r \]

Expanding the four factors:

\[ \frac{E^2/c^4-1}{L^2/c^2}\left[ r^{4} - (A+P+\varepsilon)\, r^{3} + \{AP + \varepsilon(A + P)\}\, r^{2} - \varepsilon AP\, r \right] \]

We know that the sum of the three non‑zero roots equals \(\dfrac{R_{s}}{E^{2}/c^{4} - 1}\) (the coefficient of \(r^{3}\) in the standard polynomial form). Thus:

\[ -(A+P+\varepsilon) = \frac{R_{s}}{E^{2}/c^{4} - 1} \]

This allows us to relate the roots \(A\), \(P\), and \(\varepsilon\) in terms of \(R_s\), the Schwarzschild radius, and the energy and angular momentum terms.

From the above we know that:

\[ \frac{E^2}{c^4}-1=\frac{R_s[-AP+(A+P)R_s]}{AP(A+P+R_s)-R_s(A+P)^2} \]

So we substitute this into the previous relation:

\[ A + P + \varepsilon = \frac{R_{s}}{E^{2}/c^{4} - 1} \] \[ A + P + \varepsilon = R_s\frac{-AP(A+P+R_s)+R_s(A+P)^2}{R_s[-AP+(A+P)R_s]} = \frac{-AP(A+P+R_s)+R_s(A+P)^2}{-AP+(A+P)R_s} \] \[ \varepsilon = \frac{-AP(A+P+R_s)+R_s(A+P)^2}{-AP+(A+P)R_s} - (A+P) \] \[ = \frac{-AP(A+P+R_s)+R_s(A+P)^2+AP(A+P)-(A+P)^2R_s}{-AP+(A+P)R_s} \] \[ \varepsilon = \frac{-AP(A+P+R_s)+AP(A+P)}{-AP+(A+P)R_s} = \frac{-APR_s}{-AP+(A+P)R_s} = \frac{R_s}{1-\frac{(A+P)R_s}{AP}} = \frac{R_s}{1-\frac{R_s}{D}} \]

Which gives:

\[ \varepsilon = R_{s}\left(1 - \frac{R_{s}}{D}\right) \tag{39} \]

We can now approximate (35) by writing

\[ \frac{E^2/c^4-1}{L^2/c^2}r^4 + \frac{R_s}{L^2/c^2}r^3 - r^2 + R_s r = \frac{E^2/c^4-1}{L^2/c^2} (r-A)(r-P)(r-\varepsilon)\, r \] \[ = \frac{1-E^2/c^4}{L^2/c^2} (A-r)(r-P)(r-\varepsilon)\, r \]

Using equation (35) we obtain:

\[ \phi_{A} - \phi_{P} = \sqrt{\frac{L^{2}/c^{2}}{1 - E^{2}/c^{4}}} \int_{P}^{A} \frac{1}{\sqrt{r(A-r)(r-P)(r-\varepsilon)}}\,dr \] \[ = \sqrt{\frac{L^{2}/c^{2}}{1 - E^{2}/c^{4}}} \int_{P}^{A} \frac{1}{\sqrt{r^{2}(A-r)(r-P)\left(1-\frac{\varepsilon}{r}\right)}}\,dr \] \[ = \sqrt{\frac{L^{2}/c^{2}}{1 - E^{2}/c^{4}}} \int_{P}^{A} \frac{1}{r\sqrt{(A-r)(r-P)}} \left(1-\frac{\varepsilon}{r}\right)^{-1/2}\,dr \]

Now we use the Taylor expansion

\[ \left(1-\frac{\varepsilon}{r}\right)^{-1/2} \approx 1 + \frac{\varepsilon}{2r}, \]

with an error term bounded by:

\[ |\varepsilon| \leq \frac{3}{8}\left(1-\frac{\varepsilon}{r}\right)^{-5/2} \left(\frac{\varepsilon}{r}\right)^{2} \leq \frac{3}{8}\left(1-\frac{\varepsilon}{A}\right)^{-5/2} \left(\frac{\varepsilon}{P}\right)^{2} \]

which yields:

\[ \phi_{A} - \phi_{P} = \sqrt{\frac{L^{2}/c^{2}}{1 - E^{2}/c^{4}}} \int_{P}^{A} \left[ \frac{1}{r\sqrt{(A-r)(r-P)}} + \frac{\varepsilon/2}{r^{2}\sqrt{(A-r)(r-P)}} \right]dr \tag{40} \]

Note:
In Owen Biesel’s article “The Precession of Mercury’s Perihelion” (25 January 2008), p. 8, the numerator of the left-hand integral in (40) contains \(1+\varepsilon\). We believe it should be simply \(1\), and have corrected the formula accordingly.

The first integral of (40) (see sections 4.4.6.1 and 4.4.6.3) in closed form:

\[ \int_{P}^{A} \frac{1}{r\sqrt{(A-r)(r-P)}}\,dr \] \[ = \frac{1}{\sqrt{AP}} \arctan\left[ \frac{(A-r)(r-P)+r^{2}-AP} {2\sqrt{(A-r)(r-P)AP}} \right]_{P}^{A} \] \[ \to \frac{1}{\sqrt{AP}} \left[ \arctan(+\infty) - \arctan(-\infty) \right] = \frac{1}{\sqrt{AP}} \left( \frac{\pi}{2} + \frac{\pi}{2} \right) = \frac{\pi}{\sqrt{AP}} \]

The second integral of (40) (see 4.4.6.2) is more involved, but can be evaluated in closed form:

\[ \int_{P}^{A} \frac{\varepsilon/2}{r^{2}\sqrt{(A-r)(r-P)}}\,dr = \frac{\pi\,\varepsilon/2}{2\sqrt{AP}} \frac{A+P}{AP} = \frac{1}{\sqrt{AP}} \frac{\pi\varepsilon}{4D} \]

Recognizing that:

\[ \frac{L^{2}}{c^{2}} \frac{AP}{1 - E^{2}/c^{4}} = \frac{1}{1 - R_{s}/D} \]

and:

\[ \varepsilon = \frac{R_{s}}{1 - R_{s}/D} \]

(see (38) and (39) above), we find:

\[ \phi_{A} - \phi_{P} = \frac{1}{\sqrt{AP}} \pi \sqrt{\frac{L^{2}/c^{2}}{1 - E^{2}/c^{4}}} + \frac{1}{\sqrt{AP}} \frac{\pi\varepsilon}{4D} \sqrt{\frac{L^{2}/c^{2}}{1 - E^{2}/c^{4}}} \] \[ = \pi \sqrt{\frac{L^{2}/c^{2}}{AP(1 - E^{2}/c^{4})}} + \frac{\pi\varepsilon}{4D} \sqrt{\frac{L^{2}/c^{2}}{AP(1 - E^{2}/c^{4})}} \] \[ = \pi\sqrt{\frac{1}{1 - R_{s}/D}} + \frac{\pi\varepsilon}{4D} \sqrt{\frac{1}{1 - R_{s}/D}} \] \[ = \frac{\pi}{\sqrt{1 - R_{s}/D}} \left( 1 + \frac{\varepsilon}{4D} \right) \] \[ = \frac{\pi}{\sqrt{1 - R_{s}/D}} \left( 1 + \frac{1}{4D}\frac{R_{s}}{1 - R_{s}/D} \right) \] \[ = \frac{\pi}{\sqrt{1 - R_{s}/D}} \left( 1 + \frac{1}{4}\frac{R_{s}/D}{1 - R_{s}/D} \right) \]

With the observed values \(A\text{(aphelion)} = 69.8 \cdot 10^{6}\,\text{km}\), and \(P\text{(perihelion)} = 46.0 \cdot 10^{6}\,\text{km}\), we obtain:

\[ D = 27.7 \cdot 10^{6}\,\text{km}, \qquad R_{s} = \frac{2GM}{c^{2}} = 2.95\,\text{km} \]

And we can approximate the term as:

\[ \frac{\pi}{\sqrt{1 - R_{s}/D}} \left( 1 + \frac{1}{4}\frac{R_{s}/D}{1 - R_{s}/D} \right) \approx \pi + 2.512 \cdot 10^{-7} \]

This yields a reliable estimate of \(\phi_{A} - \phi_{P}\) (half an orbit, in radians).

This gives:

\[ \Delta \phi = 2.512 \cdot 10^{-7} \text{ radians (half orbit)} \]

And:

\[ \Delta \phi = 5.024 \cdot 10^{-7} \text{ radians (full orbit)} \]

The orbital period of Mercury is 87.969 days, so Mercury completes 415.2 revolutions per century. Since there are \( 360 \cdot 60 \cdot 60/2\pi \) arcseconds per radian, we find that the perihelion of Mercury shifts by:

\[ \Delta \phi = 5.024 \cdot 10^{-7} \cdot \frac{360 \cdot 60 \cdot 60}{2\pi} \cdot \text{415.2} = \text{43.027 arcseconds per century} \] \[ \Delta \phi = \mathbf{43.027\ arcseconds\ per\ century.} \]

Note:

According to Asaf Pe’er, for a small deflection angle, the result (see equation 6 in section 4.7) is:

\[ \delta \phi_{\text{prec}} = \frac{6\pi G M_{\text{sun}}}{c^{2} a (1 - \varepsilon^{2})} \tag{41} \]

For Mercury:

\[ \Delta \phi = \frac{6\pi G M}{a(1 - e^{2})c^{2}} = 5.02 \times 10^{-7}\ \text{rad per orbit} \]

To compute the precession per century:

\[ \Delta \phi = 5.02 \times 10^{-7} \times 100 \times \text{365.2588} \times \frac{360 \times 60 \times 60}{2\pi} \] \[ \Delta \phi = \mathbf{43''} \text{ arcseconds per century.} \]

This leads to the same result.

This gives us the exact relation for the precession angle of Mercury’s orbit, as described in the result of 43.027 arcseconds per century.

4.4.6 Conclusion

The deviation of Mercury’s orbit due to general relativity is determined by the additional curvature terms in equation (35). The actual precession per orbit can be computed from the deviation of the integral \(\Delta \phi\) relative to \(2\pi\). This theoretical prediction matches the observed deviation of approximately 43 arcseconds per century, an effect that cannot be explained by Newtonian mechanics.

4.4.6.1 Verification of the first integral

Verification of the integrand:

\[ \frac{d}{dr} \left[ \frac{1}{\sqrt{AP}} \arctan \left( \frac{(A - r)(r - P)+r^2-AP} {2\sqrt{(A-r)(r-P)AP}} \right) \right] \stackrel{?}{=} \frac{1}{r\sqrt{(A - r)(r - P)}} \]

We know that:

\[ \frac{d}{dx} \arctan x = \frac{1}{1 + x^{2}} \]

Therefore:

\[ \frac{1}{\sqrt{AP}} \frac{d}{dr} \left[ \arctan\left( \frac{(A-r)(r-P)+r^{2}-AP}{2\sqrt{(A-r)(r-P)AP}} \right) \right] \]

\[ = \frac{1}{\sqrt{AP}} \frac{1}{ 1+\left( \frac{(A-r)(r-P)+r^{2}-AP}{2\sqrt{(A-r)(r-P)AP}} \right)^{2} } \frac{d}{dr} \left[ \frac{(A-r)(r-P)+r^{2}-AP}{2\sqrt{(A-r)(r-P)AP}} \right] \]

\[ = \frac{1}{\sqrt{AP}} \frac{4(A-r)(r-P)AP}{ 4(A-r)(r-P)AP + \left[(A-r)(r-P)+r^{2}-AP\right]^{2} } \frac{d}{dr} \left[ \frac{(A-r)(r-P)+r^{2}-AP}{2\sqrt{(A-r)(r-P)AP}} \right] \]

\[ = \frac{1}{\sqrt{AP}} \frac{4(A-r)(r-P)AP}{ 4(A-r)(r-P)AP + \left[(A-r)(r-P)+r^{2}-AP\right]^{2} }\times \] \[ \times \left[ \frac{-(r-P)+(A-r)+2r}{2\sqrt{(A-r)(r-P)AP}} - \frac{ AP\left[(A-r)(r-P)+r^{2}-AP\right]\left[-(r-P)+(A-r)\right] }{ 4\left[(A-r)(r-P)AP\right]^{3/2} } \right] \]

\[ = \frac{1}{\sqrt{AP}} \frac{4(A-r)(r-P)AP}{ 4(A-r)(r-P)AP + \left[(A-r)(r-P)+r^{2}-AP\right]^{2} } \times \] \[ \times \left[ \frac{A+P}{2\sqrt{(A-r)(r-P)AP}} - \frac{ AP\left[(A-r)(r-P)+r^{2}-AP\right](Ar-2AP+rP) }{ 4\left[(A-r)(r-P)AP\right]^{3/2} } \right] \]

\[ = \frac{1}{\sqrt{AP}} \frac{4(A-r)(r-P)AP}{ 4(A-r)(r-P)AP + \left[(A-r)(r-P)+r^{2}-AP\right]^{2} } \frac{1}{\sqrt{(A-r)(r-P)AP}} \times \] \[ \times \left[ \frac{2(A+P)}{4} - \frac{ AP(Ar-2AP+rP)(P+A-2r) }{ 4(A-r)(r-P)AP } \right] \]

\[ = \frac{1}{\sqrt{AP}} \frac{ 2(A+P)(A-r)(r-P)AP - AP(Ar-2AP+rP)(P+A-2r) }{ 4(A-r)(r-P)AP + \left[(A-r)(r-P)+r^{2}-AP\right]^{2} } \times \] \[ \times \frac{1}{\sqrt{(A-r)(r-P)AP}} \]

\[ = \frac{1}{AP\sqrt{(A-r)(r-P)}} \times \] \[ \times \frac{ 2(A+P)(A-r)(r-P)AP - AP(Ar-2AP+rP)(P+A-2r) }{ 4(A-r)(r-P)AP + \left[(A-r)(r-P)+r^{2}-AP\right]^{2} } \]

\[ = \frac{1}{\sqrt{(A-r)(r-P)}} \times \] \[ \times \frac{ 2(A+P)(A-r)(r-P) - (Ar-2AP+rP)(P+A-2r) }{ 4(A-r)(r-P)AP + \left[(A-r)(r-P)+r^{2}-AP\right]^{2} } \]

\[ = \frac{1}{\sqrt{(A-r)(r-P)}} \times \] \[ \times \frac{ (2A^{2}-2Ar+2AP-2Pr)(r-P) - (6APr-2AP^{2}+P^{2}r+A^{2}r-2A^{2}P-2Ar^{2}-2Pr^{2}) }{ 4A^{2}Pr - 4APr^{2} - 4A^{2}P^{2} + 4AP^{2}r + (Ar-2AP+Pr)^{2} } \]

\[ = \frac{1}{\sqrt{(A-r)(r-P)}} \times \] \[ \times \frac{ A^{2}r - 2APr + P^{2}r }{ 4A^{2}Pr - 4APr^{2} - 4A^{2}P^{2} + 4AP^{2}r + (Ar-2AP+Pr)^{2} } \]

\[ = \frac{1}{\sqrt{(A-r)(r-P)}} \times \] \[ \times \frac{ r(A^{2}-2AP+P^{2}) }{ 4A^{2}Pr - 4APr^{2} - 4A^{2}P^{2} + 4AP^{2}r + A^{2}r^{2} + 4A^{2}P^{2} + P^{2}r^{2} - 4A^{2}Pr + 2APr^{2} - 4AP^{2}r } \]

\[ = \frac{1}{\sqrt{(A-r)(r-P)}} \frac{ r(A-P)^{2} }{ r^{2}(A-P)^{2} } = \frac{1}{r\sqrt{(A-r)(r-P)}} \]

This confirms that the integration of the integrand is correct!

4.4.6.2 Evaluation of the second integral

We have derived the expression for the second integral:

The general form is:

\[ \int \frac{1}{x^{2}} \sqrt{a x^{2} + b x + c} \, dx = - \frac{\sqrt{a x^{2} + b x + c}}{c x} - \frac{b}{2c} \int \frac{dx}{\sqrt{a x^{2} + b x + c}} \]

(See also the next section for the evaluation of the integral on the right-hand side.)

\[ \int \frac{1}{x^{2}} \sqrt{a x^{2} + b x + c} \, dx = - \frac{\sqrt{a x^{2} + b x + c}}{c x} - \frac{b}{2c\sqrt{-c}}\arcsin{\frac{bx+2c}{|x|\sqrt{b^{2} -4ac}}}\, ,\quad (c<0) \]

Now with \[ a = -1, \quad b = A + P, \quad c = -AP \] we obtain:

\[ \phi_A - \phi_P =\int_P^A \frac{\varepsilon/2}{r^2\sqrt{(A - r)(r - P)}} \, dr = \int_P^A \frac{\varepsilon/2}{r^2\sqrt{(-r^2+(A+P)r-AP}} \, dr \] \[ = -\varepsilon/2 \left[ \frac{\sqrt{-r^2+(A+P)r-AP}}{-APr} \right]_{P}^{A} +\varepsilon/2\frac{A+P}{2AP\sqrt{AP}} \left[ \arcsin \left( \frac{(A+P)r - 2AP} {|r| \sqrt{(A+P)^2 - 4AP}} \right) \right]_P^A \] \[ = 0 + \frac{\varepsilon}{2} \frac{A+P}{2AP\sqrt{AP}} \left[ \arcsin \left( \frac{(A+P)A - 2AP} {|A|\sqrt{(A+P)^2 - 4AP}} \right) - \arcsin \left( \frac{(A+P)P - 2AP} {|P| \sqrt{(A+P)^2 - 4AP}} \right) \right] \] \[ = \frac{\varepsilon}{2} \frac{A+P}{2AP\sqrt{AP}} \left[ \arcsin \left( \frac{(A-P)A}{|A|(A-P)} \right) - \arcsin \left( \frac{(P-A)P}{|P|(A-P)} \right) \right] \] \[ = \frac{\varepsilon}{2} \frac{A+P}{2AP\sqrt{AP}} \left[ \arcsin(1) - \arcsin(-1) \right] \] \[ = \frac{\varepsilon}{2} \frac{A+P}{2AP\sqrt{AP}} \left( \frac{\pi}{2} - \left(-\frac{\pi}{2}\right) \right) \] \[ = \frac{\varepsilon}{2} \frac{\pi(A+P)}{2AP\sqrt{AP}} = \frac{\pi \varepsilon}{4D\sqrt{AP}} \]

Thus:

\[ \phi_{A} - \phi_{P} = \frac{\pi \varepsilon}{4D\sqrt{AP}} \]

This is in exact agreement with the earlier calculations.

4.4.6.3 Alternative solution for integral 1

According to standard integral formulas, we have:

\[ \int \frac{dx}{x \sqrt{a x^{2} + b x + c}} = \frac{1}{\sqrt{-c}} \arcsin \left( \frac{b x + 2c}{\sqrt{b^{2} - 4ac}} \right)+C, \qquad (c < 0) \]

Hence:

\[ \phi_A - \phi_P =\int_P^A \frac{1}{r\sqrt{(A - r)(r - P)}} \, dr = \int_P^A \frac{1}{r\sqrt{(-r^2+(A+P)r-AP}} \, dr \] \[ \frac{1}{\sqrt{AP}}\arcsin\left[\frac{(A+P)r-2AP}{|r|\sqrt{(A+P)^2-4AP}}\right]_P^A \] \[ \frac{1}{\sqrt{AP}} \left[ \arcsin\frac{(A+P)A-2AP}{|A|\sqrt{(A+P)^2-4AP}} - \arcsin\frac{(A+P)P-2AP}{|P|\sqrt{(A+P)^2-4AP}} \right] \] \[ \frac{1}{\sqrt{AP}} \left[ \arcsin\frac{(A-P)A}{|A|(A-P)} - \arcsin\frac{(P-A)P}{|P|(A-P)} \right] \] \[ \frac{1}{\sqrt{AP}} \left[ \arcsin(1) - \arcsin(-1) \right] \] \[ \frac{1}{\sqrt{AP}} \left[ \frac{\pi}{2} - \left( \frac{-\pi}{2} \right) \right] =\frac{\pi}{\sqrt{AP}} \]

Thus:

\[ \phi_{A} - \phi_{P} = \frac{\pi}{\sqrt{AP}} \]

4.4.6.4 Detailed calculation of the orbital period

From:

\[ L = r^{2} \frac{d\phi}{d\tau} \quad \Rightarrow \quad d\tau = \frac{r^{2}}{L} d\phi \]

it follows for the period:

\[ T = \int d\tau = \int_{0}^{2\pi} \frac{r^{2}}{L} d\phi \]

Using equation (40):

\[ d\phi = \sqrt{\frac{L^{2}/c^2}{1-E^2/c^4}} \left[ \frac{1}{r\sqrt{(A - r)(r - P)}} + \frac{\varepsilon/2}{r^2\sqrt{(A - r)(r - P)}} \right] dr \tag{40} \] \[ d\tau = \frac{r^2}{L} \sqrt{\frac{L^{2}/c^2}{1-E^2/c^4}} \left[ \frac{1}{r\sqrt{(A - r)(r - P)}} + \frac{\varepsilon/2}{r^2\sqrt{(A - r)(r - P)}} \right] dr \] \[ d\tau = \frac{1}{L} \sqrt{\frac{L^{2}/c^2}{1-E^2/c^4}} \left[ \frac{r}{\sqrt{(A - r)(r - P)}} + \frac{\varepsilon/2}{\sqrt{(A - r)(r - P)}} \right] dr \] \[ \Delta T =\int d\tau = \frac{2}{L} \sqrt{\frac{L^{2}/c^2}{1-E^2/c^4}} \int_P^A \left[ \frac{r}{\sqrt{(A - r)(r - P)}} + \frac{\varepsilon/2}{\sqrt{(A - r)(r - P)}} \right] dr \]

First, we evaluate the left integral:

\[ \int_P^A \frac{r}{\sqrt{(A - r)(r - P)}}dr = \int_P^A \frac{r}{\sqrt{-r^2+(A+P)r-AP}} dr \tag{41} \]

According to the table of integrals (Wikipedia):

\[ \int \frac{x}{\sqrt{ax^2+bx+c}}dx=\frac{\sqrt{ax^2+bx+c}}{a}- \frac{b}{2a}\int \frac{1}{\sqrt{ax^2+bx+c}}dx \tag{42} \]

and:

\[ \int \frac{1}{\sqrt{ax^2+bx+c}}dx=\frac{1}{\sqrt{-a}}\arcsin\frac{-2ax-b}{\sqrt{b^2-4ac}}+C, \quad (a<0) \]

To rewrite the left integral into this form:

\[ \int_P^A \frac{r}{\sqrt{-r^{2} + (A + P)r-AP}} \, dr= \] \[ =\left[ \frac{\sqrt{-r^2+(A+P)r-AP}}{-1}\right]_P^A -\frac{A+P}{-2} \int_P^A \frac{1}{\sqrt{-r^{2} + (A + P)r-AP}} \, dr= \] \[ =-\sqrt{A^{2} + (A + P)A-AP}+\sqrt{-P^2+(A+P)P-AP}+\frac{A+P}{2} \int_P^A \frac{1}{\sqrt{-r^{2} + (A + P)r-AP}} \, dr= \] \[ =-0+0+ \frac{A+P}{-2} \int_P^A \frac{1}{\sqrt{-r^{2} + (A + P)r-AP}} \, dr \]

Now only the integral:

\[ \int_P^A \frac{1}{\sqrt{-r^{2} + (A + P)r-AP}} \, dr =\left[\arcsin \frac{2r-(A+P)}{\sqrt{(A+P)^2-4AP}}+C \right]_P^A= \] \[ =\arcsin \frac{2A-(A+P)}{\sqrt{(A+P)^2-4AP}}+C-\arcsin \frac{2P-(A+P)}{\sqrt{(A+P)^2-4AP}}+C= \] \[ =\arcsin \frac{A-P}{A-P}-\arcsin \frac{-A+P}{A-P} \] \[ \frac {\pi}{2}+\frac{\pi}{2}=\pi \]

Thus, the left integral yields:

\[ \frac{(A+P)\pi}{2} \]

The right integral yields:

\[ \pi\frac{\varepsilon}{2} \]

The sum is:

\[ \frac{\pi}{2}(A+P+\varepsilon) \]

Hence, the total integral for one full revolution is:

\[ \Delta T = \frac{2}{L} \sqrt{\frac{L^{2}/c^{2}}{1 - E^{2}/c^{4}}} \frac{\pi}{2}(A+P+\varepsilon) \]

With:

\[ \varepsilon = \frac{R_{s}}{1 - R_{s}/D} \] \[ \Delta T = \frac{2}{L} \sqrt{\frac{L^{2}/c^{2}}{1 - E^{2}/c^{4}}} \frac{\pi}{2} \left( A+P + \frac{R_{s}}{1 - R_{s}/D} \right) \] \[ \Delta T = 2\pi \frac{A+P}{2L} \sqrt{\frac{L^{2}/c^{2}}{1 - E^{2}/c^{4}}} \left( 1 + \frac{R_{s}}{(A+P)(1 - \frac{R_s}{D})} \right) \] \[ \Delta T = 2\pi \frac{A+P}{2L} \sqrt{\frac{AP}{1 - R_s/D}} \left( 1 + \frac{R_{s}}{(A+P)(1 - \frac{R_s}{D})} \right) \]

For Mercury:

The time for one revolution is:

\[ \Delta T = 7\,598\,744\,\mathrm{s} \Rightarrow \frac{7\,598\,744}{24\cdot 3600}= 87.95\,\text{days} \]

Derived in chapter Schwarzschild Approximation 4.8.2, equation 2d, the instantaneous orbital speed of Mercury as a function of \(\phi\) is:

\[ v=\left[\frac{GM_{\text{sun}}}{a(1-e^2)}(1+2e\cos[\phi(1-\epsilon)]+e^2)\right]^2 \tag{42a} \]

4.4.7 Physical meaning

4.4.8 Core insight

4.5 Experiment 5 – Shapiro Time Delay

Introduction and Physical Idea

The Shapiro time delay is the effect in which a light signal (or radar pulse) that travels near a massive object (such as the Sun) takes longer to arrive than expected based on a straight-line path in flat spacetime. This is a direct consequence of the curvature of spacetime caused by mass, as predicted by general relativity.

History:
The effect was predicted in 1964 by Irwin Shapiro and has since been confirmed in many experiments, including radar reflections sent to Venus and Mercury and measuring the return time.

In the Shapiro experiment, radar signals were sent from Earth to a planet located on the opposite side of the Sun. These signals reflected back to Earth. According to general relativity, the signal that grazes the Sun will be bent by the Sun’s gravitational field — or more precisely, the Sun’s mass distorts spacetime such that the signal follows a “straight curved” path.

vector_4_5_5
Figure 1: Radar reflection of photons from Earth to a planet and back. The left image shows the actual path (exaggerated). The right image shows the Euclidean form.
(From Tests of General Relativity: A Review by Estelle Asmodelle (Asmodelle, 2017))

To define the Shapiro delay, we assume that Earth and the planet are stationary, while the total round‑trip time of the radar signal is Δt in coordinate time. The value of t must be expressed in terms of r along the entire path, where r₀ is the closest approach to the Sun.

4.5.1 Derivation Based on the Schwarzschild Metric

For the calculation of the Shapiro delay, the Schwarzschild metric is used:

\[ ds^{2} = c^{2} d\tau^{2} = \sigma^{2} c^{2} dt^{2} - \frac{dr^{2}}{\sigma^{2}} - r^{2} d\theta^{2} - r^{2}\sin^{2}\theta \, d\phi^{2} \]

where

\[ \sigma =\sqrt{ 1 - \frac{2GM_{\text{sun}}}{c^{2} r}} = \sqrt{1 - \frac{R_{s}}{r}} \]

and

\[ R_{s} = \frac{2GM_{\text{sun}}}{c^{2}} \]

is the Schwarzschild radius of the Sun.

We choose the reference frame such that it corresponds to the equatorial plane (\(\theta = \pi/2\)).

Then:

\[ c^2 d\tau^2 = \sigma^2 c^2 dt^2 - \frac{dr^2}{\sigma^2} - r^2 d\phi^2 \]

For photons or radar echoes, \(d\tau = 0\), so:

\[ \sigma^2 c^2 dt^2 = \frac{dr^2}{\sigma^2} + r^2 d\phi^2 \]

Differentiating with respect to the affine parameter \(\lambda\):

\[ \sigma^2 c^2 \left(\frac{dt}{d\lambda}\right)^2 = \frac{1}{\sigma^2} \left(\frac{dr}{d\lambda}\right)^2 + r^2 \left(\frac{d\phi}{d\lambda}\right)^2 \]

As derived in formula 11 of Section 4.2, the angular momentum is:

\[ L = r^2 \frac{d\phi}{d\lambda} \]

and

\[ \sigma^2 c^2 \left(\frac{dt}{d\lambda}\right)^2 = \frac{1}{\sigma^2} \left(\frac{dr}{d\lambda}\right)^2 + \frac{L^2}{r^2} \]

Multiply by \(\sigma^2\):

\[ \sigma^4 c^2 \left(\frac{dt}{d\lambda}\right)^2 = \left(\frac{dr}{d\lambda}\right)^2 + \frac{L^2}{r^2} \sigma^2 \]

Define:

\[ k^2 = \sigma^4 \left(\frac{dt}{d\lambda}\right)^2 \]

(Note: this is also \(k = E/c^2\), as seen in formula 9a of Section 4.2.)

Thus:

\[ \left(\frac{dr}{d\lambda}\right)^2 + \frac{L^2}{r^2} \sigma^2 = k^2 c^2 \]

The energy equation for a photon path in Schwarzschild geometry is therefore:

\[ \left(\frac{dr}{d\lambda}\right)^2 + \frac{L^2}{r^2} \left(1 - \frac{R_s}{r}\right) = k^2 c^2 \tag{42b} \]

As previously derived:

\[ k^2 = \sigma^4 \left(\frac{dt}{d\lambda}\right)^2 \Rightarrow \left(\frac{dt}{d\lambda}\right)^2 = \frac{k^2}{\sigma^4} \]

Using:

\[ \left(\frac{dt}{d\lambda}\right)^2 = \frac{k^2}{(1 - R_s/r)^2} \]

Now:

\[ \left(\frac{dr}{d\lambda}\right)^2 = \left(\frac{dr}{dt} \frac{dt}{d\lambda}\right)^2 = \frac{k^2}{(1 - R_s/r)^2} \left(\frac{dr}{dt}\right)^2 \tag{42c} \]

Rewrite (42b):

\[ \frac{k^2}{(1 - R_s/r)^2} \left(\frac{dr}{dt}\right)^2 + \frac{L^2}{r^2}(1 - R_s/r) = k^2 c^2 \]

Divide by \((1 - R_s/r)\):

\[ \frac{k^2}{(1 - R_s/r)^3} \left(\frac{dr}{dt}\right)^2 + \frac{L^2}{r^2} - \frac{k^2 c^2}{1 - R_s/r} = 0 \]

Then divide by \(k^2\):

\[ \frac{1}{(1 - R_s/r)^3} \left(\frac{dr}{dt}\right)^2 + \frac{L^2}{k^2 r^2} - \frac{c^2}{1 - R_s/r} = 0 \tag{43} \]

Consider now the path of a photon from Earth to another planet (e.g., Venus, with \(r_p = r_v\)), as shown in Figure 2. The photon path is clearly bent by the Sun’s gravitational field. Let \(r_0\) be the coordinate distance of closest approach; then:

\[ \left(\frac{dr}{dt}\right)_{r_0} = 0 \]

From (43) we find the relation between constants:

\[ \frac{L^2}{k^2 r^2} = \frac{c^2}{1 - R_s/r} \]

Rewriting (43):

\[ \left(\frac{dr}{dt}\right)^2 = (1 - R_s/r)^3 \left( \frac{c^2}{1 - R_s/r} - \frac{L^2 r_0^2}{k^2 r_0^2 r^2} \right) \]

\[ = (1 - R_s/r)^2 \left( c^2 - \frac{r_0^2 c^2 (1 - R_s/r)}{r^2 (1 - R_s/r_0)} \right) \]

\[ = c^2 (1 - R_s/r)^2 \left( 1 - \frac{r_0^2 (1 - R_s/r)}{r^2 (1 - R_s/r_0)} \right) \]

\[ \Rightarrow \frac{dr}{dt} = c (1 - R_s/r) \left[ 1 - \frac{r_0^2 (1 - R_s/r)}{r^2 (1 - R_s/r_0)} \right]^{1/2} \]

vector_4_5_1_1
Figure 2: Photon path from Earth to Venus, bent by the Sun.

This can be integrated to determine the time required to travel between \(r_0\) and \(r\):

\[ t(r,r_0) = \int_{r_0}^{r} \frac{1}{c(1 - R_s/r) \left[ 1 - \frac{r_0^2 (1 - R_s/r)}{r^2 (1 - R_s/r_0)} \right]^{1/2}} \, dr \]

4.5.1.1 First‑Order Approximation

Because \(R_s \ll r_0\), we may take the first‑order Taylor expansion of:

\[ \frac{1 - R_s/r}{1 - R_s/r_0} \approx \left(1 - \frac{R_s}{r}\right) \left(1 + \frac{R_s}{r_0}\right) = 1 - \frac{R_s}{r} + \frac{R_s}{r_0} - \frac{R_s^2}{r r_0} \]

Thus the integrand can be expanded to first order in \(R_s/r\):

\[ t(r,r_0) = \int_{r_0}^{r} \frac{1}{c(1 - R_s/r) \left[ 1 - \frac{r_0^2}{r^2} \left( 1 - \frac{R_s}{r} + \frac{R_s}{r_0} - \frac{R_s^2}{r r_0} \right) \right]^{1/2}} \, dr \]

Multiply numerator and denominator by \(r\):

\[ t(r,r_0) = \int_{r_0}^{r} \frac{r}{c(1 - R_s/r) \left[ r^2 - r_0^2 \left( 1 - \frac{R_s}{r} + \frac{R_s}{r_0} - \frac{R_s^2}{r r_0} \right) \right]^{1/2}} \, dr \]

\[ t(r,r_0) = \int_{r_0}^{r} \frac{r}{c(1 - R_s/r) \left[ r^2 - r_0^2 - R_s r_0 + \frac{R_s r_0^2}{r} + \frac{R_s^2 r_0}{r} \right]^{1/2}} \, dr \]

\[ t(r,r_0) = \int_{r_0}^{r} \frac{r}{c \sqrt{r^2 - r_0^2} (1 - R_s/r) \left[ 1 - \frac{R_s r_0 (1 - r_0/r - R_s/r)}{r^2 - r_0^2} \right]^{1/2}} \, dr \]

\[ t(r,r_0) = \int_{r_0}^{r} \frac{r}{c \sqrt{r^2 - r_0^2} \left[ \left(1 - \frac{2R_s}{r} + \frac{R_s^2}{r^2}\right) \left( 1 - \frac{R_s r_0 (1 - r_0/r - R_s/r)}{r^2 - r_0^2} \right) \right]^{1/2}} \, dr \]

First expand the right‑hand factor:

\[ \left(1 - \frac{2R_s}{r} + \frac{R_s^2}{r^2}\right) \left( 1 - \frac{R_s r_0 (1 - r_0/r - R_s/r)}{r^2 - r_0^2} \right) \]

\[ 1 - \frac{2R_s}{r} + \frac{R_s^2}{r^2} - \frac{R_s r_0 (1 - r_0/r - R_s/r)}{r^2 - r_0^2} + \frac{2R_s^2 r_0 (1 - r_0/r - R_s/r)}{r(r^2 - r_0^2)} - \frac{R_s^3 r_0 (1 - r_0/r - R_s/r)}{r^2(r^2 - r_0^2)} \]

Neglecting the smallest terms:

\[ \approx 1 - \frac{2R_s}{r} - \frac{R_s r_0 (1 - r_0/r)}{r^2 - r_0^2} \]

\[ \approx 1 - \frac{2R_s}{r} - \frac{R_s r_0 (r - r_0)}{r(r + r_0)(r - r_0)} \]

\[ \approx 1 - \frac{2R_s}{r} - \frac{R_s r_0}{r(r + r_0)} \]

Insert this into the denominator:

\[ t(r,r_0) = \int_{r_0}^{r} \frac{r}{c \sqrt{r^2 - r_0^2} \left[ 1 - \frac{2R_s}{r} - \frac{R_s r_0}{r(r + r_0)} \right]^{1/2}} \, dr \]

Using a first‑order Taylor expansion again:

\[ t(r,r_0) = \int_{r_0}^{r} \frac{r}{c \sqrt{r^2 - r_0^2}} \left[ 1 + \frac{R_s}{r} + \frac{R_s r_0}{2r(r + r_0)} \right] \, dr \]

This can be reduced to (see verification below):

\[ t(r,r_0) = \frac{\sqrt{r^{2} - r_{0}^{2}}}{c} + \frac{R_{s}}{c} \ln\!\left( \frac{r + \sqrt{r^{2} - r_{0}^{2}}}{r_0} \right) + \frac{R_{s}}{2c} \left( \frac{r - r_{0}}{r + r_{0}} \right)^{1/2} \]


Verification of the above formula

We can verify the above formula by taking its derivative; this must be equal to the integrand:

\[ \frac{dt(r,r_0)}{dr} = \frac{r}{c(r^2-r_0^2)^{1/2}} + \frac{R_s}{c} \frac{\left( \frac{1}{r_0}+\frac{r}{r_0(r^2-r_0^2)^{1/2}}\right)}{\frac{r+(r^2-r_0^2)^{1/2}}{r_0}} + \frac{R_s}{4c} \frac{\left(\frac{1}{r+r_0}-\frac{r-r_0}{(r+r_0)^{2}}\right)} {\left( \frac{r-r_0}{r+r_0}\right)^{1/2}} \] \[ \frac{dt(r,r_0)}{dr} = \frac{r}{c(r^2-r_0^2)^{1/2}} + \frac{R_s}{c} \frac{\left( 1+\frac{r}{(r^2-r_0^2)^{1/2}}\right)}{r+(r^2-r_0^2)^{1/2}} + \frac{R_s}{4c} \frac{\left(\frac{r+r_0-r+r_0}{(r+r_0)^{2}}\right)} {\left( \frac{r-r_0}{r+r_0}\right)^{1/2}} \] \[ \frac{dt(r,r_0)}{dr} = \frac{r}{c(r^2-r_0^2)^{1/2}} + \frac{R_s}{c} \frac{\left[ r+(r^2-r_0^2)^{1/2}\right]} {\left[r+(r^2-r_0^2)^{1/2}\right](r^2-r_0^2)^{1/2}} + \frac{R_s}{4c} \frac{\left(r+r_0-r+r_0\right)} {\left( \frac{r-r_0}{r+r_0}\right)^{1/2}(r+r_0)^{2}} \] \[ \frac{dt(r,r_0)}{dr} = \frac{r}{c(r^2-r_0^2)^{1/2}} + \frac{R_s}{c} \frac{1}{(r^2-r_0^2)^{1/2}} + \frac{R_s}{2c} \frac{r_0} {\left( \frac{r-r_0}{r+r_0}\right)^{1/2}(r+r_0)^{2}} \] \[ \frac{dt(r,r_0)}{dr} = \frac{r}{c(r^2-r_0^2)^{1/2}} + \frac{R_s}{c} \frac{1}{(r^2-r_0^2)^{1/2}} + \frac{R_s}{2c} \frac{r_0} {(r-r_0)^{1/2}(r+r_0)} \] \[ \frac{dt(r,r_0)}{dr} = \frac{r}{c(r^2-r_0^2)^{1/2}} \left[ 1+ \frac{R_s}{r}+\frac{R_s r_0}{2r(r+r_0)} \right] \]

Thus, the formula is correct!


So we have:

\[ t(r,r_{0}) = \frac{\sqrt{r^{2}-r_{0}^{2}}}{c} + \frac{R_{s}}{c} \ln\!\left( \frac{r + \sqrt{r^{2}-r_{0}^{2}}}{r_0} \right) + \frac{R_{s}}{2c} \left(\frac{r - r_{0}}{r + r_{0}}\right)^{1/2} \]

The first term on the right-hand side is exactly what we would expect if the light were to travel in a straight line. The second and third terms give the extra coordinate time required for the photon to travel along the curved path to the point \(r\).

As shown in Figure 2, if we send a radar beam to Venus and back, then the extra coordinate time relative to a straight line is:

\[ \Delta t = 2\left[ t(r_{E},r_{0}) + t(r_{V},r_{0}) - \frac{\sqrt{r_{E}^{2}-r_{0}^{2}}}{c} - \frac{\sqrt{r_{V}^{2}-r_{0}^{2}}}{c} \right] \]

As mentioned earlier, the first two terms inside these brackets represent the relativistic travel time from Earth to Venus, and the two terms on the right represent the time if the path were simply a straight line. The factor 2 is included because the photon must go to Venus and then return to Earth.

Since \(r_{E} \gg r_{0}\) and \(r_{V} \gg r_{0}\), we have:

\[ t(r_{E},r_{0}) - \frac{\sqrt{r_{E}^{2}-r_{0}^{2}}}{c} \approx \frac{R_{s}}{c} \ln\!\left( \frac{r_{E} + r_{E}}{r_{0}} \right) + \frac{R_{s}}{2c} = \frac{R_{s}}{c} \ln\!\left( \frac{2r_{E}}{r_{0}} \right) + \frac{R_{s}}{2c} \] \[ t(r_{V},r_{0}) - \frac{\sqrt{r_{V}^{2}-r_{0}^{2}}}{c} \approx \frac{R_{s}}{c} \ln\!\left( \frac{r_{V} + r_{V}}{r_{0}} \right) + \frac{R_{s}}{2c} = \frac{R_{s}}{c} \ln\!\left( \frac{2r_{V}}{r_{0}} \right) + \frac{R_{s}}{2c} \]

Summing these contributions gives:

\[ \frac{R_{s}}{c} \ln\!\left(\frac{2r_{E}}{r_{0}}\right) + \frac{R_{s}}{c} \ln\!\left(\frac{2r_{V}}{r_{0}}\right) + \frac{R_{s}}{c} = \frac{2GM}{c^{3}} \left[ \ln\!\left( \frac{4 r_{E} r_{V}}{r_{0}^{2}} \right) + 1 \right] \]

Thus, for a round trip to Venus and back, the extra coordinate time delay is:

\[ \Delta t \approx \frac{4GM_{\text{sun}}}{c^{3}} \left[ \ln\!\left( \frac{4 r_{E} r_{V}}{r_{0}^{2}} \right) + 1 \right] \]

This also shows that the time delay increases as the impact parameter \(r_{0}\) (the distance to the gravitational center) becomes smaller.

Numerical Values

The Shapiro delay is therefore small, but a clearly measurable effect.

4.5.1.2 Proper Time on Earth versus Coordinate Time

Clocks on Earth do not measure coordinate time, due to the rotation of the Earth around its own axis and the effect of the Earth's orbital motion around the Sun.
Because of the Earth's rotation, the corresponding proper time of the signal is given by: \[ \Delta \tau = \sqrt{1 - \frac{2 G M_E}{c^2 r_E}} \, \Delta t \]

The effect is therefore: \[ \Delta t - \Delta \tau = \Delta t \sqrt{1 - \frac{2 G M_E}{c^2 r_E}} \, \Delta t \] This gives: \[ \Rightarrow 6.98 \times 10^{-10} \Delta t \,\, \text{for} \,\, 252 \, \mu s \Rightarrow 1.76 \times 10^{-13} \text{ s} = 0.176\, \text{ps} \] \[ p = 10^{-12} \]

Since \( r_E \gg GM/c^2 \), and therefore \(0.176\, \text{ps} \ll 252\, \mu s\), this effect is negligible.

The effect of the Earth's orbital motion around the Sun causes a delay of 15 nanoseconds per second, as mentioned in section (4.6).
For the additional time delay \( \Delta t \approx 252\, \mu s \) from Venus, the Earth's orbital motion causes a small effect of: \(252 \times 10^{-6} \times 15 \times 10^{-9} = 3.78 \times 10^{-12}\, \text{s} = 3.78\, \text{ps}\), which can also be ignored.

4.5.2 Physical Interpretation

4.5.3 Practical Importance

4.5.4 Core Insight

The Shapiro time delay is one of the four classical experiments confirming general relativity. The effect is small but measurable and fully explained by the Schwarzschild metric.

4.6 Time Relation Between an Observer on Earth and the Center of the Sun

When considering the deflection of light or the orbits of planets around the Sun, a reference frame centered at the Sun is used, while we observe the phenomena from Earth, which rotates relative to the Sun. This section examines the time relation between an observer on Earth and the Sun’s center, including the required correction factors.

The starting point is the Schwarzschild metric, which describes spacetime around a spherically symmetric massive object:

\[ ds^2 = \sigma^2 c^2 dt^2 - \frac{dr^2}{\sigma^2} - r^2 d\theta^2 - r^2 \sin^2\theta \, d\phi^2 \]

where:

\[ \sigma = \sqrt{1 - \frac{2 G M_{\text{sun}}}{c^2 r}}, \qquad R_s = \frac{2 G M_{\text{sun}}}{c^2} \]

The coordinates \(\theta\) and \(\phi\) are the usual spherical coordinates. We restrict ourselves to the equatorial plane of the Sun, so \(\theta = \pi/2\) and the radius \(r\) is constant.

4.6.1 Simplifying the Schwarzschild Metric

For an observer on Earth, we assume the Earth moves in a circular orbit around the Sun. The observer’s time is the proper time \(d\tau\), while \(dt\) is the coordinate time in the Sun’s reference frame.

The Schwarzschild metric reduces to:

\[ ds^2 = c^2 d\tau^2 = \sigma^2 c^2 dt^2 - r^2 d\phi^2 \]

This can be rewritten as:

\[ d\tau^2 = \sigma^2 dt^2 - \frac{r^2}{c^2} \left( \frac{d\phi}{dt} \right)^2 dt^2 \]

Substituting \(\sigma\) gives:

\[ d\tau^2 = \left( 1 - \frac{R_s}{r} - \frac{r^2}{c^2} \left( \frac{d\phi}{dt} \right)^2 \right) dt^2 \]

4.6.2 Time Dilation Due to Gravity and Earth’s Orbital Motion

For an observer on Earth:

\[ d\tau^2 = \left( 1 - \frac{R_s}{r} - \frac{v^2}{c^2} \right) dt^2 \] \[ d\tau = \sqrt{ 1 - \frac{R_s}{r} - \frac{v^2}{c^2} } \, dt \]

where \(R_s\) is the Sun’s Schwarzschild radius, \(v\) is Earth’s orbital speed, and \(r\) is the Earth–Sun distance.

Numerical values:

Expanding the expression for \(d\tau\) to first order using a Taylor series:

\[ d\tau \approx \left( 1 - \frac{R_s}{2r} - \frac{v^2}{2c^2} \right) dt \]

Substituting the numerical values:

\[ d\tau = \left( 1 - 9.9 \times 10^{-10} - 5.0 \times 10^{-9} \right) dt \] \[ d\tau \approx \left( 1 - 1.5 \times 10^{-8} \right) dt \] \[ \Delta t - \Delta \tau = 1.5 \times 10^{-8} \, \Delta t \]

This corresponds to a time dilation of approximately 15 nanoseconds per second for an observer on Earth relative to Sun-centered coordinate time.

4.6.3 Correction Factor for Earth’s Gravity

Earth’s own gravity also contributes:

\[ d\tau = \sqrt{ 1 - \frac{2 G M_E}{c^2 r_E} } \, dt \]

With:

This gives:

\[ d\tau = \sqrt{1 - 1.3908 \times 10^{-9}}\, dt \approx \left( 1 - 0.6954 \times 10^{-9} \right) dt \]

For an observer at the equator, Earth’s rotation also matters. The angular velocity is given by the sidereal rotation period \(86162.4\) seconds:

\[ \frac{d\phi}{dt} = \frac{2\pi}{86162.4} = 7.2923 \times 10^{-5} \,\text{rad/s} \]

The corrected time relation becomes:

\[ d\tau=\sqrt{\left(1-\frac{R_E}{r^e}-\frac{v_E^2}{c^2}\right)}\,dt \]

Substituting the numerical values:

\[ d\tau = \sqrt{ 1 - 1.3908 \times 10^{-9} - 2.4059 \times 10^{-12} } dt \] \[ d\tau \approx \left( 1 - 0.6966 \times 10^{-9} \right) dt \]

Where:

4.6.4 Conclusion

The time relation between an observer on Earth and the center of the Sun is determined by three effects: the Sun’s gravity, Earth’s orbital velocity, and Earth’s own gravity. Together they produce a small but measurable time dilation.

4.6.5 Physical Meaning

4.7 Alternative Derivation of the Orbital Equation

According to Kepler’s first law, all planetary orbits around the Sun are elliptical. As shown in section 4.4, general relativity predicts a relativistic correction to the ellipse, explaining the perihelion precession of Mercury.

We therefore present an alternative derivation of the orbital equation for a massive particle in Schwarzschild geometry, yielding a solution closer to the classical ellipse formula.

This is: \[ r(\phi) = a\,\frac{1 - e^{2}}{1 + e \cos(\phi - \theta)} \]

This is compared with the relativistic result (see equation 3.5.5a) at the end of this section: \[ r = a\,\frac{1 - e^{2}}{1 + e \cos(\phi - \epsilon \phi)} \]

Here we see that \( \theta \) is not constant but a function of \( \phi \), changing by a factor \( \epsilon \).

From General Relativity: An Introduction for Physics by M. P. Hobson, G. Efstathiou, and A. N. Lasenby, p. 230 (Hobson, 2006).

We restrict ourselves to the equatorial plane \( \theta = \pi/2 \), so that the Schwarzschild metric for a massive particle reduces to: \[ c^{2} d\tau^{2} = c^{2}\left(1 - \frac{2GM}{c^{2} r}\right) dt^{2} - \left(1 - \frac{2GM}{c^{2} r}\right)^{-1} dr^{2} - r^{2} d\phi^{2} \]

The metric equation then becomes: \[ c^{2}\left(1 - \frac{2GM}{c^{2} r}\right) \left(\frac{dt}{d\tau}\right)^{2} - \left(1 - \frac{2GM}{c^{2} r}\right)^{-1} \left(\frac{dr}{d\tau}\right)^{2} - r^{2}\left(\frac{d\phi}{d\tau}\right)^{2} = c^{2} \]

After multiplying by \( 1 - \frac{2GM}{c^{2} r} \), we obtain: \[ c^{2}\left(1 - \frac{2GM}{c^{2} r}\right)^{2} \left(\frac{dt}{d\tau}\right)^{2} - \left(\frac{dr}{d\tau}\right)^{2} - r^{2}\left(\frac{d\phi}{d\tau}\right)^{2} \left(1 - \frac{2GM}{c^{2} r}\right) = c^{2}\left(1 - \frac{2GM}{c^{2} r}\right) \]

Rearranging: \[ \left(\frac{dr}{d\tau}\right)^{2} + r^{2}\left(\frac{d\phi}{d\tau}\right)^{2} \left(1 - \frac{2GM}{c^{2} r}\right) - c^{2}\left(1 - \frac{2GM}{c^{2} r}\right)^{2} \left(\frac{dt}{d\tau}\right)^{2} = c^{2}\left(\frac{2GM}{c^{2} r} - 1\right) \]

We now substitute the conserved quantities:

\[ \left(1 - \frac{2GM}{c^{2} r}\right)\frac{dt}{d\tau} = \frac{E}{c^{2}}, \qquad r^{2}\frac{d\phi}{d\tau} = L \]

We then obtain:

\[ \left(\frac{dr}{d\tau}\right)^{2} + \frac{L^{2}}{r^{2}}\left(1 - \frac{2GM}{c^{2} r}\right) - \frac{E^{2}}{c^{2}} = c^{2}\left(\frac{2GM}{c^{2} r} - 1\right) = \frac{2GM}{r} - c^{2} \]

\[ \left(\frac{dr}{d\tau}\right)^{2} + \frac{L^{2}}{r^{2}}\left(1 - \frac{2GM}{c^{2} r}\right) - \frac{2GM}{r} = c^{2}\left(\frac{E^{2}}{c^{4}} - 1\right) \]

\[ \left(\frac{dr}{d\tau}\right)^{2} + \frac{L^{2}}{r^{2}} = c^{2}\left(\frac{E^{2}}{c^{4}} - 1\right) + \frac{2GM}{r} + \frac{2GM L^{2}}{c^{2} r^{3}} \]

Now: \[ \frac{dr}{d\tau} = \frac{dr}{d\phi}\frac{d\phi}{d\tau} = \frac{L}{r^{2}}\frac{dr}{d\phi} \]

This substituted into the previous equation gives:

\[ \left(\frac{L}{r^{2}}\frac{dr}{d\phi}\right)^{2} + \frac{L}{r^{2}} = c^{2}\left(\frac{E^{2}}{c^{4}} - 1\right) + 2GM r + \frac{2GM L^{2}}{c^{2} r^{3}} \]

Dividing by \(L^{2}\):

\[ \left(\frac{1}{r^{2}}\frac{dr}{d\phi}\right)^{2} + \frac{1}{r^{2}} = \frac{c^{2}}{L^{2}}\left(\frac{E^{2}}{c^{4}} - 1\right) + \frac{2GM}{L^{2} r} + \frac{2GM}{c^{2} r^{3}} \]

Now substitute \(u = 1/r\): \[ \frac{du}{d\phi} = \frac{du}{dr}\frac{dr}{d\phi} = -\frac{1}{r^{2}}\frac{dr}{d\phi} \quad\Rightarrow\quad \frac{1}{r^{2}}\frac{dr}{d\phi} = -\frac{du}{d\phi} \]

Thus the equation becomes:

\[ \left(\frac{du}{d\phi}\right)^{2} + u^{2} = \frac{c^{2}}{L^{2}}\left(\frac{E^{2}}{c^{4}} - 1\right) + \frac{2GM}{L^{2}}u + \frac{2GM}{c^{2}}u^{3} \]

We differentiate this equation with respect to \( \phi \) to obtain: \[ 2\frac{du}{d\phi}\frac{d^{2}u}{d\phi^{2}} + 2u\frac{du}{d\phi} = \frac{2GM}{L^{2}}\frac{du}{d\phi} + 6\frac{GM}{c^{2}}u^{2}\frac{du}{d\phi} \]

Dividing by \(2\,\frac{du}{d\phi}\) (assuming \(\frac{du}{d\phi} \neq 0\)):

\[ \frac{d^{2}u}{d\phi^{2}} + u = \frac{GM}{L^{2}} + \frac{3GM}{c^{2}}u^{2} \tag{44} \]

If we temporarily ignore the last term, we obtain the Newtonian equation, whose solution is: \[ u = \frac{GM}{L^{2}}\left(1 + e\cos\phi\right) \quad\text{or}\quad r = \frac{L^{2}}{GM}\,\frac{1}{1 + e\cos\phi} \tag{45} \]

This describes an ellipse, where the parameter \(e\) represents the eccentricity of the orbit. The distance to the closest point (perihelion) is: \[ r_{1} = a(1 - e) \] and the farthest point (aphelion) is: \[ r_{2} = a(1 + e). \]

Derived from (45) and substituting again with \( r = 1/u \): \[ r = \frac{L^{2}}{GM}\,\frac{1}{1 + e\cos\phi} \quad\Rightarrow\quad r_{\max} = \frac{L^{2}}{GM}\,\frac{1}{1 - e}, \qquad r_{\min} = \frac{L^{2}}{GM}\,\frac{1}{1 + e}. \]

The semi-major axis \(a\) is then: \[ a = \frac{r_{\max} + r_{\min}}{2} = \frac{L^{2}}{2GM} \left( \frac{1}{1 - e} + \frac{1}{1 + e} \right) = \frac{L^{2}}{2GM} \left( \frac{1 + e + 1 - e}{1 - e^{2}} \right) \]

Thus the equation of motion requires that the semi-major axis is: \[ a = \frac{L^{2}}{GM}\,\frac{1}{1 - e^{2}} \tag{46} \]

Thus: \[ r_{\max} = \frac{L^{2}}{GM}\,\frac{1}{1 - e} = a(1 + e), \qquad r_{\min} = \frac{L^{2}}{GM}\,\frac{1}{1 + e} = a(1 - e). \]

Vector diagram
The elliptical orbit of a planet around the sun; e is the eccentricity of the orbit.

Now, to include the third term (from equation 44), the solution becomes: \[ u = \frac{GM}{L^{2}}\left(1 + e\cos\phi\right) + \Delta u \tag{47} \]

\[ \frac{du}{d\phi} = -\frac{GM}{L^{2}}\,e\sin\phi + \frac{d\Delta u}{d\phi}, \qquad \frac{d^{2}u}{d\phi^{2}} = -\frac{GM}{L^{2}}\,e\cos\phi + \frac{d^{2}\Delta u}{d\phi^{2}} \]

Substituting this into equation (44): \[ \frac{d^{2}u}{d\phi^{2}} + u = \frac{GM}{L^{2}} + \frac{3GM}{c^{2}}u^{2} \tag{44} \]

\[ \frac{d^{2}u}{d\phi^{2}} + u = \frac{GM}{L^{2}}\left(1 + e\cos\phi - e\cos\phi\right) + \frac{d^{2}\Delta u}{d\phi^{2}} + \Delta u = \frac{GM}{L^{2}} + \frac{d^{2}\Delta u}{d\phi^{2}} + \Delta u \]

\[ \frac{d^{2}\Delta u}{d\phi^{2}} + \Delta u = -\frac{GM}{L^{2}} + \frac{d^{2}u}{d\phi^{2}} + u = -\frac{GM}{L^{2}} + \frac{GM}{L^{2}} + \frac{3GM}{c^{2}}u^{2} = \frac{3GM}{c^{2}}u^{2} \]

A particular solution is: \[ \Delta u = \frac{3(GM)^3}{c^{2}L^{4}} \left[ 1+ e^{2}\left(\frac{1}{2} - \frac{1}{6}\cos(2\phi)\right) + e\phi\sin\phi \right] \tag{48} \]

Now, to include the third term (from equation 44), the solution becomes:

\[ u = \frac{GM}{L^{2}}\left(1 + e\cos\phi\right) + \Delta u \tag{47} \]

\[ \frac{du}{d\phi} = -\frac{GM}{L^{2}}\, e\sin\phi + \frac{d\Delta u}{d\phi}, \qquad \frac{d^{2}u}{d\phi^{2}} = -\frac{GM}{L^{2}}\, e\cos\phi + \frac{d^{2}\Delta u}{d\phi^{2}} \]

We substitute this into formula (44):

\[ \frac{d^{2}u}{d\phi^{2}} + u = \frac{GM}{L^{2}} + \frac{3GM}{c^{2}}u^{2} \tag{44} \]

\[ \frac{d^{2}u}{d\phi^{2}} + u = \frac{GM}{L^{2}}\left(1 + e\cos\phi - e\cos\phi\right) + \frac{d^{2}\Delta u}{d\phi^{2}} + \Delta u = \frac{GM}{L^{2}} + \frac{d^{2}\Delta u}{d\phi^{2}} + \Delta u \]

\[ \frac{d^{2}\Delta u}{d\phi^{2}} + \Delta u = -\frac{GM}{L^{2}} + \frac{d^{2}u}{d\phi^{2}} + u = -\frac{GM}{L^{2}} + \frac{GM}{L^{2}} + \frac{3GM}{c^{2}}u^{2} = \frac{3GM}{c^{2}}u^{2} \]

\[ \frac{d^{2}\Delta u}{d\phi^{2}} + \Delta u = \frac{3GM}{c^{2}} \left[ \left(\frac{GM}{L^{2}}\right)^{2} + \left(\frac{GM}{L^{2}} e\cos\phi\right)^{2} + (\Delta u)^{2} + 2\left(\frac{GM}{L^{2}}\right)^{2} e\cos\phi + 2\frac{GM}{L^{2}}\Delta u + 2\frac{GM}{L^{2}} e\cos\phi\,\Delta u \right] \]

We find that, to first order in \(\Delta u\):

\[ \frac{d^{2}\Delta u}{d\phi^{2}} + \Delta u = \frac{3(GM)^{3}}{c^{2}L^{4}} \left(1 + e^{2}\cos^{2}\phi + 2e\cos\phi\right) \]

A particular solution of the equation is:

\[ \Delta u = \frac{3(GM)^{3}}{c^{2}L^{4}} \left[ 1 + e^{2}\left(\frac{1}{2} - \frac{1}{6}\cos 2\phi\right) + e\phi\sin\phi \right] \tag{48} \]


This can be verified by direct differentiation of (48):

\[ \frac{d\Delta u}{d\phi} = \frac{3(GM)^{3}}{c^{2}L^{4}} \left[ \frac{1}{3}e^{2}\sin 2\phi + e\sin\phi + e\phi\cos\phi \right] \]

\[ \frac{d^{2}\Delta u}{d\phi^{2}} = \frac{3(GM)^{3}}{c^{2}L^{4}} \left[ \frac{2}{3}e^{2}\cos 2\phi + e\cos\phi + e\cos\phi - e\phi\sin\phi \right] \]

Substituting into (48):

\[ \frac{d^{2}\Delta u}{d\phi^{2}} + \Delta u = \frac{3(GM)^{3}}{c^{2}L^{4}} \left[ \frac{2}{3}e^{2}\cos 2\phi + 2e\cos\phi - e\phi\sin\phi + 1 + e^{2}\left(\frac{1}{2} - \frac{1}{6}\cos 2\phi\right) + e\phi\sin\phi \right] \]

\[ \frac{d^{2}\Delta u}{d\phi^{2}} + \Delta u = \frac{3(GM)^{3}}{c^{2}L^{4}} \left(1 + \tfrac{1}{2}e^{2} + \tfrac{1}{2}e^{2}\cos 2\phi + 2e\cos\phi\right) \]

So, equation (44) is correct.


Now we substitute \(\Delta u\) into equation (3):

\[ u = \frac{GM}{L^{2}}\left(1 + e\cos\phi\right) + \Delta u \]

\[ u = \frac{GM}{L^{2}}\left(1 + e\cos\phi\right) + \frac{3(GM)^{3}}{c^{2}L^{4}} \left[ 1 + e^{2}\left(\frac{1}{2} - \frac{1}{6}\cos 2\phi\right) + e\phi\sin\phi \right] \]

Since the constant \(\frac{3(GM)^{3}}{c^{2}L^{4}}\) is very small, the last three terms can be neglected.

However, the term \[ e\,\frac{3(GM)^{3}}{c^{2}L^{4}}\,\phi\sin\phi \] must be retained because it grows with \(\phi\).

Thus: \[ u = \frac{GM}{L^{2}} \left[ 1 + e\left(\cos\phi + \frac{3(GM)^{2}}{c^{2}L^{2}}\,\phi\sin\phi\right) \right] +\frac{3(GM)^{3}}{c^{2}L^{4}} \left[ 1+e^2\left(\frac{1}{2}- \frac{1}{6}\cos{2\phi} \right) \right] \]

So, our approximate solution is: \[ u = \frac{GM}{L^{2}} \left[ 1 + e\left(\cos\phi + \frac{3(GM)^{2}}{c^{2}L^{2}}\,\phi\sin\phi\right) \right] \] Using the relationship: \[ \cos\left[\phi\left(1 - \frac{3(GM)^{2}}{c^{2}L^{2}}\right)\right] =\cos\left(\phi - \frac{3(GM)^{2}\phi}{c^{2}L^{2}}\right)= \] \[ \cos{\phi}\cos{\frac{3(GM)^{2}}{c^{2}L^{2}}\phi}+\sin{\phi}\sin{\frac{3(GM)^{2}}{c^{2}L^{2}}\phi} \] \[ \approx \cos\phi + \frac{3(GM)^{2}}{c^{2}L^{2}}\phi\sin\phi \quad \text{for }\,\,\frac{3(GM)^{2}}{c^{2}L^{2}}\ll 1 \] We can now write: \[ u\approx \frac{GM}{L^{2}} \left\{ 1 + e\cos\left[\phi\left(1- \frac{3(GM)^{2}}{c^{2}L^{2}}\right)\right]\right\} =\frac{GM}{L^{2}} \left\{ 1 + e\cos\left[\phi(1 - \epsilon)\right] \right\} \] we obtain:

\[ u \approx \frac{GM}{L^{2}} \left[ 1 + e\cos\left(\phi(1 - \epsilon)\right) \right], \qquad \epsilon = \frac{3(GM)^{2}}{c^{2}L^{2}} \]

For \(r = 1/u\), we get:

\[ r = \frac{L^{2}}{GM\left(1 + e\cos[\phi(1 - \epsilon)]\right)} \tag{5} \]

This expression shows that the orbit is periodic, but with a period: \[ \frac{2\pi}{1 - \epsilon}. \] Thus the orbit does not close perfectly and instead precesses.

After each complete orbit, the ellipse is rotated by: \[ \Delta\phi = 2\pi\epsilon. \]

After each complete orbit, the ellipse is slightly rotated around the focal point, by an angle:

\[ \Delta\phi = \frac{2\pi}{1-\epsilon} - 2\pi = \frac{2\pi\epsilon}{1-\epsilon} \approx 2\pi\epsilon = \frac{6\pi (GM)^2}{c^{2}L^{2}}. \]

We replace \(L\) using equation (2): \[ a = \frac{L^{2}}{GM}\,\frac{1}{1 - e^{2}} \tag{2} \]

By substituting this expression into equation (5): \[ r = \frac{L^{2}}{GM\left\{1 + e\cos\left[\phi(1-\epsilon)\right]\right\}} \tag{5} \]

we obtain the following for the trajectory: \[ r = \frac{a(1 - e^{2})}{1 + e\cos\left[\phi(1-\epsilon)\right]} \tag{5a} \]

With: \[ \epsilon = \frac{3(GM)^2}{c^{2}L^{2}} \qquad\text{or}\qquad \epsilon = \frac{3(GM)^2}{c^{2}GM\,a(1 - e^{2})} = \frac{3GM}{c^{2}a(1 - e^{2})}. \]


Derived from Kepler’s third law: \[ T^{2} = \frac{4\pi^{2}a^{3}}{G(M+m)} \approx \frac{4\pi^{2}a^{3}}{GM} \quad\Rightarrow\quad T = 2\pi\,a\,\sqrt{\frac{a}{GM}}. \]

For the velocity \(v\): \[ v = \frac{L}{r\cos\alpha} = \frac{\sqrt{aGM(1 - e^{2})}}{a(1 - e^{2})} \,\frac{1 + e\cos\left[\phi(1-\epsilon)\right]}{\cos\alpha} \]

Thus: \[ v = \sqrt{\frac{GM}{a(1 - e^{2})}} \,\frac{1 + e\cos\left[\phi(1-\epsilon)\right]}{\cos\alpha} \]

where: \[ \epsilon = \frac{3(GM)^2}{c^{2}L^{2}} = \frac{3(GM)^2}{c^{2}aGM(1 - e^{2})} = \frac{3GM}{c^{2}a(1 - e^{2})}. \]


Because: \[ L^{2} = aGM(1 - e^{2}), \]

substitution gives: \[ \Delta\phi = \frac{6\pi (GM)^2}{c^{2}aGM(1 - e^{2})}. \]

Ultimately, we obtain the following for the precession angle: \[ \Delta\phi = \frac{6\pi GM}{a(1 - e^{2})c^{2}} \tag{6} \]

vector_4_7_2
Precession of an elliptical orbit (greatly exaggerated)

We apply equation (6) to the orbit of Mercury, with the following parameters:

We find: \[ T = \sqrt{\frac{4\pi^{2} a^{3}}{GM}} = 87.95\ \text{days} \] \[ \Delta\phi = \frac{6\pi GM}{a(1 - e^{2})c^{2}} = 5.02 \times 10^{-7}\ \text{rad per cycle} \]

To calculate the precession per century: \[ \Delta\phi = 5.02 \times 10^{-7} \times \left(100 \times \frac{365.25}{88}\right) \times \left(\frac{360 \times 60 \times 60}{2\pi}\right) \] \[ \Delta\phi = 43''\ \text{arcseconds per century}. \]

In reality, the measured precession is: \[ 5599''.7 \pm 0.4''\ \text{per century}. \]

The vast majority of this is caused by gravitational influences from other planets. But after correcting for these disturbances, a residual deviation remains that agrees remarkably well with the prediction of general relativity.

For other celestial bodies, we find similar results (in arcseconds per century):

Object Observed residual precession Predicted residual precession
Mercury 43.1 ± 0.5 43.03
Venus 8 ± 5 8.6
Earth 5 ± 1 3.8
Icarus 10 ± 1 10.3

The results therefore match the predictions of general relativity extremely well. Einstein included this calculation for Mercury in his 1915 paper on general relativity. In doing so, he immediately solved one of the major outstanding problems in classical celestial mechanics—an impressive first test of his new, sophisticated theory.

4.8 Calculation of a Projectile Trajectory

As an exercise, we are interested in calculating the trajectory of a projectile using the rules of General Relativity, in contrast to the classical (Newtonian) approach.

For the General Relativistic approach, we assume that the projectile’s path is forced into an elliptical shape by the mass of the Earth. For the calculation we use the Schwarzschild equation. But first, we begin with the Newtonian approximation.

4.8.1 Newtonian Approximation

We consider a projectile fired at an angle, with a horizontal distance D between the launch point and the target, and a maximum height h. The gravitational acceleration is g, and the initial velocity of the projectile has components vx0 (horizontal) and vy0 (vertical).

vector_4_8_1

a) Time and Velocity Components

The time required for the projectile to cover the distance D with constant horizontal velocity vx0 is:

\[ v_{x0} = \frac{D}{T} \quad \Rightarrow \quad T = \frac{D}{v_{x0}} \]

To cover the distance D, the projectile also needs an upward velocity, otherwise it would hit the ground too early. This requires an initial vertical velocity component vy0. This velocity is determined by the horizontal distance D and the time T. Thus, T is also the time it takes to rise from the ground and fall back down.

Because the motion is symmetric, the time to reach the highest point is:

\[ T_{\text{up}} = \frac{T}{2} \]

At this point the vertical velocity is zero. From the equation of motion:

\[ v_y = v_{y0} - g t = 0 \quad \text{at} \quad t = \frac{T}{2} \]

This gives:

\[ v_{y0} = g t = g \frac{T}{2} = g \frac{D}{2 v_{x0}} \]

b) Height and Relation with Time

The maximum height is reached at time T/2, so:

\[ h = v_{y0} \frac{T}{2} - \frac{1}{2} g \left( \frac{T}{2} \right)^2 \] \[ h = \frac{g D}{2 v_{x0}} \cdot \frac{D}{2 v_{x0}} - \frac{1}{2} g \left( \frac{D}{2 v_{x0}} \right)^2 = \frac{g D^2}{8 v_{x0}^2} \]

Inverted, this gives:

\[ v_{x0} = D \sqrt{\frac{g}{8h}} \]

When the projectile falls back from the highest point h, it takes T/2 to reach the ground.

c) Total Velocity and Trajectory Parameters

To reach the highest point:

\[ v_{y0} = g \frac{T}{2} = g \sqrt{\frac{2h}{g}} = \sqrt{2 h g} \]

The total initial velocity is:

\[ v_0^2 = v_{x0}^2 + v_{y0}^2 = \frac{g D^2}{8 h} + 2 h g = g \frac{D^2 + 16 h^2}{8 h} \] \[ \Rightarrow v_0 = \sqrt{ g \frac{D^2 + 16 h^2}{8 h} } \]

Furthermore:

\[ v_{x0} = v_0 \cos \alpha_0 \]

The launch angle α0 follows from:

\[ \tan \alpha_0 = \frac{v_{y0}}{v_{x0}} = \frac{\sqrt{2 h g}}{D \sqrt{g/(8h)}} = \frac{4 h}{D} \] \[ \tan \alpha_0 = \frac{4 h}{D} \]

d) Trajectory Equation

The y-position as a function of time:

\[ y(t) = v_{y0} t - \frac{1}{2} g t^2 = g \frac{T}{2} t - \frac{1}{2} g t^2 = \frac{1}{2} g t (T - t) \] \[ y(t) = \frac{1}{2} g t \left( \frac{D}{v_{x0}} - t \right) \]

In terms of x:

\[ x = v_{x0} t \quad \Rightarrow \quad t = \frac{x}{v_{x0}} \] \[ y(x) = \frac{1}{2} g \frac{x}{v_{x0}} \left( \frac{D}{v_{x0}} - \frac{x}{v_{x0}} \right) = \frac{1}{2} \frac{g}{v_{x0}^2} x (D - x) \]

Thus, the trajectory is a parabola:

\[ y(x) = \frac{1}{2} \frac{g}{v_{x0}^2} x (D - x) \]

This is therefore a function of the required distance D when the initial horizontal velocity component \(v_{x0}\) is known.

e) Example Calculation

D (m) vx0 (m/s) T (s) h (m) v0 (m/s)
10524.9311.06
105000.020.000493500
10052049399
1005024.9351

(these values were computed using g = 9.87 m/s²)

f) Next Step

Now that we have fully developed the Newtonian approximation, we can compare it with the calculation based on the Schwarzschild geometry within General Relativity. This comparison follows in the next section.

4.8.2 Schwarzschild Approximation

For this approach, we consider the projectile trajectory as part of an ellipse, with the center of the Earth located at one of the foci. We use the results from the Schwarzschild derivation in Section 4.7, Alternative Derivation of the Orbital Equation.

For the orbital trajectory, the semi-major axis is:

\[ a = \frac{L^2}{G M} \frac{1}{1 - e^2} \tag{2} \]

The parameter e is the eccentricity of the projectile’s orbit. The perihelion is:

\[ r_1 = a (1 - e) \]

and the aphelion:

\[ r_2 = a (1 + e) \] \[ \Rightarrow e = \frac{r_2 - r_1}{r_2 + r_1} \]

For a circular orbit, \(e = 0\) and \(r = r_1 = r_2 = a\).

For an ellipse with the Earth at the left focus:

\[ r(\phi) = \frac{a (1 - e^2)}{1 - e \cos\!\left[\phi(1-\epsilon)\right]} \tag{2a} \]

Angle Between Velocity and Radial Vector

We now determine the angle \(α\) between the velocity \(v\) (tangential to the ellipse) and the perpendicular component \(v_{per}\) in order to compute the angular momentum. In this experiment, \(v\) is the total velocity along the ellipse, while \(v_{per}\) is the component perpendicular to \(r(\phi)\).

\[ \tan \alpha = \frac{1}{r} \frac{dr}{d\phi} \] \[ \tan \alpha = \frac{\{1-e\cos[\phi(1-\epsilon)]\}}{a(1-\epsilon^2)}\, \frac{\{a(1-e^2)(1-\epsilon)(e\sin[\phi(1-\epsilon)])\}} {\{1-e\cos[\phi(1-\epsilon)]\}^2} \] \[ \tan \alpha = \frac{dr}{r d\phi} = \frac{e(1-\epsilon)\sin[\phi(1-\epsilon)]} {1 - e \cos[\phi(1-\epsilon)]} \] \[ \alpha = \arctan\left( \frac{e(1-\epsilon)\sin[\phi(1-\epsilon)]} {1 - e \cos[\phi(1-\epsilon)]} \right) \]

With:

\[ \cos \alpha = \frac{1}{\sqrt{1 + \tan \alpha}} \]

We obtain:

\[ \cos{\alpha} = \left[ 1 + \left( \frac{e(1-\epsilon)\sin[\phi(1-\epsilon)]} {1 - e \cos[\phi(1-\epsilon)]} \right)^2 \right]^{-1/2} \] \[ \cos{\alpha} = \left[ \frac{ 1 - 2 e \cos[\phi(1-\epsilon)] + e^2 \cos^2[\phi(1-\epsilon)] + \{e(1-\epsilon)\sin[\phi(1-\epsilon)]\}^2 } {\{1 - e \cos[\phi(1-\epsilon)]\}^2} \right]^{-1/2} \]

Because of the negative square root, we invert the expression:

\[ \cos{\alpha} = \frac{1 - e \cos[\phi(1-\epsilon)]} {\left[ 1 - 2 e \cos[\phi(1-\epsilon)] + e^2 \cos^2[\phi(1-\epsilon)] + (1-2\epsilon+\epsilon^2)e^2\sin^2[\phi(1-\epsilon)] \right]^{1/2}} \] \[ \cos{\alpha} = \frac{1 - e \cos[\phi(1-\epsilon)]} {\left[ 1 - 2 e \cos[\phi(1-\epsilon)] + e^2 \cos^2[\phi(1-\epsilon)] + e^2\sin^2[\phi(1-\epsilon)] - \epsilon(2-\epsilon)e^2\sin^2[\phi(1-\epsilon)] \right]^{1/2}} \] \[ \cos{\alpha} = \frac{1 - e \cos[\phi(1-\epsilon)]} {\left[ 1 - 2 e \cos[\phi(1-\epsilon)] + e^2(1-\epsilon(2-\epsilon)\sin^2[\phi(1-\epsilon)]) \right]^{1/2}} \tag{2b} \]

Angular Momentum

The angular momentum L is constant along the ellipse and is given by:

\[ L = v_{per}\cdot r = v \cdot \cos \alpha \cdot r \]

At the launch point on the Earth’s surface:

\[ L = v_{x0}\cdot R_{\text{earth}} \]

According to the Schwarzschild solution:

\[ L = \sqrt{a G M (1 - e^2)} \]

Velocity Along the Ellipse

\[ v = \frac{L}{r \cos \alpha} = \frac{\sqrt{aGM(1 - e^2)}}{a(1-e^2)\cos{\alpha}} (1-e\cos[\phi(1-\epsilon)]) \]

This simplifies to:

\[ v = \sqrt{\frac{G M}{a (1 - e^2)}}\, \frac{1 - e \cos[\phi(1-\epsilon)]} {\cos \alpha} \tag{2c} \]

Substituting \(\cos{\alpha}\) from (2b) into (2c):

\[ v = \left(\frac{G M}{a (1 - e^2)}\right)^{1/2} \frac{1-e\cos[\phi(1-\epsilon)]}{1-e\cos[\phi(1-\epsilon)]} \left( 1 - 2 e \cos[\phi(1-\epsilon)] + e^2\{1-\epsilon(2-\epsilon)\sin^2[\phi(1-\epsilon)]\} \right)^{1/2} \]

The instantaneous velocity as a function of \(\phi\) is:

\[ v = \sqrt{ \left(\frac{G M}{a (1 - e^2)}\right) \left( 1 - 2 e \cos[\phi(1-\epsilon)] + e^2\{1-\epsilon(2-\epsilon)\sin^2[\phi(1-\epsilon)]\} \right) } \tag{2d} \]

Relativistic Correction Parameter

From the previous chapter:

\[ \epsilon = \frac{3 (G M)^2}{c^2 L^2} = \frac{3 (G M)^2}{c^2 aGM(1-e^2)} = \frac{3 G M}{c^2 a (1 - e^2)} \]

Here:

\[ \epsilon = \frac{3 (G M)^2}{c^2 L^2} = \frac{3 (G M)^2}{c^2 (v_{x0}R_{earth})^2} = \frac{3c^2}{v_{x0}^2} \left( \frac{G M}{c^2 R_{earth}} \right)^2 \tag{2e} \]

Geometric Relations

vector_4_8_2a

To zoom in a bit further:

vector_4_8_2b

\[ \phi \cdot R = \frac{D}{2} \quad \Rightarrow \quad \phi = \frac{D}{2R} \] \[ v_{\text{per}} = v_{xA} = v \cos(\alpha) \quad\text{and}\quad v_{yA} = v \sin(\alpha) \]

From (2a): \[ a(1 - e^{2}) = r\{1 - e\cos[\phi(1 - \epsilon)]\} \]

From (2d):

\[ v = \sqrt{ \left(\frac{GM}{a(1 - e^{2})}\right) \left( 1 - 2e\cos[\phi(1-\epsilon)] + e^{2}\{1 - \epsilon(2-\epsilon)\sin^{2}[\phi(1-\epsilon)]\} \right) } \] \[ v^{2} = \left(\frac{GM}{a(1 - e^{2})}\right) \left( 1 - 2e\cos[\phi(1-\epsilon)] + e^{2}\{1 - \epsilon(2-\epsilon)\sin^{2}[\phi(1-\epsilon)]\} \right) \] \[ v^{2} = GM\, \frac{ 1 - 2e\cos[\phi(1-\epsilon)] + e^{2}\{1 - \epsilon(2-\epsilon)\sin^{2}[\phi(1-\epsilon)]\} }{ r\{1 - e\cos[\phi(1-\epsilon)]\} } \] \[ \frac{v^{2} r}{GM}\{1 - e\cos[\phi(1-\epsilon)]\} = 1 - 2e\cos[\phi(1-\epsilon)] + e^{2}\{1 - \epsilon(2-\epsilon)\sin^{2}[\phi(1-\epsilon)]\} \] \[ e^{2}\{1 - \epsilon(2-\epsilon)\sin^{2}[\phi(1-\epsilon)]\} - e\cos[\phi(1-\epsilon)]\left(2 - \frac{v^{2}r}{GM}\right) + \left(1 - \frac{v^{2}r}{GM}\right) = 0 \]

\[ e = \frac{ \cos[\phi(1-\epsilon)]\left(2 - \frac{v^{2}r}{GM}\right) }{ 2\{1 - \epsilon(2-\epsilon)\sin^{2}[\phi(1-\epsilon)]\} } \pm \]

\[ \pm \frac{ \sqrt{ \left[\cos[\phi(1-\epsilon)]\left(2 - \frac{v^{2}r}{GM}\right)\right]^{2} - 4\left(1 - \frac{v^{2}r}{GM}\right) \{1 - \epsilon(2-\epsilon)\sin^{2}[\phi(1-\epsilon)]\} } }{ 2\{1 - \epsilon(2-\epsilon)\sin^{2}[\phi(1-\epsilon)]\} } \]

At the starting point where the trajectory intersects the Earth’s surface, we have \( r = R \) (with \(R\) the Earth’s radius) and \( \phi = D/(2R) \).

From (2a):

\[ r = \frac{a(1 - \epsilon^{2})}{1 - e\cos[\phi(1-\epsilon)]} \] \[ a(1 - \epsilon^{2}) = R\{1 - e\cos[\tfrac{D}{2R}(1-\epsilon)]\} \] \[ a = \frac{ R\{1 - e\cos[\tfrac{D}{2R}(1-\epsilon)]\} }{ 1 - \epsilon^{2} } \tag{3} \]

\[ e = \frac{ \cos[\phi(1-\epsilon)]\left(2 - \frac{v^{2}r}{GM}\right) }{ 2\{1 - \epsilon(2-\epsilon)\sin^{2}[\phi(1-\epsilon)]\} } \pm \]

\[ \pm \frac{ \sqrt{ \left[\cos[\phi(1-\epsilon)]\left(2 - \frac{v^{2}r}{GM}\right)\right]^{2} - 4\left(1 - \frac{v^{2}r}{GM}\right) \{1 - \epsilon(2-\epsilon)\sin^{2}[\phi(1-\epsilon)]\} } }{ 2\{1 - \epsilon(2-\epsilon)\sin^{2}[\phi(1-\epsilon)]\} } \]

Or, using equations (2), (2e), and (3):

\[ R\{1 - e\cos[\tfrac{D}{2R}(1-\epsilon)]\} = a(1 - e^{2}) = \frac{L^{2}}{GM} \]

\[ e = \frac{1 - \frac{L^{2}}{RGM}} {\cos[\tfrac{D}{2R}(1-\epsilon)]} = \frac{ 1 - \frac{L^{2}}{RGM} }{ \cos\!\left[ \frac{D}{2R} \left( 1 - \frac{3c^{2}}{v_{x0}^{2}} \left(\frac{GM}{c^{2}R_{\text{earth}}}\right)^{2} \right) \right] } = \frac{ 1 - \frac{(v_{x0}R)^{2}}{RGM} }{ \cos\!\left[ \frac{D}{2R} \left( 1 - \frac{3c^{2}}{v_{x0}^{2}} \left(\frac{GM}{c^{2}R_{\text{earth}}}\right)^{2} \right) \right] } \]

The given velocity at the point \( r = R \) is \( v \). Thus, for a given velocity, there are two possible solutions for \( e \).

Here, \( h \) is the maximum height of the projectile: \[ h = a(1 + e) - R \]

Together with (3): \[ h = \frac{ R\{1 - e\cos[\tfrac{D}{2R}(1-\epsilon)]\} }{ 1 - e^{2} }(1 + e) - R = R\left\{ \frac{ 1 - e\cos[\tfrac{D}{2R}(1-\epsilon)] }{ 1 - e } - 1 \right\} \] \[ h = R\left\{ \frac{ 1 - e\cos[\tfrac{D}{2R}(1-\epsilon)] - 1 + e }{ 1 - e } \right\} = R\frac{ e\left(1 - e\cos[\tfrac{D}{2R}(1-\epsilon)]\right) }{ 1 - e } \]

Here, \(D\) is the horizontal distance, \(v\) the launch speed, and \(R\) the Earth’s radius. As shown above: \[ \phi = \frac{D}{2R}. \]

Or, more pragmatically, in our projectile example with \(v_{x0}\) and \(D\) as inputs:

\[ h = a(1 + e) - R = a\frac{1 - e^{2}}{1 - e} - R = \frac{L^{2}}{GM}\frac{1}{1 - e} - R \]

\[ = \frac{v_{x0}^{2}R^{2}}{GM}\frac{1}{1 - e} - R \]

Where: \[ e = \frac{ 1 - \frac{v_{x0}^{2}R}{GM} }{ \cos\!\left[ \frac{D}{2R} \left( 1 - \frac{3c^{2}}{v_{x0}^{2}} \left(\frac{GM}{c^{2}R}\right)^{2} \right) \right] } \]

Derivation of the circumference of an ellipse

\[ \frac{x^{2}}{a^{2}} + \frac{y^{2}}{b^{2}} = 1, \qquad x = a\cos\beta,\quad y = b\sin\beta \]

\[ \text{Circumference} = 4a \int_{0}^{\pi/2} \sqrt{ \left(\frac{dx}{d\beta}\right)^{2} + \left(\frac{dy}{d\beta}\right)^{2} } \, d\beta \]

\[ = 4a \int_{0}^{\pi/2} \sqrt{ a^{2}\sin^{2}\beta + b^{2}\cos^{2}\beta } \, d\beta \]

\[ = 4a \int_{0}^{\pi/2} \sqrt{ a^{2}(1 - \cos^{2}\beta) + b^{2}\cos^{2}\beta } \, d\beta \]

\[ = 4a \int_{0}^{\pi/2} \sqrt{ a^{2} - (a^{2} - b^{2})\cos^{2}\beta } \, d\beta \]

\[ \text{Circumference} = 4a \int_{0}^{\pi/2} \sqrt{ 1 - e^{2}\cos^{2}\beta } \, d\beta \]

There is no simple closed-form expression for the circumference of an ellipse. A well‑known approximation is Ramanujan’s formula:

\[ \text{Circumference} \approx \pi a\left[ 3(1 + \sqrt{1 - e^{2}}) - \sqrt{10\sqrt{1 - e^{2}} + 3(2 - e^{2})} \right] \]

Summary of formulas used

The starting points for this derivation are the projectile’s tangential velocity along the Earth’s surface (\(v_{x0} = v_{\text{per}}\), perpendicular to \(r\)) and the required distance \(D\). Thus, at the launch point we know the position and momentum of the projectile and can compute the full trajectory.

With these formulas, the full relativistic trajectory and maximum height can be computed from the launch speed and required distance.

Detailed calculation results

Detailed results for the example above. The starting parameters are the projectile’s tangential velocity (perpendicular to \(r\)) and the required horizontal distance.

Newton Schwarzschild
Parameter 5 m/s 500 m/s 500 m/s 1000 m/s 5 m/s 500 m/s 500 m/s 1000 m/s
Distance (m) 101020002000 101020002000
vr0 (m/s) 9.870.1019.739.87 9.760.1019.669.71
Speed (m/s) 115005001000 115005001000
\(\epsilon\) ---- 5.25×10-3 5×10-7 5.25×10-7 1×10-7
e (eccentricity) 1.0000.9960.9960.984
a (m) 3.18×106 3.18×106 3.18×106 3.20×106
h (m) 4.934.93×10-4 19.734.93 4.884.91×10-4 19.664.85
α (rad) 1.100.0000.040.010 1.100.0000.040.010
α (deg) 63.130.01132.260.565 62.880.01132.250.556
Φ (rad) 7.87×10-7 7.87×10-7 1.57×10-4 1.57×10-4
L (angular momentum) 3.18×107 3.18×109 3.18×109 6.36×109 3.18×107 3.18×109 3.18×109 6.36×109
cos(α) 0.4520 1.0000 0.9992 1.0000 0.4558 1.0000 0.9992 1.0000
cos(α + Φ) 0.4558 1.000 0.9992 1.000
Circumference (km) - - - - 12662 12894 12894 13346

3) Analysis of the Results

Height Differences

In the classical case, the maximum height of the projectile is \( h \approx 4.93\,\text{m} \) at low speeds. In the Schwarzschild approach this height is slightly lower (for example \( h \approx 4.88\,\text{m} \)), indicating a stronger effective gravitational field.

Eccentricity

An eccentricity of exactly \( e = 1 \) corresponds to the classical parabolic trajectory. The Schwarzschild approach shows that the trajectories become slightly elliptical with \( e < 1 \). For a horizontal velocity of 500 m/s we find \( e \approx 0.996 \), while at 5 m/s \( e \approx 1 \), consistent with an almost parabolic path.

Direction Angle

The deviation in direction angle \( \Phi \) is extremely small at low speeds, but measurable at higher energies. For a projectile of 500 m/s over 2 km, the deviation is \( \Phi \approx 1.57 \times 10^{-4}\,\text{rad} \), corresponding to a precession of the ellipse axis.

Angular Momentum and Orbit Circumference

The angular momentum \( L \) increases with the initial velocity. The corresponding (approximate) circumference of the elliptical orbit also increases, reflecting the longer path travelled by a high‑energy projectile.

4.8.3 Conclusion