General Relativity

Special relativity explains physics in flat spacetime. General relativity asks the next question: what happens when spacetime itself is curved? Einstein’s answer is that gravity is not an ordinary force pulling on objects across empty space. Instead, matter and energy curve spacetime, and free-falling objects follow the straightest paths in that geometry.

The Equivalence Principle

Einstein’s key stepping stone was the equivalence principle. Imagine a small laboratory sealed inside an elevator. If the cabin accelerates upward through empty space, dropped objects fall toward the floor. Inside a small enough lab, the local physics can mimic standing still in a uniform gravitational field.

As with special relativity, Maxwell’s theory of light as electromagnetic waves was taken as an axiom. These waves are massless and should not feel any gravitational effects according to Newton. However, light in the accelerating elevator appears to follow a parabolic trajectory. Taking the equivalence principle seriously implies a gravitational effect on light. This insight led to a further reshaping of our understanding of space and time.

0

Geodesics

The equivalence principle gives a clean way of connecting accelerating reference frames to gravity. However, it is a local statement, holding strictly true in a sufficiently small region of spacetime. (The gravitational effects of Earth are not equivalent to a giant accelerating elevator.) The curvature of spacetime becomes apparent when you compare nearby free-fall trajectories. If those paths move closer together or farther apart, the spacetime is not flat.

To make that idea more tangible, consider the motion of ants along the surface of an apple. The ants are always trying to walk “straight ahead”, and the corresponding trajectories are known as geodesics. The apple’s geometry causes nearby ants to converge (positive curvature) or diverge (negative curvature).

This is an analogy, not the full Einstein equations. The surface of an apple is an example of extrinsic curvature. It’s a 2D surface that bends/curves in a third spatial dimension. Spacetime is a 4D manifold with intrinsic curvature. Not only is it mind-bending to consider time as a geometric dimension, but there’s actually no need for a fifth embedding dimension. The math gets tricky, so let’s instead build intuition for curvature by watching how nearby paths fail to stay perfectly parallel.

Apple Geodesics

(Locally) shortest paths on a curved surface.

From Interval to Metric

The apple picture helps build intuition, but the actual geometry of spacetime is encoded in the metric. On the first relativity page, the flat-spacetime interval in three spatial dimensions would be written as

ds2=c2dt2dx2dy2dz2.ds^2 = c^2dt^2 - dx^2 - dy^2 - dz^2.

This plays a role similar to the Euclidean distance formula from ordinary geometry. In flat 3D space, you might write

d2=dx2+dy2+dz2,d\ell^2 = dx^2 + dy^2 + dz^2,

which tells you how small coordinate changes combine into an actual distance. The spacetime interval does the same kind of bookkeeping, except that time and space enter with different signs.

General relativity keeps the idea of an invariant interval, but it allows the coefficients to vary from place to place:

ds2=gμν(x)dxμdxν.ds^2 = g_{\mu \nu}(x)\,dx^{\mu}dx^{\nu}.

The collection of coefficients gμν(x)g_{\mu \nu}(x) is the metric (the notation above implies summation over indices μ\mu and ν\nu, so 16 coefficients for a 4D spacetime). In flat spacetime those coefficients are constant. Near matter and energy they change with location, and in the full theory Einstein’s equations determine them from the local mass-energy distribution.

It’s worth seeing one important example written out fully. The Schwarzschild metric describes spacetime near a spherical mass MM (with no charge or angular momentum). Written in spherical coordinates (r,θ,ϕ)(r,\theta,\phi), the spacetime interval according to a distant observer is

ds2=(12GMrc2)c2dt2dr212GMrc2r2dθ2r2sin2θdϕ2.ds^2 = \left(1 - \frac{2GM}{rc^2}\right)c^2dt^2 - \frac{dr^2}{1 - \frac{2GM}{rc^2}} - r^2d\theta^2 - r^2\sin^2\theta\,d\phi^2.

Note that the coefficient of c2dt2c^2dt^2 is no longer just 11; it depends on the distance rr from the gravitating body.

With a metric like this, we can now measure tiny intervals in the geometry of spacetime. This ultimately allows us to determine the geodesics, i.e. the worldlines of freely falling objects. At this point, we are fully in the land of differential geometry which requires the use of calculus and tensors. This is a rich subject and worth learning/reading about at some point in your life.

At this point, it’s probably better to focus on some of the interesting consequences of GR. Below, you can read about gravitational time dilation and the deflection of light.

Geodesic Equation

How the metric turns the idea of 'straight ahead' into an equation.

For a freely falling massive object, the geodesic is the worldline that makes the proper time locally extremal:

δdτ=0.\delta \int d\tau = 0.

Writing that condition as a differential equation gives the geodesic equation

d2xμdτ2+Γαβμdxαdτdxβdτ=0,\frac{d^2 x^\mu}{d\tau^2} + \Gamma^\mu_{\alpha\beta}\frac{dx^\alpha}{d\tau}\frac{dx^\beta}{d\tau} = 0,

where the Christoffel symbols Γαβμ\Gamma^\mu_{\alpha\beta} are built from the metric. As a simple example, in flat spacetime, this reduces to

d2xμdτ2=0.\frac{d^2 x^\mu}{d\tau^2} = 0.

Integrating once gives

dxμdτ=Aμ,\frac{dx^\mu}{d\tau} = A^\mu,

where AμA^\mu is constant, and integrating again gives

xμ(τ)=Aμτ+Bμ.x^\mu(\tau) = A^\mu \tau + B^\mu.

Each coordinate changes linearly with τ\tau, which is just uniform straight-line motion.

A Simple Clock Calculation

Take a clock that is held at a fixed radius outside the mass, so dr=dθ=dϕ=0dr = d\theta = d\phi = 0. Along the clock’s worldline we can write ds2=c2dτ2ds^2 = c^2d\tau^2, where τ\tau is the proper time measured by that clock. The metric then becomes

c2dτ2=(12GMrc2)c2dt2,c^2d\tau^2 = \left(1 - \frac{2GM}{rc^2}\right)c^2dt^2,

so

dτ=12GMrc2dt.d\tau = \sqrt{1 - \frac{2GM}{rc^2}}\,dt.

Here tt can be interpreted as the time kept by a clock very far from the mass, where the gravitational field is negligible. Since the square root is less than 11, a clock deeper in the gravitational field accumulates less proper time.

At Earth’s surface,

2GMRc2=2(6.67×1011)(5.97×1024)(6.37×106)(3.0×108)21.39×109.\frac{2GM}{Rc^2} = \frac{2(6.67 \times 10^{-11})(5.97 \times 10^{24})}{(6.37 \times 10^6)(3.0 \times 10^8)^2} \approx 1.39 \times 10^{-9}.

That gives

dτdt=11.39×10916.95×1010.\frac{d\tau}{dt} = \sqrt{1 - 1.39 \times 10^{-9}} \approx 1 - 6.95 \times 10^{-10}.

Over one day, t=86400st = 86400\,\text{s}, so

τ(16.95×1010)(86400s)86399.999940s.\tau \approx (1 - 6.95 \times 10^{-10})(86400\,\text{s}) \approx 86399.999940\,\text{s}.

That is about 6.0×105s=60μs6.0 \times 10^{-5}\,\text{s} = 60\,\mu\text{s} less than a clock very far from Earth. The effect is tiny here, but it is real: gravity (the warping of spacetime by Earth) changes the metric, and the metric changes how clocks accumulate time.

Weak-Field Light Deflection

A simple estimate of light bending near a gravitating body.

In the weak-field limit, a light ray passing a spherical mass is deflected by

Δϕ4GMbc2,\Delta \phi \approx \frac{4GM}{bc^2},

where MM is the mass of the gravitating body and bb is the closest-approach distance of the light ray. The scaling makes good physical sense: larger masses bend light more, while larger fly-by distances bend it less. The factor of c2c^2 in the denominator also shows why the effect is usually tiny unless the gravitating body is very massive or very compact.

The accelerating-elevator picture tells you why there should be any bending at all. In an upward-accelerating cabin, a horizontal light beam appears to deflect as the floor rises while the beam crosses the room. By the equivalence principle, light near a gravitating body should also be deflected.

A quick elevator-style estimate goes like this. If the cabin has width LL, the light-crossing time is

tLc.t \approx \frac{L}{c}.

During that time, the floor rises by

y12gt2=12gL2c2.y \approx \frac{1}{2}gt^2 = \frac{1}{2}g\frac{L^2}{c^2}.

The beam therefore picks up a small tilt. Using the slope of the parabolic path, the exit angle is approximately

θdydxgLc2.\theta \approx \frac{dy}{dx} \approx \frac{gL}{c^2}.

To turn this into a rough gravitational estimate for a light ray passing a mass MM, take the local field strength near closest approach bb to be

gGMb2,g \approx \frac{GM}{b^2},

and take the interaction distance to be of order

L2b.L \approx 2b.

Then

θgLc2GMb22bc2=2GMbc2.\theta \approx \frac{gL}{c^2} \approx \frac{GM}{b^2}\frac{2b}{c^2} = \frac{2GM}{bc^2}.

That elevator argument gets the right qualitative idea and roughly the right scale, but it is not the whole story. A simple elevator-style estimate gives only about

Δϕ2GMbc2,\Delta \phi \approx \frac{2GM}{bc^2},

while the full Schwarzschild geometry gives

Δϕ4GMbc2.\Delta \phi \approx \frac{4GM}{bc^2}.

The extra factor of 22 comes from the fact that general relativity curves not just time, but space as well.

Problem Solving

Elevator Kinematics

Problem

An elevator accelerates upward at 9.8m/s29.8\,\text{m/s}^2 in deep space. A ball is released from rest relative to the cabin when it is 1.2m1.2\,\text{m} above the floor. How long does the ball take to reach the floor according to someone inside the elevator?

Given

  • effective downward acceleration a=9.8m/s2a = 9.8\,\text{m/s}^2
  • drop distance y=1.2my = 1.2\,\text{m}
  • initial speed relative to cabin is zero

Use

In the accelerating cabin, the motion matches the constant-acceleration relation

y=12at2.y = \frac{1}{2}at^2.

Solve

1.2=12(9.8)t2t2=2.49.80.245.1.2 = \frac{1}{2}(9.8)t^2 \qquad \Rightarrow \qquad t^2 = \frac{2.4}{9.8} \approx 0.245.

So

t0.49s.t \approx 0.49\,\text{s}.

Check

The calculation is exactly what you would do for a small cabin resting in a uniform gravitational field. That is the local content of the equivalence principle.

Converging Geodesics

Problem

Two shuttles, moving parallel to each other, are in radial free fall toward the surface of a planet. They begin 3.0km3.0\,\text{km} apart and later are only 2.9km2.9\,\text{km} apart, even though neither fires a rocket. What does that change in separation tell you?

Given

  • initial separation: 3.0km3.0\,\text{km}
  • later separation: 2.9km2.9\,\text{km}
  • both shuttles remain in free-fall

Use

Free-fall paths represent geodesics. Decreasing spatial separation implies the geodesics are converging (the full spacetime geodesics are a bit trickier to consider).

Solve

The separation changes by

Δd=2.9km3.0km=0.1km.\Delta d = 2.9\,\text{km} - 3.0\,\text{km} = -0.1\,\text{km}.

This inward focusing is evidence that spacetime has positive curvature in this region.

Check

In flat spacetime, initially parallel free-fall paths would not naturally focus together this way. A more Newtonian description would point out that this tidal effect could be explained as both shuttles moving toward the center of the planet.

Deflection of Light

Problem

A ray of starlight just grazes the surface of the Sun on its way to Earth. Estimate the angle by which the light is deflected by the Sun’s gravity.

Given

  • G=6.67×1011Nm2/kg2G = 6.67 \times 10^{-11}\,\text{N}\,\text{m}^2/\text{kg}^2
  • M=1.99×1030kgM_{\odot} = 1.99 \times 10^{30}\,\text{kg}
  • R=6.96×108mR_{\odot} = 6.96 \times 10^8\,\text{m}
  • c=3.0×108m/sc = 3.0 \times 10^8\,\text{m/s}

Use

For a light ray passing a spherical mass in the weak-field limit,

Δϕ4GMbc2,\Delta \phi \approx \frac{4GM}{bc^2},

where bb is the closest approach distance. For light grazing the Sun, bRb \approx R_{\odot}.

Solve

Δϕ4(6.67×1011)(1.99×1030)(6.96×108)(3.0×108)28.5×106rad.\Delta \phi \approx \frac{4(6.67 \times 10^{-11})(1.99 \times 10^{30})}{(6.96 \times 10^8)(3.0 \times 10^8)^2} \approx 8.5 \times 10^{-6}\,\text{rad}.

Converting to arcseconds using 1rad2062651\,\text{rad} \approx 206265 arcsec,

Δϕ(8.5×106)(206265)1.75 arcsec.\Delta \phi \approx (8.5 \times 10^{-6})(206265) \approx 1.75 \text{ arcsec}.

Check

The deflection is tiny, but still measurable. This is why stars viewed near the edge of the Sun appear slightly shifted on the sky, one of the classic early tests of general relativity.

Gravitational Time Dilation

Problem

In a simplified Schwarzschild black hole model, a landing team spends time at a radius r=1.00000001rsr = 1.00000001\,r_s, where rs=2GM/c2r_s = 2GM/c^2, while the main ship waits very far away. If 10.010.0 years pass on the ship, how much proper time passes for the landing team?

Given

  • r=1.00000001rsr = 1.00000001\,r_s
  • distant-ship time Δt=10.0yr\Delta t = 10.0\,\text{yr}

Use

For a clock held at fixed radius outside a nonrotating black hole,

Δτ=1rsrΔt.\Delta \tau = \sqrt{1 - \frac{r_s}{r}}\,\Delta t.

Solve

First compute the time-dilation factor:

ΔτΔt=1rs1.00000001rs=111.000000011.0×1081.0×104.\frac{\Delta \tau}{\Delta t} = \sqrt{1 - \frac{r_s}{1.00000001\,r_s}} = \sqrt{1 - \frac{1}{1.00000001}} \approx \sqrt{1.0 \times 10^{-8}} \approx 1.0 \times 10^{-4}.

So

Δτ(1.0×104)(10.0yr)=1.0×103yr.\Delta \tau \approx (1.0 \times 10^{-4})(10.0\,\text{yr}) = 1.0 \times 10^{-3}\,\text{yr}.

Converting to more familiar units,

1.0×103yr0.365day8.8h.1.0 \times 10^{-3}\,\text{yr} \approx 0.365\,\text{day} \approx 8.8\,\text{h}.

Check

This is the same kind of effect dramatized in Interstellar: the crew deeper in the gravitational well experiences far less elapsed time than the crew farther away. This example uses a highly simplified nonrotating-black-hole model, but it captures the basic idea that strong gravity can create enormous time differences.

General Relativity Checkpoint

Question 1 of 3

Why does the equivalence principle matter for general relativity?

Choose an answer to get instant feedback.