# Kepler's Laws of Planetary Motion/First Law

## Physical Law

**Kepler's first law of planetary motion** is one of the three physical laws of celestial mechanics deduced by Johannes Kepler:

## Proof

Consider a planet $p$ of mass $m$ moving around the sun in the plane under the influence of the force $\mathbf F$ imparted by the gravitational field which the two bodies give rise to.

Let the position of $p$ at time $t$ be given in polar coordinates as $\left\langle{r, \theta}\right\rangle$.

Let $\mathbf F$ be expressed as:

- $\mathbf F = F_r \mathbf u_r + F_\theta \mathbf u_\theta$

where:

- $\mathbf u_r$ is the unit vector in the direction of the radial coordinate of $p$
- $\mathbf u_\theta$ is the unit vector in the direction of the angular coordinate of $p$
- $F_r$ and $F_\theta$ are the magnitudes of the components of $\mathbf F$ in the directions of $\mathbf u_r$ and $\mathbf u_\theta$ respectively.

Let $\mathbf r$ be the radius vector from the origin to $p$.

From Motion of Particle in Polar Coordinates, the second order ordinary differential equations governing the motion of $m$ under the force $\mathbf F$ are:

\((1):\quad\) | \(\displaystyle F_\theta\) | \(=\) | \(\displaystyle m \left({r \dfrac {\mathrm d^2 \theta} {\mathrm d t^2} + 2 \dfrac {\mathrm d r} {\mathrm d t} \dfrac {\mathrm d \theta} {\mathrm d t} }\right)\) | ||||||||||

\((2):\quad\) | \(\displaystyle F_r\) | \(=\) | \(\displaystyle m \left({\dfrac {\mathrm d^2 r} {\mathrm d t^2} - r \left({\dfrac {\mathrm d \theta} {\mathrm d t} }\right)^2}\right)\) |

By Newton's Law of Universal Gravitation, $\mathbf F$ is a central force of value:

- $\mathbf F = -G \dfrac {M m} {r^3} \mathbf r$

where:

- $G$ is the gravitational constant
- $M$ and $m$ are the masses of the bodies
- the minus sign indicates that the central force is in the opposite direction to that of the radius vector $\mathbf r$.

Hence we have:

- $F_\theta = 0$
- $F_r = -G \dfrac {M m} {r^2} \dfrac {\left\vert{\mathbf r}\right\vert} r = -G \dfrac {M m} {r^2}$

As $G$ and $M$ are both constants, so we can express this as:

- $F_r = -\dfrac {k m} {r^2}$

So from $(2)$:

\(\displaystyle -\dfrac {k m} {r^2}\) | \(=\) | \(\displaystyle m \left({\dfrac {\mathrm d^2 r} {\mathrm d t^2} - r \left({\dfrac {\mathrm d \theta} {\mathrm d t} }\right)^2}\right)\) | |||||||||||

\((3):\quad\) | \(\displaystyle \leadsto \ \ \) | \(\displaystyle \dfrac {\mathrm d^2 r} {\mathrm d t^2} - r \left({\dfrac {\mathrm d \theta} {\mathrm d t} }\right)^2\) | \(=\) | \(\displaystyle -\dfrac k {r^2}\) |

Let $z = \dfrac 1 r$.

Then:

\(\displaystyle \dfrac {\mathrm d r} {\mathrm d t}\) | \(=\) | \(\displaystyle \dfrac {\mathrm d} {\mathrm d t} \dfrac 1 z\) | |||||||||||

\(\displaystyle \) | \(=\) | \(\displaystyle - \dfrac 1 {z^2} \dfrac {\mathrm d z} {\mathrm d t}\) | Power Rule for Derivatives | ||||||||||

\(\displaystyle \) | \(=\) | \(\displaystyle - \dfrac 1 {z^2} \dfrac {\mathrm d z} {\mathrm d \theta} \dfrac {\mathrm d \theta} {\mathrm d t}\) | Chain Rule | ||||||||||

\(\displaystyle \) | \(=\) | \(\displaystyle - \dfrac 1 {z^2} \dfrac {\mathrm d z} {\mathrm d \theta} \dfrac h {r^2}\) | Derivative of Angular Component under Central Force | ||||||||||

\(\displaystyle \) | \(=\) | \(\displaystyle - h \dfrac {\mathrm d z} {\mathrm d \theta}\) | as $r = \dfrac 1 z$ |

Then:

\(\displaystyle \dfrac {\mathrm d^2 r} {\mathrm d t^2}\) | \(=\) | \(\displaystyle -h \dfrac {\mathrm d} {\mathrm d t} \left({\dfrac {\mathrm d z} {\mathrm d \theta} }\right)\) | |||||||||||

\(\displaystyle \) | \(=\) | \(\displaystyle -h \dfrac {\mathrm d} {\mathrm d \theta} \left({\dfrac {\mathrm d z} {\mathrm d \theta} }\right) \dfrac {\mathrm d \theta} {\mathrm d t}\) | Chain Rule | ||||||||||

\(\displaystyle \) | \(=\) | \(\displaystyle -h \dfrac {\mathrm d^2 z} {\mathrm d \theta^2} \dfrac {\mathrm d \theta} {\mathrm d t}\) | Chain Rule | ||||||||||

\(\displaystyle \) | \(=\) | \(\displaystyle -h \dfrac {\mathrm d^2 z} {\mathrm d \theta^2} \dfrac h {r^2}\) | Derivative of Angular Component under Central Force | ||||||||||

\((4):\quad\) | \(\displaystyle \) | \(=\) | \(\displaystyle -h^2 z^2 \dfrac {\mathrm d^2 z} {\mathrm d \theta^2}\) | as $r = \dfrac 1 z$ |

Substituting from $(4)$ into $(3)$:

\(\displaystyle -h^2 z^2 \dfrac {\mathrm d^2 z} {\mathrm d \theta^2} - r \left({\dfrac {\mathrm d \theta} {\mathrm d t} }\right)^2\) | \(=\) | \(\displaystyle -\dfrac k {r^2}\) | |||||||||||

\(\displaystyle \leadsto \ \ \) | \(\displaystyle -h^2 z^2 \dfrac {\mathrm d^2 z} {\mathrm d \theta^2} - r \left({ \dfrac h {r^2} }\right)^2\) | \(=\) | \(\displaystyle -\dfrac k {r^2}\) | Derivative of Angular Component under Central Force | |||||||||

\(\displaystyle \leadsto \ \ \) | \(\displaystyle -h^2 z^2 \dfrac {\mathrm d^2 z} {\mathrm d \theta^2} - \frac 1 z \left({ \dfrac h {z^2} }\right)^2\) | \(=\) | \(\displaystyle -k z^2\) | substituting $r = \dfrac 1 z$ | |||||||||

\((5):\quad\) | \(\displaystyle \leadsto \ \ \) | \(\displaystyle \dfrac {\mathrm d^2 z} {\mathrm d \theta^2} + z\) | \(=\) | \(\displaystyle \dfrac k {h^2}\) | simplifying |

From Second Order ODE: $y'' + y = K$, $(5)$ has the general solution:

- $(6): \quad z = A \sin \theta + B \cos \theta + \dfrac k {h^2}$

Without loss of generality, we shift the polar axis so as to make $r$ a minimum when $\theta = 0$.

That is, when $\theta = 0$, $p$ is at its closest point to the origin.

This means that $z = \dfrac 1 z$ is a maximum at this point.

Thus at $\theta = 0$:

From Derivative at Maximum or Minimum:

- $\dfrac {\mathrm d z} {\mathrm d \theta} = 0$

From Second Derivative of Strictly Concave Real Function is Strictly Negative:

- $\dfrac {\mathrm d^2 z} {\mathrm d \theta^2} < 0$

Thus:

- $A = 0$
- $B > 0$

Substituting these and $z = \dfrac 1 r$ back into $(6)$:

- $r = \dfrac 1 {B \cos \theta + \dfrac k {h^2} }$

\(\displaystyle r\) | \(=\) | \(\displaystyle \dfrac 1 {B \cos \theta + k / h^2}\) | |||||||||||

\(\displaystyle \) | \(=\) | \(\displaystyle \dfrac {h^2 / k} {1 + \left({B h^2 / k}\right) \cos \theta}\) |

Setting $B h^2 / k = e$:

- $r = \dfrac {h^2 / k} {1 + e \cos \theta}$

where $e \in \R_{>0}$ and is constant.

This can be expressed as:

- $(7): \quad r = \dfrac {p e} {1 + e \cos \theta}$

by setting $p = \dfrac 1 B$.

From Equation of Conic Section in Polar Form, $(7)$ is the equation of a conic section with one focus at the origin.

As the planets remain in the solar system it follows that their orbits are stable and elliptical.

$\blacksquare$

## Also see

## Source of Name

This entry was named for Johannes Kepler.

## Historical Note

Kepler derived his three laws of planetary motion in the early $1600$s from a concentrated study over the course of $20$ years of the colossal wealth of observational data which had been made previously by Tycho Brahe of the behavior of the planets of the solar system, and in particular Mars.

The first two of these results he published in his gigantic work *Astronomia Nova*.

The third appears some ten years later in his *Harmonices Mundi* of $1619$.

It was Isaac Newton who managed to interpret these three laws and so work out what is now known as Newton's Law of Universal Gravitation, from which Kepler's laws can straightforwardly be derived.

## Sources

- 1937: Eric Temple Bell:
*Men of Mathematics*... (previous) ... (next): Chapter $\text{VI}$: On the Seashore - 1972: George F. Simmons:
*Differential Equations*... (previous) ... (next): $\S 3.21$: Newton's Law of Gravitation - 1992: George F. Simmons:
*Calculus Gems*... (previous) ... (next): Chapter $\text {A}.10$: Kepler ($1571$ – $1630$) - 1992: George F. Simmons:
*Calculus Gems*... (previous) ... (next): Chapter $\text {B}.25$: Kepler's Laws and Newton's Law of Gravitation - 2008: Ian Stewart:
*Taming the Infinite*... (previous) ... (next): Chapter $8$: The System of the World