Why geodesics should be … geodesics?

Seriously, why can I come up with a silly question like this? It is probably your first thought when seeing this weird title. But be patient, it is not a joke and I am really serious because I want to convince you that geodesics are actually what they should be.

As we may have heard about geodesics, they are locally minimizing curves on a curved space, or more precisely a manifold. For example, the straight lines are geodesics on flat Euclidean spaces, or the arcs of equators are geodesics on the sphere, etc. However, a mechanical point of view is that geodesics in nature are trajectories of a particle traveling in a free space, that is the potential is zero. In General Relativity, we have the important equivalence principle stating that a motion is independent of coordinates of the spacetime. What it means? It means that every motion can be regarded as the motion in “free space” by the change of the coordinate systems. A further consequence is that trajectories of motions in spacetime should be geodesics. Now I am going to convince you of the second point of geodesics above with the following arguments. The next part would be full of mathematics. Consider skipping if you want.

I will start by recalling the definition of geodesics in Riemannian geometry context. Let (M,g) be a Riemannian manifold and \nabla be the Levi-Civita connection. The connection plays a similar role as the directional derivatives on the curved spaces. Once we have a connection, we can define the notion called parallel transport on a curve. Given a curve \gamma(t) on the manifold M, a vector field V(t) along this curve is parallel transported if it satisfies

\nabla_{\dot{\gamma}(t)} V(t) =0.

What does this equation say? It says that V remains “unchanged” following the tangent directions of \gamma.

A curve \gamma on M is said to be a geodesic if the tangent vector field to \gamma is parallel transported along itself, i.e.

\nabla_{\dot{\gamma}(t)} \dot{\gamma}(t) =0.

The LHS is analogous to the “second derivative” of \gamma and hence this equation somehow describes the free motion. In particular, when M= \mathbb R^n, this equation becomes

\ddot{\gamma}(t) =0.

It is indeed the equation describing the free motion in classical mechanics.

Still not convincing? Don’t worry, I think the next one would be more convincing.

First of all, we need to answer the following questions. Which elements prescribe a motion? When talking about the motion of a particle, what we care about is the position and the momentum of that particle at a certain time. Therefore, we need at least two kinds of coordinates in order to describe a motion, the first one is the position coordinates, and the second one is momentum coordinates. We want to make sense mathematically what it means. That is where the symplectic geometry comes in.

In classical mechanics, the position of a mechanical system can be specified by a point of a smooth manifold M which is called the configuration space. the momentum at position q \in M is specified by a vector p in the cotangent space T_q^* M. Then the cotangent bundle T^* M which is called the phase space encodes the data of position-momentum of systems. In other words, the trajectory of a motion is in nature the projection of some curve in the phase space onto the configuration space. Another important element that decides a mechanical system is the Hamiltonian. Hamilton or in other words the total energy of the system is a smooth function on cotangent bundle T^* M. The Hamiltonian can always be written as

Hamiltonian = Kinetic energy + Potential energy.

Let’s assume that M can be equipped with a Riemannian metric g so that it becomes a Riemannian manifold. Define the following Hamiltonian H: T^*M \rightarrow \mathbb R by H(q,p) := \frac{1}{2} g_q(p,p). When g is the Euclidean metric, this Hamiltonian is precisely the Hamiltonian for free motion.

The cotangent bundle can be equipped by the canonical symplectic form \omega as follows. Firstly, we define a 1-form on T^*M that is called the tautological form \alpha by

\alpha_{(q,p)} (v) :=p( T\pi_{(q,p)}(v)), for (q,p) \in T^*M, v \in T(T^*M),

where \pi: T^*M \rightarrow M is the natural projection. Then we define \omega: = d \alpha. It can be readily verified that \omega is a symplectic form. Furthermore, in a local coordinate (x, \xi) of the cotangent bundle T^*M, \omega can be expressed as

\omega = \sum_{i} d\xi_i \wedge dx_i.

Now we define the following vector field X on T^*M by

\omega(X, \cdot) = dH(\cdot).

One thing that might surprise you is that geodesics on configuration space M are actually the projection of integral curves of X onto M. However, it still remains to show that these two definitions of geodesics are consistent. I will come back to do that in subsequent posts.

Leave a comment