I am on my way to general relativity, but I am struggling with the covariant derivative.
At this point I am trying to ignore the spacetime character of the world i.e. I am trying to understand what a covariant derivative means in an intrinsically curved space, without taking into account, that time is also effected by the curvature. I hope that it is possible to understand things in this simplified way, so that in the next step I can deal with time. If that isn't possible, so be it, but if you people see a way to explain things without taking the curvature of time into account, it would mean a lot to me, because it seems just less complicated this way.
My main problem with the covariant derivative arrises, when derivatives of basis vectors appear. Let's take the covariant derivative of a vector field $\vec{v}$ in direction of coordinate $x^i$:
$$\nabla_{\vec{e}_i}\vec{v}~=~\frac{\partial}{\partial x^i}\vec{v}~=~\frac{\partial}{\partial x^i}\left(v^j\vec{e}_j\right)~=~\frac{\partial v^j}{\partial x^i}\vec{e}_j+v^j\frac{\partial \vec{e}_j}{\partial x^i}$$
Now, what is to be understood by $\frac{\partial \vec{e}_j}{\partial x^i}$? I cannot really imagine, what a change of direction in curved space looks like, because I think it is necessary to have a straight line in order to define a change in direction as differing from that straight line...
elaboration:
Maybe I have to elaborate a little, to make clear what my problem is. In order to so, I have to give some credit to eigenchris from youtube, whos video series on tensor calculus I watched on my mission to understand the covariant derivative and whose sketches I use to formulate my question.
I already struggled with this question when thinking about it in flat space or on a curved 2-dimensional surface in flat 3-dimensional space. In those two cases I was able to understand, what it means, but unfortunately my reasoning does not work anymore, when space itself is curved and there is no higher dimensional flat space to help me. To make clear, what my problem is, I think it is necessary to go through my reasoning of those two cases which I think I understood:
two dimensional flat space:
In this video the covariant derivative in flat space was explained as just taking the ordinary derivative, but doing it properly (i.e. taking into account, that the derivatives of the basis vectors are not necessarily zero). For example in Cartesian and polar coordinates:
$$\frac{\partial \vec{e}_x}{\partial x}~=~\frac{\partial \vec{e}_x}{\partial y}~=~0~~~~~\text{but}~~~~~\frac{\partial \vec{e}_\theta}{\partial \theta}~=~-r\vec{e}_r~,~\frac{\partial \vec{e}_\theta}{\partial r}~=~\frac{1}{r}\vec{e}_\theta$$
Here the basis vectors are not normalized, so $\vec{e}_\theta = \partial \vec{R}/\partial \theta$, etc. This derivative can then be computed, by expanding $\vec{e}_\theta$ in Cartesian coordinates and using, that the Cartesian basis vectors are constant, which leads to the results on the right. So to show, that $\vec{e}_\theta$ is not constant, it was necessary to know, that $\vec{e}_x$ and $\vec{e}_y$ are constant.
At first this did seem weird to me. Why can I objectively say, that $\vec{e}_x$ is constant, but $\vec{e}_\theta$ is not? I can expand $\vec{e}_x$ in polar coordinates and suddenly it does not look constant at all. Now the solution of this is propably obvious: As soon, as I don't think about the vectors in purely abstract terms, it is clear, that $\vec{e}_\theta$ does physically change its direction, while $\vec{e}_x$ does not.
I could print out a large version of the coordinate systems in the picture above and put it on the floor of my room, with the origin being in the center. Now when I start walking in $\vec{e}_x$-direction and keep walking in a straight line, it does not really matter, from what point in my room I start walking. I can start from some point A and after some time I will arrive at say the football stadium. The next day I can start from a different point right next to A and I will still arrive at the football stadium. The two straight lines, that mark my ways on the two days are parallels. The distance between them does not change, so in the end I will arrive at points that are still right next to each other. That is not the case, if I follow the directions of $\vec{e}_\theta$ at two different points close to each other. In this case, starting from point A and going straight into the direction at which $\vec{e}_\theta$ points, might still take me to the football stadium, but starting from a point right next to A and following the dirction of $\vec{e}_\theta$ from there, might bring me to the cathedral. Basically I am saying: I can see the real difference in the change of $\vec{e}_\theta$, beacause I can attach a straight line, and see where it leads me.
The only problem is: How do I know, if I am following a straight line, while walking away from my room? In flat space and with Newtonian physics this is easy and there are many ways:
1) I can just trust my eyes: I keep the stadium in the center of my field of view. Because I know, that the light coming from the stadium moves on a straight line, I know, that I myself am moving on a straight line, when I always see the stadium right in front of me.
2) I could use Newton’s first law of motion: If I just accelerate one time at the beginning and there are no forces acting on me (neglecting friction, wind and so on) I can be sure, that I will not change direction and therefore move on a straight line.
3) I could take a string and attach one end to my room and the other one to the stadium. When the string ist streched, I know that the line is straigth, because a straight line is the shortest path between two points.
To sum up: When I want to know if a vector field is constant or not and I have been given the vector field in non-Cartesian coordinates, I have to take into account, that my basis vectors may be changing their direction, depending on their position in space. I can understand this, because I can grasp what changing direction means. And I can understand what changing direction means, because I can define straight lines.
A two dimensional curved surface in three dimensional flat space
The next step is to formulate derivative for people living on a curved surface, e.g. the earth. How would a constant vector field for somone living on the surface look like?
Looking from space, we see, that the two vectors on the left hand side in the picture above point in the same direction (e.g. some fixed star). But for someone on the surface those two vectors are very different because the one at the north pole points forward along the surface, but the one at the equator just points out of the surface. Walking down from the north pole to the equator, the vector field on the right hand side does look way more constant, than the one on the left. If the vector field is a kind of force, say the wind, it would have the same effect on the person every step of the way, namely providing some momentum by tailwind.
The covariant derivative takes this into account by subtracting the component normal to the surface from the rate of change of the vector field:
$$\nabla_{\vec{e}_i}\vec{v}=\frac{\partial \vec{v}}{\partial x^i}-\vec{n}~=~\left[\frac{\partial v^k}{\partial x^i}+v^j\Gamma^k_{ij}\right]\vec{e}_k$$
Where the $\Gamma^k_{ij}$ are the Christoffel-symbols which give the rate of Change of basis-vectors tangent to the surface:
$$\frac{\partial \vec{e}_j}{\partial u^i}~=~\Gamma^k_{ij}\vec{e}_k+L_{ij}\hat{n}$$
This does make sense to me. I can understand the rate of change of the basis vectors $\frac{\partial \vec{e}_j}{\partial u^i}$, because it happens in three dimensional falt space and I can use all the reasoning from flat space.
Intrinsically curved space
Now if I don't have any extrinsic dimension from which I can look at the curved surface, my reasoning does not work anymore. I cannot understand, what $\frac{\partial \vec{e}_j}{\partial x^i}$ would mean in curved space.
How would I know in curced space whether I am approaching my target on a straight line (without any change of direction on the way)? I cannot trust my eyes, because light itself travels on curved lines. I can't use Newtons laws, because in general relativity there is no force acting on the moon, but it still goes around the earth rather than travelling on a straight line away from it. I could find the shortest path, I think, but the length of a path depends on the speed with which one travels and even if there is one invariant shortest path, why would it make sense to call this one straight and define a change of direction as not following that path?
I would not know what it means, just to keep on walking in one direction in curved space. But if I cannot say what it means, not to change direction, than I can't understand what it means when the basis vectors do change direction.
Any help?
EDIT:
I have learned, that $\frac{\partial\vec{e}_j}{\partial x^i}$ is the rate of change of a basis vector, where a basis vector is defined to be constant, if it keeps being tangent to the same geodesic.
My problem now is, that I don't understand, where that definition comes into play. I think this must happen at some point, while finding the Christoffel symbols. It is:
$$\frac{\partial\vec{e}_j}{\partial x^i} \equiv \Gamma^k_{ij}\vec{e}_k$$
I am familiar with the following derivation of the Christoffel symbols $\Gamma^k_{ij}$:
$$\frac{\partial g_{ij}}{\partial u^k}~=~\frac{\partial}{\partial u^k}\left(\vec{e}_i\cdot\vec{e}_j\right)$$
$$~~~~~~~~~~~~~~~~~~~~~~=~\frac{\partial\vec{e}_i}{\partial u^k}\cdot \vec{e}_j+\vec{e}_i\cdot\frac{\vec{e}_j}{\partial u ^k}$$
$$~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~=~\Gamma^l_{ik}\left(\vec{e}_l\cdot\vec{e}_j\right)+\Gamma^l_{jk}\left(\vec{e}_i\cdot\vec{e}_l\right)$$
$$~~~~~~~~~~~~~=~\Gamma^l_{ik}g_{lj}+\Gamma^l_{jk}g_{il}$$
Now using the symmetry of the metric tensor and the Christoffel symbols in the lower indices one can show:
$$\Gamma^k_{ij}~=~\frac{1}{2}g^{kl}\left(\frac{\partial g_{li}}{\partial u^j }+\frac{\partial g_{jl}}{\partial u^i}-\frac{\partial g_{ij}}{\partial u^l }\right)$$
But I don't think that any physically relevant stuff is happening there. I rather feel like the choice, that $\frac{\partial\vec{e}_j}{\partial x^i}$ is the rate of change in contrast to a geodesic has to implemented in one of the first two steps of calculating the derivative of the metric tensor. But I don't see how.
The transition from flat to curved space is precisely the point at which I struggle.
– Benito McLanbeck May 25 '20 at 11:40