First the derivations:
Equation 18:
The connection between total time derivative $\frac{d}{dt}$ and partial time derivative $\frac{\partial}{\partial t}$ of a scalar function is, using the chain rule,
$$ \frac{d}{dt} = \frac{\partial}{\partial t} + \frac{\partial x}{\partial t} \frac{\partial}{\partial x} + \frac{\partial y}{\partial t} \frac{\partial}{\partial y} + \frac{\partial z}{\partial t} \frac{\partial}{\partial z} $$
Notice that this is equal to
$$ \frac{d}{dt} = \frac{\partial}{\partial t} + \mathbf{v} \cdot \nabla $$
Hence $q \frac{d \phi}{dt} = q \frac{\partial \phi}{\partial t} + q \mathbf{v} \cdot \nabla \phi$, where the last term is the one appearing in Eq. 16, but with a minus sign.
From this we get
$$ \frac{d}{dt} \left( \frac{1}{2} m v^2 \right) = -q \mathbf{v} \cdot \nabla \phi - q \mathbf{v} \cdot \frac{\partial \mathbf{A}}{\partial t}$$
$$ \frac{d}{dt} \left( \frac{1}{2} m v^2 \right) = q \frac{\partial \phi}{\partial t} - q \frac{d \phi}{dt} - q \mathbf{v} \cdot \frac{\partial \mathbf{A}}{\partial t} $$
$$ \frac{d}{dt} \left( \frac{1}{2} m v^2 \right) + q \frac{d \phi}{dt} = q \frac{\partial \phi}{\partial t} - q \mathbf{v} \cdot \frac{\partial \mathbf{A}}{\partial t}$$
$$ \frac{d}{dt} \left(\frac{1}{2} m v^2 + q \phi \right) = q \left(\frac{\partial \phi}{\partial t} - \mathbf{v} \cdot \frac{\partial \mathbf{A}}{\partial t} \right)$$
which is what I think Eq. 18 is supposed to be. Here it is also perfectly clear that if $\phi, \mathbf{A}$ are not explicitly time dependent, then their partial time derivatives are 0, and the whole RHS is zero.
Equation 19:
Here we need the total time derivative of a vector function $\mathbf{A}$. By looking at the components of the vector, we should realize that
$$\frac{d \mathbf{A}}{dt} = \frac{\partial \mathbf{A}}{\partial t} + \left( \mathbf{v} \cdot \nabla \right) \mathbf{A} $$
with the interpretation that e.g.
$$ \frac{d A_x}{dt} = \frac{\partial A_x}{\partial t} + \frac{\partial x}{\partial t} \frac{\partial A_x}{\partial x} + \frac{\partial y}{\partial t} \frac{\partial A_x}{\partial y} + \frac{\partial z}{\partial t} \frac{\partial A_x}{\partial z} $$
We have a useful vector calculus identity link:
$$ \mathbf{v} \times (\nabla \times \mathbf{A}) = \nabla_\mathbf{A} (\mathbf{v} \cdot \mathbf{A}) - (\mathbf{v} \cdot \nabla) \mathbf{A} $$
where the notation $\nabla_\mathbf{A}$ means only the variation due to $\mathbf{A}$ is taken into account in that term.
Starting from Newton's second law, $\frac{d}{dt}(m \mathbf{v} ) = q(\mathbf{E} + \mathbf{v} \times \mathbf{B})$ and inserting the potentials, we get
$$ \frac{d}{dt}(m \mathbf{v} ) = q \left(- \nabla \phi - \frac{\partial \mathbf{A}}{\partial t} + \mathbf{v} \times (\nabla \times \mathbf{A}) \right) $$
Using the identity gives
$$ \frac{d}{dt}(m \mathbf{v} ) = q \left(- \nabla \phi - \frac{\partial \mathbf{A}}{\partial t} + \nabla_\mathbf{A} (\mathbf{v} \cdot \mathbf{A}) - (\mathbf{v} \cdot \nabla) \mathbf{A} \right) $$
$$ \frac{d}{dt}(m \mathbf{v} ) + q \left( \frac{\partial \mathbf{A}}{\partial t} + (\mathbf{v} \cdot \nabla) \mathbf{A} \right) = q \left(- \nabla \phi + \nabla_\mathbf{A} (\mathbf{v} \cdot \mathbf{A}) \right) $$
On the LHS we identify the total derivative of the vector potential, so finally
$$ \frac{d}{dt} \left( m \mathbf{v} + q \mathbf{A} \right) = - q \nabla \left(\phi - \mathbf{v} \cdot \mathbf{A} \right) $$
where the gradient on the RHS is understood in accordance to the clarification about the identity we used.
Now the meaning:
The question of the difference between $p=mv$ and $p=mv + qA$ has been asked before on this site, e.g. here. In short, "momentum" is a type of quantity that is often of great importance in physics, just as "energy" is, and there exist several different momenta, just as there exist different important forms of energy.
In the classical theories of Lagrangian and Hamiltonian mechanics, the fundamental variables are so-called generalized coordinates $q_i$ and their time derivatives $\dot{q}_i$. They can be, and are often, real-space coordinates, but they can also be angles or other ways of describing the configurations of a system. For each generalized coordinate, there exists a corresponding quantity called the conjugate momentum, which is given by
$$p_i = \frac{\partial L}{\partial \dot{q}_i} $$
where $L$ is the Lagrangian, usually the kinetic minus the potential energy. In the case of a charged particle in an EM field, the conjugate momentum to position turns out to be exactly $mv + qA$.
EDIT TO ADDRESS COMMENTS:
a) Eq. 18 does not tell us that $\mathbf{A}$ is momentum per unit charge, Eq. 19 does. Eq. 18 is a conservation law; it tells you that when your charged particle is moving around, the conserved quantity is not just the kinetic energy, but $1/2 m v^2 + q \phi$. Hence we can think of $q \phi$ as potential energy $E_p$, and then
$$ \frac{E_p}{q} = \phi $$
is interpreted as potential energy per charge. Likewise for Eq. 19, it establishes a mathematical form similar to Newton's second law when written on the form
$$ \frac{d p}{dt} = - \nabla U $$
Comparing Eq. 19 to this form, we see that the "total momentum" is now $m \mathbf{v} + q \mathbf{A}$, so kinetic momentum $p_k$ + electromagnetic momentum $p_{EM}$. Hence
$$ \frac{p_{EM}}{q} = \frac{q \mathbf{A}}{q} = \mathbf{A} $$
is electromagnetic momentum per unit charge.
b) I'll explain the chain rule for scalar functions, called fields. Remember that fields $\phi$ depend on position in space, and possibly explicitly on time too:
$$ \phi = \phi(t, x(t), y(t), z(t)) $$
The reason that I'm letting the spatial coordinates depend on $t$, is that the particle we are describing the motion of, will of course move around, and the only value of $\phi$ relevant to the particle is the field value at the particle's position.
So how can the field at the position of the particle change with time? Of course it can change explicitly with time in a given fixed point in space, so that is what $\frac{\partial}{\partial t}$ gives you. But if the particle moves a small amount in the $x$ direction over a small time, then the change is
$$ \frac{\partial x}{\partial t} \frac{\partial \phi}{\partial x} $$
by the ordinary chain rule. And since the three spatial directions are not explicitly dependent on each other (instead they all depend on time $t$), we get contributions like this for all three directions.
c) The general rule for the gradient of the dot product of two vector fields is
$$ \nabla(\mathbf{v} \cdot \mathbf{A}) = \mathbf{v} \times (\nabla \times \mathbf{A}) + \mathbf{A} \times (\nabla \times \mathbf{v}) + (\mathbf{v} \cdot \nabla) \mathbf{A} + (\mathbf{A} \cdot \nabla) \mathbf{v} $$
Of course it is symmetric in the two vector fields. The point here is that in our physical situation, it is only $\mathbf{A}$ that is a proper vector field (one vector value for each point in space). The quantity $\mathbf{v}$ is not a field. There is only one value of $\mathbf{v}$ for each point in time, and it is not associated to anything in space, apart from maybe the position of the particle whose motion we are describing. In that sense, we don't consider the "spatial variation" in $\mathbf{v}$, so we drop the two terms where $\nabla$ is acting on $\mathbf{v}$, and keep the two terms where $\nabla$ acts on $\mathbf{A}$. The notation $\nabla_\mathbf{A}$ signifies just that.