So I am studying classical mechanics through the MIT 8.223 notes, and encountered the derivation of the Euler Lagrange equation. There is a part I don't quite understand, which resides in the actual meaning of the $\delta$ symbol here. We define the action $S[q(t)]$ as the integral from $t_1$ to $t_2$ of $L(q,\dot q,t)$:
$$S[q(t)] = \int_{t_1}^{t_2}L(q,\dot q,t) dt.$$ We also define a new slightly perturbed function $q(t) + \delta q(t)$ and the variation of the action $\delta S$ as the difference between the action evaluated at the perturbed and initial functions, respectively (the lagrangian function is the same for both)
$$\delta S = S[q+\delta q]-S[q] = \int_{t_1}^{t_2}L(q + \delta q,\dot q + \delta \dot q,t) dt - \int_{t_1}^{t_2}L(q,\dot q,t) dt. $$ It is then said that:
$$ \delta S = \delta \int_{t_1}^{t_2}L(q,\dot q,t) dt = \int_{t_1}^{t_2} \delta L(q,\dot q,t) dt. $$
Then, by using the chain rule: $$\int_{t_1}^{t_2} \delta L(q,\dot q,t) dt = \int_{t_1}^{t_2} \frac{\partial L}{\partial q} \delta q + \frac{\partial L}{\partial \dot q} \delta \dot q dt.$$ The derivation goes on, but this is enough to answer my question. I get everything until the definition of $ \delta S$, here $\delta$ just acts on two places, to define $\delta q$ , which is a slight perturbation to the original function (but still a function of $t$, we can even take derivatives of it) and to define $\delta S$, which has a straightforward definition given above, it is just the difference of the functional at the perturbed and original functions.
The thing I don't get is the use of $\delta$ afterwards, it is brought into the integral as if it were a new kind of derivative and it even acts on $L$. However, this use of $\delta$ hasn't been defined. So what is this "operator" exactly and why can it act both to define the perturbations on the action and the generalized coordinate and to operate on functions?
Another shorter question: why is $\delta S = 0$? I know it might seem weird, but to me it seems like it should be greater than zero, if we were looking for a minimum, since we said that it is the difference between the action evaluated at the different perturbed and non perturbed functions, and the action at the original function is a minimum, thus the action at any other function is greater than that value. Shouldn't that make it greater than zero?