What I think is tripping you up here is the use partial derivative notation in calculus of variations. It's generally a lot easier, particularly when doing calculations in GR, to use $\delta$-operator notation instead. (The $\delta$-operator, by definition, does obey the product rule: $\delta(fg) = f \delta g + g \delta f$.) Nonetheless, I've written up the basics of what's going on here in your notation; I do have to make a bit of a fudge at the end (see if you can spot it!), but rest assured that writing everything out using $\delta$-operators makes everything a bit more rigorous.
So let's take the functional derivative of the product $F (g, \partial g) G(g, \partial g)$:
$$
\delta \left[ F (g, \partial g) G(g, \partial g) \right] = \left( \frac{\partial F}{\partial g^{ij}} \delta g^{ij} + \frac{\partial F}{\partial (\partial_k g^{ij})} \delta( \partial_k g^{ij}) \right) G( g^{ij}, \partial_k g^{ij}) + \left( \frac{\partial G}{\partial g^{ij}} \delta g^{ij} + \frac{\partial G}{\partial (\partial_k g^{ij})} \delta( \partial_k g^{ij}) \right) F( g^{ij}, \partial_k g^{ij})
$$
The first terms in each set of brackets (proportional to $\partial F/\partial g^{ij}$ & $\partial G/\partial g^{ij}$) obviously obey the product rule when taken together, so let's focus on the others:
$$
\left(\frac{\partial F}{\partial (\partial_k g^{ij})} \delta( \partial_k g^{ij}) \right) G + \left( \frac{\partial G}{\partial (\partial_k g^{ij})} \delta( \partial_k g^{ij}) \right) F
\\ = \partial_k \left[ \left( \frac{\partial F}{\partial (\partial_k g^{ij})} G + \frac{\partial G}{\partial (\partial_k g^{ij})} F \right) \delta g^{ij} \right] - \partial_k \left[ \frac{\partial F}{\partial (\partial_k g^{ij})} G + \frac{\partial G}{\partial (\partial_k g^{ij})} F \right] \delta g^{ij}
$$
and so (discarding the total derivative) we have
$$
\frac{\delta \left[ F (g, \partial g) G(g, \partial g) \right]}{\delta g^{ij}} = G \frac{\partial F}{\partial g^{ij}} + F\frac{\partial G}{\partial g^{ij}} - \partial_k \left[ \frac{\partial F}{\partial (\partial_k g^{ij})} G + \frac{\partial G}{\partial (\partial_k g^{ij})} F \right]. \qquad {(1)}
$$
This last term will not, as you note in general be equal to
$$
- G \partial_k \left[ \frac{\partial F}{\partial (\partial_k g^{ij})} \right] - F \partial_k \left[\frac{\partial G}{\partial (\partial_k g^{ij})} \right]
$$
as one would expect from the product rule.
However, in the case of the Einstein-Hilbert action, we have $F = \sqrt{-g}$ and $G = R$. Since $\nabla_k F = 0$ (remember, we should really be using covariant derivatives above) and since $\partial F/\partial (\partial_k g^{ij}) = 0$, the second term in (1) becomes
$$
- \partial_k \left[ \frac{\partial F}{\partial (\partial_k g^{ij})} G + \frac{\partial G}{\partial (\partial_k g^{ij})} F \right] = - F \partial_k \left[ \frac{\partial G}{\partial (\partial_k g^{ij})} \right] = - G \partial_k [0] - F \partial_k \left[ \frac{\partial G}{\partial (\partial_k g^{ij})} \right],
$$
which is what you would expect from the product rule. Similar logic extends to the case where $G$ (but not $F$) depends on higher derivatives of the fields.