As Matt says, the improved Kato inequality I know about involves the trace of the Hessian squared and not the determinant. The oldest proof I'm aware of for this inequality is in Stein's book, "Singular Integral Operators and Differentiability Properties of Functions", where he uses it to get improved estimates for functions in a Hardy space. But I believe that he does not call it a "Kato inequality".
This inequality was later put to extremely good use by Uhlenbeck, Taubes, Yau, and others and plays a critical role in many major theorems on minimal hypersurfaces, self-dual Yang-Mils fields, and special Riemannian metrics, especially where the PDE involved is "critical" or "scale-invariant". You can, for example, find a rather messy proof of the inequality (Stein's is cleaner) in the paper of Schoen-Simon-Yau, "Curvature estimates for minimal hypersurfaces".
Anyway, here's my take on it. The improved Kato inequality is a pointwise tensor space inequality that holds in many different situations. The desired inequality is, given a section $T$ of a tensor bundle
$$
|\nabla |\nabla T|| \le c |\nabla^2T|.
$$
With $c = 1$, this follows easily from the identity
$$
|\nabla |\nabla T|| = \frac{|\nabla T \cdot \nabla^2T|}{|\nabla T|}
$$
and the Cauchy-Schwarz inequality
$$
|\langle v, t\rangle| \le c|v||t|
$$
with $c = 1$, where $t \in V\otimes W$, $v \in V$, and $V$ and $W$ are vector spaces with inner products, and $\langle\cdot,\cdot\rangle$ is the inner product in $V$.
So here's the key point: If you happen to know that $t$ always lies in a known subspace of $V\otimes W$, then it is sometimes possible that the last inequality holds with a constant less than 1. To identify the optimal constant, it is easiest to differentiate
$$
\frac{|\langle v, t\rangle|^2}{|v|^2|t|^2}
$$
with respect to $t$ and solve for the critical points of $t$ restricted to the subspace.
In the specific example cited in the question, $W = V$, so $V\otimes W$ is the space of all matrices but $t$ is restricted to the space of traceless matrices. The same argument gives for example a more general improved Kato inequality for divergence free tensor fields. This was used, for example, in the work of Taubes and Uhlenbeck on self-dual Yang-Mills.
Any ideas?
– Miranda Oct 20 '12 at 22:50