So there are a bunch of "pictures" (that's the technical term!) of quantum mechanics, agreeing in the broad perspective that:
- There is some vector space $\{|\phi\rangle\}$ over the field $\mathbb C$ and its canonical dual space $\{\langle\phi|\},$ such that the dual operation $\mathcal D$ maps $$\mathcal D\Big(a |\alpha\rangle + b |\beta\rangle\Big) = \langle \alpha|a^* + \langle\beta|b^*$$ and there is an inverse mapping the other way and so on; we usually write this dualizing operation with a superscript $\dagger$ so that $\big(c~|\alpha\rangle\langle\beta|\big)^\dagger = c^* |\beta\rangle\langle\alpha|.$
- Observable quantities are represented by Hermitian operators $\hat O^\dagger = \hat O,$ or in other words you have expressions like $$\mathcal D\Big(\hat O |\phi\rangle\Big)~|\psi\rangle = \langle\phi| \hat O|\psi\rangle,$$or what mathematicians will sometimes write $\langle \hat O\phi,~\psi\rangle = \langle \phi,~\hat O\psi\rangle.$ The point is that they are their own conjugate transpose, in the sense that they play nice with this dualizing operation.
- The central prediction of QM is: "you observe the eigenvalues of the Hermitian operators, but we only predict the averages of these eigenvalues over many measurements. The average always takes the form $\langle O \rangle = \langle \psi|\hat O|\psi\rangle,$ where $|\psi\rangle$ is a vector we regard as the state of the system."
In one of these pictures in particular, the Schrödinger picture, all of the operators $\hat p$ and $\hat x$ and so on are generally formally independent of time, and the state $|\psi\rangle$ changes explicitly with time according to the Schrödinger equation, $$i \hbar |\partial_t \Psi(t)\rangle = \hat H |\Psi(t)\rangle,$$ where $\hat H$ is an observable for the total energy in the system. Of course we could still define time-dependent observables like $\hat O = \hat x ~ \cos(\omega t) + \hat p/\hbar ~ \sin(\omega t)$ if we wanted, and then we would have something that we'd call maybe $d \hat O\over d t,$ but the basic point is that the theory is made out of basic things which are not fundamentally time-dependent, and you can do the time dependence if you want to. So $\hat p = -i\hbar \partial_x$ as an operator, it does not change over time.
One nevertheless gets that the actual observable change in the average value is given by the formula you gave, which includes the possibility of explicit time dependence. Explicit time dependence is unusual in the Schrödinger picture, but we can handle it by saying $$\frac{d\langle A\rangle}{dt} = \frac{i}{\hbar} \langle [H, A]\rangle + \langle \frac{dA}{dt} \rangle,$$ where all of the stuff inside brackets is fundamentally some operator expression first and foremost, so $\langle dA/dt\rangle$ means, "first figure out what operator $d\hat A/dt$ is, then its average appears above."
And then, you have all of the other pictures. It turns out that we can think about solving the equation $i\hbar |\partial_t \psi(t)\rangle = \hat h |\psi(t)\rangle$ for an arbitrary Hermitian $\hat h$, and we get that $\psi(t) = \hat u(t) |\psi(0)\rangle$ for some "unitary operator" $\hat u(t)$, meaning that $\hat u \hat u^\dagger = \hat u^\dagger \hat u = 1.$ One particular one of these, $\hat U(t)$, corresponds to the case where $\hat h = \hat H.$
We can insert these into the expectation value given by the Schrödinger picture to do a sort of quantum coordinate transform,$$\langle A \rangle = \langle \psi_0|\hat U^\dagger \hat u ~ \hat u^\dagger \hat A \hat u ~ \hat u^\dagger \hat U |\psi_0\rangle.$$
The point is that now instead of $|\psi\rangle =\hat U |\psi_0\rangle$ we think about $|\psi'\rangle = \hat u^\dagger \hat U |\psi_0\rangle$ which we can derive evolves according to $$i\hbar |\partial_t \psi'\rangle = (\hat H' - \hat h') |\psi'\rangle.$$ Above you can also see primes on the operators; see now it is also more typical for operators to have explicit time dependence, since we are also replacing $\hat A$ with $\hat A' = \hat u^\dagger \hat A \hat u$ and finding $$i \hbar \frac{d\hat A'}{dt} = -\hat u^\dagger \hat h \hat A \hat u + \hat u^\dagger \hat A \hat h \hat u = [\hat A',~\hat h'].$$
In the most extreme form of this, the Heisenberg picture, we choose $\hat h = \hat H$ so that the state does not evolve at all and remains at $|\psi_0\rangle$ in perpetuity. Instead all of the operators evolve in time. This was the basis for the original "matrix mechanics" form of quantum mechanics before Schrödinger discovered his wave equation.
It is also very common to have "interaction pictures" where we divide $\hat H$ into a nice easy noninteracting part $\hat H_0$ plus whatever complications exist in the interactions $\hat H_I.$ Then we choose $\hat h = \hat H_0$ which usually just throws some $e^{i \omega t}$ terms on all of the operators we're analyzing, and then we can make various approximations for the remaining dynamics now that the easy part is "out of the way."