Imagine a generic pure state$^\dagger$, which is a superposition of some basis states
\begin{equation}
|\Psi\rangle = \sum_{k=1}^N c_k | k \rangle
\end{equation}
To be concrete, we can take the basis states $|k\rangle$ to be the computational basis with $N$ qubits. For example, for $N=2$, $|k\rangle$ would be a member of the set $\{|00\rangle, |01\rangle, |10\rangle, |11\rangle\}$.
Now, let's write the complex coefficients as a real amplitude and phase: $c_k = a_k e^{i \varphi_k}$. So
\begin{equation}
|\Psi\rangle = \sum_{k=1}^N a_k e^{i \varphi_k} |k\rangle
\end{equation}
Now consider the probability for the state to be $|m\rangle$ after collapse (here $|m\rangle$ is an eigenstate of some observable which isn't diagonal in the computational basis)
\begin{eqnarray}
P_m = |\langle m | \Psi \rangle|^2 &=& \sum_{k=1}^N \sum_{k'=1}^N \langle m | k \rangle \langle k' | m \rangle a_k a_{k'} e^{i(\varphi_k - \varphi_{k'})} \\
&=& \sum_{k=1}^N a_k^2 \langle m | k\rangle + \sum_{k=1}^N \sum_{k'=1, k'\neq k}^N a_k a_{k'} e^{i (\varphi_k - \varphi_{k'})} \langle m | k' \rangle \langle k | m\rangle
\end{eqnarray}
If the phases $\varphi_k$ are uncorrelated, then you would expect the second term above to tend to cancel, particularly if $N$ is large. This is essentially because you are adding complex numbers with roughly the same amplitude but with random phases. In the limit where the cancellation is perfect, then the probability becomes
\begin{equation}
P_m = \sum_{k=1}^N a_k^2 \langle m | k\rangle
\end{equation}
This cancellation is the key idea of decoherence. The probabilities for different basis states simply add, as they would in normal probability theory.
On the other hand, a coherent state is one where the phases are correlated, For example, if $\varphi_k=\varphi_{k'}$ for all $k, k'$, then there will not be any cancellation in the second term at all. The interference term, which only appears in quantum mechanics and not ordinary probability theory, is crucial for correctly calculating the probabilities in this case. The computational advantage of quantum computing comes from arranging these correlations in a smart way.
Therefore I would not say that coherence is not equivalent to the superposition principle. I would say that coherence and decoherence are descriptions of different types of superposition, depending on the phase coherence of the expansion coefficients.
$^\dagger$ Of course you can also formulate this for mixed states in terms of density matrices, but I personally find it useful to understand the pure state case first.