Most of the question has already been answered in the comments. I'll summarise and expand on some of the points.
The OP asks why we have a requirement that the 2-norm of vectors be preserved under time evolution. The answer, already given in the comments, is as follows. Let $U(t)$ be the time evolution operator, and $\lvert \psi(t) \rangle$ the state of the system at time $t$. Then $\lvert \psi(t) \rangle = U(t)\lvert \psi(0) \rangle$ where $\lvert \psi(0) \rangle$ is the initial state. We insist that the wavefunction is normalised, which in the usual interpretation means that probabilities "add up to 1". So if $\langle \psi(0)\lvert \psi(0) \rangle=1$. We therefore require that $U(t)$ is unitary to ensure that the wavefunction is then normalised at all times.
If time evolution wasn't unitary, then you may have $\langle \psi(t)\lvert \psi(t) \rangle \neq 1$. If it's greater than one, then it's nonsense (as long as you believe that the wavefunction can be interpreted as a probability amplitude). If it's less than one, then your system is "leaking" information. This is also nonsense in the usual interpretation. Some people use this to model dissipative effects by using an imaginary potential, which translates to non hermitian hamiltonian and therefore (if you believe Schrodinger) non unitary evolution.
As the question you linked points out correctly, we need operators corresponding to observables to be normal. This is because by the spectral theorem, this means that they are diagonalisable by an orthogonal set, which is something we need if measurements are to make sense. Notice however that we can build an hermitian Hamiltonian from non-normal operators. The typical example is the harmonic oscillator, for which $H=a^\dagger a$, but $a$ is not normal. If we require the eigenvalues of a normal operator to be real, then the operator is hermitian. Why do we do this? It's what we are used to. Position, momentum and energy eigenvalues have a simple interpretation if they are real. What if they were complex? This happens when you do scattering, for which the wavefunction is not normalisable, and you may get complex momentum and energies, which in that context may be interpreted as decay times (see these notes, page 312 onwards). However it's not clear how to interpret them in general.
We can summarise the logical chain:
$$\textrm{wavefunctions are probabilty amplitudes} \implies \textrm{time evolution should be unitary}$$
$$\textrm{observables give real measurements} \implies \textrm{corresponding operators should be hermitian}$$
$$\textrm{Schrodinger equation holds} \Leftrightarrow \textrm{time evolution is given by } e^{-iHt}$$
So to answer your question in the comments, if you assume 1) but not 2) you still get unitary time evolution, however you do not necessarily know its mathematical expression.
Notice however that there are cases in QM when we want to interpret things differently. The most common example is scattering states, whose wavefunctions are not interpreted as probability amplitudes.