I am new to quantum entanglement and I want to have a simple example that can relate entangled states with probability. Therefore, I try to use the 2 dices system to examine whether am I really understand quantum entanglement. Suppose I have 2 dices, call dice A and dice B. They range from 1 to 6. Classically, they are independent since whatever I roll dice A, the result of dice A would not affect the result on dice B. However, if I impose the condition that the sum of result of dice A and dice B must obey ($i + j = 3$), where $i$ and $j$ are the result of the dice A and B respectively. I can construct an entangled state as follow: \begin{equation} |\Psi \rangle = |i+j = 6\rangle = \sum^{6}_{i} \sum^{6}_{j} C_{ij} |i\rangle_{A} \otimes |j\rangle_{B} ~~,~~ |i \rangle_{A} \in \text{Dice A} , ~~,~~ |j \rangle_{B} \in \text{Dice B} \end{equation} Where the state $| \Psi \rangle = |i+ j =3\rangle$ describes the possible combinations of $|i\rangle \otimes |j\rangle$ such that it satisfies the condition $i+j =3$. In such case, we can easily guess the possible outcomes which satisfies $i+j = 3$ without knowing the form of $C_{ij}$: \begin{equation} |\Psi \rangle \propto |1\rangle_{A} \otimes |2 \rangle_{B} + |2\rangle_{A} \otimes |1 \rangle_{B} \end{equation} Instead of thinking 1 possible outcome where $i + j =3 $, I can also take $i+j =2 $ and $i+j =4$ into account. Therefore, the state that describes all the possible outcomes above will be like this: \begin{equation} \begin{split} | \Psi \rangle = |i+j=m; m=2,3,4 \rangle &\propto \ |1\rangle_{A} \otimes |1 \rangle_{B} \\ &+ |1\rangle_{A} \otimes |2 \rangle_{B} + |2\rangle_{A} \otimes |2 \rangle_{B} \\ &+ |1\rangle_{A} \otimes |3 \rangle_{B} + |2\rangle_{A} \otimes |2 \rangle_{B} + |3\rangle_{A} \otimes |1 \rangle_{B} \end{split} \end{equation} Where the first line on R.H.S corresponds to the $|i+j = 2 \rangle$ , and the second line on R.H.S corresponds to $|i+j = 3 \rangle$, and so forth. Then, I want to ask a question regarding the normalization factor of the above states. Currently I have not fixed the normalisation factor of these states. The reason is that I am not sure which normalisation factor that I should use. For instance, let's us focus the case of $|i+j = 3 \rangle$: \begin{equation} |i+j =3 \rangle = \alpha |1\rangle_{A} \otimes |2 \rangle_{B} + \beta |2\rangle_{A} \otimes |1 \rangle_{B} ~~,~~ |\alpha|^{2} + |\beta|^{2} = 1 \end{equation} There are infinitely many choices of $\alpha$ and $\beta$ to satisfy the normalisation. The most convenient choice is $\alpha = \beta = \frac{1}{\sqrt{2}}$, but I am not sure why this will work if I use this normalisation to calculate the classical probability of such system. Suppose I have the following normalisation for $|i+j =3 \rangle$ and $|i+j=m; m=2,3,4 \rangle $: \begin{align} |i+j =3 \rangle &= \frac{1}{\sqrt{2}} \Big( |1\rangle_{A} \otimes |2 \rangle_{B} + \beta |2\rangle_{A} \otimes |1 \rangle_{B} \Big) \\ |i+j=m; m=2,3,4 \rangle &= \frac{1}{\sqrt{6}} \Big(\ |1\rangle_{A} \otimes |1 \rangle_{B} + |1\rangle_{A} \otimes |2 \rangle_{B} + |2\rangle_{A} \otimes |2 \rangle_{B} + |1\rangle_{A} \otimes |3 \rangle_{B} \\ & \qquad \qquad + |2\rangle_{A} \otimes |2 \rangle_{B} + |3\rangle_{A}\otimes |1 \rangle_{B} \Big) \end{align} Classically, we know that there are total 6 states contribute to $|i+j=m; m=2,3,4 \rangle$. If we want to find the probability of the system sitting in $|i+j=3\rangle$, we can directly get the probability $P(m=3) = \frac{2}{6}$ since there are to micorstates in $|i+j=m; m=2,3,4 \rangle$ contribute to $|i+j=3\rangle$. Besides, we can also use quantum mechanics to solve the problem: \begin{equation} P(m=3) = | \langle i +j =3 | i+j=m, m=2,3,4 \rangle|^{2} = ( \frac{1}{\sqrt{2}} \frac{1}{\sqrt{6}} \times 2 )^{2} = \frac{2}{6} \end{equation}
To summarise above , I want to know whether my thought is correct, whether I can use this two dices system to illustrate quantum entanglement? Secondly, I want to ask why the normalisation of $|i+j =2\rangle$ and $|i+j=m, m=2,3,4 \rangle$ are $\frac{1}{\sqrt{2}}$ and $\frac{1}{\sqrt{6}}$ respectively? My thought of the such normalisation factors comes from the assumption of equally probable of microstates in statistical mechanics. If there is any mistake that I made, could anyone help me to point out the mistake and correct me. Thank you.