So, to understand this, you have to keep the big picture in mind. Remember, what's happening is that a plane wave is coming upon a barrier with a non-pointlike slit in it (so, a slit of a certain length).
For a "pointlike" slit (smaller than the wavelength), this is what happens; the slit acts as a source of a spherical wave:

Note that its the wave phenomena that's the "real thing", the rays are just imaginary lines perpendicular to the wavefronts.
For a wider slit, the behavior is more complicated, but, according to the Huygens–Fresnel principle, the situation can be described by treating every point along the slit as an emitter of such spherical waves ("wavelets"), that then interfere (superpose) with each other. The amplitude of the actual (resulting) wave at any point is a superposition of all these wavelets.

So, every point is emitting rays in every direction (within the half plane). Now, the assumption for the explanation you posted is that the screen (where you observe the diffraction pattern) is very far away, meaning the distance to it is much larger than the size of the slit .
That means that the rays originating from every part of the slit that are hitting the same point on the screen are nearly parallel (just how we treat rays of sunlight as nearly parallel). All of the rays in the picture below are meant to be the rays that hit the same point at the other end:

So here, you're just trying to figure out what's happening at a single point on the screen, by summing contributions from all of these rays. Remember, though, it's the value of the field along the ray that is making the contribution (think of each ray as being a slice through its corresponding wavelet; the contribution comes from what the amplitude of the wavelet happens to be at the other end, where it hits the screen).

This image does not depict some alternate setup, this is the same experiment, at the same time. It's just that now you're considering rays that all hit a different point on the screen. Again, you're summing contributions from all of them.
Note that the picture only shows some representative rays, but that there are infinitely many of them; there are rays in-between, not just the ones depicted here. (The contributions from the wavelets are, really, infinitesimal, and the sum is an integral.) All of them, together, add up to zero.
However, I don't understand how this principle is not applied to constructive interference. In the second figure, the top ray and the ray at $\frac{b}{2}$ should interfere constructively since they have a path difference of $\lambda$.
These particular examples were chosen so that all the rays combine to form a minimum. Yes, considering random pairs, some add up constructively, some cancel out, some produce an arbitrary value. However, you're interested in the total sum. Now, rather then working with the actual values, you can make use of the phase-difference in your reasoning. The idea is that you can always pick a pair that interferes destructively (so, zero net contribution), take it out of the total sum, then pick another such pair. So if every pair cancels out, then everything cancels out.
It's the same logic as here:

Note that you don't have to know the actual values if you know that, in this particular arrangement, all the addends 6 positions apart cancel.