From the material that you posted initially in the beginning of this discussion, M14 is the feed-forward device. But for simplicity just consider M14 and M16 as something similar to an inverter. The gates of these two transistors have to move in the same direction to create good negative feedback for the amplifier. If gate of M16 goes down because M11 sinks more current than M15 can source, then the gate of M14 should also go down, because if M16 is turning slightly off, M14 should turn slightly on, so the output can stay more or less unchanged (or follow the input) - sort of a push-pull. If you connect the gate of M14 as in the last schematic, the two gates go in the opposite directions - if M16 tends to turn off, M14 also tends to turn off. You can imagine in the extreme if both turn off hard, the output will be kind of high impedance and can have any value - so, it will not follow the input and will not create virtual short. That's why the gate of M14 has to be connected to the output of the first stage because between the output of 1st stage and gate of M16 you have two inversions.