If you need real time performance, I suggest you go with a DSP instead of an MCU.
TI has some very interesting devices.
But I think that you're jumping far ahead...
Image recognition involves complex mathematics and heavy signal processing.
Deciding on a target device is a task that comes way after you know exactly how and what needs to be done.
Have the algorithm work 100% on Matlab first and only then proceed to the embedded part.