I suppose two stage opamp architecture is a very good cjoice for your specifications,
a differential stage as the first one, followed by a common-source second stage.......
I think the two stage can meet you spec. The topolog is :
the folded-cascode input stage & class AB output stage;
the general differential input stage & common source sigle amplifier output stage;
the P&N parallel differential input stage & class AB output stage;
or the single folded-cascode amplifier;
or the regulate folded-cascode amplifier;