you should give your comparator's speed. If u don't care speed It is easy but maybe don't fit you application.
U can reference book "Analog Integrated Circuit Design" by Ken Martin
As long as speed is not an issue here, offset can be reduced by utilizing larger area transistors in the preamplifier. Check the matching propoties of the process.