1) How does the increase of logN in area complexity resulted in regular layout ?
2) Besides, adding those '2' (or two '1's in binary, just imagine putting 1'b1 on top of each other vertically) by itself also requires carry-save adders which may also contribute to the overall latency or delay as well as resulted in irregular layout ?
Most people now (and for many years) just use embedded multipliers. They are now cheap and plentiful.
What are you reasons you are investigating logic multipliers from many years ago?