With DCT you convert the 8*8 block into very few values to store, may be less than 10 values (instead of 64)
I believe that 4*4 is too small, you won't achieve good compression.
16*16 would probably cause the resulting image to not sharp or smooth enough (bad reconstruction).