Some architectures will perform better if only aligned accesses are made, so putting 32-bit objects on 32-bit boundaries, and 64-bit objects on 64-bit boundaries can improve the speed of your application.
Some architectures are completely incapable of making unaligned accesses, and on those architectures not padding can be a real disaster.