The core argument for x^2 is basically just that it's the only function with linear derivative, so for rational voters who calculate marginal benefit of additional votes purchased at cost x^2 (in terms of likelihood of changing the outcome of the "election"), you get a welfare maximizing outcome.
Definitely a challenge for many mechanism design approaches to intervening in the world. As I understand, the only really solution is to experiment in the real world, see what holds and see what doesn't, and tweak accordingly (e.g. test out different interfaces that communicate how the mechanism works!)
It incentivizes the welfare-maximizing result. I don’t have a short “intuitive” explanation of why handy but the math is covered in section 8.3 of Public Choice III (should be easy to get in college libraries, pdf copies can be found online if needed).
There are a few mathematical reasons. I forget the details, but I remember that a key point is to consider marginal cost: the derivative of a quadratic is linear. I think the Central Limit Theorem is also relevant, as someone else here pointed out. Anyway, if you really want to know, you can read the papers!
Actually it's a good point and IMO for a population that may have a certain distribution of voting credits e.g. shareholders of a stock, adjusting the exponent can make sense to more fairly distribute power.