Where does a log-normal distribution “come from”?
The log-normal distribution shows up all over the place because it arises from a quite general process. That process is the multiplication of many positive random variables. The same way we associate the normal distribution with the addition of many random variables due to the central-limit theorem, we can also think of analogous behavior in the log domain. That is if we have some random variables with some mean and finite variance, then the sum of all of these variables tends to a normal distribution
. Likewise then, the product of many random variables
, is
, and taking the logarithm of this expression leaves
and thus the logarithm of
can be seen to be normally distributed.
Why is it like a power law?
The utility/prevalence of the log-normal distribution rises too because we can see that in some approximations, it maps to a`power-law’ distribution. First writing out the precise form, the log-normal distribution function in terms of the log mean and the log standard deviation
is
taking the log of both sides we have
or
but if is an appreciable amount of the range of
we are investigating, than the second term in the brackets is very small. Defining
and
we then can re-exponentiate
or
.
Thus the probability depends on a power-law of the independent variable.
The distribution is nice because it looks linear on a log-log scale. That is if we have some distribution , and we take the log of both sides and plot the log-transformed variables, say
, we can write this as
we have a linear function in
space.
Notes/caution on power law distributions
I met an sociologist the other day who was studying the social networks involved in HIV epidemiology. I asked him a dumb physicist question, “what do the networks look like? are they power-law distributed?”. He smiled. “If you are used to looking for power-law distributions, then yes.”
His point was really well taken. In this example what I was asking about is the distribution of degree sizes (or number of edges coming out from nodes in a social network, or graph). That is, how many people have 3 contacts?, how many 5?, how many 50? It seems that many human networks (the internet is a good example) have multiple scales (5-50-500 etc), and that actually those distributions look linear on a log-log scale as described above. However, it has become fashionable to point to these relationships and draw out power laws. It turns out this fitting process is actually slippery, and Clauset, Shalizi, and Newman (arxiv) have a really nice albeit snarky paper on how to do this correctly, or not.