Next Page Previous Page Home Tools & Aids Search Handbook
1. Exploratory Data Analysis
1.3. EDA Techniques
1.3.6. Probability Distributions
1.3.6.6. Gallery of Distributions

1.3.6.6.9. Lognormal Distribution

Probability Density Function A variable X is lognormally distributed if Y = LN(X) is normally distributed with "LN" denoting the natural logarithm. The general formula for the probability density function of the lognormal distribution is

f(x) = EXP(-((ln((x-theta)/m))**2/(2*sigma*2))/
 ((x-theta)*sigma*SQRT(2*PI))   x >= theta; sigma, m > 0

where sigma is the shape parameter, theta is the location parameter and m is the scale parameter. The case where = 0 and m = 1 is called the standard lognormal distribution. The case where theta equals zero is called the 2-parameter lognormal distribution.

The equation for the standard lognormal distribution is

f(x) = EXP(-(log(x)**2/(2*sigma**2))/(x*sigma*SQRT(2*PI))
  x >= 0; sigma > 0

Since the general form of probability functions can be expressed in terms of the standard distribution, all subsequent formulas in this section are given for the standard form of the function.

The following is the plot of the lognormal probability density function for four values of sigma.

plot of the lognormal probability density function for
 four values of sigma

There are several common parameterizations of the lognormal distribution. The form given here is from Evans, Hastings, and Peacock.

Cumulative Distribution Function The formula for the cumulative distribution function of the lognormal distribution is

F(x) = PHI(LN(x)/sigma)   x >= 0; sigma > 0

where PHI is the cumulative distribution function of the normal distribution.

The following is the plot of the lognormal cumulative distribution function with the same values of sigma as the pdf plots above.

plot of the lognormal cumulative distribution function

Percent Point Function The formula for the percent point function of the lognormal distribution is

G(p) = EXP(sigma*PHI**(-1)(p))   0 <= p < 1; sigma > 0

where PHI**(-1) is the percent point function of the normal distribution.

The following is the plot of the lognormal percent point function with the same values of sigma as the pdf plots above.

plot of the lognormal percent point function

Hazard Function The formula for the hazard function of the lognormal distribution is

(1/(sigma*x))*phi(LOG(x)/sigma)/PHI(-LOG(x)/sigma)  x > 0; sigma > 0

where phi is the probability density function of the normal distribution and PHI is the cumulative distribution function of the normal distribution.

The following is the plot of the lognormal hazard function with the same values of as the pdf plots above.

plot of the lognormal hazard function

Cumulative Hazard Function The formula for the cumulative hazard function of the lognormal distribution is

H(x) = -LN(1 - PHI(LN(x)/sigma))   x >= 0; sigma > 0

where PHI is the cumulative distribution function of the normal distribution.

The following is the plot of the lognormal cumulative hazard function with the same values of sigma as the pdf plots above.

plot of the lognormal cumulative hazard function

Survival Function The formula for the survival function of the lognormal distribution is

S(x) = 1 - PHI(LN(x)/sigma)   x >= 0; sigma > 0

where PHI is the cumulative distribution function of the normal distribution.

The following is the plot of the lognormal survival function with the same values of sigma as the pdf plots above.

plot of the lognormal survival function

Inverse Survival Function The formula for the inverse survival function of the lognormal distribution is

Z(p) = EXP(sigma*PHI**(-1)(1-p))   0 <= p < 1; sigma > 0

where PHI**(-1) is the percent point function of the normal distribution.

The following is the plot of the lognormal inverse survival function with the same values of sigma as the pdf plots above.

plot of the lognormal inverse survival function

Common Statistics The formulas below are with the location parameter equal to zero and the scale parameter equal to one.

Mean EXP(0.5*sigma**2
Median Scale parameter m (= 1 if scale parameter not specified).
Mode 1/EXP(sigma**2)
Range Zero to positive infinity
Standard Deviation SQRT(EXP(sigma**2)*(EXP(sigma**2)-1))
Skewness (EXP(sigma**2)+2)**SQRT(EXP(sigma**2)-1))
Kurtosis EXP(sigma**2)**4+2*EXP(sigma**2)**3+3*EXP(sigma**2)**2-3
Coefficient of Variation SQRT(EXP(sigma**2) - 1)

Parameter Estimation The maximum likelihood estimates for the scale parameter, m, and the shape parameter, sigma, are
    mhat = EXP(uhat)
and
    sigmahat = SQRT{SUM[i=1 to N][(LOG(X(i))-mu)**2]/N}
where
    Uhat = SUM[i=1 to N][LOG(X(i))]/N
If the location parameter is known, it can be subtracted from the original data points before computing the maximum likelihood estimates of the shape and scale parameters.
Comments The lognormal distribution is used extensively in reliability applications to model failure times. The lognormal and Weibull distributions are probably the most commonly used distributions in reliability applications.
Software Most general purpose statistical software programs support at least some of the probability functions for the lognormal distribution.

Home Tools & Aids Search Handbook Previous Page Next Page