Exponential distribution
From open-encyclopedia.com - the free encyclopedia.
In probability theory and statistics, the exponential distribution is a continuous probability distribution.
| Contents |
Specification of the exponential distribution
Probability density function
The probability density function (pdf) of the Exponential(λ) distribution is
- <math> f(x) = \lambda e^{-\lambda x} <math>
for x ≥ 0 and where λ > 0 is a parameter of the distribution.
Alternatively, the exponential distribution can be parameterized by a scale parameter μ = 1/λ, as follows:
- <math> g(x) = \frac{1}{\mu} e^{-x/\mu} <math>
where μ > 0.
Cumulative distribution function
The cumulative distribution function is given by
- <math> F(x) = 1-e^{-\lambda x} \,\!<math>.
Quantile function
The inverse cumulative distribution function is
- <math>F^{-1}(p) = \frac{-\ln(1-p)}{\lambda}<math>
for 0 ≤ p < 1.
Occurrence
The exponential distribution is used to model Poisson processes, which are situations in which an object initially in state A can change to state B with constant probability per unit time λ. The time at which the state actually changes is described by an exponential random variable with parameter λ. Therefore, the integral from 0 to T over f is the probability that the object is in state B at time T.
The exponential distribution may be viewed as a continuous counterpart of the geometric distribution, which describes the number of Bernoulli trials necessary for a discrete process to change state. In contrast, the exponential distribution describes the time for a continuous process to change state.
Examples of variables that are approximately exponentially distributed are:
- the time until you have your next car accident;
- the time until you get your next phone call (assuming you get called many times a day, or get called by people from many different time zones);
- the distance between mutations on a DNA strand;
- the distance between roadkill;
- the time until a radioactive particle decays;
- the lifetime of an incandescent light bulb;
- the number of dice rolls until you roll 6 11 times in a row.
Properties
Memorylessness
An important property of the exponential distribution is that it is memoryless. This means that if a random variable T is exponentially distributed, its conditional probability obeys
- <math>P(T > s + t\; |\; T > t) = P(T > s) \;\; \hbox{for all}\ s, t \ge 0. <math>
This says that the conditional probability that we need to wait, for example, more than another 10 seconds before the first arrival, given that the first arrival has not yet happened after 30 seconds, is no different from the initial probability that we need to wait more than 10 seconds for the first arrival. This is often misunderstood by students taking courses on probability: the fact that P(T > 40 | T > 30) = P(T > 10) does not mean that the events T > 40 and T > 10 are independent. To summarize: "memorylessness" of the probability distribution of the waiting time T until the first arrival means
- <math>\mathrm{(Right)}\ P(T>40 \mid T>30)=P(T>10).<math>
It does not mean
- <math>\mathrm{(Wrong)}\ P(T>40 \mid T>30)=P(T>40).<math>
(That would be independence. These two events are not independent.)
Quartiles
The quartiles of an Exponential(λ) random variable are as follows:
- first quartile
- ln(4/3)/λ
- median
- ln(2)/λ
- third quartile
- ln(4)/λ
Expectations
An Exponential(λ) random variable has the following properties:
Parameter estimation
Maximum likelihood
The likelihood function of an independent and identically distributed sample x = (x1, ..., xn) is
- <math> L(\lambda) = \prod_{i=1}^n \lambda \, \exp(-\lambda x_i) = \lambda^n \, \exp\!\left(\!-\lambda \sum_{i=1}^n x_i\right)=\lambda^n\exp\left(-\lambda n \overline{x}\right) <math>
where
- <math>\overline{x}={1 \over n}\sum_{i=1}^n x_i<math>
is the sample mean.
The derivative of the likelihood function is
- <math>\frac{\mathrm{d}}{\mathrm{d}\lambda} \ln L(\lambda) = \frac{\mathrm{d}}{\mathrm{d}\lambda} \left( n \ln(\lambda) - \lambda n\overline{x} \right) = {n \over \lambda}-n\overline{x}\ \left\{\begin{matrix} > 0 & \mbox{if}\ 0 < \lambda < 1/\overline{x}, \\ \\ = 0 & \mbox{if}\ \lambda = 1/\overline{x}, \\ \\ < 0 & \mbox{if}\ \lambda > 1/\overline{x}. \end{matrix}\right. <math>
Consequently the maximum likelihood estimate is
- <math>\widehat{\lambda} = \frac1{\overline{x}}<math>.
Bayesian inference
The conjugate prior for the exponential distribution is the gamma distribution (of which the exponential distribution is a special case). The following parameterization of the gamma pdf is useful:
- <math> \mathrm{Gamma}(\lambda \,|\, \alpha, \beta) = \frac{\beta^{\alpha}}{\Gamma(\alpha)} \, \lambda^{\alpha-1} \, \exp(-\lambda\,\beta) <math>
The posterior distribution p can then be expressed in terms of the likelihood function defined above and a gamma prior:
| <math>p(\lambda) \!\!\!\!<math> | <math>\propto L(\lambda) \times \mathrm{Gamma}(\lambda \,|\, \alpha, \beta)<math> |
| <math>= \lambda^n \, \exp(-\lambda\,n\overline{x}) \times \frac{\beta^{\alpha}}{\Gamma(\alpha)} \, \lambda^{\alpha-1} \, \exp(-\lambda\,\beta)<math> | |
| <math>\propto \lambda^{(\alpha+n)-1} \, \exp(-\lambda\,(\beta + n\overline{x}))<math> |
Now the posterior density p has been specified up to a missing normalizing constant. Since it has the form of a gamma pdf, this can easily be filled in, and one obtains:
- <math> p(\lambda) = \mathrm{Gamma}(\lambda \,|\, \alpha + n, \beta + n \overline{x}) <math>
Here the parameter α can be interpreted as the number of prior observations, and β as the sum of the prior observations.
Generating exponential variates
Given a random variate U drawn from the uniform distribution in the interval (0;1], the variate
- <math>T=\frac{-\ln U}{\lambda}<math>
has an exponential distribution with parameter λ.
Applications of the exponential distribution
Reliability theory makes extensive use of the exponential distribution. Because of the memoryless property of this distribution, it is well-suited to model the consant hazard rate portion of the bathtub curve used in reliability theory. It is also very convenient because it is so easy to add failure rates in a reliability model. de:Exponentialverteilung es:Distribución exponencial it:Variabile casuale Esponenziale Negativa sv:Exponentialfördelning