Statistics/Distributions/Geometric

Geometric
	Probability mass function; <img src="//upload.wikimedia.org/wikipedia/commons/thumb/4/4b/Geometric_pmf.svg/450px-Geometric_pmf.svg.png" decoding="async" width="450" height="192" class="mw-file-element" data-file-width="675" data-file-height="288">
	Cumulative distribution function; <img src="//upload.wikimedia.org/wikipedia/commons/thumb/6/6f/Geometric_cdf.svg/450px-Geometric_cdf.svg.png" decoding="async" width="450" height="192" class="mw-file-element" data-file-width="675" data-file-height="288">
Parameters	<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/e2351d19b9e7a7132cc2efdb3033dbbe6487d47f" class="mwe-math-fallback-image-inline mw-invert skin-invert" aria-hidden="true" style="vertical-align: -0.671ex; width:9.691ex; height:2.509ex;" alt="{\displaystyle 0<p\leq 1}"> success probability (real)
Support	<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/0410be05d594a63d50d6ef72a9545d88146b0e0b" class="mwe-math-fallback-image-inline mw-invert skin-invert" aria-hidden="true" style="vertical-align: -0.838ex; margin-right: -0.234ex; width:15.536ex; height:2.843ex;" alt="{\displaystyle k\in \{1,2,3,\dots \}\!}">
PMF	<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/a2e2b6a8b17b652839341eeee3449d8451691c54" class="mwe-math-fallback-image-inline mw-invert skin-invert" aria-hidden="true" style="vertical-align: -0.838ex; margin-right: -0.373ex; width:11.713ex; height:3.176ex;" alt="{\displaystyle (1-p)^{k-1}\,p\!}">
CDF	<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/4a350c2a62086bbf92f08212fa21deb16169a37f" class="mwe-math-fallback-image-inline mw-invert skin-invert" aria-hidden="true" style="vertical-align: -0.838ex; margin-right: -0.387ex; width:12.073ex; height:3.176ex;" alt="{\displaystyle 1-(1-p)^{k}\!}">
Mean	<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/05442dee1820276f6c7373892907e86985799202" class="mwe-math-fallback-image-inline mw-invert skin-invert" aria-hidden="true" style="vertical-align: -2.338ex; margin-right: -0.108ex; width:1.727ex; height:5.676ex;" alt="{\displaystyle {\frac {1}{p}}\!}">
Median	<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/0902df6a00dad1bc4696e54469ffdc971e0bef10" class="mwe-math-fallback-image-inline mw-invert skin-invert" aria-hidden="true" style="vertical-align: -2.671ex; width:14.167ex; height:6.343ex;" alt="{\displaystyle \left\lceil {\frac {-1}{\log _{2}(1-p)}}\right\rceil \!}"> (not unique if <img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/d626364ef11ef547d40cefe36c0ee984f5f010d9" class="mwe-math-fallback-image-inline mw-invert skin-invert" aria-hidden="true" style="vertical-align: -0.838ex; width:15.528ex; height:2.843ex;" alt="{\displaystyle -1/\log _{2}(1-p)}"> is an integer)
Mode	<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/92d98b82a3778f043108d4e20960a9193df57cbf" class="mwe-math-fallback-image-inline mw-invert skin-invert" aria-hidden="true" style="vertical-align: -0.338ex; width:1.162ex; height:2.176ex;" alt="{\displaystyle 1}">
Variance	<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/ad445ed450d679c246973015cd7a678511d0ae7a" class="mwe-math-fallback-image-inline mw-invert skin-invert" aria-hidden="true" style="vertical-align: -2.505ex; margin-right: -0.108ex; width:5.73ex; height:6.009ex;" alt="{\displaystyle {\frac {1-p}{p^{2}}}\!}">
Skewness	<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/fb744cb32757a900c233c8eebdc9639f33ed6921" class="mwe-math-fallback-image-inline mw-invert skin-invert" aria-hidden="true" style="vertical-align: -3.171ex; margin-right: -0.108ex; width:8.053ex; height:6.676ex;" alt="{\displaystyle {\frac {2-p}{\sqrt {1-p}}}\!}">
Ex. kurtosis	<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/5455d55fab6bf94a210fc395140eab8a52f66918" class="mwe-math-fallback-image-inline mw-invert skin-invert" aria-hidden="true" style="vertical-align: -2.338ex; margin-right: -0.108ex; width:9.733ex; height:6.176ex;" alt="{\displaystyle 6+{\frac {p^{2}}{1-p}}\!}">
Entropy	<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/7372cc718964326804e72f76d63024ba485873b9" class="mwe-math-fallback-image-inline mw-invert skin-invert" aria-hidden="true" style="vertical-align: -1.338ex; margin-right: -0.108ex; width:20.209ex; height:4.343ex;" alt="{\displaystyle {\tfrac {-(1-p)\log _{2}(1-p)-p\log _{2}p}{p}}\!}">
MGF	<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/5d767773f6fbbb823b55bf23486a6addabecc5f8" class="mwe-math-fallback-image-inline mw-invert skin-invert" aria-hidden="true" style="vertical-align: -2.671ex; margin-right: -0.108ex; width:13.451ex; height:6.509ex;" alt="{\displaystyle {\frac {pe^{t}}{1-(1-p)e^{t}}}\!}"> , ; for <img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/4daad39f718bdab178e2c2aab322ed3bce7132c0" class="mwe-math-fallback-image-inline mw-invert skin-invert" aria-hidden="true" style="vertical-align: -0.838ex; margin-right: -0.166ex; width:14.834ex; height:2.843ex;" alt="{\displaystyle t<-\ln(1-p)\!}">
CF	<img src="https://wikimedia.org/api/rest_v1/media/math/render/svg/e4724699135faa2a110c030e0ed4697b83ab28c5" class="mwe-math-fallback-image-inline mw-invert skin-invert" aria-hidden="true" style="vertical-align: -2.671ex; margin-right: -0.108ex; width:14.406ex; height:6.509ex;" alt="{\displaystyle {\frac {pe^{it}}{1-(1-p)\,e^{it}}}\!}">

Geometric Distribution

There are two similar distributions with the name "Geometric Distribution".

The probability distribution of the number X of Bernoulli trials needed to get one success, supported on the set { 1, 2, 3, ...}

The probability distribution of the number Y = X − 1 of failures before the first success, supported on the set { 0, 1, 2, 3, ... }

These two different geometric distributions should not be confused with each other. Often, the name shifted geometric distribution is adopted for the former one. We will use X and Y to refer to distinguish the two.

Shifted

The shifted Geometric Distribution refers to the probability of the number of times needed to do something until getting a desired result. For example:

How many times will I throw a coin until it lands on heads?
How many children will I have until I get a girl?
How many cards will I draw from a pack until I get a Joker?

Just like the Bernoulli Distribution, the Geometric distribution has one controlling parameter: The probability of success in any independent test.

If a random variable X is distributed with a Geometric Distribution with a parameter p we write its probability mass function as:

$P\left(X=i\right)=p\left(1-p\right)^{i-1}$

With a Geometric Distribution it is also pretty easy to calculate the probability of a "more than n times" case. The probability of failing to achieve the wanted result is $\left(1-p\right)^{k}$ .

Example: a student comes home from a party in the forest, in which interesting substances were consumed. The student is trying to find the key to his front door, out of a keychain with 10 different keys. What is the probability of the student succeeding in finding the right key in the 4th attempt?

$P\left(X=4\right)={\frac {1}{10}}\left(1-{\frac {1}{10}}\right)^{4-1}={\frac {1}{10}}\left({\frac {9}{10}}\right)^{3}=0.0729$

Unshifted

The probability mass function is defined as:

f(x)=p(1-p)^{x}\,

for

x\in \{0,1,2,\dots \}

Mean

\operatorname {E} [X]=\sum _{i}f(x_{i})x_{i}=\sum _{0}^{\infty }p(1-p)^{x}x

Let q=1-p

\operatorname {E} [X]=\sum _{0}^{\infty }(1-q)q^{x}x

\operatorname {E} [X]=\sum _{0}^{\infty }(1-q)qq^{x-1}x

\operatorname {E} [X]=(1-q)q\sum _{0}^{\infty }q^{x-1}x

\operatorname {E} [X]=(1-q)q\sum _{0}^{\infty }{\frac {d}{dq}}q^{x}

We can now interchange the derivative and the sum.

\operatorname {E} [X]=(1-q)q{\frac {d}{dq}}\sum _{0}^{\infty }q^{x}

\operatorname {E} [X]=(1-q)q{\frac {d}{dq}}{1 \over 1-q}

\operatorname {E} [X]=(1-q)q{1 \over (1-q)^{2}}

\operatorname {E} [X]=q{1 \over (1-q)}

\operatorname {E} [X]={(1-p) \over p}

Variance

We derive the variance using the following formula:

\operatorname {Var} [X]=\operatorname {E} [X^{2}]-(\operatorname {E} [X])^{2}

We have already calculated E[X] above, so now we will calculate E[X²] and then return to this variance formula:

\operatorname {E} [X^{2}]=\sum _{i}f(x_{i})\cdot x^{2}

\operatorname {E} [X^{2}]=\sum _{0}^{\infty }p(1-p)^{x}x^{2}

Let q=1-p

\operatorname {E} [X^{2}]=\sum _{0}^{\infty }(1-q)q^{x}x^{2}

We now manipulate x² so that we get forms that are easy to handle by the technique used when deriving the mean.

\operatorname {E} [X^{2}]=(1-q)\sum _{0}^{\infty }q^{x}[(x^{2}-x)+x]

\operatorname {E} [X^{2}]=(1-q)\left[\sum _{0}^{\infty }q^{x}(x^{2}-x)+\sum _{0}^{\infty }q^{x}x\right]

\operatorname {E} [X^{2}]=(1-q)\left[q^{2}\sum _{0}^{\infty }q^{x-2}x(x-1)+q\sum _{0}^{\infty }q^{x-1}x\right]

\operatorname {E} [X^{2}]=(1-q)q\left[q\sum _{0}^{\infty }{\frac {d^{2}}{(dq)^{2}}}q^{x}+\sum _{0}^{\infty }{\frac {d}{dq}}q^{x}\right]

\operatorname {E} [X^{2}]=(1-q)q\left[q{\frac {d^{2}}{(dq)^{2}}}\sum _{0}^{\infty }q^{x}+{\frac {d}{dq}}\sum _{0}^{\infty }q^{x}\right]

\operatorname {E} [X^{2}]=(1-q)q\left[q{\frac {d^{2}}{(dq)^{2}}}{1 \over 1-q}+{\frac {d}{dq}}{1 \over 1-q}\right]

\operatorname {E} [X^{2}]=(1-q)q\left[q{2 \over (1-q)^{3}}+{1 \over (1-q)^{2}}\right]

\operatorname {E} [X^{2}]={2q^{2} \over (1-q)^{2}}+{q \over (1-q)}

\operatorname {E} [X^{2}]={2q^{2}+q(1-q) \over (1-q)^{2}}

\operatorname {E} [X^{2}]={q(q+1) \over (1-q)^{2}}

\operatorname {E} [X^{2}]={(1-p)(2-p) \over p^{2}}

We then return to the variance formula

\operatorname {Var} [X]=\left[{(1-p)(2-p) \over p^{2}}\right]-\left({1-p \over p}\right)^{2}

\operatorname {Var} [X]={(1-p) \over p^{2}}

External links

Interactive Geometric Distribution Web Applet (Java)

Probability mass function
Cumulative distribution function
Parameters	$0<p\leq 1$ success probability (real)
Support	$k\in \{1,2,3,\dots \}\!$
PMF	$(1-p)^{k-1}\,p\!$
CDF	$1-(1-p)^{k}\!$
Mean	${\frac {1}{p}}\!$
Median	$\left\lceil {\frac {-1}{\log _{2}(1-p)}}\right\rceil \!$ (not unique if $-1/\log _{2}(1-p)$ is an integer)
Mode	$1$
Variance	${\frac {1-p}{p^{2}}}\!$
Skewness	${\frac {2-p}{\sqrt {1-p}}}\!$
Ex. kurtosis	$6+{\frac {p^{2}}{1-p}}\!$
Entropy	${\tfrac {-(1-p)\log _{2}(1-p)-p\log _{2}p}{p}}\!$
MGF	${\frac {pe^{t}}{1-(1-p)e^{t}}}\!$ , for $t<-\ln(1-p)\!$
CF	${\frac {pe^{it}}{1-(1-p)\,e^{it}}}\!$