“Statistics is not perfect, but it is beautiful and exquisite! ”

Theoretical Derivation

This post is heavily focused on the mathematical proof and theoretical derivation, which doesn’t be discussed much in the “Statistics” Post.

Will be an appendix of it.

Gaussian Distribution

$X\sim\mathcal{N}(\mu, \sigma^2)$ .

$(X-\mu)\sim\mathcal{N}(0, \sigma^2)$ , which is symmetric odd-function.

As for the properties of PDF and CDF, we have:

\begin{matrix} \int_{- \infty}^{\infty} \frac{1}{\sqrt{2 π} σ} e^{- \frac{(x - μ)^{2}}{2 σ^{2}}} d x = 1 \\ \int_{- \infty}^{\infty} \frac{x - μ}{\sqrt{2 π} σ} e^{- \frac{(x - μ)^{2}}{2 σ^{2}}} d x = 0 \end{matrix}

Gaussian Probability Tables and Quantiles

Means of Gaussians

For $\mathbb E[x] = \mu$ ,

\begin{aligned} E (x) & = \int_{- \infty}^{+ \infty} x f (x) d x = \int_{- \infty}^{+ \infty} \frac{x}{\sqrt{2 π} δ} e^{- \frac{(x - μ)^{2}}{2 δ^{2}}} d x \\ = \int_{- \infty}^{+ \infty} \frac{(x - μ) + μ}{\sqrt{2 π} δ} e^{- \frac{(x - μ)^{2}}{2 δ^{2}}} d x \\ = \underset{0}{\underset{⏟}{\int_{- \infty}^{+ \infty} \frac{x - μ}{\sqrt{2 π} δ} e^{- \frac{(x - μ)^{2}}{2 δ^{2}}} d x}} + μ \underset{1}{\underset{⏟}{\int_{- \infty}^{+ \infty} \frac{1}{\sqrt{2 π} δ} e^{- \frac{(x - μ)^{2}}{2 δ^{2}}} d x}} \\ = μ \end{aligned}

For $\mathbb E[x^2] = \sigma^2 +\mu^2$ ,

\begin{array}{l} E (x^{2}) = \int_{- \infty}^{\infty} x^{2} \cdot \frac{1}{\sqrt{2 π} σ} e^{- \frac{(x - μ)^{2}}{2 σ^{2}}} d x = \int_{- \infty}^{\infty} \frac{(x + μ) (x - μ) + μ^{2}}{\sqrt{2 π} σ} e^{- \frac{(x - μ)^{2}}{2 σ^{2}}} d x \\ = \int_{- \infty}^{\infty} \frac{(x + μ) (x - μ)}{\sqrt{2 π} σ} e^{- \frac{(x - μ)^{2}}{2 σ^{2}}} d x + \int_{- \infty}^{\infty} \frac{μ^{2}}{\sqrt{2 π} σ} e^{- \frac{(x - μ)^{2}}{2 σ^{2}}} d x \\ = \int_{- \infty}^{\infty} \frac{- σ (x + μ)}{\sqrt{2 π}} d (e^{- \frac{(x - μ)^{2}}{2 σ^{2}}}) + μ^{2} \\ = \underset{0}{\underset{⏟}{{(\frac{- σ (x + μ)}{\sqrt{2 π}} e^{- \frac{(x - μ)^{2}}{2 σ^{2}}})}^{\infty}}} + σ^{2} \int_{- \infty}^{\infty} \frac{1}{σ \sqrt{2 π}} e^{- \frac{(x - μ)^{2}}{2 σ^{2}}} d x + μ^{2} \\ = σ^{2} + μ^{2} \end{array}

For $\mathbb E[x^3] = 3\mu\sigma^2 +\mu^3$ ,

$x^3$ $(x-\mu)^3$ ,

x^{3} = (x - μ)^{3} + μ^{3} + 3 μ x^{2} - 3 x μ^{2}

$\mathbb E[x^2]$ $\mathbb E[x]$ formulas:

\begin{array}{l} E (x^{3}) = \int_{- \infty}^{\infty} x^{3} \cdot \frac{1}{\sqrt{2 π} σ} e^{- \frac{(x - μ)^{2}}{2 σ^{2}}} d x = \int_{- \infty}^{\infty} \frac{(x - μ)^{3} + μ^{3} + 3 μ x^{2} - 3 x μ^{2}}{\sqrt{2 π} σ} e^{- \frac{(x - μ)^{2}}{2 σ^{2}}} d x \\ = \underset{0}{\underset{⏟}{\int_{- \infty}^{\infty} \frac{(x - μ)^{3}}{\sqrt{2 π} σ} e^{- \frac{(x - μ)^{2}}{2 σ^{2}}} d x}} + \int_{- \infty}^{\infty} \frac{3 (μ x^{2} - x μ^{2})}{\sqrt{2 π} σ} e^{- \frac{(x - μ)^{2}}{2 σ^{2}}} d x + μ^{3} \\ = 0 + 3 (μ E (x^{2}) - μ^{2} E (x)) + μ^{3} \\ = 3 μ σ^{2} + μ^{3} \end{array}

For $\mathbb E[x^4] = 3\sigma^4 +6\mu^2\sigma^2 + \mu^4$ ,

$x^4$ ,

x^{4} = (x^{2} + μ^{2}) (x + μ) (x - μ) + μ^{4}

So we have,

\begin{array}{l} E (x^{4}) = \int_{- \infty}^{\infty} x^{4} \cdot \frac{1}{\sqrt{2 π} σ} e^{- \frac{(x - μ)^{2}}{2 σ^{2}}} d x = \int_{- \infty}^{\infty} \frac{(x^{2} + μ^{2}) (x + μ) (x - μ)}{\sqrt{2 π} σ} e^{- \frac{(x - μ)^{2}}{2 σ^{2}}} d x + μ^{4} \\ = \int_{- \infty}^{\infty} \frac{(x^{2} + μ^{2}) (x + μ)}{2 \sqrt{2 π} σ} e^{- \frac{(x - μ)^{2}}{2 σ^{2}}} d (x - μ)^{2} + μ^{4} \\ = - \int_{- \infty}^{\infty} \frac{σ (x^{2} + μ^{2}) (x + μ)}{\sqrt{2 π}} d e^{- \frac{(x - μ)^{2}}{2 σ^{2}}} + μ^{4} \\ = \underset{0}{\underset{⏟}{{(- \frac{σ (x^{2} + μ^{2}) (x + μ)}{\sqrt{2 π}} e^{- \frac{(x - μ)^{2}}{2 σ^{2}}})}^{\infty}}} + \int_{- \infty}^{\infty} e^{- \frac{(x - μ)^{2}}{2 σ^{2}}} d \frac{(x^{2} + μ^{2}) (x + μ)}{2 \sqrt{2 π} σ} + μ^{4} \\ = 0 + σ^{2} (3 * E (x^{2}) + 2 μ E (x) + μ^{2}) + μ^{4} \\ = 3 σ^{4} + 6 μ^{2} σ^{2} + μ^{4} \end{array}

For $\mathbb E[\bar{X}_{n}] = \mathbb{E}[X_{1}]$ ,

E [{\bar{X}}_{n}] = \frac{1}{n} \sum_{i = 1}^{n} E [X_{i}] = E [X_{1}]

[eq2]

For $\mathbb E[XY]$ ,

The expectation of the product of X and Y is the product of the individual expectations: E(XY ) = E(X)E(Y ).

Variance of Gaussians

For $\mathbb V[x] = \sigma^2$ ,

With integral formulas step by step, we can get:

\begin{aligned} V & = \int_{- \infty}^{+ \infty} \frac{1}{\sqrt{2 π} δ} e^{- \frac{(x - u)^{2}}{2 δ^{2}}} (x - u)^{2} d x \\ = \int_{- \infty}^{+ \infty} \frac{1}{\sqrt{2 π} δ} e^{- \frac{x^{2}}{2 δ^{2}}} x^{2} d x \\ = - \frac{δ}{\sqrt{2 π}} \int_{- \infty}^{+ \infty} x d (e^{- \frac{x^{2}}{2 δ^{2}}}) \\ = - \frac{δ}{\sqrt{2 π}} (\underset{⏟}{{x e^{- \frac{x^{2}}{2 δ^{2}}} |}_{- \infty}^{+ \infty}} - \int_{- \infty}^{+ \infty} e^{- \frac{x^{2}}{2 δ^{2}}} d x) \\ = \frac{δ}{\sqrt{2 π}} \int_{- \infty}^{+ \infty} e^{- \frac{x^{2}}{2 δ^{2}}} d x \\ = δ^{2} \end{aligned}

Which can be simplified with expectation substitutions we concluded above:

\begin{aligned} V (x) & = E ((x - E (x))^{2}) \\ = E (x^{2} - 2 x E (x) + E^{2} (x)) \\ = E (x^{2}) - 2 E (x) E (x) + E^{2} (x) \\ = E (x^{2}) - E^{2} (x) \\ = E (x^{2}) - μ^{2} \\ = σ^{2} \end{aligned}

For $\mathbb V[x^2] = 2\sigma^4+4\sigma^2\mu^2$ ,

\begin{aligned} V (x^{2}) & = E (x^{4}) - E (x^{2})^{2} \\ = 3 σ^{4} + 6 μ^{2} σ^{2} + μ^{4} - (σ^{2} + μ^{2})^{2} \\ = 2 σ^{4} + 4 σ^{2} μ^{2} \end{aligned} 4

$V(\bar X_n)=\large\frac{\sigma^2}{n}$

[eq3]

Var(XY)

$image-20220301113018384$

$image-20220301113212523$

Covariance

$x$ $y$ ,

Cov (x, y) = E [(x - E [x]) (y - E [y])]

pdf

You should have something like:

\begin{matrix} E (| X - Y |^{a}) = \iint_{x > y} (x - y)^{a} d x d y + \\ \iint_{y > x} (y - x)^{a} d x d y = 2 \iint_{x > y} (x - y)^{a} d x d y \end{matrix}

Now:

\begin{matrix} \iint_{x > y} (x - y)^{a} d x d y = \int_{y = 0}^{1} \int_{x = y}^{1} (x - y)^{a} d x d y \\ = \frac{1}{(a + 1) (a + 2)} \end{matrix}

So:

\begin{matrix} E (| X - Y |^{a}) = 2 \iint_{x > y} (x - y)^{a} d x d y \\ = \frac{2}{(a + 1) (a + 2)} \end{matrix}

Covariance of Gaussians

covariance $X$ $Y$ $\text{Cov} (X,Y)$ is defined as:

\begin{matrix} Cov (X, Y) = E [(X - E [X]) (Y - E [Y])] \\ Cov (X, Y) = E [X Y] \cdot E [X] E [Y])] \end{matrix}

Var[X+Y] = Var[X] + Var[Y] + 2∙Cov[X,Y]

\begin{aligned} Var (X Y) & = Var [E (X Y ∣ X)] + E [Var (X Y ∣ X)] \\ = Var [X E (Y ∣ X)] + E [X^{2} Var (Y ∣ X)] \\ = Var [X E (Y)] + E [X^{2} Var (Y)] \\ = E (Y)^{2} Var (X) + Var (Y) E (X^{2}) \end{aligned}

If the covariance between two random variables is 0, then they are independent?

independence $F(x,y)=F_X(x)F_Y(y)$

variance of ln function

$\frac{1}{x}$ ]:

Using Delta Method,

You can use Taylor series to get an approximation of the low order moments of a transformed random variable. If the distribution is fairly 'tight' around the mean (in a particular sense), the approximation can be pretty good.

g (X) = g (μ) + (X - μ) g' (μ) + (X - μ) 22 g'' (μ) + \dots

\begin{aligned} Var [g (X)] = & Var [g (μ) + (X - μ) g^{'} (μ) + \frac{(X - μ)^{2}}{2} g^{''} (μ) + \dots] \\ = & Var [(X - μ) g^{'} (μ) + \frac{(X - μ)^{2}}{2} g^{''} (μ) + \dots] \\ = & g^{'} (μ)^{2} Var [(X - μ)] + 2 g^{'} (μ) Cov [(X - μ), \frac{(X - μ)^{2}}{2} g^{''} (μ) + \dots] \\ + Var [\frac{(X - μ)^{2}}{2} g^{''} (μ) + \dots] \end{aligned}

often only the first term is taken

Var [g (X)] \approx g^{'} (μ)^{2} Var (X)

$g(X)=\frac{1}{X}, \operatorname{Var}\left[\frac{1}{X}\right] \approx \frac{1}{\mu^{4}} \operatorname{Var}(X)$ .

The Property of Maximum

$X\sim \text{Uni}(0,1)$ $M_n=\max(x_1,\cdots,x_n)$ ,

$X_n$ are i.i.d. :

P (M_{n} < t) = P (\cap_{i = 1}^{n} {x_{i} \leq t}) = \prod_{i = q}^{n} P (x_{i} \leq t) ⟶ F_{X} (t)^{n}

$F_X(\cdot)$ is the CDF of distribution.

\begin{aligned} P (n (1 - M_{n}) \leq t) & = P (1 - M_{n} \leq \frac{t}{n}) \\ = P (M_{n} \geq 1 - \frac{t}{n}) \\ = 1 - P (M_{n} < 1 - \frac{t}{n}) \\ = 1 - {(1 - \frac{t}{n})}^{n} \overset{n \to \infty}{⟶} 1 - e^{- t} . \end{aligned}

Limitations

$c$ :

lim_{n \to \infty} {(1 - \frac{c}{n})}^{n} \to e^{- c}

\sum_{i = 1}^{\infty} α^{i} = \frac{α}{1 - α}, i f f | α | < 1

Food for thought:

What is the variance of the maximum of a sample?

Maximum of uniform random variables

$M_n$

b = - (2*barX_n + 1.6448^2/n)

(2barX_n + 1.6448^2/n + sqrt((2barX_n + 1.6448^2/n) &2-4*barX_n^2) )/2

Mode of Convergence Examples

Example of a.s.

$U\sim \text{Uni}(0,1)$ $X_n = U + U^n$ $n$ )

$X_n\overset{\text{a.s}} \longrightarrow U$

Proof:

$A$ ,

P (A) = P (A ∣ S_{1}) P (S_{1}) + P (A ∣ S_{2}) P (S_{2})

\begin{matrix} {\begin{cases} X_{n} \underset{n \to \infty}{⟶} U & if U \in (0, 1) \\ X_{n} \underset{n \to \infty}{⟶} 2 & if U = 1 \end{cases} \end{matrix}

So we have

P (X_{n} \underset{n \to \infty}{⟶} U) = P (X_{n} ⟶ U) P (U < 1) + 0 = 1

$\mathbb P$ ,

$X_n \overset{iid}\sim Ber(\frac{1}{n})$ $\epsilon \in (0,1)$ , we have

P (X_{n} > ϵ) = P (X_{n} = 1) = \frac{1}{n} \underset{n \to \infty}{⟶} 0

$\mathbb P$ ,

$U\sim \text{Uni}(0,1)$ $X_i = \min X_i$

$X_{(1)} \stackrel{p}{\rightarrow} O$ ,

Proof:

$\varepsilon>0$ ,

\begin{aligned} P (| X_{(1)} - 0 | > ε) & = P (X_{(1)} > ε) \\ = P (X_{i} > ε, \forall i) \\ = {(P (X_{1} > ε))}^{n} \\ = {(S_{ε}^{1} ∣ d x)}^{n} \\ = (1 - ε)^{n} \underset{n \to \infty}{\to} 0 \end{aligned}

$\mathbb P$ but not with a.s.

$U\sim \text{Uni}(0,1)$ $x$ is more and more subdivided segmentation functions between [0, 1]:

\begin{array}{l} x_{1} = U + 1 (U \in [0, 1]) = U + 1 \\ x_{2} = U + 1 (U \in [0, 1 / 2]) \\ x_{3} = U + 1 (U \in [1 / 2, 1]) \\ x_{4} = U + 1 (U \in [0, 1 / 3]) \\ x_{5} = U + 1 (U \in [1 / 3, 2 / 3]) \\ x_{6} = U + 1 (U \in [2 / 3, 1]) \end{array}

$x_{n} \stackrel{p}{\rightarrow} U: \text{Fix}\ 0<\varepsilon<1$

\begin{array}{l} P (| x_{n} - U | > ε) & = P (U \in [a_{n}, b_{n}]) \\ = b_{n} - a_{n} \underset{n \to \infty}{\to} 0 \end{array}

$x_{n} \enclose{downdiagonalstrike} {\stackrel{\text{a.s.}}{\rightarrow}} U$ :

P (x_{n} \to x) \neq 1, x_{n} has not limis

Quadratic equation