Discussion 12A

Normal Distribution

XN(μ,σ2)X \sim \mathcal{N}(\mu, \sigma^2)
f(x)=12πσ2exp(xμ)22σ2f(x) = \frac{1}{\sqrt{2 \pi \sigma^2}} \exp{\frac{-(x-\mu)^2}{2 \sigma^2}}
XX has mean μ\mu and variance σ2\sigma^2.

ZN(0,1)Z \sim \mathcal{N}(0,1) has the standard normal distribution.
If X=aZ+bX = a Z + b then XN(b,a2)X \sim \mathcal{N} (b, a^2).
If XN(μX,σX2)X\sim \mathcal{N}(\mu_X, \sigma_X^2) and YN(μY,σY2)Y \sim \mathcal{N} (\mu_Y, \sigma_Y^2) are independent then X+YN(μX+μY,σX2+σY2)X + Y \sim \mathcal{N} (\mu_X + \mu_Y, {\sigma_X^2 + \sigma_Y^2}).

The z-score of a sample point is its position from the mean, in number of standard deviations.

The CDF of the normal distribution is Φ(z)=Pr(Z<z)\Phi(z) = \Pr(Z < z). There is no closed form for the CDF, so we use a table of approximations. The CDF maps from z-scores to probabilities, while the inverse CDF maps from probabilities to z-scores.

Hypothesis Testing

  1. Formulate a null hypothesis H0H_0 which says that a setting is ordinary
  1. Formulate an alternative hypothesis H1H_1 which says that a setting is unusual
  1. Run an experiment and record an observation
  1. Calculate the p-value of the outcome: the probability that the outcome (or a more extraordinary outcome) occurs assuming the null hypothesis H0H_0 is true
  1. If the p-value less than a certain value (often 0.050.05) then the observation is considered statistically significant and we reject H0H_0.