Gentle Intro

Hypothesis
- after Popper Logic
- every hypothesis has to be able to be disproven
null-Hypothesis
- default value (status quo) for a parameter (until proven false)
- like defendant in court … unguilty until proven otherwise
- denoted as $H_{0}$
alternative Hypothesis
- deviation from current knowledge
- must be proven to be valid
- denoted as $H_{a}$

Example Machine

machine must produce with mean diameter of 0.5 inch
- $H_{0} : d = 0.5 in$
- $H_{a} : d \neq = 0.5 in$

Testing

possible outcome of any Test
- reject the null hypothesis
  - finding a non-white swan (significant result)
- fail to reject the null hypothesis
  - even tho I fail to reject, I still do not accept the null-Hypothesis
    - there can still be a black swan out there
  - in reality, large enough sample size might result in “impractical” status
    - no further research to be done
rejecting is positive

Errors

Errors
$standard error = \frac{standard deviation}{n}$
- standard deviation … data - how spread out the data points are
- standard error … meaning - how relevant/meaningful the conclusions are

Ingredients

confidence level
- e.g. 99%
rejection region
- defining when $H_{0}$ is rejected in favor of $H_{a}$
- e.g. when arbitrary experiment result is greater than 5
- rejection region is always outside of confidence interval
test statistic
- depends on problem we have

Interpretation

when result is inside confidence interval
- i.e. outside the rejection region
- we know that we cannot reject $H_{0}$ , but still not accept it
- at the current confidence level

Tests

Population Mean

One-Tailed

upper $H_{a} : Θ > Θ_{0}$ or lower $H_{a} : Θ < Θ_{0}$
Theta $Θ$ (measured) and $Θ_{0}$ (expected) are placeholder for the corresponding values compared
ignored in this course, but not hard to grasp or adjust the formulas

Two-Tailed

two-tailed $H_{a} : Θ \neq = Θ_{0}$
- $H_{0} : μ = μ_{0}$
- $H_{a} : μ \neq = μ_{0}$
then we collect sample data and get $\overset{x}{ˉ}$ and $σ$
- or $s$ sample standard deviation if $σ$ is not known
therefore for large samples: $t_{stat} = \frac{x ˉ - μ _{0}}{\frac{σ}{n}} \sim N (0, 1)$
- for small samples: $t_{stat} = \frac{x ˉ - μ _{0}}{\frac{σ}{n}} \sim t-distribution$
choose significance level $α$
- reminder: $α$ = chance of Type I error
- region within confidence interval → do not reject $H_{0}$
- region outside confidence interval → reject $H_{0}$
- confidence interval can be constructed without data!
  - only distribution type, sample size and $α$ needed
$z$ -critical value (end points of rejection region)
- $z_{c} = q n or m (\frac{1 - α}{2})$ if $n$ is large (> 30) → CLT
- $z_{c} = qt (\frac{1 - α}{2})$ if $n$ is small and population is normally distributed

Population Proportion

follows Large Sample Confidence Intervals
- $\overset{p}{^} = \frac{1}{n} * \sum_{i = 1}^{n} Bernoulli (p)$
$\overset{p}{^} \approx N (p, \frac{p ( 1 - p )}{n})$

$p$ -Values

the probability of obtaining a sample “more extreme” than the one observed in the data set, assuming that $H_{0}$ is true
basically reversing the calculation
- finding $α$ for the given $\overset{x}{ˉ}$ (two-sided CI)
leaving it up to the reader to interpret the result
p-value =
- $2 * P (observed z < Z)$ for $\overset{x}{ˉ} < μ$ → $Z$ will be negative
- $2 * P (observed z > Z)$ for $\overset{x}{ˉ} > μ$ → $Z$ will be positive

🪴 Maixnor WU

Table of Contents

Explorer

Hypothesis Testing

Gentle Intro

Example Machine

Testing

Errors

Ingredients

Interpretation

Tests

Population Mean

One-Tailed

Two-Tailed

Population Proportion

$p$ -Values

Backlinks

🪴 Maixnor WU

Table of Contents

Explorer

Hypothesis Testing

Gentle Intro

Example Machine

Testing

Errors

Ingredients

Interpretation

Tests

Population Mean

One-Tailed

Two-Tailed

Population Proportion

p-Values

Backlinks

$p$ -Values