Population Mean

  • Confidence interval is a formula that tells us how to use the sample data to calculate an interval that estimates the target parameter
  • we always have to add confidence level: , , etc

Large Samples

  • similar to creating the score
    • we can expect the sample mean is good enough because of Central Limit Theorem
  • region is defined by confidence level as a symmetric range
    • tails share
      • i.e. 95% confidence β†’ tails share 5%
        • each tail has 2.5%
      • look at any table to get the results

Alpha Table

20%1.282
15%1.440
10%1.645
5%1.960
1%2.576

Small Samples

  • for small samples the Central Limit Theorem is not holding anymore
    • we need another assumption
  • we can use -statistic distributions
    • -statistic distributions have thiccer tails
      • i.e. more extreme events are more likely
    • degree of freedom defines how exact the -static will match the value
      • normally degree of freedom is almost the sample size
      • for large the and value will be ever more similar
        • therefore we don’t need -statistic with larger sample sizes
  • we assume that the population of the sample is normally distributed
    • therefore we can use for β†’ normally not allowed

Large Sample Confidence Intervals

Theory

  • when asking a binary question (e.g. Is coffee overpriced?) the result is a binomial
  • remember average number of successes from Probability
  • … p-hat is number of success over sample size
    • is unbiased, i.e. the expected value = the probability
    • all estimators in QM2 are unbiased
  • expressable as where Bournoulli(p) is either 1 or 0
    • summing up all yes (1) and no (0) values and divide by sample size
  • after Central Limit Theorem this can be considered the same as sample mean as long as is β€œlarge” enough
    • large is true when
      • and
  • therefore
    • therefore
    • and
    • results in
  • now going for the z-value
    • … standard normal
  • finally for the confidence interval
    • … probability of being in the confidence interval

Example Coffee

  • givens:
  • asked for: 95% Confidence Interval
  • solution:

More Theory - Error

  • is just the half width β†’ careful! sometimes the full width is given/asked for
    • is the distance from the center to the Confidence Interval edge
  • therefore
    • always round up … is the minimum, therefore lower would be outside of Confidence Interval