Idea: have some suspicion as to value of (unknown) population
parameter; use a sample to test if hypothesis is true
ex:
Setup: use two complementary hypotheses:
ex:
Approach: play devil’s advocate:
ex:
What outcomes would suggest that the assumption p >= .40 is false?
Well, if p >= .40, we expect 8 or more to be registered Democratic; thus if our sample has fewer than 8 Democrats, the assuption that 40% or more of the students are registered Democratic would seem incorrect.
However: even if 40% of the students are Democrats, we certainly won't always get exactly 8 Democrats in every sample of 20 students; we'd expect the number to vary somewhat from sample to sample. For example, it wouldn't be all that unlikely that just due to random chance, we'd get a sample of 20 students in which only 7 are registered Democratic; thus this result wouldn't give strong evidence that there must be fewer than 40% Democrats in the student population.
Thus we really only get strong evidence that we should reject the null hypothesis if the number of Democrats in the sample is far less than the expected number of 8.
We'll use as our rejection region X <= 4;
if p >= .40, it is unlikely we’d get a sample with 4 or fewer Democrats
in it.
In fact, we can quantify just how unlikely this is using probabilities:
Suppose p = .40; then the distribution of X will be the binomial distribution
with n = 20, p = .40
Then the probability that X <= 4 is .0510,
from the table of cumulative probabilities for the binomial distribution.
If p > .40, there’s even a smaller chance that X <= 4.
Thus we conclude that if the null hypothesis is true, there would be
only a 5% chance we'd get a sample with 4 or fewer Democrats in it due
just to sampling variation. While this could happen, it's quite
unlikely, and thus it seems more likely that the null hypothesis is false,
and that there are in fact fewer than 40% Democrats in the student population
at large.
ex:
Well, assuming the null hypothesis is true, that p = .40 or greater, we can see from the table for the binomial distribution with n = 20 and p = .40 that
There are 4 possible outcomes of a hypothesis test:
|
|
|
|
|
|
accept H0 |
|
|
Note: