MA401 Notes, Section 8.3

8.4 Significance Testing

Differs from hypothesis testing in only one way: in both, assume hypothesis true; then:

Hypothesis Test: specify a rejection region (for desired significance level) at outset; see if our test statistic falls into this
Significance Test: compute the probability the value of our test statistic will be as extreme as (or worse than) the observed value if the null hypothesis were true; called the P-value. Reject the null hypothesis if the P-value is small enough.

ex:

What's really the difference between a hypothesis test and a significance test? In the hypothesis test, we decide on the criterion for rejecting the null hypothesis before the sample is taken; if the results of the sample don't meet our criteria, we don't reject the hypothesis. In a significance test, we sample first, and look at the result, and decide if this is unlikely enough for us to conclude that the null hypothesis is incorrect.

Which is better? Both are used, but statisticians prefer the approach used in the hypothesis test, as a way to "keep themselves honest."

If we choose a significance level of .01 in designing a hypothesis test, and then compute our rejection region based on this, we will then only reject the null hypothesis if the sample we get is one of the "1% of worst samples" that would occur if the null hypothesis is true. If we get a sample that is close to our rejection region, but not in it, we don't reject!
In the significance test, we look at the probability that a sample would be "as bad as or worse than" the one we got; if this probability is low, we reject the null hypothesis. But since we haven't specified a clear-cut criterion for rejection, we can sometimes "talk ourselves into" rejecting the hypothesis. For example, the P-value for a sample might come out to be .02; thus there's only a 2% chance we'd get a sample this bad if the null hypothesis is true, and we might be inclined to reject the hypothesis. However, this sample would not have fallen into the rejection region for the hypothesis test with significance level .01, since its P-value is a little larger than the significance level. Thus we would not have permitted ourselves to reject, having previously decided on our criterion.

Hypothesis and Significance Tests on the Population Mean

Frequently, our hypothesis will concern the value of the mean of a population.

ex:

section 7.4

₀

₁

Hypothesis Test

To test, we'll take a sample of 50 boxes, and look at the sample mean; if is sufficiently less than 14.0, we'll conclude that the machine is out of adjustment and that the mean fill is indeed less than 14.0 ounces.

Rejection region: how far below 14.0 should be for us to reject H₀?
Consider:

Assuming that the weights of boxes are normally distributed, will also be normally distributed. (Since n is large (n = 50), will be normally distributed even if the weights aren't normally distributed, by the Central Limit Theorem.)
Thus

We want the Z value such that only 5% of Z-values would be below this value just due to random chance; this value is the critical value -z_.05 = -1.645.

Solving for , we find that if H₀ is true, for only 5% of samples will

This gives us our rejection region; reject if our sample yields <= 13.93

Significance Test
To test the hypothesis via a significance test, we would not bother to figure out a rejection region; we'd just take a sample, look at the value of obtained, and compute its P-value to see if it's low enough for us to reject the null hypothesis.

To compute the P-value for the sample above, with = 13.88 ounces, we need to find the probability that we'd get a sample with a mean this low or lower just by chance if the null hypothesis is in fact true. Thus we want to find P( <= 13.88), assuming that the population mean is m = 14.0.
Using the fact that is normally distributed, we use the Z distribution to compute this probability:

Thus there's only a 2% chance we'd get a sample with a mean this low or lower if the null hypothesis were in fact true; since this is quite unlikely, we would reject the null hypothesis.

Previous section Next section