MA401 Notes, Section 7.3

7.3 Distribution of the Sample Mean; Central Limit Theorem

In section 7.1, we looked at the expected value and variance of the sample mean

. Now, we'll look at the type of distribution that

has.

We'll use two important results about normal random variables.

If X is a normal r. v. with mean m, variance s², and c is a constant, then Y = cX is normal, with mean cm, variance c²s².
If X₁ and X₂ are independent normal random variables with means m₁ and m₂ and variances s₁²and s₂², then X₁+X₂ is normally distributed,with mean m₁+m₂, variance s₁²+s₂².

-- the info about means and variances is old hat; what’s new is fact that cX & X₁+X₂ are normal.

To summarize:

a constant times a normal r.v. is normal
the sum of normals is normal.

The proofs of these assertions use properties of moment generating functions, in particular the following:

Distribution of the sample mean

From the above two properties it follows that:

₁

₂

= m

= s

Thus if the original population has a normal distribution, the the sample means from samples of some (fixed) size n will also be normally distributed, with the same mean but a smaller variance (and standard deviation). The density curve for the sample means will thus be bell-shaped, and centered at the same location as the density curve for the population, but will be narrower.

Using the information about the distribution of

ex:

m =

s =

Take samples of size 70, and compute the sample mean for each sample; then the sample means will be normally distributed, with mean m = 7.3 inches and standard deviation s = 1.6/sqrt(70) = .19 inches.

Graphs of densities: the density graph for the sample means from the various samples will be a normal curve, centered at the same location (7.3 inches) as the population density curve, but will be much narrower: the standard deviation of the sample means is just .19, while that for the population is 1.6. Thus the values of the sample mean for various samples will be centered at the population mean, but will tend to vary much less on either side than individuals from the population.

By the normal probability rule, since is normally distributed,

Central Limit Theorem

Let X₁, X₂, ..., X_n be a random sample from a population having any distribution (with mean m, variance s²), not necessarily a normal distribution; then for large n, the sample mean will be approximately normally distributed (with mean m, variance s²/n ).

this is surprising: regardless of distribution of population, distribution of sample mean will be approximately normal for large n!!
thus can use normal calculations with the sample mean even when distribution of population isn’t normal
proof involves looking at moment generating functions

Previous section Next section