MA401 Notes, Section 5.2

5.2 Expectation & Covariance

Def: Let X and Y be discrete random variables with joint density function f(x,y), and let H(X,Y) be any function of X & Y (or either alone). Then the expected value of H is

ex:

Consider the plant example from previous section, where X = number of stems on a plant and Y = # of blooms. Then

=   (0)(.22) + (1)(.12) + (2)(0)
   +   (0)(.09) + (2)(.25) + (4)(.15)
   +   (0)(.01) + (3)(.07) + (6)(.09)

= 1.97
so the product of the number of stems and number of blooms averages 1.97.

=   (1)(.22) + (1)(.12) + (1)(0)
   +   (2)(.09) + (2)(.25) + (2)(.15)
   +   (3)(.01) + (3)(.07) + (3)(.09)

= 1.83
so there are on average 1.83 stems per plant.

E(X+Y) = E(X) + E(Y) from properties of expectation

Note: denote E(X) by m_X, and E(Y) by m_Y.

Q: when is it the case that E(XY) = E(X) E(Y)?

Theorem: If X,Y are independent, then

E(XY) = E(X) E(Y)

Def: The covariance of X and Y is defined to be

What does this measure?
Consider:

X - m_X measures how far X is from the mean for X; it's positive if X is above the mean, negative if X is below the mean
Y - m_Y measures how far Y is from the mean for Y; positive if Y is above the mean, negative if Y is below the mean
(X - m_X)(Y - m_Y)

will be positive if X and Y are both above or both below their means
will be negative if X is above average, Y is below, and vice-versa

E((X - m_X)(Y - m_Y)) is the average value of the product

will be positive if above-average values of X tend to occur with above-average values of Y
will be negative if above-average values of X tend to occur with below-average values of Y

cov(X,Y) thus measures whether X & Y tend to "vary together"

Computational formula for covariance:

cov (X,Y) = E(XY) ? E(X)E(Y).

ex:

cov(X,Y) = E(XY) - E(X)E(Y) = 1.97 - (1.83)(.92) = .2864

note

Note:

If X & Y are independent, then cov(X,Y) = 0

follows because then E(XY) = E(X) E(Y)
makes sense: if X and Y are independent, whether X is above or below average should have no influence on the value of Y; thus X and Y wouldn't tend to vary together

Converse not true; just because cov(X,Y) = 0, doesn't mean X, Y are independent!

Previous section Next section