**Quantitative Methods: Basic Concepts**

**Reading 8. Probability Concepts**

**Learning Outcome Statements**

k. calculate and interpret covariance and correlation;

*CFA Curriculum, 2020, Volume 1*

### Why should I choose AnalystNotes?

AnalystNotes specializes in helping candidates pass. Period.

### Subject 7. Covariance and Correlation

Given two random variables, R

_{i}and R

_{j}, the

**covariance**between the two variables is:

Facts about covariance:

- Covariance of returns is negative if, when the return on one asset is above its expected value, the return on the other asset is below its expected value (an average inverse relationship between returns).
- Covariance of returns is 0 if returns on the assets are unrelated.
- Covariance of returns is positive if, when the return on one asset is above its expected value, the return on the other asset is above its expected value (an average positive relationship between returns).
- The covariance of a random variable with itself (own covariance) is its own variance.

*Example*

Suppose that the future short-term outlook for the economy is favorable with a probability 0.6 and unfavorable with a probability of 0.4. For two stocks, F and G, returns are 0.25 and 0.2, respectively, in favorable conditions, and 0.01 and 0.02, in unfavorable conditions. Calculate cov (R

_{f}, R

_{g}).

We must firstly calculate the expected value of the return on each stock:

E[R

_{f}] = 0.6 x 0.25 + 0.4 x 0.01 = 0.154

E[R

_{g}] = 0.6 x 0.2 + 0.4 x 0.02 = 0.128

cov (R

_{f}, R

_{g}) = E[{R

_{f}- E(R

_{f})} x {R

_{g}- E(R

_{g})}] = 0.6 x [{0.25 - 0.154}x {0.2 - 0.128}] + 0.4 x [{0.01 - 0.154}x {0.02-0.128}] = 0.010368

The fact that the answer is positive indicates that the return on both stocks is above (or below) the expected value at the same time. We know that this is the case because both returns are higher in favorable conditions and lower in unfavorable conditions. Had we obtained a negative answer, logic would have told us that we had made an error somewhere.

The

**correlation**between two random variables, R

_{i}and R

_{j}, is defined as:

Alternative notations are corr(R

_{i}, R

_{j}) and ρ

_{ij}.

Properties of correlation:

- Correlation is a number between -1 and +1.
- A correlation of 0 indicates an absence of any linear (straight-line) relationship between the variables.
- Increasingly positive correlation indicates an increasingly strong positive linear relationship (up to 1, which indicates a perfect linear relationship).
- Increasingly negative correlation indicates an increasingly strong negative linear relationship (down to -1, which indicates a perfect inverse linear relationship).

The correlation between two variables represents the degree to which these variables are related. It is important to keep in mind that correlation does not necessarily mean causation. For example, there is a high positive relationship between the number of fire-fighters sent to a fire and the amount of damage done. Does this mean that the fire fighters cause the damage? Or is it more likely that the bigger the fire, the more fire fighters are sent and the more damage is done? In this example, the variable "size of the fire" is the causal variable, correlating with both the number of fire-fighters sent and the amount of damage done.

The relationship among covariance, standard deviation, and correlation:

Using the figures from the previous example, we first need to calculate the two standard deviation terms:

Var(R

_{f}) =[{R

_{f}- E(R

_{f})}

^{2}] x P(R

_{f}) = 0.6 x [{0.25-0.154}

^{2}] + 0.4 x [{0.01-0.154}

^{2}] = 0.013824. Hence, σ(R

_{f}) = 0.117576

Var(R

_{g}) = [{R

_{g}- E(R

_{g})}

^{2}] x P(R

_{g}) = 0.6 x [{0.2-0.128}

^{2}] + 0.4 x [{0.02-0.128}

^{2}] = 0.007776. Hence, σ(R

_{g}) = 0.088182

Also, we know that cov(R

_{f},R

_{g}) = 0.010368.

So, correlation = ρ(R

_{f}, R

_{g})= cov(R

_{f}, R

_{g}) / σ(R

_{f}) x σ(R

_{g}) = 0.010368 / (0.117576 x 0.088182) = 0.99999.

This indicates an almost perfect positive linear relationship between R

_{f}and R

_{g}.

Calculate covariance given a joint probability function:

We can calculate covariance using the joint probability function of the random variables if that can be estimated. The joint probability function of two random variables, X and Y, denoted P(X, Y), gives the probability of joint occurrences of values X and Y. For example, P(3, 2) is the probability that X equals 3 and Y equals 2.

###
**User Contributed Comments**
5

You need to log in first to add your comment. ###### achu

As an alternate method / aside, Cov(F,G) ALSO equals E(FG) - E(F) E(G). In given example, E(FG) equals .6 * (Prob F&G in favorabl economy = .25*.2) PLUS .4 * (Prob F&G in bad economy= .01*.02) = .03008. Cov(F,G) = .03008 - (.154)(.128) = .010368 as above.

###### aakash1108

Thanks @ achu!

###### jpducros

Questions 5 and 6 in this session are interesting and could help save you some time...

###### somebody

Not understand: why E (Rf)= 0.6 * 0.25 + 0.4 * 0.01

Why not 0.6 * 0.25 + 0.4 * 0.25 + 0.01 * 0.25 + 0.02 * 0.25

###### Olesya_CFA

Somebody, the formula for expected return is probability weighted average of possible outcomes. Probabilities in our case are 0.6 and 0.4. Possible outcomes: for stock F - 0.25 and 0.01 in favorable and unfavorable condition, for stock G - 0.2 and 0.02 accordingly. You have to calculate expected return for the stock F separately from the stock G. That is why your suggestion is irrelevant - you mix the stocks in one formula. It should not happen.