Expected Value

Expected Value#

If we have some random variable \(X\), we might be interested in knowingwhat is the “average” value of \(X\). This concept is captured by the expected value (or mean) \(\mathbb{E}[X]\), which is defined as

\[\mathbb{E}[X] = \sum_{x \in X(\Omega)} xp(x)\]

for discrete \(X\) and as

\[\mathbb{E}[X] = \int_{-\infty}^\infty xp(x)\operatorname{d}{x}\]

for continuous \(X\).

In words, we are taking a weighted sum of the values that \(X\) can take on, where the weights are the probabilities of those respective values. The expected value has a physical interpretation as the “center of mass” of the distribution.

In our running example, the random variable \(X\) (number of heads in two fair coin tosses) has the following distribution:

\(x\)	0	1	2
\(\mathbb{P}(X = x)\)	0.25	0.5	0.25

We compute the expected value as:

\[ \mathbb{E}[X] = \sum_{x=0}^2 x \cdot \mathbb{P}(X = x) = 0 \cdot 0.25 + 1 \cdot 0.5 + 2 \cdot 0.25 = 1.0 \]

This means that on average, you expect to see 1 head in two tosses of a fair coin.
The expected value \(\mathbb{E}[X]\) corresponds to the center of mass of the PMF, and aligns with our intuition about symmetry in a fair coin toss experiment.

Properties of the expected value#

Linearity of expectation#

A very useful property of the expected value is that of linearity of expectation:

\[\mathbb{E}\left[\sum_{i=1}^n \alpha_i X_i + \beta\right] = \sum_{i=1}^n \alpha_i \mathbb{E}[X_i] + \beta\]

Note that this holds even if the \(X_i\) are not independent!

Let us see an example involving two coin tosses.

Suppose we toss a fair coin twice. Define:

\(X_1 = \mathbb{1}\{\text{first toss is heads}\}\)
\(X_2 = \mathbb{1}\{\text{second toss is heads}\}\)

Then:

\(\mathbb{E}[X_1] = \mathbb{E}[X_2] = 0.5\)
Let \(S = X_1 + X_2\) be the total number of heads

By linearity:

\[ \mathbb{E}[S] = \mathbb{E}[X_1 + X_2] = \mathbb{E}[X_1] + \mathbb{E}[X_2] = 0.5 + 0.5 = 1.0 \]

Product rule for expectation#

But if the \(X_i\) are independent, the product rule for expectation also holds:

\[\mathbb{E}\left[\prod_{i=1}^n X_i\right] = \prod_{i=1}^n \mathbb{E}[X_i]\]

Let’s extend the coin toss example to illustrate the product rule for expectation, which holds only if the random variables are independent:

Let \(X_1 = \mathbb{1}\{\text{first toss is heads}\}\), \(X_2 = \mathbb{1}\{\text{second toss is heads}\}\)
These are independent indicators, each with \(\mathbb{E}[X_1] = \mathbb{E}[X_2] = 0.5\)

Then:

\[ \mathbb{E}[X_1 \cdot X_2] = \mathbb{P}(\text{first toss is H and second toss is H}) = \mathbb{P}(\text{hh}) = 0.25 \]

\[ \mathbb{E}[X_1] \cdot \mathbb{E}[X_2] = 0.5 \cdot 0.5 = 0.25 \]

So the product rule holds here.

E[X₁]           = 0.50
E[X₂]           = 0.50
E[X₁·X₂]        = 0.25
E[X₁]·E[X₂]     = 0.25

Since \(X_1\) and \(X_2\) are independent, we observe:

\[ \mathbb{E}[X_1 \cdot X_2] = \mathbb{E}[X_1] \cdot \mathbb{E}[X_2] \]
This would not hold if the tosses were somehow dependent (e.g., if the second toss always matched the first).

Expected Value

Contents

Expected Value#

Properties of the expected value#

Linearity of expectation#

Product rule for expectation#