Random variables

Random variables#

A random variable is some uncertain quantity with an associated probability distribution over the values it can assume.

Formally, a random variable on a probability space $(Ω, F, P)$ is a function $X : Ω \to R$ .

We denote the range of $X$ by $X (Ω) = {X (ω) : ω \in Ω}$ .

To give a concrete example, suppose $X$ is the number of heads in two tosses of a fair coin.

The sample space is

Ω = {h h, t t, h t, t h}

and $X$ is determined completely by the outcome $ω$ , i.e. $X = X (ω)$ .

For example, the event $X = 1$ is the set of outcomes ${h t, t h}$ .

Below is a visualization of the random variable $X$ mapping the sample space $Ω$ to the real numbers $R$ .

../_images/a6cd5133ece88adc0a3f5950d62edfea3370a1afcad7e8cd49d42faa55f33494.png

It is possible to define multiple random variables on the same probability space.

For example, we can define the random variable $Y$ to be the number of tails in two tosses of a fair coin.

The sample space is still

Ω = {h h, h t, t h, t t}

and $Y$ is determined completely by the outcome $ω$ , i.e. $Y = Y (ω)$ .

Below is a visualization of the joint random variable $(X, Y)$ mapping the sample space $Ω$ to the real numbers $R^{2}$ .

../_images/ba3a31ed9dc906fc3c2baf3d262a8ff28c32215fa14792b36b46d05eead334ab.png

It is common to talk about the values of a random variable without directly referencing its sample space. The two are related by the following definition: the event that the value of $X$ lies in some set $S \subseteq R$ is

X \in S = {ω \in Ω : X (ω) \in S}

Note that special cases of this definition include $X$ being equal to, less than, or greater than some specified value.

For example,

P (X = x) = P ({ω \in Ω : X (ω) = x})

A word on notation: we write $p (X)$ to denote the entire probability distribution of $X$ and $p (x)$ for the evaluation of the function $p$ at a particular value $x \in X (Ω)$ .

If $p$ is parameterized by some parameters $θ$ , we write $p (X; θ)$ or $p (x; θ)$ , unless we are in a Bayesian setting where the parameters are considered a random variable, in which case we condition on the parameters.

The cumulative distribution function#

The cumulative distribution function (c.d.f.) gives the probability that a random variable is at most a certain value:

F (x) = P (X \leq x)

The c.d.f. is a non-decreasing function that is right-continuous.

../_images/54ddf1a4f0e4985cc7ad469c6ca63a23ad9d4c747ec6bfd3336781dd2e5a3d4f.png

The c.d.f. can be used to give the probability that a variable lies within a certain range:

P (a < X \leq b) = F (b) - F (a)

Discrete random variables#

A discrete random variable is a random variable that has a countable range and assumes each value in this range with positive probability.

Discrete random variables are completely specified by their probability mass function (p.m.f.) $p : X (Ω) \to [0, 1]$ which satisfies

\sum_{x \in X (Ω)} p (x) = 1

For a discrete $X$ , the probability of a particular value is given exactly by its p.m.f.:

P (X = x) = p (x)

Going back to our example. Below, we visualize the probability mass function (PMF) of $X$ , which gives the probability of each value of $X$ .

../_images/c25d0eff914c327287eedd7f0097a512ea7bb9b83a19a53ac0dda3cc825fea25.png

Continuous random variables#

A continuous random variable is a random variable that has an uncountable range and assumes each value in this range with probability zero.

Most of the continuous random variables that one would encounter in practice are absolutely continuous random variables, which means that there exists a function $p : R \to [0, \infty)$ that satisfies

F (x) \equiv \int_{- \infty}^{x} p (z) d z

The function $p$ is called a probability density function (abbreviated p.d.f.) and must satisfy

\int_{- \infty}^{\infty} p (x) d x = 1

As an example for a continuous random variable, consider patient body temperature in a hospital.

Let

$Ω$ : all possible physiological states of a patient at a point in time (heart rate, immune response, infection status, etc.)
$F$ : a $σ$ -algebra over measurable subsets of these states
$P$ : probability measure capturing likelihoods over patient states

Define the random variable:

T : Ω \to R

that maps each state $ω \in Ω$ to a measured body temperature, e.g.,

T (ω) = body temperature (in °C) in state ω

Typical human temperatures range roughly between:

T (Ω) \subseteq [34.0, 42.0] \subset R

The average temperature in the United States is around 36.6°C, but there is some spread around this value and the distribution is not symmetric. Below is a visualization of the probability density function of the body temperature distribution (modelled as a gamma distribution).

../_images/a3802d5c6e90ee2428b81d116b60d4a3cf4d1f9f04aa8eb865dd6951adf7b316.png

Below is a visualization of the cumulative distribution function of the body temperature distribution.

../_images/bf79238d445670a490a39d1851ff65e125fb4789b3ecdf36a8a9de217d022dda.png

The values of the probability density function are not themselves probabilities, since they could exceed 1.

However, they do have a couple of reasonable interpretations.

One is as relative probabilities; even though the probability of each particular value being picked is technically zero, some points are still in a sense more likely than others.

../_images/32714cb119836fbb75fa47dea81b424a771849567d4bc4af39e3749227c205e4.png

One can also think of the density as determining the probability that the variable will lie in a small range about a given value. This is because, for small $ϵ > 0$ ,

P (x - ϵ \leq X \leq x + ϵ) = \int_{x - ϵ}^{x + ϵ} p (z) d z \approx 2 ϵ p (x)

using a midpoint approximation to the integral.

../_images/1d886aff2690feb9343f67836ff8bf37a62e02c7f95c3f2d017084308b39a28b.png

True probability ≈ 0.1528, Midpoint approx ≈ 0.1531

Here are some useful identities that follow from the definitions above:

\begin{array}{r} \begin{aligned} P (a \leq X \leq b) & = \int_{a}^{b} p (x) d x \\ p (x) & = F^{'} (x) \end{aligned} \end{array}

../_images/046e8ac0206a7a44a8905b1d2524a922f636c8e5eb70b5e33670bcf2665bf77e.png

Other kinds of random variables#

There are random variables that are neither discrete nor continuous. For example, consider a random variable determined as follows: flip a fair coin, then the value is zero if it comes up heads, otherwise draw a number uniformly at random from $[1, 2]$ . Such a random variable can take on uncountably many values, but only finitely many of these with positive probability.

We will not discuss such random variables because they are rather pathological and require measure theory to analyze.