This note on random variables follows as a result of confusing notation in several math textbooks. I’ll explain random variables (in measure theoretic terms) as verbosely as I can, and then prove some results. This article assumes that the reader is familiar with probability triples , as well as a basic idea of what random variables are, in non-measure theory terms.

1. Random Variable Prerequisites

We start with defining measurable spaces and measurable functions

Definition 1.1: A Measurable space consists of a set and a -algebra defined on .

Definition 1.2: A Generated -algebra is the smallest -algebra containing a specified collection of sets. That is, if is a set of subsets of , is the smallest sigma-algebra such that .

Definition 1.3: A Measurable Function between two measurable spaces is a function such that for every , .

Since the definition is used so commonly in the context of measurable functions, this has a special notation

Definition 1.4:

NOTE: The above definition is confusing, but is unfortunately the norm when dealing with measurable functions. In the context of measurable functions, does not refer to the inverse of (which is a function from ), but rather the set of preimages of all the elements contained in a set in the sigma algebra.

Measurable functions can also be defined in terms of the -algebra generated by a function, rather than that of a set

Definition 1.5: The -algebra generated by a function is the collection of all inverse images .

According to this definition, if , then is a measurable function.

2. Random Variables

Random variables are unfortunately, neither random nor variables. This is the first of many misnomers that we encounter in their study.

Definition 2.1: A Random Variable defined on a probability triple is a measurable function

In it’s simplest terms, A random variable is simply a function from , obeying some ‘nice’ rules which allow us to use probability measures with it. These nice rules would come into play a bit later, after we first see how random variables and probability measures go hand in hand.

Consider

such that (This is the discrete uniform probability space on ). Let our random variable map to , . A graphical depiction of this would look something like this:

X

Now, suppose we had to calculate the probability that the random variable would be less than or equal to . The probability of this event occuring is given by . From the inverse notation we developed in §1.4, We can also write this as . From the graph, we clearly see that and are the only elements in that would be in this set, hence . This is how random variables and probability measures go hand in hand.

Why then, do random variables need to be measurable functions? Note that the probability measure is only defined for sets in , and if is not measurable, we cannot find the probability of certain events associated with .

An example for this is to consider

such that . Now consider such that . This is the same map as before. However, if we try to calculate the probability that is less than or equal to 2.5 now, we find that is undefined, as . Hence, is not a random variable, as it is not measurable on . More specifically, , hence, is not measurable

3. Results on Random Variables

Claim 3.1: If is a random variable on , then .

A simple (maybe even obvious) claim, the proof is by definition:

Claim 3.2: If is the indicator of some event , then is a random variable

Proof: for all , we have equal to any one of (if contains 1 and not 0), (if contains 0 and not 1), (if contains neither 0 nor 1) or (if contains both 0 and 1). Hence, is a random variable.

The next two claims would be key to proving results about functions of random variables

Claim 3.3: if and are two measurable functions, then is also a measurable function

Proof: For all , since is measurable, . Since is measurable, . Hence, is measurable.

Claim 3.4: is measurable if .

Proof: Note that , and . This, along with the fact that is a -algebra proves that is a -algebra containing . Since is the smallest -algbra containing C, would be a subset of the above -algebra, hence the claim is true.

This above claim ensures that we don’t need to prove that every set of a -algebra has a preimage in the previous -algebra. Proving it for only the generating set is enough eg. for , it’s sufficient to show that only the open sets have a preimage, something that we’ll use in the next proof.

Claim 3.5: Every continuous function is measurable.

Proof: from (3.4), it’s sufficient to prove that for every open set , . This follows from the continuity of : is continuous iff is open implies that is also open. Hence, is measurable.

The above three claims give us the following very powerful result: every continuous function of a random variable is also a random variable. We can make a stronger claim, after proving the following claims as well:

Claim 3.6: If and are random variables on , then and are random variables as well

Proof: This cute proof comes from Rosenthal. It’s sufficient to prove that is a random variable on the collection of sets , as the generated -algebra of this collection is . Hence, consider the set . From the density theorem, we can find a rational number in (I’ve dropped the , as it’s implicit here). hence,

Since all the elements in the union belong to and since is a -algebra, is a random variable.

XY is also a random variable, as , and a sum/function of random variables is a random variable, from the previous claims.

We are now free to extend the claim that every continuous function of a random variable is a random variable, to piecewise continuity: every piecewise continuous function of a random variable is also a random variable. If is piecewise continuous, then , where are random variables as is continuous, and are disjoint intervals. From claim (3.6), is a linear sum of random variables, and hence is also a random variable.

4. References

  1. Rosenthal, Jeffrey S. A First Look at Rigorous Probability Theory. World Scientific, 2006. Open WorldCat, http://public.ebookcentral.proquest.com/choice/publicfullrecord.aspx?p=5227675
  2. Lebanon, Guy, editor. Probability: The Analysis of Data ; Vol. 1. 2012. Available online at http://theanalysisofdata.com/probability/0_2.html
  3. Math StackExchange, Wikipedia, etc etc :)