Sample Spaces, Events, Probability, & Random Variables
The World is a Random Place and Uncertainty Prevails. We would like to study this uncertainty and use our learnings to maximize Shareholder Value™ which is, or ought to be, the Primary Goal of our short lives.
Sample Spaces
In the World, you perform an Experiment which leads to Outcomes . The space of all possible outcomes of your experiment1 is its Sample Space .
Events and Event Spaces
An Event is just a set of outcomes in the Sample Space. That's really it.
But an Event Space is a bit subtle. It is the set of all sets of outcomes in the sample space2. That is, it's the set of all possible Events.
Formally, to say that is an event space,
- must contain and (nothing and the entire kit-n-kaboodle)
- If some event then ("closed under complement")
- If events then ("closed under union")
- If 2 and 3 are true, you can use both to conclude that things are also "closed under intersection":
So with this definition, and within a legit event space, an event can be no outcome3, a single outcome, or a collection of outcomes (including all outcomes) you're interested in. The closure rules mean that you pick an event , it 'generates' its own complement in the event space4. This is nice because you can now use all manner of logical tricks to compute probabilities!
Sizes of Event Spaces
If you define an Event Space as the set of all subsets of a given set, you're just talking about the Powerset of the Sample Space in the discrete case. That is,
Say you toss a coin 100 times. That'll give you 100 outcomes in the Sample Space (i.e. ). But we're talking about the Event Space, which is the set of all sets of outcomes in the sample space. That thing's size is , an unimaginably large number.
When it comes to continuous outcomes, things get pretty complicated in that you're looking at both Sample and Event Spaces that are uncountably infinite in size ("cardinality"). There are a few things Math people do about this that involves Measure Theory to tame these Beasts of Infinity. That's all I gotta say about that for now.
Probability and Probability Measures
Easy-peasy with the definitions of Events and Event Spaces. Let's go in reverse tho. A Probability Measure is a kind of 'rulebook' that looks at an Event Space and assigns numbers ("sizes") to all Events within it4. That is, is a function that maps an event to . This number corresponding to a single event of interest, is the Probability of the event.
The Probability Measure must satisfy certain rules which we assume to be true ("axioms"):
-
If event then
Probability of all events are non-negative real numbers -
and
"Something will happen": probability that at least one of the elementary events (those with just one outcome) will occur is 1. -
If events and are disjoint5 () then
So what determines the rules of this 'rulebook'?
Random Variables
For non-Mathematicians, this is 'simply' a function that takes an outcome and maps it to a number that is in the Reals ().
Let's say you're measuring the heights of 1,237 people. The height is the Random Variable. It is denoted , which is a function that will map the height of Alice (Outcome of 1,237 outcomes) to 143.12cm and the height of Bob (Outcome ) height to 112.01cm. That is,
And so on. Whence the 'random' part of a Random Variable? Because you cannot predict what the mapping to will be before you conduct the experiment ("The World in Random and Uncertainty Prevails"). And if you collect heights again with a different sample, will map to different numbers.
Random variables can be discrete/categorical or continuous.
Discrete - The Probability Mass Function (PMF)
This is denoted with a lowercase and is defined as the probability that
Continuous - The Probability Density Function (PDF)
This is also denoted with a lowercase and is defined as the probability that
Summing Things - The Cumulative Distribution Function (CDF)
This is denoted
TODO: Finish this.
Media
"The Big Picture of Statistics" by Very Normal (a Columbia Biostats grad!)
Footnotes
-
Even those you haven't seen! ↩
-
A lovely combinatorial explosion! ↩
-
Being interested in no outcomes might be weird but consider being interested in "rolling a 16" when you have a six-sided die. where is the sample space. ↩
-
This some -algebra and Measure Theory stuff. Take it to be true. ↩ ↩2
-
We're talking mutually exclusive events, not independent ones. That's totally different. E.g. rolling 3 and 5 on a single die cannot happen at once (Mutually Exclusive, ) but can on two dice (Independent, ). ↩