Distributions
Think of these as a class
in Python or some other OOP language. If you identify that an RV 'fits' one of these, you can use precalculated/derived formulae for things like the Probability Mass Function, Expected Value , and Variance .
A 'distribution' doesn't have to be anything fancy and can just be a small list that tells people what the probabilities are for each value ("realization") of your Random Variable. E.g. your shiny variable can only take on values and you assign each value ("realization") probabilities of . Bam, done, Distribution.
This applies to joint distributions as well! Let's say can take on and can take on . You'd do a nice table of , , , and . Note that I'm not saying anything about independence or anything. Just a table of values (which may reveal stuff about the relation between and ).
There are rules of course. But that's all really. A function is nice because you won't have to labor over this list for a lot of values (and if you're laboring over , you'll be laboring for a long time indeed).
The Normal Distribution ♥️
TODO: Write more about this and the CLT.
Here's a lovely derivation that's worth watching.
Discrete Distributions
Bernoulli
Simplest one. If an experiment with probability succeeds, you get a 1, else 0.
PMF | ||
---|---|---|
, |
Examples: Coin flip results in a heads. Your friend liked Nights in Rodanthe.
Binomial
Extends Bernoulli to independent trials of your experiment with probability of success
PMF | ||
---|---|---|
Examples: Number of heads in 12 coin flips. Bits being corrupted (think Hamming Codes).
Poisson
Approximates Binomial to cases when probablity is very small and the number of trials is very large. It's pretty important and used in queueing theory.
- Think the number of things that happen over a fixed interval of time at a constant average rate.
- Think of events that are rare, independent (of course), and relatively uniformly distributed in time or space: mutations in a stretch of DNA, calls into a call center, insurance claims filed throughout a day, patients arriving at a clinic, rare diseases reported per month, meteors observed in the night sky, bankruptcies observed in a year...
Geometric
Number of independent trials until first success (which has a probability ).
Negative Binomial
This is the number of independent trials you'll need until a fixed number of successes with probability . Kinda extends the idea of the Geometric.