Hypothesis Testing
TODO: Finish this.
You're a lawyer in a courtroom prosecuting someone for insurance fraud.
The Law Says
"Innocent until proven guilty."
You Say
"Guilty."
Judge
"I have a really high bar to rule 'Beyond Reasonable Doubt' that someone's guilty."
Evidence
Emails, texts, tax records, spouse's statements.
What you need to prove
How unusual/unexpected/surprising the evidence would be if the defendant is not innocent.
You never accept the Alternative/Research Hypothesis ! You either reject or fail to reject the Null Hypothesis . All you're doing with your test is measuring how surprised you are under (see the court analogy above).
Why the negation? Why shouldn't you be asking "How expected is the evidence that the defendant is guilty?"
The reason is that the null hypothesis is well-defined.
Do you ever set ... ?
"Which Test?" TLDR (finish this!)
To pick a test, and generally speaking, you'll be asking
- What is the nature of my Data1? Continuous? Categorical?
- How many groups am I dealing with? One, two, or more than two?
Here's a nice little table from this excellent video (by a Columbia alum!)
1 Group | 2 Groups | 2+ Groups | |
---|---|---|---|
Categorical Data | Proportion Test (-test approx.) Test | Proportion Test (-test approx.) Test | Test |
Continuous Data | -test & Variants -test & Variants | -test & Variants -test & Variants | ANOVA (-test, 1-way, 2-way) |
Classic Assumptions Violated2 | Sign Test Signed Rank Test | Wilcoxon–Mann–Whitney Test Paired -test McNemar’s Test | Kruskal–Wallis Test |