Types of Studies
From various class notes. There will be a lot of stuff missing here. Iβll finish this one day lol.
You can zoom and pan this.
Broad Classesβ
Quantitative and Qualitativeβ
This is about what data is collected.
Quantitative focuses on the measurable (statistics involved of course).
Qualitative is about observation (description, explanation) that produces artifacts (e.g. interviews, photos, sketches of spaces, notes) and may use some quantitative data. It can be exploratory research (see and refine questions, understand opportunities.) Can be used for evaluation (why are informatics interventions used or not used?)
Prospective and Retrospectiveβ
This is about the temporality of the data that is collected. Look forward or backward in time. One is cheaper because things have already happened. One gives you more control.
Observational and Experimentalβ
Did you change anything in the world? Just observed it? Observational Study. Did you assign an exposure? Experimental.
Measurement and Demonstrationβ
Measurement Study is where you evaluate the properties of a measurement instrument or method itself: these are the objects of investigation.
You typically use these later in Demonstration Studies where you want to show that an intervention, system, or technology works. The instrument or system is now the tool, and youβre investigating its impact.
Experimental and Quasi-Experimental Studiesβ
Key distinction is randomization. Experimental studies randomly allocate subjects to exposure groups, which (with sufficient N) removes confounding. Quasi-experimental studies assign exposures without randomization β common when randomization is infeasible or unethical.
Observationalβ
No exposure assigned (i.e. no influence on subjects). Descriptive. Analytical: give associations through comparisons. Can infer cause-effect relationships. Less common in interventions research unless youβre studying existing technologies/interventions.
Cross-Sectional Studiesβ
Slice in time. There is no temporality in cross-sectional studies. Itβs a slice in time. Because of this, there cannot be causal assertions. You can only speak of associations.
You can measure prevalence. You cannot measure incidence (incidence is about new cases, which implies temporality). Think of how timelines (e)------(o) get smushed into a spreadsheet β you just get (e)s and (o)s, the lines vanish.
ββββ Single Time Point ββββ
β β
βΌ βΌ
Population βββΊ Sample βββΊ Measure simultaneously:
βββ Exposure status (E+ / Eβ)
βββ Outcome status (O+ / Oβ)
Resulting 2Γ2:
O+ Oβ
ββββββββββ¬βββββββββ
E+ β a β b β
ββββββββββΌβββββββββ€
Eβ β c β d β
ββββββββββ΄βββββββββ
βββββββββββββββββ time = tβ ββββββββββββββββββ€
(no follow-up)
Cohort Studiesβ
Select on exposure, follow for outcome. The key move: enforce temporality by design. These have the most information about time of any observational design.
ββββΊ Outcome+
ββ Exposed βββ€
β ββββΊ Outcomeβ
Population βββΊ Cohort
β ββββΊ Outcome+
β Unexposed ββ€
ββββΊ Outcomeβ
tβ βββββββββββββββββββββββ follow-up ββββββββββββββββββββΊ tβ
(define exposure) (ascertain outcome)
Risk in exposed = a / (a+b)
Risk in unexposed = c / (c+d)
Relative Risk = [a/(a+b)] / [c/(c+d)]
Prospective: Study begins at point of exposure. You have a lot of control. You cannot always assume causality. Kinda costly for rare outcomes (gotta recruit a lot of people!). Loss-to-follow-up is a big problem.
Retrospective: Study begins at point of outcome. No control (lol). Inexpensive (already happened, someone else paid the bill). But youβre limited in terms of sampling and quality (already happened). And you cannot always assume causality. More prone to selection, information, and recall bias.
Ambidirectional (βCallback Studiesβ): Rare. βWeldβ two timelines together β retrospective data combined with prospective follow-up. Data inconsistency is a headache. Inherits the bummers of both designs.
Multiple/Double Cohort: You have several cohorts with different levels of the same exposure. Lots of potential confounding risk here.
Cohort studies can start with cross-sectional studies where you can establish prevalence.
Case-Control Studiesβ
Select on outcome, look back at exposure. The logical inversion of a cohort study. Always retrospective.
βββ Exposed (a) ββββ
Cases (O+) ββββ€ β
βββ Unexposed (c) ββ€
β recall /
β records
βββ Exposed (b) ββββ€
Controls (Oβ) ββββ€ β
βββ Unexposed (d) ββ
βββββββββ look backward in time ββββββββ tβ (sampling)
Odds Ratio = (aΒ·d) / (bΒ·c)
You can study just one outcome. Big challenge is bias: you recruit your cases separately from controls (you can match, for example, to get over this problem). Doesnβt have the temporal ordering that cohort studies do.
Controls approximate what the cases would have looked like had they not developed the disease β itβs a bit counterfactual-y. Cases and controls must come from the same source population (donβt break links). This is why matching matters: cases are straightforward but controls are hard β you donβt want confounding by covariates or time.
When to use: Rare diseases, long latency periods. Think glioblastoma β a cohort study would require following participants for decades.
Metrics: You cannot compute Risk Ratios! Youβre fixing the outcome distribution (setting the case/control ratio), so you donβt know and . You can compute the Odds Ratio. For rare diseases, .
Nested Case-Controlβ
Say you follow 50,000 people over 20 years to see if an exposure caused a disease. Testing all of them would be expensive. What if you only tested people who didnβt get sick?
THE COHORT (already exists, already followed)
ββββββββββββββββββββββββββββββββββββββββββββββββββ
ββββββββββββββββββββββββββββββββββββββββββββββββββ
ββββββββββββββββββββββββββββββββββββββββββββββββββ
50,000 people
β
βΌ
follow them for 20 years
β
βΌ
ββββββββββββββββββββββββββββββββββββββββββββββββββββ
β 500 people develop the disease = CASES β
β 49,500 people don't = potential β
β controls β
ββββββββββββββββββββββββββββββββββββββββββββββββββββ
β
βΌ
For each case, randomly pick a few controls
from people who were still disease-free at
the moment that case got sick.
β
βΌ
ββββββββββββββββββββββββββββββββββββββββββββββββββββ
β 500 cases + ~2,000 controls (4 per case) β
β β
β NOW measure exposure on these 2,500 people. β
β Skip the other 47,500. β
ββββββββββββββββββββββββββββββββββββββββββββββββββββ
Sampling Strategiesβ
The hardest part. The control group should be a snapshot of the baseline population.
| Strategy | When Sampled | Notes |
|---|---|---|
| Case-Base / Case-Cohort | At start | Controls = random sample from source population at beginning. Use for very rare outcomes. Controls may develop the outcome. |
| Cumulative Density / Survivor | At end | Controls = people who survived without outcome. Problem: survivor bias. |
| Incidence Density / Risk Set | Each time a case occurs | For every incident case, randomly sample from risk set at that time. Preserves underlying risk process. Good for short-term or fluctuating exposures. |
Crossover Studiesβ
Cases serve as their own controls. Reduces random error and confounding. Think seasonal allergies β within-person comparison across time periods.
Experimentalβ
Exposure is assigned by the researcher. Randomization is the dividing line between experimental and quasi-experimental.
Randomized Controlled Trial (RCT)β
The gold standard. You assign exposure randomly and follow for outcome. Establishes temporality and (with sufficient N) removes confounding.
Key features:
- Stringent inclusion criteria
- Typically one primary predictor and one primary outcome β keep it simple
- Randomization only works with large N. Small samples may not remove all confounders.
βββββββββββββββββ
β RANDOMIZATION β
βββββββββ€ββββββββ
β
βββββββββββββββββββ΄ββββββββββββββββββ
βΌ βΌ
ββββββββββββββ ββββββββββββββ
β Treatment β β Control β
β (n=N/2) β β (n=N/2) β
βββββββ¬βββββββ βββββββ¬βββββββ
β β
βΌ βΌ
ββββββββββββββ ββββββββββββββ
β Outcome β ββββ compare βββββΊ β Outcome β
ββββββββββββββ ββββββββββββββ
tβ βββ allocation βββ follow-up βββ outcome assessment ββ tβ
Factorial Designβ
Assess the impact of multiple interventions (all permutations) without creating a separate arm for each. More efficient for multi-intervention testing.
Factor B: Behavioral coaching
Bβ B+
ββββββββββββββββββ¬βββββββββββββββββ
Factor A: Aβ β Group 1 β Group 2 β
Drug β Placebo + β Placebo + β
β No coaching β Coaching β
ββββββββββββββββββΌβββββββββββββββββ€
A+ β Group 3 β Group 4 β
β Drug + β Drug + β
β No coaching β Coaching β
ββββββββββββββββββ΄βββββββββββββββββ
Main effect of A = (G3+G4)/2 β (G1+G2)/2
Main effect of B = (G2+G4)/2 β (G1+G3)/2
AΓB interaction = (G4βG3) β (G2βG1)
Crossover Designβ
Each subject receives both treatment and control in sequence (order randomized). Subjects serve as their own controls. Reduces inter-subject variability. Washout period between treatments needed.
βββ RANDOMIZE SEQUENCE βββ
βΌ βΌ
Sequence 1: A βββΊ [washout] βββΊ B
β β
βββ outcomes Yβ ββββββββββ΄ββ outcomes Yβ
Sequence 2: B βββΊ [washout] βββΊ A
β β
βββ outcomes Yβ ββββββββββ΄ββ outcomes Yβ
Period 1 Period 2 Period 3
βββββ Tx βββββ€βββ washout βββ€βββββ Tx βββββ€
(clears carryover)
Within-subject contrast: Yβ β Y_b for each participant
Cluster Randomized Trialβ
Randomize at the level of groups (clinics, schools, communities) rather than individuals. Use when individual randomization is infeasible or risks contamination.
Population of clusters: β― β― β― β― β― β― β― β―
β
βΌ
βββββββββββββββββββββ
β RANDOMIZE CLUSTERSβ
βββββββββββ€ββββββββββ
β
ββββββββββββββββ΄βββββββββββββββ
βΌ βΌ
Treatment clusters Control clusters
βββββββββββββββββ βββββββββββββββββ
β β― Clinic A β β β― Clinic C β
β β pt pt pt β β β pt pt pt β
β β― Clinic B β β β― Clinic D β
β β pt pt pt β β β pt pt pt β
βββββββββββββββββ βββββββββββββββββ
β β
βΌ βΌ
Outcomes in pts βββ compare βββΊ Outcomes in pts
Note: individuals within a cluster are correlated β ICC
Microrandomized Trials (MRT)β
Used a lot in behavioural change interventions. Idea is to not split participant into intervention and control group. Each participant, whenever there is an opportunity for an intervention, is randomized.
Smaller sample sizes (think half) than RCTs. Learning effects mess this up, perhaps. However, in CDS, you are either receive information or not so this effect is not as profound here.
Participant i:
tβ tβ tβ tβ tβ
tβ tβ ...
β β β β β β β
βΌ βΌ βΌ βΌ βΌ βΌ βΌ
[R] [R] [R] [R] [R] [R] [R] β randomize at each
β β β β β β β decision point
A Γ A A Γ A Γ β prompt / no prompt
β β β β β β β
Yβ Yβ Yβ Yβ Yβ
Yβ Yβ β proximal outcome
Estimand: causal excursion effect
= E[Yβ | Aβ=1, Hβ] β E[Yβ | Aβ=0, Hβ]
averaged over decision points and participants.
Sequential, Multiple Assignment, Randomized Trial (SMART)β
Adaptive treatment design: multiple decision points with re-randomization based on response. Useful for building adaptive treatment strategies.
Hereβs a more colorful picture.
βββββββββββββββββ
β RANDOMIZE Rβ β
βββββββββ€ββββββββ
β
βββββββββββββ΄ββββββββββββ
βΌ βΌ
Treatment A Treatment B
β β
evaluate response evaluate response
at week 8 at week 8
β β
ββββββ΄βββββ βββββ΄βββββ
βΌ βΌ βΌ βΌ
Responder Non-resp. Responder Non-resp.
β β β β
Continue ββββββββ Continue ββββββββ
A β Rβ β B β Rβ β
ββββ€ββββ ββββ€ββββ
β β
ββββ΄βββ ββββ΄βββ
βΌ βΌ βΌ βΌ
Aug. Switch Aug. Switch
A to B B to A
Builds dynamic treatment regimes:
"Start A; if non-responder by wk 8, switch to B"
Quasi-Experimentalβ
Exposure is assigned but not randomly. Common when randomization is infeasible or unethical. Ranked roughly from weakest to strongest:
Pre-Post Designβ
Weakest. Single measurement before and after intervention. Major threat: regression to the mean.
ββββββββββββββ ββββββββββββββ
β Pretest β β Posttest β
β Oβ β βββΊ β Oβ β
ββββββββββββββ ββββββββββββββ
β β²
β Intervention X β
ββββββββββββΊβββββββββββββ
Estimate of effect = Oβ β Oβ
β Threats: history, maturation, regression to mean,
testing effects (no counterfactual)
Posttest Onlyβ
Measure only after intervention. No baseline β hard to attribute change to the intervention.
Group βββΊ Intervention X βββΊ Posttest Oβ
tβ βββββββββββ X ββββββββββββββββββββββββββΊ tβ
β No baseline, no comparison group.
Cannot establish change or causation; descriptive only.
Posttest Only with Controlsβ
Add a concurrent control group but still no pre-test. Better than posttest only but you canβt verify group equivalence at baseline.
Group A (self-selected) βββΊ X βββΊ Oβ
β
β compare
βΌ
Group B (self-selected) βββββββββΊ Oβ
β Selection bias is the dominant threat β groups may have
differed before X. No pretest to verify equivalence.
Pre-Post with Concurrent Controlsβ
Very common. Also called Interrupted Time Series. Sample β Treat β Measure β No Treat β Measure β β¦
Treatment grp: Oβ ββββ X βββββΊ Oβ
β β
β β Ξ_T = Oβ β Oβ
β β
Control grp: Oβ βββββββββββΊ Oβ
β
β Ξ_C = Oβ β Oβ
Difference-in-differences:
DiD = Ξ_T β Ξ_C
= (OββOβ) β (OββOβ)
tβ ββββ pretest ββββ X ββββ posttest βββΊ tβ
Controls for: secular trends, history, maturation
Does NOT control for: differential selection on trajectory
Innate characteristics are eliminated as confounders. But: regression to the mean and learning effects remain. Not appropriate for drug efficacy.
Removed Treatment Designβ
Single group acts as its own control. Over equal time periods, you add and remove the intervention. Stronger than pre-post because the pattern of improvement/reversal supports causal inference.
Phase: Baseline Treatment Removal
β β β
Time: tβ βββββΊ tβ βββββββββΊ tβ βββββββββΊ tβ
Group: Oβ ββββββΊ Oβ ββXβββΊ Oβ ββΓβββΊ Oβ
β² β² β²
β β β
pre-Tx during Tx after Tx
withdrawn
Expected pattern if X works:
Oβ Oβ
Oβ ββββΊ β β
β / \ /
/ \ /
/ \ /
βββββββββ βββ
baseline treatment removal
Effect = (Oβ β Oβ) and reversal at (Oβ β Oβ)
Double Pretest Pre-Post Designβ
Measure β wait β Measure again β Introduce intervention β Measure. Measurements must be equally spaced. The double pretest lets you detect pre-existing trends (βsomething funky going onβ) before attributing change to the intervention.
βββββ pretests βββββ
Group: Oβ βββββββββΊ Oβ ββββ X βββββΊ Oβ
β β β
β baseline β interventionβ
β trend β effect β
ββββββββββββββ β
ββββββββββββββββ
tββ βββββββ tβ βββββββββββ X ββββββββββΊ tβ
Uses (Oβ β Oβ) to estimate the underlying trend,
then asks: does (Oβ β Oβ) exceed that trend?
Helps rule out: maturation, regression to the mean,
pre-existing time trends
Biasβ
Think of the contingency table. Bias is systematic/structural distortion between exposure and outcome.
| O+ | O- | |
|---|---|---|
| E+ | a | b |
| E- | c | d |
Two things can go wrong: (a) who gets into the table (selection bias) and (b) how/where they are placed in the table (information bias).
| Bias | What it is | Most prone design |
|---|---|---|
| Selection | Who gets into the study. Loss-to-follow-up is a subtype. | Retrospective; prospective (attrition) |
| Information | How subjects are classified in cells. | Retrospective |
| Recall | Differential accuracy of remembered exposures. | Retrospective, case-control |
| Survivor | Only survivors available for sampling. | Cumulative/survivor sampling in case-control |
| Regression to the mean | Extreme measurements tend toward average on re-test. | Pre-post designs |
Misclassificationβ
When subjects move between cells (e.g. , , or vertically).
- Non-differential: Misclassification is the same across groups. Biases toward the null (slightly better β you underestimate the effect).
- Differential: Misclassification rate differs between groups. Can distort in any direction.
Key Measuresβ
| Measure | Computable in⦠| Notes |
|---|---|---|
| Prevalence | Cross-sectional, case-control | Snapshot of who has what |
| Incidence | Cohort, RCT | Requires temporality |
| Risk Ratio (RR) | Cohort, RCT | |
| Odds Ratio (OR) | Case-control (primary), all others | ; for rare diseases |
| Incidence Rate | Cohort | Cases / person-time |
Validityβ
- Internal Validity: How sound your statistical treatment is. Did your study measure what it set out to measure?
- External Validity: Generalizability. Do findings apply beyond your study population?
Also distinct from test validity, which is about instrumentation/measurement tools.
Simpler Diagramsβ
Using the amazing MonoDraw.
OBSERVATIONAL
-------------
Cross-Sectional
Population βββΊ Measure exposure + outcome once
(single time point)
Cohort
Population βββΊ Exposed ββββββββΊ Follow over time ββββββββΊ Outcome?
βββΊ Unexposed ββββββΊ Follow over time ββββββββΊ Outcome?
Case-Control
Cases = outcome present ββββββ
ββββΊ Look backward for exposure
Controls = outcome absent ββββββ
EXPERIMENTAL
------------
Randomized Controlled Trial (RCT)
Eligible participants βββΊ Randomize
ββββΊ Intervention βββΊ Outcome
ββββΊ Control ββββββββΊ Outcome
Factorial Design
Eligible participants βββΊ Randomize
ββββΊ A only βββββββββΊ Outcome
ββββΊ B only βββββββββΊ Outcome
ββββΊ A + B ββββββββββΊ Outcome
ββββΊ Neither ββββββββΊ Outcome
Crossover Design
Participants βββΊ Randomize
ββββΊ Treatment A ββΊ Washout ββΊ Treatment B ββΊ Outcome
ββββΊ Treatment B ββΊ Washout ββΊ Treatment A ββΊ Outcome
Cluster Randomized Trial
Groups/clusters βββΊ Randomize
ββββΊ Cluster intervention ββΊ Outcome
ββββΊ Cluster control βββββββΊ Outcome
Microrandomized Trial (MRT)
Participant timeline:
T1 ββ R βββΊ Intervention? βββΊ Response
T2 ββ R βββΊ Intervention? βββΊ Response
T3 ββ R βββΊ Intervention? βββΊ Response
T4 ββ R βββΊ Intervention? βββΊ Response
SMART
Participants βββΊ Randomize
ββββΊ Treatment A ββΊ Respond? ββ¬βββΊ Continue A
β ββββΊ Switch/intensify
ββββΊ Treatment B ββΊ Respond? ββ¬βββΊ Continue B
ββββΊ Switch/intensify
QUASI-EXPERIMENTAL
------------------
Pre-Post Design
O1 βββΊ X βββΊ O2
Posttest Only
X βββΊ O
Posttest Only with Controls
X βββΊ O1
No X ββΊ O2
Pre-Post with Concurrent Controls
O1 βββΊ X βββΊ O2
O1 βββΊ No X βββΊ O2
Removed Treatment Design
O1 βββΊ X on βββΊ O2 βββΊ X removed βββΊ O3
Double Pretest Pre-Post Design
O0 βββΊ O1 βββΊ X βββΊ O2