DBMI Course Prerequisites
This is no longer offered.
You’ll be given a small exam to see how much statistics and programming you’ll need to learn to take other informatics courses.
Programming
Basic UNIX Stuff
awkgrepsedfind
Regular Expressions (#)
This is your friend. Play with it, learn from it.
- Quantification -
*?+{..} - Delimiters
- Character Classes
- Backreferences
- Greediness and Laziness
Python
- Conditionals and Ternaries
- Loops:
breakandcontinue - Data Structures (and Big-O for each!)
- List Comprehensions
- Lists:
map,filter, andreduce - Tuples
- Sets
- Hash Maps/Dictionaries
- Functions
- Recursion
- Decorators/Closures
- I/O
- Reading and Writing Files
- De/Serialization of JSON files
- Vectorized Operations and Efficiency Considerations
- Scientific Python
- Draw a line plot
- Draw a histogram
- Dataframes: Sorting, Filtering. Get a Pandas cheatsheet.
- Review a NumPy cheatsheet.
- Dataclasses
Object-Oriented Programming 🤢
- Objects and Classes (and Singletons!)
- Public, Protected, Private, Static methods and attributes
- Inheritance, Polymorphism, Encapsulation
What they are and problems with each.
Relational Databases
This site is your friend. Do everything there and you’ll know most of what you’ll need. Making friends with a DBA is (I’m assuming) outside the scope of this course.
- Language (
SELECT/WHERE/LIKEGROUP BYORDER BYASCDESC) - Various Joins (
INNEROUTERLEFTRIGHT) - Functions (
MINMAXSUMCOUNTCONCATENATE) - Indexes: single and multi-column. Types of Indexes (e.g. inverted B-trees)
- Keys: Primary, Secondary, Foreign
- Schemas, Fields/Columns, lingo
- Fuzziness
- Subqueries
Git
- Know that you’re fundamentally dealing with a content-addressible system.
- 80% of porcelain commands.
- Refs,
HEAD - Rebasing versus Merging
Other
- Intro to the Unified Medical Language System (UMLS)
- File formats and their strengths and weaknesses (JSON, CSV, XML). De/Serialization. Columnar compression.
Linear Algebra
See this page.
- Scalars, Vectors, Matrices, Tensors
- Basis Space and Basis Vectors
- Dot and Cross Products
- Matrix Rank (look at Row Echelon Forms)
- Matrix Transpose and Operations
- Matrix Determinants and what they mean
- Matrix Invertibility
- Matrix Adjoint
- Matrix Conjugate
- Matrix Orthogonality
- Eigenvalues and Eigenvectors
- Cramer’s Rule
Probability
- Counting
- Philosophy/Interpretation
- Axioms
- Sample Spaces
- Sample Events
- Expected Value
- Law of Total Probability
- Conditional Probabilities
- Independent and Dependent Probability
- Chain Rule
- Density Functions (PDFs)
- Mass Functions (PMFs)
- Cumulative Distribution Functions
- Bayesian Analysis
- Bayes Theorem
- Cox’s Theorem
- MAP
- PDFs, CDFs: Support, means, variances of common ones
- Distribution “Arithmetic” (e.g. )
- The Gaussian/Normal Distribution and derivation
- The Multivariate Gaussian Distribution
- Moment Generating Functions and “Fingerprints”
Statistics
Concepts
- Measures of Central Tendency: Mean, Median, Mode
- Variance
- The Central Limit Theorem
- Estimators
- Method of Least Squares
- Maximum Likelihood Estimation
- Maximum A Posteriori Estimation
Randomness
- Random Variables
- Covariance
- Sampling
- Estimation
- Bootstrapping
Hypothesis Testing
Theory
- p-value
- Null hypothesis
- Confidence Intervals
- Credible Intervals
Practice
- -squared Test
- t-Test
- ANOVA
- Pearson Correlation
- Non-parametric Tests
Prediction
- Regression
- Linear
- Multivariate
- Logistic
- Ridge
- LASSO
- Convex Optimization
- Machine Learning