Skip to main content

Programming

Basic UNIX Stuff

  • awk
  • grep
  • sed
  • find

Regular Expressions (#)

Python

  • Conditionals and Ternaries
  • Loops: break and continue
  • Data Structures (and Big-O for each!)
    • List Comprehensions
    • Lists: map, filter, and reduce
    • Tuples
    • Sets
    • Hash Maps/Dictionaries
  • Functions
    • Recursion
    • Decorators/Closures
  • I/O
    • Reading and Writing Files
    • De/Serialization of JSON files
  • Vectorized Operations and Efficiency Considerations
  • Scientific Python
    • Draw a line plot
    • Draw a histogram
    • Dataframes: Sorting, Filtering. Get a Pandas cheatsheet.
    • Review a NumPy cheatsheet.
  • Dataclasses

Object-Oriented Programming 🤢

Relational Databases

This site is your friend.

  • Language (SELECT/WHERE/LIKE GROUP BY ORDER BY ASC DESC)
  • Various Joins (INNER OUTER LEFT RIGHT)
  • Functions (MIN MAX SUM COUNT CONCATENATE)
  • Indexes: single and multi-column. Types of Indexes (e.g. inverted B-trees)
  • Keys: Primary, Secondary, Foreign
  • Schemas, Fields/Columns, lingo
  • Fuzziness
  • Subqueries

Git

  • Know that you're fundamentally dealing with a content-addressible system.
  • 80% of porcelain commands.
  • Refs, HEAD
  • Rebasing versus Merging

Other

  • Intro to the Unified Medical Language System (UMLS)
  • File formats and their strengths and weaknesses (JSON, CSV, XML). De/Serialization. Columnar compression.