A Broad Review of ML Topics

This repository will be used to help me consolidate my knowledge regarding various ML topics.

This repository will include:

Notes on various topics (Edit: Note that markdown on Github isn't formatting the same as my local IDE, PyCharm, which may cause readability issues!)
- Classical (Non-DL) ML/Statistics
  - Linear Algebra and Calculus
  - Probability and Information Theory
  - Statistical Learning Theory
  - Statistical Testing and Metrics
  - Bayesian Statistics
  - Linear Regression & Regularization
  - Naive Bayes & Logistic Regression & GLMs
  - SVMs
  - Decision Trees
  - Ensemble Learning, Random Forests and Boosting
  - Dimensionality Reduction
  - Unsupervised Clustering
  - Gaussian Process
  - Causal Inference
  - ARIMA
- DL (Note that some topics may bleed into each category)
  - Basics
  - Activation Functions
  - Initialization
  - Optimization and Regularization
  - Coding Practices
  - CNNs
  - RNNs
  - Attention & Transformers
  - Autoencoders
  - Diffusion Models
  - Flow-based Models
  - Generative Adversarial Networks
  - Graph Neural Networks
  - Meta-Learning
  - Self-Supervised Contrastive Learning
  - Computer Vision
  - Natural Language Processing
  - Reinforcement Learning
  - Audio
  - Video
  - Multimodal
  - Post Training
  - AI Safety
  - Hyperparameter Optimization
  - Computational Performance
  - Personal Projects
  - Misc
Code implementations for various algorithms, which will mostly come from online resources/tutorials.
- The first priority would be to fulfill learning goals.
- If time permits, a stretch goal would be to refactor the code with a greater emphasis on OOP, e.g. John's repo
- Code currently includes:
  - From scratch implementations, including BERT, GPT-2, Llama 2-3.2, DDPM, Real-NVP.
  - Post-training experiments, including (LoRA) fine-tuning and DPO.
  - Data and model parallelism, with and without JAX (+FLAX).
  - Experiments with TensorRT-LLM for model serving
Interview Preparation

I shall try to be diligent in citing my sources. Due to visa-related time constraints, I do apologize for any lapses in citation. At the current moment (1/10/25), I have pulled most heavily from the following sources:

Name		Name	Last commit message	Last commit date
Latest commit History 138 Commits
.idea		.idea
.ipynb_checkpoints		.ipynb_checkpoints
.virtual_documents/dl/26_personal_projects/understanding_mlps		.virtual_documents/dl/26_personal_projects/understanding_mlps
classical		classical
dl		dl
interview_prep		interview_prep
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
scratch.ipynb		scratch.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

A Broad Review of ML Topics

About

Uh oh!

Releases

Packages

Uh oh!

Languages

EricTay1997/ML-Review

Folders and files

Latest commit

History

Repository files navigation

A Broad Review of ML Topics

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages