Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

ML - Curriculum

Here’s a comprehensive ML curriculum structured from absolute basics to advanced topics, with clear prerequisites for each section. This is designed for a workshop/series format.

PREREQUISITES (What you MUST Know First)

Before starting any ML models, ensure these are covered:

  1. Basic Python Programming (NumPy, Pandas basics)

  2. High School Math Concepts:

    • Functions, slopes, graphs

    • Basic probability (what is probability?)

    • Basic statistics (mean, median, standard deviation)

  3. Data Thinking:

    • What are features/variables?

    • What are observations/rows?

    • What is a target variable?

CORE ML CURRICULUM (Structured Pathway)

PART 1: FOUNDATIONS (Week 1-2)

Topic 1.1: What is ML? The Big Picture

Topic 1.2: Data Preparation

Topic 1.3: Evaluation Basics

PART 2: SUPERVISED LEARNING (Week 3-6)

REGRESSION MODELS

Topic 2.1: Linear Regression

Topic 2.2: Model Evaluation (Regression)

PART 3: UN-SUPERVISED LEARNING (Week 3-6)

CLASSIFICATION MODELS

Topic 3.1: Logistic Regression

Topic 3.2: Evaluation (Classification)

Topic 3.3: k-Nearest Neighbors (k-NN)

Topic 3.4: Decision Trees

Topic 3.5: Ensemble Methods

PART 4: UNSUPERVISED LEARNING (Week 7-8)

CLUSTERING

Topic 4.1: Introduction to Unsupervised Learning

Topic 4.2: k-Means Clustering

Topic 4.3: Hierarchical Clustering

Topic 4.4: DBSCAN

DIMENSIONALITY REDUCTION

Topic 4.5: PCA (Principal Component Analysis)

PART 5: CORE ML CONCEPTS (Week 9-10)

Topic 5.1: Bias-Variance Tradeoff

Topic 5.2: Cross-Validation

Topic 5.3: Hyperparameter Tuning

PART 6: INTRODUCTION TO ADVANCED TOPICS (Week 11-12)

Topic 6.1: Neural Networks Introduction

Topic 6.2: Natural Language Processing Basics

Topic 6.3: Recommendation Systems

TEACHING METHODOLOGY FOR EACH TOPIC

For every model, follow this pattern:

1. BUSINESS PROBLEM (Why do we care?)
2. HUMAN INTUITION (How would you solve it?)
3. VISUAL EXPLANATION (Graphs/diagrams)
4. SIMPLE MATH (Concepts, not proofs)
5. CODE DEMO (Scikit-learn implementation)
6. INTERPRETATION (What do results mean?)
7. PROS/CONS (When to use/avoid)

HANDS-ON PROJECTS PROGRESSION

  1. Beginner: Titanic Survival Prediction (binary classification)

  2. Intermediate: House Price Prediction (regression + feature engineering)

  3. Advanced: Customer Segmentation + Product Recommendation (clustering + NLP)

ESSENTIAL TOOLS TO TEACH

COMMON PITFALLS TO WARN ABOUT

  1. Data leakage (using test data during training)

  2. Ignoring class imbalance in classification

  3. Not scaling features when needed

  4. Overcomplicating with neural networks too early

  5. Chasing accuracy without business context

SESSION TEMPLATE (2-hour session)

0:00-0:15  Recap previous session + Q&A
0:15-0:30  New concept intuition (whiteboard)
0:30-0:45  Mathematical intuition (simple formulas)
0:45-1:15  Live coding demo
1:15-1:45  Hands-on exercise (they code with guidance)
1:45-2:00  Q&A + Real-world applications discussion

ASSESSMENT CHECKPOINTS

After each part:

This curriculum ensures they understand why before how, and can make informed decisions about which model to use for which problem. Would you like me to elaborate on any specific section or provide code examples for any topic?