🤖

Module

Machine Learning Fundamentals

Progress45%

9 / 20 pages

Lesson 1: What is Machine Learning?

Lesson 2: Linear Regression from Scratch

Lesson 3: Visualizing the Loss Landscape

Lesson 4: Logistic Regression (Classification)

Lesson 5: K-Nearest Neighbors (Distance)

Lesson 6: Evaluation Metrics (From Scratch)

Lesson 7: Unsupervised Learning & K-Means

Lesson 8: Dimensionality Reduction with PCA

Lesson 9: Decision Trees & Splits

Lesson 10: Regularization (L1 & L2)

Lesson 11: K-Fold Cross Validation

Lesson 12: Naive Bayes — Probabilistic Classifier

Lesson 13: Support Vector Machines (SVM)

Lesson 14: Gradient Boosting & AdaBoost

Lesson 15: DBSCAN — Density-Based Clustering

Lesson 16: Gaussian Mixture Models (GMM)

Lesson 17: Ensemble Methods — Combine Multiple Models

Back to Module Overview

Page9/20

Evaluation Metrics (From Scratch) · Page 1 of 1

Why Accuracy is a Trap

Evaluation Metrics

The Imbalanced Data Problem

Imagine a dataset of 100 patients, where 99 are healthy (0) and 1 has cancer (1). If a lazy model predicts "0" every single time, it is 99% accurate! But it completely failed its medical purpose.

The Confusion Matrix

To truly evaluate a model, we look at 4 outcomes:

True Positives (TP): Model predicted 1, actual is 1.
True Negatives (TN): Model predicted 0, actual is 0.
False Positives (FP): Model predicted 1, actual is 0. (Type I error)
False Negatives (FN): Model predicted 0, actual is 1. (Type II error - Dangerous in medical/spam filtering)

Metrics

Precision: Out of all 1s we predicted, how many were actually 1? (TP / (TP + FP))
Recall: Out of all actual 1s, how many did we find? (TP / (TP + FN))

main.py

OUTPUT

▶Click "Run Code" to execute…