Course 1 : Classical Machine Learning Algorithms

Categories Advanced AI

Wishlist

Course Curriculum

ML models – Evolution

Intro on Machine learning

supervised – Logistic regression
Logistic regression is a popular supervised learning algorithm used for binary classification tasks. It models the relationship between a dependent variable and one or more independent variables by estimating the probabilities of the target class.

Linear regression for classification (excel demo)

00:00
sigmoid function

00:00
Example demo of Logistic regression (sklearn)

00:00
Model evaluation – metrics (ROC, AUC)

00:00
Logit Function

00:00
Probability Threshold

00:00
type – Binary Logistic Regression

00:00
type – Multinomial Logistic Regression

00:00
Regularization : demo using sklearn

00:00
Data imbalance : demo using sklearn

00:00
Hyperparameter tuning – Grid search

00:00
Assumptions

00:00

supervised – Decision Trees
Decision trees are a popular supervised learning algorithm used for both classification and regression tasks. They provide a clear and interpretable representation of the decision-making process by constructing a tree-like model of decisions and their possible consequences.

whats a tree? Key terms used

00:00
Entropy : definition and examples

00:00
Gini index : definition and examples

00:00
Information gain : definition and example

00:00
Attribute Selection Measures:

00:00
Splitting Criteria:

00:00
Demo : attribute selection/splitting (excel)

00:00
Demo – using sklearn (classification)

00:00
How decision trees handle numeric features

00:00
Decision trees : regression

00:00
Hyperparameters of Decision Trees

00:00
Tuning a decision tree (PRUNING)

00:00
Data imbalance : dec trees

00:00
strengths and limitations of a decision tree

00:00
Summarize : Key points on Decision tree

00:00

Supervised – Random Forest
Random Forest is an ensemble learning method that combines multiple decision trees to make predictions. It leverages the wisdom of the crowd by aggregating the predictions of individual trees.

Supervised – Support Vector Machines
Support Vector Machines (SVM) is a powerful supervised learning algorithm used for classification and regression tasks.

background on Support Vector Machines

00:00
DEMO : basic usage of sklearn implementation of SVM

00:00
Margin Maximization:

00:00
Linear and Non-linear Classification

00:00
Support Vectors

00:00
Kernel Functions:

00:00
C Parameter and Soft Margin

00:00
Hyperparameter Tuning:

00:00
Multiclass Classification

00:00
Support Vector Regression

00:00

supervised – Naive Bayes model
The Naive Bayes model is a popular supervised learning algorithm commonly used for classification tasks. It is based on the Bayes' theorem and assumes that features are conditionally independent of each other given the class label. Despite its simplicity, Naive Bayes often performs well and is efficient in terms of training and prediction time

Unsupervised – Variations of K-means model

Unsupervised – Hierarchical models

Unsupervised – Density based models

Unsupervised – Gaussian Mixture Models (GMM)

Unsupervised – Spectral Clustering
Spectral clustering treats the data points as nodes in a graph and uses the eigenvectors of the graph Laplacian matrix to find clusters. It first constructs an affinity matrix to measure the similarity between data points and then performs dimensionality reduction on the affinity matrix. Finally, it applies K-means or another clustering algorithm on the reduced-dimensional space to assign the data points to clusters.

Unsupervised – Mean Shift Clustering
Mean Shift clustering is a density-based algorithm that iteratively moves a window (kernel) over the data points, shifting it towards the region of highest density. It aims to find the modes or peaks of the underlying density function, which correspond to the cluster centers. Mean Shift clustering does not require specifying the number of clusters in advance and can handle irregularly shaped clusters.

Unsupervised – Self-Organizing Maps (SOM)
SOM is an artificial neural network-based clustering technique that maps high-dimensional data onto a lower-dimensional grid. It organizes the grid nodes (neurons) based on the similarity of their weight vectors to the input data. SOM preserves the topological relationships between the data points and can reveal the underlying structure of the data. These are just a few examples of clustering models commonly used in machine learning. Each model has its strengths, limitations, and assumptions, and the choice of clustering algorithm depends on the nature of the data, the desired clustering outcome, and the specific requirements of the problem at hand.

Student Ratings & Reviews

No Review Yet

Course 1 : Classical Machine Learning Algorithms

Course Curriculum

ML models – Evolution

Intro on Machine learning

supervised – Logistic regression Logistic regression is a popular supervised learning algorithm used for binary classification tasks. It models the relationship between a dependent variable and one or more independent variables by estimating the probabilities of the target class.

Linear regression for classification (excel demo)

sigmoid function

Example demo of Logistic regression (sklearn)

Model evaluation – metrics (ROC, AUC)

Logit Function

Probability Threshold

type – Binary Logistic Regression

type – Multinomial Logistic Regression

Regularization : demo using sklearn

Data imbalance : demo using sklearn

Hyperparameter tuning – Grid search

Assumptions

whats a tree? Key terms used

Entropy : definition and examples

Gini index : definition and examples

Information gain : definition and example

Attribute Selection Measures:

Splitting Criteria:

Demo : attribute selection/splitting (excel)

Demo – using sklearn (classification)

How decision trees handle numeric features

Decision trees : regression

Hyperparameters of Decision Trees

Tuning a decision tree (PRUNING)

Data imbalance : dec trees

strengths and limitations of a decision tree

Summarize : Key points on Decision tree

Supervised – Random Forest Random Forest is an ensemble learning method that combines multiple decision trees to make predictions. It leverages the wisdom of the crowd by aggregating the predictions of individual trees.

What is ENSEMBLE in machine learning?

Decision trees – weaknesses and how ensemble can help

Describe BAGGING type ensemble

DEMO : sklearn implementation of Random Forest

Out-of-Bag (OOB) Error Estimation:

Hyperparameter tuning with Grid Search

sklearn version of bagging algorithms

Interpretability and Feature Importance

Supervised – Support Vector Machines Support Vector Machines (SVM) is a powerful supervised learning algorithm used for classification and regression tasks.

background on Support Vector Machines

DEMO : basic usage of sklearn implementation of SVM

Margin Maximization:

Linear and Non-linear Classification

Support Vectors

Kernel Functions:

C Parameter and Soft Margin

Hyperparameter Tuning:

Multiclass Classification

Support Vector Regression

Bayes’ Theorem : overview

Example using sklearn

Probability Estimation

Training phase

Classification phase

Laplace Smoothing

Assumptions and Limitations

Variants

MCQs on Naive Bayes

Unsupervised – Variations of K-means model

Unsupervised – Hierarchical models

Unsupervised – Density based models

Unsupervised – Gaussian Mixture Models (GMM)

Probability Distributions

Mixture Models

Model Representation

Model Training

Use case : Clustering

Use case : Generative Model

Student Ratings & Reviews

supervised – Logistic regression
Logistic regression is a popular supervised learning algorithm used for binary classification tasks. It models the relationship between a dependent variable and one or more independent variables by estimating the probabilities of the target class.

Supervised – Random Forest
Random Forest is an ensemble learning method that combines multiple decision trees to make predictions. It leverages the wisdom of the crowd by aggregating the predictions of individual trees.

Supervised – Support Vector Machines
Support Vector Machines (SVM) is a powerful supervised learning algorithm used for classification and regression tasks.