0% found this document useful (0 votes)

7 views

ML U3 Notes

xv xcv

Uploaded by

Shrenik Pittala

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

ML U3 Notes

xv xcv

Uploaded by

Shrenik Pittala

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Key Concepts in AdaBoost

Here are the important ideas in AdaBoost simplified:

1. Weak Learners

o These are simple models (like decision stumps) that perform slightly better than random
guessing.

o They are trained in sequence, focusing more on data points that previous models found
hard to classify.

2. Strong Classifier

o This is the final model created by combining the predictions of all weak learners.

o It is powerful and accurate because it uses the collective learning of all the weak
learners.

3. Weighted Voting

o Each weak learner gets a weight based on how well it performs.

o More accurate models have a bigger influence on the final prediction.

4. Error Rate

o Measures how many mistakes a weak learner makes.

o Models with fewer errors get higher weights in the ensemble.

5. Iterations

o AdaBoost trains weak learners in multiple rounds (iterations).

o The number of iterations is a key setting; too many can lead to overfitting.

Advantages of AdaBoost

Why AdaBoost is useful:

1. Better Accuracy

o Even with simple models, it can significantly improve accuracy by focusing on tough-to-
classify data.

2. Versatile

o Works with many types of base models and can be applied to different problems.

3. Feature Selection

o Automatically picks the most important features, reducing the need for manual feature
selection.
4. Less Overfitting

o It’s less likely to overfit compared to some other ensemble methods.

Limitations and Challenges

Things to be careful about:

1. Sensitive to Noisy Data

o Noisy data and outliers can mislead AdaBoost because it gives extra weight to
misclassified data points.

2. Computationally Expensive

o Training multiple models takes time, especially for large datasets or many iterations.

3. Overfitting Risk

o Too many iterations can lead to overfitting, especially on small datasets.

4. Complex Tuning

o Choosing the right weak learner and settings (like the number of iterations) can be
tricky.

Summary Table for Quick Memorization

Concept/Advantage/Challenge Key Idea

Weak Learners Simple models trained on hard-to-classify data

Strong Classifier Combines all weak learners for accuracy

Weighted Voting More accurate models get higher influence

Error Rate Measures mistakes; low error = higher weight

Iterations Trains models in multiple rounds

Advantages Improved accuracy, versatility, feature selection

Challenges Sensitive to noise, slow training, overfitting risk

This breakdown makes it easier to recall key points about AdaBoost.

Bagging
• Bagging, an abbreviation for Bootstrap Aggregating, is a machine learning ensemble strategy for
enhancing the reliability and precision of predictive models.
• It entails generating numerous subsets of the training data by employing random sampling with
replacement
• These subsets train multiple base learners, such as decision trees, neural networks, or other
models.

Implementing bagging involves several steps. Here's a general overview:

1. Dataset Preparation: Prepare your dataset, ensuring it's properly cleaned and preprocessed.
Split it into a training set and a test set.

2. Bootstrap Sampling: Randomly sample from the training dataset with replacement to create
multiple bootstrap samples. Each bootstrap sample should typically have the same size as the
original dataset, but some data points may be repeated while others may be omitted.

3. Model Training: Train a base model (e.g., decision tree, neural network, etc.) on each bootstrap
sample. Each model should be trained independently of the others.

4. Prediction Generation: Use each trained model to predict the test dataset.

5. Combining Predictions: Combine the predictions from all the models. You can use majority
voting to determine the final predicted class for classification tasks. For regression tasks, you can
average the predictions.

6. Evaluation: Evaluate the bagging ensemble's performance on the test dataset using appropriate
metrics (e.g., accuracy, F1 score, mean squared error, etc.).

7. Hyperparameter Tuning: If necessary, tune the hyperparameters of the base model(s) or the
bagging ensemble itself using techniques like cross-validation.

8. Deployment: Once you're satisfied with the performance of the bagging ensemble, deploy it to
make predictions on new, unseen data.

Advantages

Main reduces variance

Reduces the risks of overfitting

Rest write your won

Applications
Write your own

Bagging and Sub-bagging are similar. Only difference is that Sub bagging uses random sampling
without replacement where as bagging uses random sampling with replacement
Differences Between Bagging and Subbagging

Aspect Bagging Subbagging

Subsets are created without

Subset Creation Subsets are created with replacement.
replacement.

Each subset can have the same size as the Subsets are usually smaller than the
Sample Size
original dataset. original dataset.

Data points can appear multiple times in a Each data point appears at most once
Data Redundancy
subset. in a subset.

More computationally intensive due to

Complexity Less computationally intensive.
larger subsets.

Better at handling overfitting due to more Less effective in handling overfitting in

Overfitting Handling
diverse subsets. comparison.

Performance on Performs better on noisy data due to its

May struggle with noisy data.
Noisy Data robustness.

Works well with larger datasets and high Suitable for smaller datasets or when
Best Use Case
computational resources. resources are limited.

Simplified ensemble models with

Examples of Use Random Forest, Bagging Classifier.
reduced data usage.

Summary

Bagging emphasizes diversity by allowing data repetition within subsets.

Subbagging is simpler, faster, and uses smaller, non-repeating subsets.

Stumping
• Stumping is a technique where a decision stump (a very simple model) is used as a base learner
in an ensemble learning method like AdaBoost.
• A decision stump is a decision tree with just one split (or decision point).
• It means the model makes decisions based on a single feature.

Purpose of Stumping:

• It simplifies the learning process by focusing on just one feature at a time.

• Stumps are very fast to train because they are extremely simple.

Use in AdaBoost:
• In AdaBoost, many stumps are created sequentially.

• Each stump focuses on the data points that were misclassified by the previous stumps.

Bagging vs Boosting
Differences Between Bagging and Boosting

Feature Bagging Boosting

Type of Parallel ensemble method, where base Sequential ensemble method, where
Ensemble learners are trained independently. base learners are trained sequentially.

Base learners are trained sequentially,

Base learners are typically trained in
with each subsequent learner focusing
Base Learners parallel on different subsets of the
more on correcting the mistakes of its
data.
predecessors.

Misclassified data points are given more

Weighting of All data points are equally weighted in
weight in subsequent iterations to focus
Data the training of base learners.
on difficult instances.

Mainly reduces bias by focusing on

Reduction of Mainly reduces variance by averaging
difficult instances and improving the
Bias/Variance predictions from multiple models.
accuracy of subsequent models.

More sensitive to outliers, especially in

Handling of Resilient to outliers due to averaging
boosting iterations where misclassified
Outliers or voting among multiple models.
instances are given more weight.

May be less robust to outliers,

Generally robust to noisy data and
especially in boosting iterations where
Robustness outliers due to averaging of
misclassified instances are given more
predictions.
weight.
Model Training Can be parallelized, allowing for faster Generally slower than bagging, as base
Time training on multi-core systems. learners are trained sequentially.

AdaBoost, Gradient Boosting Machines

Random Forest is a popular bagging
Examples (GBM), and XGBoost are popular
algorithm.
boosting algorithms.

KD Trees
Are KD Trees and KNN the Same?

No, KD Trees and KNN (k-Nearest Neighbors) are not the same, but they are related.

• KNN is an algorithm used for classification or regression, where we find the k-nearest neighbors
of a given data point.

• KD Trees are a data structure used to make finding those neighbors (in KNN) faster, especially in
high-dimensional data.

KD Tree Explained in Simple English

A KD Tree (K-Dimensional Tree) is a binary tree that organizes points in a space with multiple
dimensions (like 2D or 3D) for fast searching of neighbors.

Key Idea:

• Split the data points into smaller regions, where each region focuses on a specific part of the
dataset.

• At each level, split the data based on one dimension (like x, y, or z) and alternate dimensions at
each level.

How KD Tree Works

Building the KD Tree:

1. Start with All Points:

o Begin with a set of points (e.g., locations on a map: (x, y) coordinates).

2. Choose a Splitting Dimension:

o Split the points based on a chosen dimension (e.g., x-coordinate at the first level, y-
coordinate at the second level, etc.).

o Alternate dimensions at each level.

3. Find the Median:

o Sort the points by the chosen dimension and find the median.

o The median becomes the "root" of the current level.

4. Split into Left and Right:

o Points smaller than the median (on the chosen dimension) go to the left subtree.

o Points larger go to the right subtree.

5. Repeat Recursively:

o Continue splitting the remaining points in the same way until all points are in leaf nodes.

Algorithm for KD Tree Construction

1. Input: A set of points and the current depth ddd.

2. Choose Splitting Dimension:

o Split dimension = dmod kd \mod kdmodk, where kkk is the total number of dimensions.

3. Find Median:

o Sort points along the splitting dimension and choose the median.

4. Create Node:

o The median becomes the current node.

5. Recursive Calls:

o Build left and right subtrees using points before and after the median.

6. Base Case:

o Stop when no points are left.

Example of KD Tree (2D Example)

Points:

(3,6),(2,7),(17,15),(6,12),(13,15),(9,1),(10,19)(3, 6), (2, 7), (17, 15), (6, 12), (13, 15), (9, 1), (10,
19)(3,6),(2,7),(17,15),(6,12),(13,15),(9,1),(10,19)
Step-by-Step Construction: (Example in Notes)

1. Depth = 0 (Split by x-coordinate):

o Points sorted by x: (2,7),(3,6),(6,12),(9,1),(10,19),(13,15),(17,15)(2, 7), (3, 6), (6, 12), (9,

1), (10, 19), (13, 15), (17, 15)(2,7),(3,6),(6,12),(9,1),(10,19),(13,15),(17,15)

o Median: (9,1)(9, 1)(9,1) → Root of tree.

2. Depth = 1 (Split by y-coordinate):

o Left subtree (points < 9): (2,7),(3,6),(6,12)(2, 7), (3, 6), (6, 12)(2,7),(3,6),(6,12)

▪ Median: (3,6)(3, 6)(3,6) → Root of left subtree.

o Right subtree (points > 9): (10,19),(13,15),(17,15)(10, 19), (13, 15), (17,
15)(10,19),(13,15),(17,15)

▪ Median: (13,15)(13, 15)(13,15) → Root of right subtree.

3. Continue Recursively:

o Repeat the process for each subset, alternating between x and y splits.

Searching in KD Tree (Nearest Neighbor Search)

Goal:

Find the closest point to a given query point.

Steps:

1. Start at the root and compare the query point to the splitting dimension.

2. Move to the left or right subtree based on the query point’s position relative to the current
node.

3. Once you reach a leaf node, calculate the distance to the query point.

4. Backtrack and check the other subtree if necessary (to ensure the closest point isn’t missed).

Advantages of KD Tree

1. Fast Search: Reduces the number of distance calculations compared to brute-force KNN.

2. Efficient for Low Dimensions: Works well for datasets with a moderate number of dimensions.

3. Supports KNN: KD Trees make KNN searches more efficient.

Limitations of KD Tree
1. Curse of Dimensionality: Performance decreases as dimensions increase.

2. Uneven Splits: If the data isn’t evenly distributed, the tree may become unbalanced.

Example Use Case:

Imagine you have GPS data of cities and want to find the city closest to a given location. Instead of
calculating distances for all cities, a KD Tree organizes the cities for fast nearest neighbor searches.

For example:

• Query: (5,10)(5, 10)(5,10)

• KD Tree quickly identifies (6,12)(6, 12)(6,12) as the closest point.

Ch-4 Ensemble Learning
No ratings yet
Ch-4 Ensemble Learning
18 pages
Adaboost
No ratings yet
Adaboost
4 pages
33_Assignment 7_ Implementation of Ensemble techniques
No ratings yet
33_Assignment 7_ Implementation of Ensemble techniques
7 pages
Knowledge Discovery in Healthcare-1
No ratings yet
Knowledge Discovery in Healthcare-1
35 pages
Resilience To Overfitting AdaBoosts Approach
No ratings yet
Resilience To Overfitting AdaBoosts Approach
8 pages
UNIT 3 AML
No ratings yet
UNIT 3 AML
9 pages
Unit I ML (I) 24-25-1
No ratings yet
Unit I ML (I) 24-25-1
152 pages
Unit I ML (I) 24-25
No ratings yet
Unit I ML (I) 24-25
79 pages
Data Mining Notes
No ratings yet
Data Mining Notes
5 pages
Machine learning
No ratings yet
Machine learning
76 pages
Random Forest
No ratings yet
Random Forest
29 pages
Bagging vs Boosting in Machine Learning - GeeksforGeeks
No ratings yet
Bagging vs Boosting in Machine Learning - GeeksforGeeks
9 pages
Ensemble Learning
No ratings yet
Ensemble Learning
8 pages
ML Mod 5.1
No ratings yet
ML Mod 5.1
18 pages
Random Forest
No ratings yet
Random Forest
25 pages
Ensemble - Part 1
No ratings yet
Ensemble - Part 1
33 pages
unit 4 pdf
No ratings yet
unit 4 pdf
9 pages
Adv Ai Ia2
No ratings yet
Adv Ai Ia2
6 pages
Ensemble Learning in Machine Learning
No ratings yet
Ensemble Learning in Machine Learning
4 pages
Ensemble Interview Questions
No ratings yet
Ensemble Interview Questions
3 pages
Lecture 9
No ratings yet
Lecture 9
12 pages
Ensemble Methods
No ratings yet
Ensemble Methods
3 pages
Machine learning lecture 2,3,4
No ratings yet
Machine learning lecture 2,3,4
26 pages
AIML Final Cpy Word
No ratings yet
AIML Final Cpy Word
15 pages
Outlines: Statements of Problems Objectives Bagging Random Forest Boosting Adaboost
100% (1)
Outlines: Statements of Problems Objectives Bagging Random Forest Boosting Adaboost
14 pages
Bagging
No ratings yet
Bagging
6 pages
ENSEMBLE LEARNING-1
No ratings yet
ENSEMBLE LEARNING-1
61 pages
Random Forest Algorithm
No ratings yet
Random Forest Algorithm
4 pages
Unit V Aiml
No ratings yet
Unit V Aiml
18 pages
Ensemble Models
No ratings yet
Ensemble Models
52 pages
Classification Algorithms
No ratings yet
Classification Algorithms
68 pages
DM(Boosting)
No ratings yet
DM(Boosting)
15 pages
Assignment 3
No ratings yet
Assignment 3
3 pages
Ensemble Learning and Random Forest 4th
No ratings yet
Ensemble Learning and Random Forest 4th
19 pages
Dl
No ratings yet
Dl
10 pages
16-Ensemble Learning - Cont... - 12-04-2024
No ratings yet
16-Ensemble Learning - Cont... - 12-04-2024
13 pages
ADABOOST
No ratings yet
ADABOOST
9 pages
E4fbc2f-C755-Ed1a-C18-F18ec25eb0d Ensemble Learning Bagging Boosting and Stacking
No ratings yet
E4fbc2f-C755-Ed1a-C18-F18ec25eb0d Ensemble Learning Bagging Boosting and Stacking
6 pages
Unit-3 ML
No ratings yet
Unit-3 ML
18 pages
Bagging vs Boosting in Machine Learning
No ratings yet
Bagging vs Boosting in Machine Learning
5 pages
Technical Report
No ratings yet
Technical Report
10 pages
Ensemble Learning
No ratings yet
Ensemble Learning
16 pages
Handout9 Trees Bagging Boosting
100% (1)
Handout9 Trees Bagging Boosting
23 pages
Ensemble Methods (Final)
No ratings yet
Ensemble Methods (Final)
16 pages
Unit-I (Ensemble Learning)
No ratings yet
Unit-I (Ensemble Learning)
67 pages
unit 4 ml
No ratings yet
unit 4 ml
9 pages
Module 2 - S8 CSE NOTES - KTU DEEP LEARNING NOTES - CST414
No ratings yet
Module 2 - S8 CSE NOTES - KTU DEEP LEARNING NOTES - CST414
20 pages
unit 5 ML
No ratings yet
unit 5 ML
14 pages
Ensemble Learning: Comprehensive Explanation: Base Models
No ratings yet
Ensemble Learning: Comprehensive Explanation: Base Models
20 pages
Pa Mod - 3,4,5
No ratings yet
Pa Mod - 3,4,5
47 pages
Ensemble Method
No ratings yet
Ensemble Method
18 pages
Bagging and Random Forest Presentation1
100% (2)
Bagging and Random Forest Presentation1
23 pages
Boosting
No ratings yet
Boosting
6 pages
ML UNIT 3-1
No ratings yet
ML UNIT 3-1
14 pages
Random FOrest
No ratings yet
Random FOrest
19 pages
Study of Ensemble Classifers
No ratings yet
Study of Ensemble Classifers
8 pages
Ensemble-Based Techniques_XAI PPT
No ratings yet
Ensemble-Based Techniques_XAI PPT
13 pages
Bagging Vs Boosting in Machine Learning
No ratings yet
Bagging Vs Boosting in Machine Learning
4 pages
UNIT3_class
No ratings yet
UNIT3_class
30 pages
Decision Tree Pruning: Fundamentals and Applications
From Everand
Decision Tree Pruning: Fundamentals and Applications
Fouad Sabry
No ratings yet
CN Programs
No ratings yet
CN Programs
5 pages
vertopal.com_MachineLearning
No ratings yet
vertopal.com_MachineLearning
11 pages
JAM Topic Prep
No ratings yet
JAM Topic Prep
4 pages
Flutter Viva QnA
No ratings yet
Flutter Viva QnA
4 pages
Flutter Exam Guide
No ratings yet
Flutter Exam Guide
4 pages
MachineLearning
No ratings yet
MachineLearning
10 pages
ML U5 Notes
No ratings yet
ML U5 Notes
26 pages
ML U1 Notes
No ratings yet
ML U1 Notes
3 pages
ML U2 Notes
No ratings yet
ML U2 Notes
12 pages
ML U4 Notes
No ratings yet
ML U4 Notes
15 pages
Lecture 1-Unit 3.3
No ratings yet
Lecture 1-Unit 3.3
3 pages
Deep Learning: Huawei AI Academy Training Materials
No ratings yet
Deep Learning: Huawei AI Academy Training Materials
47 pages
University of Technology Department of Electrical Engineering Final Course Examination 2019-2020
No ratings yet
University of Technology Department of Electrical Engineering Final Course Examination 2019-2020
2 pages
ML Course PDF
No ratings yet
ML Course PDF
133 pages
Ait401 DL Syllubus
100% (1)
Ait401 DL Syllubus
13 pages
Prediction of Idiopathic Recurrent Spontaneous Miscarriage Using Machine Learning
No ratings yet
Prediction of Idiopathic Recurrent Spontaneous Miscarriage Using Machine Learning
8 pages
Adaline Madaline
No ratings yet
Adaline Madaline
32 pages
Huawei Final Written Exam 2.2 Attempts
No ratings yet
Huawei Final Written Exam 2.2 Attempts
19 pages
ANN Updated Syllabus
No ratings yet
ANN Updated Syllabus
2 pages
AIML-UNIT-4
No ratings yet
AIML-UNIT-4
17 pages
Deep Learning - Question Papers
No ratings yet
Deep Learning - Question Papers
7 pages
Domnic Object Detecion Basics
No ratings yet
Domnic Object Detecion Basics
62 pages
Chapter 3 - Neural Network
No ratings yet
Chapter 3 - Neural Network
47 pages
1491-Article Text-3334-1-10-20201216
No ratings yet
1491-Article Text-3334-1-10-20201216
6 pages
Linear Classifiers in Python: Chapter1
No ratings yet
Linear Classifiers in Python: Chapter1
16 pages
Malhotra Mr05 PPT 20
100% (1)
Malhotra Mr05 PPT 20
41 pages
ML Prelims 2024-25
No ratings yet
ML Prelims 2024-25
1 page
Parameter Calculation
No ratings yet
Parameter Calculation
10 pages
threat detection
No ratings yet
threat detection
10 pages
MODULE 1 DL
No ratings yet
MODULE 1 DL
6 pages
P95 Course Slides
No ratings yet
P95 Course Slides
86 pages
Convolutional Neural Networks: Shusen Wang
No ratings yet
Convolutional Neural Networks: Shusen Wang
75 pages
Lecture Notes - Clustering
No ratings yet
Lecture Notes - Clustering
13 pages
Lecture 5
No ratings yet
Lecture 5
43 pages
Activation Functions For Neural Networks: Application and Performance-Based Comparison
No ratings yet
Activation Functions For Neural Networks: Application and Performance-Based Comparison
5 pages
Neural Network Quiz Questions
No ratings yet
Neural Network Quiz Questions
6 pages
set3sol-2022
No ratings yet
set3sol-2022
3 pages
Artificial Neural Networks: Slides Are By: Tan, Steinbach, Karpatne, Kumar
No ratings yet
Artificial Neural Networks: Slides Are By: Tan, Steinbach, Karpatne, Kumar
26 pages
P1
No ratings yet
P1
8 pages