What Is Machine Learning

Uploaded by

chessyrohan

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views

What Is Machine Learning

Uploaded by

chessyrohan

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

what is machine learning

Machine learning is a branch of artificial intelligence (AI) that enables systems to

automatically learn and improve from experience without being explicitly
programmed.

Features of Machine Learning:

• Machine learning uses data to detect various patterns in a given
dataset.
• It can learn from past data and improve automatically.
• It is a data-driven technology.
• Machine learning is much similar to data mining as it also deals with
the huge amount of the data.

How does Machine Learning work

A ML system builds prediction models, learns from previous data, and predicts the
output of new data whenever it receives it.
The amount of data helps to build a better model that accurately predicts the
output, which in turn affects the accuracy of the predicted output.

Applications:
• Online recommendation systems
• Search engine algorithms
• Image and speech recognition
• Email spam detection
• Customer segmentation
• Fraud detection
• Predictive maintenance
• Autonomous vehicles (self-driving cars)
• Virtual personal assistants (e.g., Siri, Alexa)
• Medical diagnosis and imaging
Key differences between Artificial Intelligence (AI) and Machine learning (ML):

Regression vs. Classification in Machine Learning

discuss briefly Finding the best fit line or performance evaluation

• R-Squared (R²): This statistical measure assesses how well the model's predictions match
actual outcomes. R² values range from 0 to 1 (or 0% to 100%), with higher values indicating that
the model accurately predicts data points. It is commonly used in regression to measure the
strength of the relationship between dependent and independent variables(L-3 Machine
Learning).

• Mean Squared Error (MSE): For linear regression, MSE calculates the average of the
squared differences between predicted and actual values. A lower MSE suggests a better fit, as it
indicates smaller deviations from the predicted regression line(L-3 Machine Learning).

• Gradient Descent: This optimization algorithm iteratively adjusts the model’s parameters to
minimize MSE, helping to identify the best fit line by reducing prediction error over each
iteration

discuss briefly the evaluation of a classification model as per the file

The evaluation of a classification model involves assessing its performance on various metrics to
ensure accuracy and reliability. Key evaluation metrics often used include:

1. Accuracy - The proportion of correct predictions out of the total predictions, showing the
overall effectiveness.
2. Precision - Measures how many positive predictions were actually correct, useful when
minimizing false positives is essential.
3. Recall (Sensitivity) - The ratio of true positives to actual positives, focusing on capturing
all relevant cases.
4. F1 Score - The harmonic mean of precision and recall, useful when there is an uneven
class distribution.
5. Confusion Matrix - A table summarizing true positives, false positives, true negatives,
and false negatives, providing a detailed error analysis.

what is unsupervised in ML

Unsupervised learning in machine learning is a type of learning where the model is trained on
data without labeled responses or predefined categories.

Types of Unsupervised Learning

1. Clustering: Groups data points into clusters based on similarity. For example, customer
segmentation, where similar customer behaviors are grouped together.
2. Association: Finds relationships between data points. An example is market basket
analysis, where it determines products often bought together.

Key Algorithms
• K-means clustering
• Hierarchical clustering
• Principal Component Analysis (PCA)
• Association Rule Learning (e.g., Apriori algorithm)

Reinforcement Learning
Reinforcement learning (RL) is a type of machine learning where an agent learns
to make decisions by interacting with an environment. It works on a feedback-
based system, rewarding the agent for beneficial actions and penalizing it
Types of Reinforcement Learning:
1. Positive Reinforcement Learning: This type strengthens behavior by
adding a positive outcome after a desired action, making it more likely the
agent will repeat the behavior.
2. Negative Reinforcement Learning: In contrast, this method encourages
behavior by avoiding a negative outcome when the correct action is taken,
promoting behavior modification through deterrence of undesirable results.
Most Used Algorithm
Q-learning
Examples:
Machine playing PACMAN or other similar game.
Use Cases: Used in Robotics for Industrial Automation
Used to create training systems that provides custom
instruction and materials according to the requirements
of students

How does a machine learning system work?

A machine learning (ML) system works through several key stages:
1. Data Collection: The system gathers relevant data from various sources.
This data will serve as the foundation for learning.
2. Data Preparation and Wrangling: The collected data is cleaned,
structured, and transformed to ensure it is in a usable format, which includes
handling missing or duplicate values and selecting appropriate variables.
3. Data Analysis: Various analytical techniques are applied to understand the
data. This includes choosing ML methods (e.g., classification, regression)
and building initial models.
4. Training the Model: The ML model is trained on this data to learn from
past examples, adjusting its parameters to improve accuracy in making
predictions or identifying patterns.
5. Testing the Model: The model is tested on a separate set of data to assess its
accuracy and performance.
6. Deployment: If the model meets performance requirements, it is deployed to
a real-world system where it continuously makes predictions on new data.
Why do we need Data Preprocessing?
Data preprocessing is essential because real-world data often contains noise,
missing values, and inconsistencies, making it unsuitable for direct use in machine
learning models. Preprocessing cleans and transforms this data to improve the
accuracy and efficiency of the model. It involves several key steps, such as
handling missing data, encoding categorical variables, splitting data into training
and test sets, and feature scaling. These steps help ensure that the model performs
well and generalizes effectively on new data
What is Overfitting?
Overfitting happens when a model learns the training data too well, including its noise, leading to poor
performance on new data. This results in high accuracy on the training set but low accuracy on the test
set, indicating poor generalization. To prevent overfitting, techniques like early stopping, cross-validation,
and regularization are commonly used
how to detect overfitting?
To detect overfitting in machine learning, you can compare the model’s performance on training and
testing datasets. If a model performs well on the training data but poorly on the test data, it’s a clear
indication of overfitting. For instance, if a model shows 85% accuracy on the training dataset but only
50% accuracy on the test dataset, it has likely memorized the training data rather than learning
generalizable patterns

how to prevent overfitting?

• Early Stopping: Stop training when performance on a validation set stops improving, to
prevent the model from learning noise.
• Train with More Data: Providing a larger, clean dataset helps the model generalize better.
• Feature Selection: Retain only the most relevant features to reduce complexity and noise.
• Cross-Validation: Use techniques like k-fold cross-validation to ensure the model performs
well across different subsets of data.
• Data Augmentation: Create modified versions of existing data, especially useful in image
data, to increase diversity.
• Regularization: Apply methods like L1 or L2 regularization to penalize complex models,
discouraging overfitting by simplifying the mode

Are machine learning models, and algorithms are thes ame?

According to the file, a machine learning algorithm is a procedure or method
applied to data to identify patterns and produce a model. On the other hand, a
machine learning model is the resulting program that, once trained with data
through the algorithm, can make predictions or generate outputs based on
new data
Evaluating a Classification Model
1.Log loss and cross entropy

Log loss and cross-entropy are both metrics used to evaluate the performance of a classification
model by measuring the difference between the predicted probability distribution and the actual
distribution.

In binary classification tasks, the cross-entropy, often called binary cross-entropy, is the
average of cross-entropy values for all data samples. For two possible classes (e.g., 0 and 1),
binary cross-entropy can be expressed as:

2.Confusion Matrics

Machine Learning Notes
100% (10)
Machine Learning Notes
19 pages
Machine Learning?
100% (2)
Machine Learning?
114 pages
Describing Data Numerically - Activity-SK
No ratings yet
Describing Data Numerically - Activity-SK
6 pages
DA4675 CFA Level II SmartSheet 2020 PDF
100% (3)
DA4675 CFA Level II SmartSheet 2020 PDF
10 pages
Machine Learning
No ratings yet
Machine Learning
42 pages
Machine Learning Models: by Mayuri Bhandari
No ratings yet
Machine Learning Models: by Mayuri Bhandari
48 pages
Machine Learning for Data Science Unit-4
No ratings yet
Machine Learning for Data Science Unit-4
16 pages
Unit I
No ratings yet
Unit I
44 pages
Machine Learning
No ratings yet
Machine Learning
24 pages
ML Unit1
No ratings yet
ML Unit1
25 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
19 pages
ML Unit-1 (CEC)
No ratings yet
ML Unit-1 (CEC)
108 pages
Unit 5 Intro To Machine Learning
No ratings yet
Unit 5 Intro To Machine Learning
25 pages
Introduction To ML
No ratings yet
Introduction To ML
55 pages
Machine Learning: BE Sixth Semester 20CS610
No ratings yet
Machine Learning: BE Sixth Semester 20CS610
211 pages
ML notes
No ratings yet
ML notes
16 pages
Unit 1
No ratings yet
Unit 1
62 pages
Introduction Class
No ratings yet
Introduction Class
134 pages
Machine Learning (Important QS) - Young Researchers
No ratings yet
Machine Learning (Important QS) - Young Researchers
81 pages
Unit 1 Machine Learning - PDF Lands
No ratings yet
Unit 1 Machine Learning - PDF Lands
5 pages
Module 2 - ML
No ratings yet
Module 2 - ML
53 pages
Disruptive Technologies AI Lecture 2
No ratings yet
Disruptive Technologies AI Lecture 2
12 pages
Inductive Learning and Machine Learning
100% (1)
Inductive Learning and Machine Learning
321 pages
Module 1 ML
No ratings yet
Module 1 ML
8 pages
Module 1 ML
No ratings yet
Module 1 ML
51 pages
Machine Learning Notes
100% (1)
Machine Learning Notes
8 pages
Untitled
No ratings yet
Untitled
11 pages
Machine Learning Tutorial_ Learn ML for Free
No ratings yet
Machine Learning Tutorial_ Learn ML for Free
9 pages
presenttion33
No ratings yet
presenttion33
2 pages
There Are Key Areas in The Process of Machine Learning, Like
No ratings yet
There Are Key Areas in The Process of Machine Learning, Like
45 pages
Machine learning_question bank
No ratings yet
Machine learning_question bank
45 pages
ML Unit 1
No ratings yet
ML Unit 1
21 pages
Chapter 01 machine learning
No ratings yet
Chapter 01 machine learning
22 pages
MachineLearning
No ratings yet
MachineLearning
16 pages
AIYA SESSION 4
No ratings yet
AIYA SESSION 4
42 pages
UNit 1 Introduction To ML
No ratings yet
UNit 1 Introduction To ML
225 pages
ML-1-PPT-UNIT-1
No ratings yet
ML-1-PPT-UNIT-1
93 pages
Unit 3
No ratings yet
Unit 3
13 pages
Supervised and Deep Learning
No ratings yet
Supervised and Deep Learning
83 pages
ML Unit 1
No ratings yet
ML Unit 1
9 pages
Introduction To Machine Learning Notes
No ratings yet
Introduction To Machine Learning Notes
26 pages
LECTURE-2
No ratings yet
LECTURE-2
36 pages
Machine Learning - Brief
No ratings yet
Machine Learning - Brief
12 pages
Module2 ch2
No ratings yet
Module2 ch2
36 pages
ML Unit-1
No ratings yet
ML Unit-1
39 pages
Machine Learning.
No ratings yet
Machine Learning.
50 pages
Social Media Analytics Techniques[1] (1)
No ratings yet
Social Media Analytics Techniques[1] (1)
77 pages
Workflow of A Machine Learning Project
No ratings yet
Workflow of A Machine Learning Project
12 pages
ML Doc1
No ratings yet
ML Doc1
14 pages
Basic of Machine Learning
No ratings yet
Basic of Machine Learning
7 pages
Machine Learning
No ratings yet
Machine Learning
54 pages
University Institute of Engineering Department of Computer Science and Engg
No ratings yet
University Institute of Engineering Department of Computer Science and Engg
27 pages
Types of ML
No ratings yet
Types of ML
4 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
4 pages
Unit 1ML
No ratings yet
Unit 1ML
12 pages
dbms-10 marks
No ratings yet
dbms-10 marks
32 pages
MLT Unit 1
No ratings yet
MLT Unit 1
15 pages
5.3 Model
No ratings yet
5.3 Model
26 pages
ML & DL
No ratings yet
ML & DL
19 pages
Chapter 01 Introduction To Machine Learning
No ratings yet
Chapter 01 Introduction To Machine Learning
59 pages
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
From Everand
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
Elaine Tate
No ratings yet
Linear Approximations and The Cox Model
No ratings yet
Linear Approximations and The Cox Model
40 pages
JMP for Chemistry and Chemical Engineering
No ratings yet
JMP for Chemistry and Chemical Engineering
2 pages
Course Schedule - Summer 2020 MGMT 67000-Y02-DY2 DIS
No ratings yet
Course Schedule - Summer 2020 MGMT 67000-Y02-DY2 DIS
1 page
Research Paper
No ratings yet
Research Paper
10 pages
10 - Quantiles or Fractiles-1
No ratings yet
10 - Quantiles or Fractiles-1
26 pages
LP Pearson Correlation Coefficient
No ratings yet
LP Pearson Correlation Coefficient
11 pages
Name: Treshea Leih Deocampo 1.) 20 Points: Solution: 23. 666666666667
No ratings yet
Name: Treshea Leih Deocampo 1.) 20 Points: Solution: 23. 666666666667
7 pages
Jurnal Kepimpinan Pendidikan
No ratings yet
Jurnal Kepimpinan Pendidikan
21 pages
U I
No ratings yet
U I
13 pages
N X N X: Chapter 8 Selected Problem Solutions
No ratings yet
N X N X: Chapter 8 Selected Problem Solutions
6 pages
SB11 - Group 1
100% (1)
SB11 - Group 1
33 pages
6 The Relationship Between Job Satisfaction and Intention To Stay in Taiwanese Nurse Practitioners
No ratings yet
6 The Relationship Between Job Satisfaction and Intention To Stay in Taiwanese Nurse Practitioners
1 page
Marketing Analysis Homework II
No ratings yet
Marketing Analysis Homework II
15 pages
Correlation & Regression: (DP IB Maths: AA SL)
No ratings yet
Correlation & Regression: (DP IB Maths: AA SL)
1 page
psych-110-chap-3-4-notes
No ratings yet
psych-110-chap-3-4-notes
8 pages
SB Test Bank Chapter 7
No ratings yet
SB Test Bank Chapter 7
77 pages
Sta 312 Regression Analysis and Analysis of Variance
No ratings yet
Sta 312 Regression Analysis and Analysis of Variance
5 pages
One-Sample Kolmogorov-Smirnov Test
No ratings yet
One-Sample Kolmogorov-Smirnov Test
7 pages
Wisavifaxafowakob
No ratings yet
Wisavifaxafowakob
4 pages
BRM Unit-3
No ratings yet
BRM Unit-3
22 pages
BBA Assignment 2
No ratings yet
BBA Assignment 2
6 pages
Sample Size Calculation
No ratings yet
Sample Size Calculation
6 pages
Leejoedel Cruz ASSESSMENT TASK NO.2
No ratings yet
Leejoedel Cruz ASSESSMENT TASK NO.2
2 pages
Sta 316 Cat2 PDF
No ratings yet
Sta 316 Cat2 PDF
2 pages
Maximum Likelihood Estimation
No ratings yet
Maximum Likelihood Estimation
6 pages
IB9JB0 Marketing and R Analytics Assignment
No ratings yet
IB9JB0 Marketing and R Analytics Assignment
36 pages
AIML
No ratings yet
AIML
2 pages