0% found this document useful (0 votes)

8 views6 pages

Machine Learning - 1 (UNIT - 1)

Uploaded by

Niharika Khanna

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views6 pages

Machine Learning - 1 (UNIT - 1)

Uploaded by

Niharika Khanna

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

What is Machine Learning

In the real world, we are surrounded by humans who can learn everything from their
experiences with their learning capability, and we have computers or machines which work
on our instructions. But can a machine also learn from experiences or past data like a human
does? So here comes the role of Machine Learning.

Introduction to Machine Learning

A subset of artificial intelligence known as machine learning focuses primarily on the

creation of algorithms that enable a computer to independently learn from data and previous
experiences. Arthur Samuel first used the term "machine learning" in 1959. It could be
summarized as follows:

Without being explicitly programmed, machine learning enables a machine to automatically

learn from data, improve performance from experiences, and predict things.

Machine learning algorithms create a mathematical model that, without being explicitly
programmed, aids in making predictions or decisions with the assistance of sample historical
data, or training data. For the purpose of developing predictive models, machine learning
brings together statistics and computer science. Algorithms that learn from historical data are
either constructed or utilized in machine learning. The performance will rise in proportion to
the quantity of information we provide.

A machine can learn if it can gain more data to improve its performance.
How does Machine Learning work

A machine learning system builds prediction models, learns from previous data, and predicts
the output of new data whenever it receives it. The amount of data helps to build a better
model that accurately predicts the output, which in turn affects the accuracy of the predicted
output.

Let's say we have a complex problem in which we need to make predictions. Instead of
writing code, we just need to feed the data to generic algorithms, which build the logic based
on the data and predict the output. Our perspective on the issue has changed as a result of
machine learning. The Machine Learning algorithm's operation is depicted in the following
block diagram:

Need of Machine Learning

Following are some key points which show the importance of Machine Learning:

o Rapid increment in the production of data

o Solving complex problems, which are difficult for a human
o Decision making in various sector including finance
o Finding hidden patterns and extracting useful information from data.

AI vs ML vs DL

Artificial Intelligence
Machine Learning (ML) Deep Learning (DL)
(AI)
AI simulates human ML is a subset of AI that DL is a subset of ML that
intelligence to perform tasks uses algorithms to learn employs artificial neural
and make decisions. patterns from data. networks for complex tasks.
AI may or may not require ML heavily relies on DL requires extensive labeled
large datasets; it can use labeled data for training and data and performs exceptionally
predefined rules. making predictions. with big datasets.
AI can be rule-based, requiring ML automates learning DL automates feature extraction,
human programming and from data and requires less reducing the need for manual
intervention. manual intervention. engineering.
AI can handle various tasks, ML specializes in data- DL excels at complex tasks like
from simple to complex, driven tasks like image recognition, natural
across domains. classification, regression, language processing, and more.
Artificial Intelligence
Machine Learning (ML) Deep Learning (DL)
(AI)
etc.
ML employs various DL relies on deep neural
AI algorithms can be simple or
algorithms like decision networks, which can have
complex, depending on the
trees, SVM, and random numerous hidden layers for
application.
forests. complex learning.
AI may require less training ML training time varies DL training demands substantial
time and resources for rule- with the algorithm computational resources and
based systems. complexity and dataset size. time for deep networks.
ML models can be
AI systems may offer DL models are often considered
interpretable or less
interpretable results based on less interpretable due to complex
interpretable based on the
human rules. network architectures.
algorithm.
AI is used in virtual assistants, ML is applied in image DL is utilized in autonomous
recommendation systems, and recognition, spam filtering, vehicles, speech recognition, and
more. and other data tasks. advanced AI applications.

Challenges in Machine Learning

1. Inadequate Training Data

The major issue that comes while using machine learning algorithms is the lack of quality as
well as quantity of data. Although data plays a vital role in the processing of machine
learning algorithms, many data scientists claim that inadequate data, noisy data, and unclean
data are extremely exhausting the machine learning algorithms. For example, a simple task
requires thousands of sample data, and an advanced task such as speech or image recognition
needs millions of sample data examples. Further, data quality is also important for the
algorithms to work ideally, but the absence of data quality is also found in Machine Learning
applications. Data quality can be affected by some factors as follows:

o Noisy Data- It is responsible for an inaccurate prediction that affects the decision as
well as accuracy in classification tasks.
o Incorrect data- It is also responsible for faulty programming and results obtained in
machine learning models. Hence, incorrect data may affect the accuracy of the results
also.
o Generalizing of output data- Sometimes, it is also found that generalizing output
data becomes complex, which results in comparatively poor future actions.

2. Poor quality of data

As we have discussed above, data plays a significant role in machine learning, and it must be
of good quality as well. Noisy data, incomplete data, inaccurate data, and unclean data lead to
less accuracy in classification and low-quality results. Hence, data quality can also be
considered as a major common problem while processing machine learning algorithms.

3. Non-representative training data

To make sure our training model is generalized well or not, we have to ensure that sample
training data must be representative of new cases that we need to generalize. The training
data must cover all cases that are already occurred as well as occurring.

Further, if we are using non-representative training data in the model, it results in less
accurate predictions. A machine learning model is said to be ideal if it predicts well for
generalized cases and provides accurate decisions. If there is less training data, then there will
be a sampling noise in the model, called the non-representative training set. It won't be
accurate in predictions. To overcome this, it will be biased against one class or a group.

Hence, we should use representative data in training to protect against being biased and make
accurate predictions without any drift.

4. Overfitting and Underfitting

Overfitting:

Overfitting is one of the most common issues faced by Machine Learning engineers and data
scientists. Whenever a machine learning model is trained with a huge amount of data, it starts
capturing noise and inaccurate data into the training data set. It negatively affects the
performance of the model. Let's understand with a simple example where we have a few
training data sets such as 1000 mangoes, 1000 apples, 1000 bananas, and 5000 papayas. Then
there is a considerable probability of identification of an apple as papaya because we have a
massive amount of biased data in the training data set; hence prediction got negatively
affected. The main reason behind overfitting is using non-linear methods used in machine
learning algorithms as they build non-realistic data models. We can overcome overfitting by
using linear and parametric algorithms in the machine learning models.
Methods to reduce overfitting:

o Increase training data in a dataset.

o Reduce model complexity by simplifying the model by selecting one with fewer
parameters
o Ridge Regularization and Lasso Regularization
o Early stopping during the training phase
o Reduce the noise
o Reduce the number of attributes in training data.
o Constraining the model.

Underfitting:

Underfitting is just the opposite of overfitting. Whenever a machine learning model is trained
with fewer amounts of data, and as a result, it provides incomplete and inaccurate data and
destroys the accuracy of the machine learning model.

Underfitting occurs when our model is too simple to understand the base structure of the data,
just like an undersized pant. This generally happens when we have limited data into the data
set, and we try to build a linear model with non-linear data. In such scenarios, the complexity
of the model destroys, and rules of the machine learning model become too easy to be applied
on this data set, and the model starts doing wrong predictions as well.

Methods to reduce Underfitting:

o Increase model complexity

o Remove noise from the data
o Trained on increased and better features
o Reduce the constraints
o Increase the number of epochs to get better results.

5) Irrelevant features

Although machine learning models are intended to give the best possible outcome, if we feed
garbage data as input, then the result will also be garbage. Hence, we should use relevant
features in our training sample. A machine learning model is said to be good if training data
has a good set of features or less to no irrelevant features.

6) Offline Learning & Deployment of the model

Simplifying model deployment and machine learning operations (MLOps)

The Challenge: Deploying and managing ML models can be a complex and time-consuming
process. ML models need to be deployed to a production environment, monitored for
performance, and updated as needed. This process, often referred to as MLOps, can be
challenging and requires significant resources.
7) Choosing the right production requirements for machine learning solutions
One of the biggest challenges in developing and deploying ML solutions is choosing the
right production requirements. Production requirements can include factors such as data size,
processing speed, and security considerations. These requirements must be carefully
considered to ensure that the ML solution will perform optimally in the production
environment.

Cookbook For Dock Appointment Scheduling Integration With External Systems
No ratings yet
Cookbook For Dock Appointment Scheduling Integration With External Systems
31 pages
Ia Checklist
No ratings yet
Ia Checklist
9 pages
Machine Learning?
100% (2)
Machine Learning?
114 pages
Flytxt Banking Intelligent Decisioning Solution 120522
No ratings yet
Flytxt Banking Intelligent Decisioning Solution 120522
14 pages
Manual Bomba Injetora 4d56 COVEC - F
100% (3)
Manual Bomba Injetora 4d56 COVEC - F
55 pages
UNIT 3__ML
No ratings yet
UNIT 3__ML
15 pages
Machine Learning
No ratings yet
Machine Learning
5 pages
Issues in ML and Generating Algo
No ratings yet
Issues in ML and Generating Algo
31 pages
LECTURE - 1
No ratings yet
LECTURE - 1
35 pages
Unit 1 Notes_FML
No ratings yet
Unit 1 Notes_FML
95 pages
Unit 1
No ratings yet
Unit 1
62 pages
Machine Learning
No ratings yet
Machine Learning
57 pages
Machine Learning - ch1
No ratings yet
Machine Learning - ch1
46 pages
Machine Learning: Bilal Khan
No ratings yet
Machine Learning: Bilal Khan
26 pages
Machine Learning Practical File
No ratings yet
Machine Learning Practical File
41 pages
ML Unit-1
No ratings yet
ML Unit-1
39 pages
ML & DL
No ratings yet
ML & DL
19 pages
5.3 Model
No ratings yet
5.3 Model
26 pages
Unit 1
No ratings yet
Unit 1
8 pages
ML 1
No ratings yet
ML 1
21 pages
Machine Learning
No ratings yet
Machine Learning
8 pages
ML Unit1
No ratings yet
ML Unit1
25 pages
UNIT-1-Intro of ML
No ratings yet
UNIT-1-Intro of ML
33 pages
Big-Data Unit-3
100% (1)
Big-Data Unit-3
54 pages
Al_Lec 3
No ratings yet
Al_Lec 3
30 pages
Machine Learning With Python
No ratings yet
Machine Learning With Python
6 pages
ML Unit 1
No ratings yet
ML Unit 1
20 pages
DAIOT UNIT 5 (1) Own
No ratings yet
DAIOT UNIT 5 (1) Own
13 pages
Unit-I
No ratings yet
Unit-I
42 pages
Report - Stock Price Prediction DL
No ratings yet
Report - Stock Price Prediction DL
37 pages
Intro Machine Learning
No ratings yet
Intro Machine Learning
4 pages
Issues in ML
No ratings yet
Issues in ML
2 pages
Unit1 ML
No ratings yet
Unit1 ML
23 pages
Machine Learning_v1 (1)
No ratings yet
Machine Learning_v1 (1)
30 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
24 pages
Machine Learning Notes "2023
No ratings yet
Machine Learning Notes "2023
31 pages
ML notes
No ratings yet
ML notes
26 pages
Module2 ch2
No ratings yet
Module2 ch2
36 pages
Machine Learning(MCA)
No ratings yet
Machine Learning(MCA)
5 pages
MLT unit -1
No ratings yet
MLT unit -1
38 pages
Fundamentals of Machine Learning II
No ratings yet
Fundamentals of Machine Learning II
13 pages
mehakreport
No ratings yet
mehakreport
23 pages
Unit 1 Notes
No ratings yet
Unit 1 Notes
68 pages
ML-Unit 1
No ratings yet
ML-Unit 1
43 pages
ML-Unit 1 Merged
No ratings yet
ML-Unit 1 Merged
151 pages
5_6095834670757318868
No ratings yet
5_6095834670757318868
62 pages
MLT Unit-1
No ratings yet
MLT Unit-1
19 pages
Engineer Being Machine Learning Notes
No ratings yet
Engineer Being Machine Learning Notes
95 pages
Machine Learning Unit - 1
No ratings yet
Machine Learning Unit - 1
154 pages
Unit 2 – Advance Concepts of Modelling in AI
No ratings yet
Unit 2 – Advance Concepts of Modelling in AI
12 pages
22wj8a6630ml ppt
No ratings yet
22wj8a6630ml ppt
12 pages
ML Notes
No ratings yet
ML Notes
7 pages
Machine Learning.
No ratings yet
Machine Learning.
50 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
9 pages
Inductive Learning and Machine Learning
100% (1)
Inductive Learning and Machine Learning
321 pages
INTRODUCTION
No ratings yet
INTRODUCTION
51 pages
unit V
No ratings yet
unit V
67 pages
Firoz Topic 0 Ppt
No ratings yet
Firoz Topic 0 Ppt
24 pages
PUSHKAR
No ratings yet
PUSHKAR
15 pages
Engineer Being Machine Learning Notes
No ratings yet
Engineer Being Machine Learning Notes
95 pages
Engineer Being Machine Learning notes
No ratings yet
Engineer Being Machine Learning notes
95 pages
Unit 1 Machine Learning
No ratings yet
Unit 1 Machine Learning
26 pages
Machine Learning: Adaptive Behaviour Through Experience: Thinking Machines
From Everand
Machine Learning: Adaptive Behaviour Through Experience: Thinking Machines
alasdair gilchrist
4.5/5 (5)
Fundamentals of Machine Learning: a Simplified Approach
From Everand
Fundamentals of Machine Learning: a Simplified Approach
Er. Sudhir Goswami
No ratings yet
Account Deletion Form
No ratings yet
Account Deletion Form
2 pages
AP PGECET Electronics (ECE) Syllabus and Exam Pattern
No ratings yet
AP PGECET Electronics (ECE) Syllabus and Exam Pattern
2 pages
Dynamic System Modelling and Control
100% (1)
Dynamic System Modelling and Control
761 pages
Unit - II Operating System Concepts
No ratings yet
Unit - II Operating System Concepts
34 pages
231 Quiz 2
No ratings yet
231 Quiz 2
3 pages
2015 Boysnamesfinal
No ratings yet
2015 Boysnamesfinal
110 pages
Top 13 SQL Scenario Based Interview Questions With Answers (2024)
No ratings yet
Top 13 SQL Scenario Based Interview Questions With Answers (2024)
10 pages
PI IGBT ProductCatalog PDF
No ratings yet
PI IGBT ProductCatalog PDF
52 pages
VS Maxv6
No ratings yet
VS Maxv6
36 pages
FLOW CHART FOR LOGON PROCESS
No ratings yet
FLOW CHART FOR LOGON PROCESS
35 pages
Siemens Probing
No ratings yet
Siemens Probing
573 pages
3 Dcrack
No ratings yet
3 Dcrack
2 pages
Number Bases
No ratings yet
Number Bases
26 pages
Found 465999304 2273280
No ratings yet
Found 465999304 2273280
55 pages
A Novel Approach To Fabric Control For Automated S
No ratings yet
A Novel Approach To Fabric Control For Automated S
7 pages
P-0222-GB, Rev A Fixturlaser XA - Low Res
No ratings yet
P-0222-GB, Rev A Fixturlaser XA - Low Res
12 pages
Mysql & SQL - Introduction: Course: Nosql
100% (1)
Mysql & SQL - Introduction: Course: Nosql
88 pages
Hexa Research Inc
No ratings yet
Hexa Research Inc
5 pages
LaTeX For Economists
No ratings yet
LaTeX For Economists
12 pages
Project Material Guidelines PDF
No ratings yet
Project Material Guidelines PDF
12 pages
57k_val
No ratings yet
57k_val
2,923 pages
Time of Flight Accessory Manual ME 6810A
No ratings yet
Time of Flight Accessory Manual ME 6810A
17 pages
CCC Games For Vocabulary Practice
89% (18)
CCC Games For Vocabulary Practice
121 pages
v92 Installation Guide
No ratings yet
v92 Installation Guide
44 pages
Acer Aspire V3 372 Wistron BA30 Mihawk SL 13 15208-3 Kabylake
No ratings yet
Acer Aspire V3 372 Wistron BA30 Mihawk SL 13 15208-3 Kabylake
105 pages
Pe Structural
No ratings yet
Pe Structural
98 pages