0% found this document useful (0 votes)

20 views

Machine Learning

The document provides an overview of regression analysis in machine learning including definitions of key terms and descriptions of different types of regression models. It explains what regression analysis is used for and gives examples of linear, logistic, and other types of regressions.

Uploaded by

R Muhammad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views

Machine Learning

Uploaded by

R Muhammad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

https://www.youtube.

com/@Codanics/playlists

Turi Create documentation:

https://apple.github.io/turicreate/docs/api/
Install Turicreate using this doc:
https://medium.com/@malondireads/installing-turicreate-on-
windows-10-534e147a4792

Week 1:

Getting started with Python, Jupyter Notebook, & Turi Create

The learning approach in this specialization is to start from use cases and then dig into
algorithms and methods, what we call a case-studies approach. The first course is focused on
understanding how ML can be used in various cases studies, and the follow on courses will dig
into the details of algorithms and methods for each of the main ML areas.

Python
Python is a simple scripting language that makes it easy to interact with data. Python is widely
used in industry, and is becoming the de facto language for data science in industry

Jupyter notebook
. The Jupyter Notebook is a simple interactive environment for programming with Python, which
makes it really easy to share your results. Think about it as a combination of a Python terminal
and a wiki page.
How to install turicreate in windows
https://www.geeksforgeeks.org/guide-to-install-turicreate-in-python3-x/

SFrame
SFrame is a scalable, tabular, column-mutable dataframe object. The data
in SFrame is stored column-wise, and is stored on persistent storage (e.g.
disk) to avoid being constrained by memory size.
Why SFrame & Turi Create
There are many excellent machine learning libraries in Python. One of the most popular one
today is scikit-learn. Similarly, there are many tools for data manipulations in Python; a popular
example is Pandas. However, most of these tools do not scale to large
datasets, including some we will tackle in this specialization. In addition, in this
specialization, we will cover a wide range of ML models, feature engineering transformation,
and evaluation metrics. With most existing packages, you will have to install a combination of
packages to get the tools that we need to tackle the use cases in this course. This is possible,
but requires advanced knowledge of Python, which we feel will slow down most people's
learning of the core concepts.
Turi Create is a highly scalable machine learning library for Python, which also includes the
SFrame, a highly-scalable library for data manipulation. A huge advantage
of SFrame over Pandas is that with SFrame, you are not limited to datasets that fit in
memory, which allows you to deal with large datasets, even on a laptop. (The SFrame API is
very similar to Pandas' API. Here is a doc showing the relationship between the two of them.)

Install turicreate:
https://github.com/apple/turicreate/issues/3198

use this code to run jupyter notebook

cd $HOME
virtualenv venv
cd venv/
source bin/activate

use this one for jupyter notebook

source venv/bin/activate
jupyter notebook

Regression Analysis in Machine

learning
Use this link:
https://www.javatpoint.com/
regression-analysis-in-machine-
learning

Regression analysis is a statistical method to model the relationship between a dependent

(target) and independent (predictor) variables with one or more independent variables. More
specifically, Regression analysis helps us to understand how the value of the dependent variable
is changing corresponding to an independent variable when other independent variables are
held fixed. It predicts continuous/real values such as temperature, age, salary, price, etc.

Regression is a supervised learning technique which helps in finding the correlation between
variables and enables us to predict the continuous output variable based on the one or more
predictor variables. It is mainly used for prediction, forecasting, time series modeling, and
determining the causal-effect relationship between variables.

In Regression, we plot a graph between the variables which best fits the given datapoints, using
this plot, the machine learning model can make predictions about the data. In simple
words, "Regression shows a line or curve that passes through all the datapoints on target-
predictor graph in such a way that the vertical distance between the datapoints and the
regression line is minimum." The distance between datapoints and line tells whether a model
has captured a strong relationship or not.

Terminologies Related to the Regression

Analysis:
o Dependent Variable: The main factor in Regression analysis which we want to predict or
understand is called the dependent variable. It is also called target variable.
o Independent Variable: The factors which affect the dependent variables or which are
used to predict the values of the dependent variables are called independent variable,
also called as a predictor.
o Outliers: Outlier is an observation which contains either very low value or very high
value in comparison to other observed values. An outlier may hamper the result, so it
should be avoided.
o Multicollinearity: If the independent variables are highly correlated with each other
than other variables, then such condition is called Multicollinearity. It should not be
present in the dataset, because it creates problem while ranking the most affecting
variable.
o Underfitting and Overfitting: If our algorithm works well with the training dataset but
not well with test dataset, then such problem is called Overfitting. And if our algorithm
does not perform well even with training dataset, then such problem is
called underfitting.

There are various types of regressions which are used in data science and machine
learning. Each type has its own importance on different scenarios, but at the core, all the
regression methods analyze the effect of the independent variable on dependent
variables. Here we are discussing some important types of regression which are given
below:

o Linear Regression
o Logistic Regression
o Polynomial Regression
o Support Vector Regression
o Decision Tree Regression
o Random Forest Regression
o Ridge Regression
o Lasso Regression:

Linear Regression:
o Linear regression is a statistical regression method which is used for predictive analysis.
o It is one of the very simple and easy algorithms which works on regression and shows
the relationship between the continuous variables.
o It is used for solving the regression problem in machine learning.
o Linear regression shows the linear relationship between the independent variable (X-axis)
and the dependent variable (Y-axis), hence called linear regression.
o If there is only one input variable (x), then such linear regression is called simple linear
regression. And if there is more than one input variable, then such linear regression is
called multiple linear regression.
o The relationship between variables in the linear regression model can be explained using
the below image. Here we are predicting the salary of an employee on the basis of the
year of experience.

o Below is the mathematical equation for Linear regression:

1. Y= aX+b

Here, Y = dependent variables (target variables),

X= Independent variables (predictor variables),
a and b are the linear coefficients

Some popular applications of linear regression are:

o Analyzing trends and sales estimates

o Salary forecasting
o Real estate prediction
o Arriving at ETAs in traffic.

turicreate.SFrame.groupby
SFrame.groupby (key_column_names, operations, *args)

products.groupby('name',operations={'count':turicreate.aggregate.COUNT()})
https://apple.github.io/turicreate/docs/api/generated/turicreate.SFrame.groupby.html#turicreate-
sframe-groupby
roducts['word_count'] = turicreate.text_analytics.count_words(products['review'])

turicreate.SFrame.sort
SFrame.sort (key_column_names, ascending=True)

.sort('count',ascending=False)

turicreate.logistic_classifier.create
turicreate.logistic_classifier.create (dataset, target, features=None, l2_penalty=0.01, l1_penal
ty=0.0, solver='auto', feature_rescaling=True, convergence_threshold=0.01, step_size=1.
0, lbfgs_memory_level=11, max_iterations=10, class_weights=None, validation_set='auto
', verbose=True, seed=None)
sentiment_model = turicreate.logistic_classifier.create(train_data,target='sentiment',
features=['word_count'], validation_set=test_data)
sentiment = is column
word_count = is column
train_data,test_data = products.random_split(.8,seed=0)
train_data = 80% of data
test data = 20% of data

turicreate.linear_regression.LinearRegression.pre
dict
LinearRegression.predict (dataset, missing_value_action='auto')

products['predicted_sentiment'] = sentiment_model.predict(products, output_type = 'probability')

Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Financial Econometrics Mathematics and Statistics Theory Method and Application Hardcovernbsped 1493994271 9781493994274 - Compress
100% (1)
Financial Econometrics Mathematics and Statistics Theory Method and Application Hardcovernbsped 1493994271 9781493994274 - Compress
657 pages
Ecology Global Insights and Investigations 2nd Edition by Stiling ISBN Test Bank
100% (38)
Ecology Global Insights and Investigations 2nd Edition by Stiling ISBN Test Bank
12 pages
Assignment 1:: Intro To Machine Learning
No ratings yet
Assignment 1:: Intro To Machine Learning
6 pages
ML Combined
No ratings yet
ML Combined
254 pages
Machine Learning
100% (3)
Machine Learning
46 pages
COMP1801 - Copy 1
No ratings yet
COMP1801 - Copy 1
18 pages
Presentation 2
No ratings yet
Presentation 2
9 pages
Unit 2 Notes - Final
No ratings yet
Unit 2 Notes - Final
32 pages
Machine Learning: Bilal Khan
100% (2)
Machine Learning: Bilal Khan
20 pages
Data Science
No ratings yet
Data Science
5 pages
Unit - Iii Data Analysis
No ratings yet
Unit - Iii Data Analysis
39 pages
Regression: UNIT - V Regression Model
100% (1)
Regression: UNIT - V Regression Model
21 pages
Unit 2
No ratings yet
Unit 2
67 pages
Machine learning notes
No ratings yet
Machine learning notes
12 pages
Python Theory Notes
No ratings yet
Python Theory Notes
28 pages
Module_2
No ratings yet
Module_2
5 pages
Lab Experiment 4 - AI
No ratings yet
Lab Experiment 4 - AI
7 pages
Unit 1 Machine Learning - PDF Lands
No ratings yet
Unit 1 Machine Learning - PDF Lands
5 pages
Unit - 2 MLA
No ratings yet
Unit - 2 MLA
57 pages
Unit 1 Machine Learning
No ratings yet
Unit 1 Machine Learning
10 pages
Module 1 Notes
100% (1)
Module 1 Notes
73 pages
9 Types of Regression Analysis
No ratings yet
9 Types of Regression Analysis
16 pages
TYPES OF SUPERVISED LEARNING2
No ratings yet
TYPES OF SUPERVISED LEARNING2
66 pages
Commonly Used Machine Learning Algorithms
No ratings yet
Commonly Used Machine Learning Algorithms
38 pages
UNIT-2 Material
No ratings yet
UNIT-2 Material
71 pages
CS601_Machine Learning_Unit 1_Notes_1672759748
No ratings yet
CS601_Machine Learning_Unit 1_Notes_1672759748
13 pages
4 ML
No ratings yet
4 ML
41 pages
ML-U2-Regression
No ratings yet
ML-U2-Regression
20 pages
Regression Analysis in Machine Learning - Javatpoint
No ratings yet
Regression Analysis in Machine Learning - Javatpoint
1 page
ML Unit 2
No ratings yet
ML Unit 2
27 pages
statistics for data science
No ratings yet
statistics for data science
4 pages
Regression Analysis in Machine Learning
No ratings yet
Regression Analysis in Machine Learning
26 pages
Data Science Deep Learning & Artificial Intelligence
No ratings yet
Data Science Deep Learning & Artificial Intelligence
9 pages
MODULE 5
No ratings yet
MODULE 5
31 pages
UNIT 3 Regression
No ratings yet
UNIT 3 Regression
5 pages
Machine Learning The Basics
No ratings yet
Machine Learning The Basics
158 pages
SML
No ratings yet
SML
8 pages
DOC-20240831-WA0023.
No ratings yet
DOC-20240831-WA0023.
22 pages
Seminar Presentation
No ratings yet
Seminar Presentation
25 pages
Unit1 6thsemCS
No ratings yet
Unit1 6thsemCS
22 pages
House Report
No ratings yet
House Report
26 pages
MLT Unit 2 Linear Regression
No ratings yet
MLT Unit 2 Linear Regression
26 pages
ml record
No ratings yet
ml record
21 pages
7 محاضرات
No ratings yet
7 محاضرات
36 pages
Supervised Learning
No ratings yet
Supervised Learning
24 pages
Regression Analysis in Machine Learning
No ratings yet
Regression Analysis in Machine Learning
9 pages
Aiml Unit 3
No ratings yet
Aiml Unit 3
9 pages
Intro To Machine Learning With PyTorch
No ratings yet
Intro To Machine Learning With PyTorch
48 pages
Machine Learning With Real Life Project: by - Rishabh Gaur
100% (2)
Machine Learning With Real Life Project: by - Rishabh Gaur
26 pages
ML 2 nd Unit
No ratings yet
ML 2 nd Unit
50 pages
Report
No ratings yet
Report
11 pages
Linear Regression
No ratings yet
Linear Regression
64 pages
AI lab7
No ratings yet
AI lab7
13 pages
Data Science Course in Hyderabad - Innomatics
No ratings yet
Data Science Course in Hyderabad - Innomatics
10 pages
ML 3 (1)
No ratings yet
ML 3 (1)
50 pages
ML Report 1
No ratings yet
ML Report 1
23 pages
22
No ratings yet
22
13 pages
Ch-2 Supervised Machine Learning
No ratings yet
Ch-2 Supervised Machine Learning
48 pages
CC02 Group6 Report
No ratings yet
CC02 Group6 Report
36 pages
Lecture 2
No ratings yet
Lecture 2
17 pages
Process Performance Models: Statistical, Probabilistic & Simulation
From Everand
Process Performance Models: Statistical, Probabilistic & Simulation
Vishnuvarthanan Moorthy
No ratings yet
Exponential Smoothing Excel Template
No ratings yet
Exponential Smoothing Excel Template
6 pages
Lab 9
No ratings yet
Lab 9
2 pages
100 Employee Data Set
No ratings yet
100 Employee Data Set
7 pages
Landau Theory Full - 3
No ratings yet
Landau Theory Full - 3
11 pages
Gas Law FLORES
No ratings yet
Gas Law FLORES
1 page
Regression
No ratings yet
Regression
14 pages
Logistic Regression & Practice
100% (1)
Logistic Regression & Practice
51 pages
20. AILSAINASSHABRINA(OUTPUTREGRESSIONINVRENING)
No ratings yet
20. AILSAINASSHABRINA(OUTPUTREGRESSIONINVRENING)
19 pages
AIC and BIC
No ratings yet
AIC and BIC
5 pages
PHYS09007 2016 May
No ratings yet
PHYS09007 2016 May
7 pages
Lampiran 4 (Output SPSS)
No ratings yet
Lampiran 4 (Output SPSS)
14 pages
Tutorial - Session Nine
0% (1)
Tutorial - Session Nine
3 pages
Lecture Notes by D. Arovas
No ratings yet
Lecture Notes by D. Arovas
440 pages
Cronbach's Alpha Calculator
No ratings yet
Cronbach's Alpha Calculator
5 pages
Nils Baker
No ratings yet
Nils Baker
11 pages
004 07 Roc Auc Eer W4L2 W5L1 PDF
No ratings yet
004 07 Roc Auc Eer W4L2 W5L1 PDF
12 pages
Margin of Error
No ratings yet
Margin of Error
5 pages
A Regression Analysis Investigating The Relationship Between Income and Happiness
No ratings yet
A Regression Analysis Investigating The Relationship Between Income and Happiness
7 pages
Chapter 2 Simple Linear Regression - Jan2023
No ratings yet
Chapter 2 Simple Linear Regression - Jan2023
66 pages
T test
No ratings yet
T test
17 pages
Analisis Autokorelasi Spasialtitik Panas Di Kalimantan Timur Menggunakan Indeks Moran PDF
No ratings yet
Analisis Autokorelasi Spasialtitik Panas Di Kalimantan Timur Menggunakan Indeks Moran PDF
8 pages
Introduction To The Theory of Ferromagnetism: Exact Solution of The Ising Model in One Dimension. Antiferromagnetism
No ratings yet
Introduction To The Theory of Ferromagnetism: Exact Solution of The Ising Model in One Dimension. Antiferromagnetism
5 pages
PRAC8_23BME053
No ratings yet
PRAC8_23BME053
2 pages
Gpa Salary
No ratings yet
Gpa Salary
14 pages
Pls Sem Model
No ratings yet
Pls Sem Model
97 pages
EXE - Forecasting
No ratings yet
EXE - Forecasting
3 pages
Nama: Ahmad Jordiansyah Kelas: A NPM:170610200049
No ratings yet
Nama: Ahmad Jordiansyah Kelas: A NPM:170610200049
8 pages
Output SmartPLS 27 September 2024 Brostrapping
No ratings yet
Output SmartPLS 27 September 2024 Brostrapping
153 pages

Machine Learning

Uploaded by

Machine Learning

Uploaded by

https://www.youtube.

Turi Create documentation:

Getting started with Python, Jupyter Notebook, & Turi Create

use this code to run jupyter notebook

use this one for jupyter notebook

Regression Analysis in Machine

Regression analysis is a statistical method to model the relationship between a dependent

Terminologies Related to the Regression

o Below is the mathematical equation for Linear regression:

Here, Y = dependent variables (target variables),

Some popular applications of linear regression are:

o Analyzing trends and sales estimates

products['predicted_sentiment'] = sentiment_model.predict(products, output_type = 'probability')

You might also like