0% found this document useful (0 votes)

5 views

ML_LAB_MANUAL

The document provides an overview of various Python libraries and techniques for data analysis and machine learning, including statistics, linear regression, decision trees, K-nearest neighbors, and logistic regression. It includes code examples for computing central tendency and dispersion measures, using libraries like NumPy, SciPy, Pandas, and Matplotlib, as well as implementing machine learning models with scikit-learn. Additionally, it covers performance analysis of classification algorithms using metrics such as accuracy and F1-score.

Uploaded by

lagishettisuresh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

ML_LAB_MANUAL

Uploaded by

lagishettisuresh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

1.

Compute Central Tendency and Dispersion Measures

import statistics as stats

# Sample Data
data = [10, 20, 20, 40, 50, 50, 50, 80, 90]

# Central Tendency
mean = stats.mean(data)
median = stats.median(data)
mode = stats.mode(data)

# Dispersion
variance = stats.variance(data)
std_dev = stats.stdev(data)

# Display Results
print(f"Mean: {mean}")
print(f"Median: {median}")
print(f"Mode: {mode}")
print(f"Variance: {variance}")
print(f"Standard Deviation: {std_dev}")

OUTPUT:

Mean: 45.55555555555556
Median: 50
Mode: 50
Variance: 727.7777777777778
Standard Deviation: 26.977356760397743
2. Study of Python Basic Libraries such as Statistics, Math, Numpy and Scipy.

 Python Math Library

The math module is a standard module in Python and is always available. To use mathematical functions
under this module, you have to import the module using import math. It gives access to the underlying C
library functions. This module does not support complex datatypes. The math module is the complex
counterpart.

List of Functions in Python Math Module

Function Description
ceil(x) Returnsthesmallestinteger greaterthanorequalto x.
copysign(x,y) Returnsxwith thesignofy

fabs(x) Returnstheabsolutevalueof x
factorial(x) Returnsthefactorialofx
floor(x) Returnsthelargestintegerlessthanorequaltox
fmod(x,y) Returnsthe remainderwhen xisdivided byy
frexp(x) Returnsthemantissa andexponent ofxasthe pair(m,
e)
fsum(iterable) Returnsan accuratefloatingpointsum of values in the
iterable
isfinite(x) Returns Trueif xis neither an infinitynor aNaN (Not
aNumber)
isinf(x) ReturnsTrueifxisa positiveornegativeinfinity
isnan(x) ReturnsTrueif x is a NaN
ldexp(x,i) Returnsx*(2**i)
modf(x) Returnsthefractionalandintegerpartsofx
trunc(x) Returnsthetruncated integervalueof x
exp(x) Returnse**x
expm1(x) Returnse**x-1

Program-1
 Python Scipy Library

SciPy is an Open Source Python-based library, which is used in mathematics, scientific computing,
Engineering, and technical computing. SciPy also pronounced as "SighPi."

SciPy contains varieties of sub packages which help to solve the most common issue related to Scientific
Computation.
SciPy is the most used Scientific library only second to GNU Scientific Library for C/C++ or Matlab's.
Easy to use and understand as well as fast computational power.
It can operate on an array of NumPylibrary.

Numpy VS SciPy

Numpy:
1. Numpy is written in C and used for mathematical or numerical calculation.
2. It is faster than other Python Libraries
3. Numpy is the most use full Library for Data Science to perform basic calculations.
4. Numpy contains nothing but array datatype which performs the most basic operation like
5. sorting,shaping,indexing,etc.

SciPy:

1. SciPy is built in top of the NumPy

2. SciPy is a fully-feature diversion of Linear Algebra while Numpy contains only a few features.
3. Most new Data Science features are available in Scipy rather than Numpy.

Linear Algebra with SciPy

1. Linear Algebra of SciPy is an implementation of BLAS and ATLASLA PACK libraries.
2. Performance of Linear Algebra is very fast compared to BLAS and LAPACK.
3. Linear algebra routine accepts two-dimensional array object and output is also a two-dimensional array.
4. Now let's do some test with scipy.linalg,

Calculating determinant of a two-dimensional matrix,

Program-1
3. Study of Python Basic Libraries such as Pandas and Matplotlib.

The primary two components of pandas are the Series and Data Frame.
A Series is essentially a column, and a Data Frame is a multi-dimensional table made up of a collection
ofSeries.
Data Frames and Series are quite similar in that many operations that you can do with one you can do with the
other, such as filling in null values and calculating the mean.

Reading data from CSVs

With CSV files all you need is a single line to loading the data:
df = pd.read_csv('purchases.csv')df

Let's load in the IMDB movies dataset to begin:

movies_df=pd.read_csv("IMDB-Movie-Data.csv",index_col="Title")
We're loading this dataset from a CSV and designating the movie titles to be our index.

Viewingyour data
The first thing to do when opening a new data set is print out a few rows to keep as a visual reference.We
accomplish this with head():
Movies_df.head()

Another fast and useful attribute is.shape,which outputs just a tuple of (rows,columns):
movies_df.shape
Note that. Shape has no parentheses and is a simple tuple of format (rows, columns). So we have1000 rows
and 11 columns in our movies Data Frame.
You'll be going to shape a lot when cleaning and transforming data.For example, you might filter some rows
based on some criteria and then want to know quickly how many rows were removed.

Program-1
We haven't defined an index in our example, but we see two columns in our output: The right column
contains our data, whereas the left column contains the index.Pandas created a default index starting with 0
going to 5, which is the length of the data minus 1.

dtype('int64'):The type int 64 tells us that Python is storing each value with in this column as a 64bitinteger

Program-2
Wecan directlyaccesstheindexandthe values of our Series S:

 Matplotlib Library

Pyplot is a module of Matplotlib which provides simple functions to add plot elements like lines, images,
text,etc. to the current axes in the current figure.

Make a simple plot

importmatplotlib.pyplotasplt import
numpy asnp
List of all the methods as they appeared.

plot(x-axis values,y-axis values)—plots a simple line graph with x-axis values against y-axis values
show()—displays the graph
title(―string ) — set the title of the plot as specified by the string
xlabel(―string )—set the label for x-axis as specified by the string
ylabel(―string )—set the label for y-axis as specified by thestring
figure()— used to control a figure level attributes
subplot(nrows,ncols,index)— Add a subplot to the current figure
suptitle(―string ) —It adds a common title to the figure specified by the string
subplots(nrows,ncols,figsize)—a convenient way to create sub plots, in a single call.It returns a figure
and number of axes.

set_title(―string )—an axes level method used to set the title of sub plots in a figure
bar(categorical variables, values, color) —used to create vertical bar graphs bar(categorical
variables, values, color) —used to create horizontal bar graphs legend(loc)—used to make
legend of the graph
xticks(index, categorical variables)—Get or set the current tick locations and labels of the x-axis
pie(value, categorical variables) —used to create a pie chart

hist(values,number of bins) —used to create a histogram

xlim(start value,endvalue)—used to set the limit of values of the x-axis
ylim(start value, end value)—used to set the limit of values of they-axis
scatter(x-axisvalues,y-axisvalues)—plots as catter plot with x-axisvalues against y-axisvalues axes()—
adds an axes to the current figure
set_xlabel(―string ) — axes level method used to set the x-label of the plot specified as a string
set_ylabel(―string )— axes level method used to set they-label of the plot specified as a string
scatter3D(x-axisvalues,y-axisvalues)—plots a three-dimensional scatter plot with x-axisvalues
against y-axisvalues
plot3D(x-axisvalues,y-axisvalues)—plots a three-dimensional line graph with x-axis values against y-
axis values

Here we import Matplotlib‘s Py plot module and Numpy library as most of the data that we will be working
with arrays only.

We pass two arrays as our input arguments to Pyplot‘s plot() method and use show()method to invoke the
required plot. Here note that the first array appears on the x-axis and second array appears on the y-axis of
the plot. Now that our first plot is ready,let us add the title and namex-axis and y-axis using methods title(),
x label() and y label()respectively.
4. Simple Linear Regression
import numpy as np
import matplotlib.pyplot as plt
from sklearn.linear_model import LinearRegression

# Sample Data
x = np.array([1, 2, 3, 4, 5]).reshape(-1, 1)
y = np.array([1, 3, 5, 7, 9])

# Linear Regression Model

model = LinearRegression()
model.fit(x, y)

# Prediction
y_pred = model.predict(x)

# Plot
plt.scatter(x, y, color='blue', label='Actual')
plt.plot(x, y_pred, color='red', label='Predicted')
plt.legend()
plt.show()

OUTPUT:
5. Multiple Linear Regression for House Price Prediction
from sklearn.linear_model import LinearRegression
from sklearn.model_selection import train_test_split
from sklearn.metrics import mean_squared_error
import pandas as pd

data = pd.DataFrame({
"Size": [1400, 1600, 1700, 1875],
"Bedrooms": [3, 3, 4, 3],
"Age": [20, 15, 18, 12],
"Price": [245000, 312000, 279000, 308000]
})

X = data[["Size", "Bedrooms", "Age"]]

y = data["Price"]

# Train/Test Split
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2)

# Model
model = LinearRegression()
model.fit(X_train, y_train)

# Prediction
y_pred = model.predict(X_test)
print(f"Mean Squared Error: {mean_squared_error(y_test, y_pred)}")

OUTPUT:

<ipython-input-5-04f1b0d55f88>:4: DeprecationWarning:
Pyarrow will become a required dependency of pandas in the next major release of pandas (pand
as 3.0),
(to allow more performant data types, such as the Arrow string type, and better interoperability w
ith other libraries)
but was not found to be installed on your system.
If this would cause problems for you,
please provide us feedback at https://github.com/pandas-dev/pandas/issues/54466

import pandas as pd
Mean Squared Error: 1419700880.4400368
6. Decision Tree and Parameter Tuning
from sklearn.tree import DecisionTreeClassifier
from sklearn.model_selection import GridSearchCV

X = [[1, 1], [2, 2], [3, 3],[4,4],[5,5]]

y = [0, 0, 1,1,0]

# Decision Tree
clf = DecisionTreeClassifier()
parameters = {"max_depth": [1, 2, 3], "criterion": ["gini", "entropy"]}
grid_search = GridSearchCV(clf, parameters,cv=2)
grid_search.fit(X, y)

print(f"Best Parameters: {grid_search.best_params_}")

OUTPUT:

Best Parameters: {'criterion': 'gini', 'max_depth': 1}

7. K-Nearest Neighbors
from sklearn.neighbors import KNeighborsClassifier

# Data
X = [[1], [2], [3], [6], [7], [8]]
y = [0, 0, 0, 1, 1, 1]

# KNN Model
model = KNeighborsClassifier(n_neighbors=3)
model.fit(X, y)

# Prediction
print(model.predict([[4]]))

OUTPUT:

[0]
8. Logistic Regression

from sklearn.linear_model import LogisticRegression

# Data
X = [[1], [2], [3], [6], [7], [8]]
y = [0, 0, 0, 1, 1, 1]

# Logistic Regression
model = LogisticRegression()
model.fit(X, y)

# Prediction
print(model.predict([[4]]))

OUTPUT:
[0]

9. K-Means Clustering
from sklearn.cluster import KMeans
import numpy as np
X = np.array([[1, 2], [2, 3], [3, 4], [8, 9], [9, 10], [10, 11]])

# K-Means Clustering
kmeans = KMeans(n_clusters=2)
kmeans.fit(X)

print(f"Cluster Centers: {kmeans.cluster_centers_}")

OUTPUT:

Cluster Centers: [[ 2. 3.]

[ 9. 10.]]
10. Performance Analysis of Classification Algorithms (Mini Project)
import numpy as np
import pandas as pd
from sklearn.datasets import load_iris
from sklearn.model_selection import train_test_split
from sklearn.metrics import accuracy_score, precision_score, recall_score, f1_score
from sklearn.tree import DecisionTreeClassifier
from sklearn.neighbors import KNeighborsClassifier
from sklearn.linear_model import LogisticRegression

# Loading Datsets
data = load_iris()
X = pd.DataFrame(data.data, columns=data.feature_names)
y = data.target

# Split the data into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

# Initialize classifiers
models = {
"Decision Tree": DecisionTreeClassifier(max_depth=3, random_state=42),
"KNN": KNeighborsClassifier(n_neighbors=3),
"Logistic Regression": LogisticRegression(max_iter=200, random_state=42)
}

# Train and evaluate each model

results = []
for name, model in models.items():
model.fit(X_train, y_train)
y_pred = model.predict(X_test)

# Evaluate metrics
accuracy = accuracy_score(y_test, y_pred)
precision = precision_score(y_test, y_pred, average="weighted")
recall = recall_score(y_test, y_pred, average="weighted")
f1 = f1_score(y_test, y_pred, average="weighted")

results.append({
"Model": name,
"Accuracy": accuracy,
"Precision": precision,
"Recall": recall,
"F1-Score": f1
})

results_df = pd.DataFrame(results)
print(results_df)

OUTPUT:

Model Accuracy Precision Recall F1-Score

0 Decision Tree 1.0 1.0 1.0 1.0
1 KNN 1.0 1.0 1.0 1.0
2 Logistic Regression 1.0 1.0 1.0 1.0

TCRC December Evaluation
No ratings yet
TCRC December Evaluation
53 pages
Developmental Psychology Reviewer
75% (4)
Developmental Psychology Reviewer
8 pages
Unit 5 PythonPackages(Matplotlib)
No ratings yet
Unit 5 PythonPackages(Matplotlib)
24 pages
Lab description file (4)
No ratings yet
Lab description file (4)
11 pages
EXP1-siddhant gupta (23_SE_148)
No ratings yet
EXP1-siddhant gupta (23_SE_148)
17 pages
ML Lab Manual
No ratings yet
ML Lab Manual
28 pages
Scipy,Matplotlib,Pandas
No ratings yet
Scipy,Matplotlib,Pandas
16 pages
ML LAB_MANUAL
No ratings yet
ML LAB_MANUAL
15 pages
Tutorial 2
No ratings yet
Tutorial 2
9 pages
fdsa lab manual final
No ratings yet
fdsa lab manual final
70 pages
CO-367 Machine Learning Lab File: Submitted To: Submitted by
No ratings yet
CO-367 Machine Learning Lab File: Submitted To: Submitted by
12 pages
ML3_Data_Analysis
No ratings yet
ML3_Data_Analysis
80 pages
Py PPT 06
No ratings yet
Py PPT 06
33 pages
Unit 5-Python Packages 240127 185930
No ratings yet
Unit 5-Python Packages 240127 185930
34 pages
Python Abstract
No ratings yet
Python Abstract
7 pages
Machine Learning Lab File: Submitted To: Submitted by
No ratings yet
Machine Learning Lab File: Submitted To: Submitted by
9 pages
Unit 5
No ratings yet
Unit 5
27 pages
Introduction To Python (Part III)
No ratings yet
Introduction To Python (Part III)
29 pages
PP&DS UNIT III
No ratings yet
PP&DS UNIT III
26 pages
Unit Vi
No ratings yet
Unit Vi
60 pages
3-numpy_pandas
No ratings yet
3-numpy_pandas
37 pages
Fds Lab Record
No ratings yet
Fds Lab Record
84 pages
Advance Data Analysis and Visualisation - With - Python For Executives and Business Management
No ratings yet
Advance Data Analysis and Visualisation - With - Python For Executives and Business Management
76 pages
unit 5
No ratings yet
unit 5
28 pages
Data Analysis and Visualisation With Python
No ratings yet
Data Analysis and Visualisation With Python
75 pages
Numpy_Data_Analysis_and_visualisation_with_Python
No ratings yet
Numpy_Data_Analysis_and_visualisation_with_Python
75 pages
Python-Unit-4
No ratings yet
Python-Unit-4
43 pages
NumPy and Pandas
No ratings yet
NumPy and Pandas
72 pages
Week 4- Introduction to Python #3
No ratings yet
Week 4- Introduction to Python #3
47 pages
Q-Step WS 06112019 Data Analysis and Visualisation With Python
No ratings yet
Q-Step WS 06112019 Data Analysis and Visualisation With Python
76 pages
MACHINE LEARNING LAB WORD 12-1-2025. DOCUMENT
No ratings yet
MACHINE LEARNING LAB WORD 12-1-2025. DOCUMENT
68 pages
Ad3411 - Student
No ratings yet
Ad3411 - Student
27 pages
Lesson 03 3.01 Python Libraries For Data Science
No ratings yet
Lesson 03 3.01 Python Libraries For Data Science
79 pages
PP_unit-5_notes
No ratings yet
PP_unit-5_notes
15 pages
Practical # 8
No ratings yet
Practical # 8
16 pages
ML LAB(R22) MANUAL (4)
No ratings yet
ML LAB(R22) MANUAL (4)
25 pages
r22-1-9-ml-lab-manual-r22-regulations
No ratings yet
r22-1-9-ml-lab-manual-r22-regulations
24 pages
12_Numpy&Matplotlib
No ratings yet
12_Numpy&Matplotlib
48 pages
Machine Learning - Manual
No ratings yet
Machine Learning - Manual
32 pages
Fundamentals of Data Science Lab Manual New1
No ratings yet
Fundamentals of Data Science Lab Manual New1
32 pages
Final Fds Manual Print
No ratings yet
Final Fds Manual Print
55 pages
ANL252 SU3 Jul2022
No ratings yet
ANL252 SU3 Jul2022
23 pages
Final Fds Manual
No ratings yet
Final Fds Manual
77 pages
pythonlibraries[1]
No ratings yet
pythonlibraries[1]
20 pages
4 Introduction to Python Part 3 (2)
No ratings yet
4 Introduction to Python Part 3 (2)
48 pages
01 Introduction to Python
No ratings yet
01 Introduction to Python
36 pages
Ip Chapter 1
No ratings yet
Ip Chapter 1
36 pages
Data Analysis Lab - Final - 23-24
No ratings yet
Data Analysis Lab - Final - 23-24
11 pages
LAB 2 DWM
No ratings yet
LAB 2 DWM
13 pages
Exp1-ref-doc-installation
No ratings yet
Exp1-ref-doc-installation
6 pages
Data Handling Python NCERT
No ratings yet
Data Handling Python NCERT
36 pages
Panda Ncert 1
No ratings yet
Panda Ncert 1
36 pages
CH 2
No ratings yet
CH 2
36 pages
FODS_LAB_MANUAL
No ratings yet
FODS_LAB_MANUAL
26 pages
12.1 - 12.9 Introduction To Modules - Libraries For DataScience
No ratings yet
12.1 - 12.9 Introduction To Modules - Libraries For DataScience
54 pages
Data Visualization1
No ratings yet
Data Visualization1
52 pages
Python Libraries
No ratings yet
Python Libraries
27 pages
DSF LAB EXP FULL (1) (1)
No ratings yet
DSF LAB EXP FULL (1) (1)
88 pages
IP Book 12 Question Bank
No ratings yet
IP Book 12 Question Bank
20 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Python For Beginners
From Everand
Python For Beginners
Célio Azevedo
No ratings yet
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
JAVA_LAB_MANUAL
No ratings yet
JAVA_LAB_MANUAL
65 pages
DBMS_VIVA_Q&A
No ratings yet
DBMS_VIVA_Q&A
4 pages
COURSE FILE-PPL
No ratings yet
COURSE FILE-PPL
36 pages
Complex Integrity Constraints in SQL
No ratings yet
Complex Integrity Constraints in SQL
8 pages
Lg952 Wheel Loader Parts Catalog: Shandong Lingong Construction Machinery Co.,Ltd
100% (1)
Lg952 Wheel Loader Parts Catalog: Shandong Lingong Construction Machinery Co.,Ltd
252 pages
4 Merancang Jaringan Supply Chain
No ratings yet
4 Merancang Jaringan Supply Chain
2 pages
(Ebook) The Oxford Handbook of Time and Politics by Klaus Goetz ISBN 9780190862084, 0190862084download
100% (2)
(Ebook) The Oxford Handbook of Time and Politics by Klaus Goetz ISBN 9780190862084, 0190862084download
57 pages
Deck Machinery
100% (5)
Deck Machinery
18 pages
Grade 7 EMS Case Study - Term 1 - 2023 Memorandum…
No ratings yet
Grade 7 EMS Case Study - Term 1 - 2023 Memorandum…
1 page
Diode
100% (2)
Diode
32 pages
Rentlandedproperty Fine
No ratings yet
Rentlandedproperty Fine
7 pages
The Practical Lubrication of Clocks and Watches
0% (1)
The Practical Lubrication of Clocks and Watches
23 pages
Bloom's Taxonomy
No ratings yet
Bloom's Taxonomy
4 pages
Ribbon Blenders - A Best Practices Guide
No ratings yet
Ribbon Blenders - A Best Practices Guide
8 pages
GSTR1 Report - 07 - 23 - To - 07 - 23
No ratings yet
GSTR1 Report - 07 - 23 - To - 07 - 23
45 pages
Chapter2 Study Guide Key
No ratings yet
Chapter2 Study Guide Key
6 pages
Academic Writing April 2019 Test
No ratings yet
Academic Writing April 2019 Test
4 pages
Multi-Stage Fusion Algorithm For Estimation of Aerodynamic Angles in Mini Aerial Vehicle
No ratings yet
Multi-Stage Fusion Algorithm For Estimation of Aerodynamic Angles in Mini Aerial Vehicle
8 pages
Form 1 Exercises
100% (1)
Form 1 Exercises
160 pages
2020 CyberCamp Registration Web
No ratings yet
2020 CyberCamp Registration Web
20 pages
SignFlow Setup and Overview
No ratings yet
SignFlow Setup and Overview
6 pages
10 Steps For Scheduling With Appworx
100% (1)
10 Steps For Scheduling With Appworx
30 pages
Cps 330 C
No ratings yet
Cps 330 C
110 pages
MTP-410-091 Converter
No ratings yet
MTP-410-091 Converter
2 pages
Teaching The Unit Radian As A Physical Quantity: Íîâè Ïîäõîäè New Approaches
No ratings yet
Teaching The Unit Radian As A Physical Quantity: Íîâè Ïîäõîäè New Approaches
5 pages
Ijed 2022 03 s0340CracksGerdolleBrowet
No ratings yet
Ijed 2022 03 s0340CracksGerdolleBrowet
17 pages
Lesson Plan Science
No ratings yet
Lesson Plan Science
4 pages
QSB Audit1
No ratings yet
QSB Audit1
1 page
Standard Test Procedures Manual: 1. Scope 1.1. Description of Test
No ratings yet
Standard Test Procedures Manual: 1. Scope 1.1. Description of Test
6 pages
Instant download The American Promise Volume C A History of the United States Since 1890 James L. Roark pdf all chapter
No ratings yet
Instant download The American Promise Volume C A History of the United States Since 1890 James L. Roark pdf all chapter
65 pages
CS 6515 2025-1
No ratings yet
CS 6515 2025-1
8 pages
Fear and Peer Pressure - The Catalysts of Compliance in Shirley Jackson's "The Lottery"
No ratings yet
Fear and Peer Pressure - The Catalysts of Compliance in Shirley Jackson's "The Lottery"
2 pages

ML_LAB_MANUAL

Uploaded by

ML_LAB_MANUAL

Uploaded by

1.

Compute Central Tendency and Dispersion Measures

 Python Math Library

List of Functions in Python Math Module

1. SciPy is built in top of the NumPy

Linear Algebra with SciPy

Calculating determinant of a two-dimensional matrix,

Reading data from CSVs

Let's load in the IMDB movies dataset to begin:

Make a simple plot

hist(values,number of bins) —used to create a histogram

# Linear Regression Model

X = data[["Size", "Bedrooms", "Age"]]

X = [[1, 1], [2, 2], [3, 3],[4,4],[5,5]]

print(f"Best Parameters: {grid_search.best_params_}")

Best Parameters: {'criterion': 'gini', 'max_depth': 1}

from sklearn.linear_model import LogisticRegression

print(f"Cluster Centers: {kmeans.cluster_centers_}")

Cluster Centers: [[ 2. 3.]

# Split the data into training and testing sets

# Train and evaluate each model

Model Accuracy Precision Recall F1-Score

You might also like