0% found this document useful (0 votes)

7 views

ML LAB_MANUAL

The document outlines a lab course in machine learning, focusing on various techniques and their implementation using Python. It includes objectives, outcomes, a list of experiments, and detailed instructions for using Python libraries like NumPy, SciPy, Pandas, and Matplotlib for data analysis and visualization. The course covers statistical measures, regression models, and performance analysis of classification algorithms.

Uploaded by

Swathi Reddy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

ML LAB_MANUAL

Uploaded by

Swathi Reddy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 15

Department of Computer Science and Engineering

Course Objective: The objective of this lab is to get an overview of the various machine learning
techniques and can demonstrate them using python.

Course Outcomes:

 Understand modern notions in predictive data analysis

 Select data, model selection, model complexity and identify the trends
 Understand a range of machine learning algorithms along with their strengths and weaknesses
 Build predictive models from data and analyze their performance

List of Experiments:

1. Write a python program to compute Central Tendency Measures: Mean, Median, Mode Measure of
Dispersion: Variance, Standard Deviation

2. Study of Python Basic Libraries such as Statistics, Math, Numpy and Scipy

3. Study of Python Libraries for ML application such as Pandas and Matplotlib

4. Write a Python program to implement Simple Linear Regression

5. Implementation of Multiple Linear Regression for House Price Prediction using sklearn

6. Implementation of Decision tree using sklearn and its parameter tuning

7. Implementation of KNN using sklearn

8. Implementation of Logistic Regression using sklearn

9. Implementation of K-Means Clustering

10. Performance analysis of Classification Algorithms on a specific dataset (Mini Project)

WEEK1:

1. Write a python program to compute Central Tendency Measures: Mean, Median, Mode
Measure of Dispersion: Variance, Standard Deviation

Solution:

import statistics

import math

#This line imports Python's built-in statistics module.The statistics module provides functions to perform
statistical calculations

#This line imports Python’s built-in math module.The math module provides mathematical functions that
go beyond basic arithmetic

l = [1, 3, 8, 15]

print(statistics.mean(l))

#The mean value is the average value.To calculate the mean, find the sum of all values, and divide the sum
by the number of values:

6.75

import statistics

s=[5,6,7,8,9,11]

print(statistics.mean(s))

print(statistics.median(s))

print(statistics.mean([1, 3, 5, 7, 9, 11, 13]))

print(statistics.mean([1, 3, 5, 7, 9, 11]))

print(statistics.mean([-11, 5.5, -3.4, 7.1, -9, 22]))

7.666666666666667

7.5

5
7

1.8666666666666667

# Calculate the median from a sample of data

print(statistics.median([1, 3, 5, 7, 9, 11, 13]))

print(statistics.median([1, 3, 5, 7, 9, 11]))

print(statistics.median([-11, 5.5, -3.4, 7.1, -9, 22]))

6.0

1.05

# Calculate the mode from a sample of data

print(statistics.mode([1, 3, 3, 3, 3,5, 7, 9, 11]))

print(statistics.mode([1, 1, 3, -5, 7, -9, 11]))

print(statistics.mode(['red', 'green', 'blue', 'red']))

red

print(statistics.variance([1, 3, 5, 7, 9, 11]))

print(statistics.variance([2, 2.5, 1.25, 3.1, 1.75, 2.8]))

print(statistics.variance([-11, 5.5, -3.4, 7.1]))

print(statistics.variance([1, 30, 50, 100]))

0.47966666666666663

70.80333333333333
1736.9166666666667

import statistics

def compute_statistics(data):

mean = statistics.mean(data)

median = statistics.median(data)

mode = statistics.mode(data)

variance = statistics.variance(data)

std_dev = statistics.stdev(data)

print(f"Mean: {mean}")

print(f"Median: {median}")

print(f"Mode: {mode}")

print(f"Variance: {variance}")

print(f"Standard Deviation: {std_dev}")

if __name__ == "__main__":

data = [1, 2, 2, 3, 4, 5, 5, 5, 6, 7]

compute_statistics(data)

Mean: 4

Median: 4.5

Mode: 5

Variance: 3.7777777777777777

Standard Deviation: 1.9436506316151

WEEK2: Implementation of Python Basic Libraries such as Math, Numpy and Scipy

Theory/Description:

Python Libraries There are a lot of reasons why Python is popular among developers and one of them is
that it has an amazingly large collection of libraries that users can work with. In this Python Library, we
will discuss Python Standard library and different libraries offered by Python Programming Language:
scipy, numpy,etc. We know that a module is a file with some Python code, and a package is a directory for
sub packages and modules. A Python library is a reusable chunk of code that you may want to include in
your programs/ projects. Here, a library loosely describes a collection of core modules. Essentially, then, a
library is a collection of modules. A package is a library that can be installed using a package manager like
numpy. Python Standard Library The Python Standard Library is a collection of script modules accessible
to a Python program to simplify the programming process and removing the need to rewrite commonly
used commands. They can be used by 'calling/importing' them at the beginning of a script. A list of the
Standard Library modules that are most important time sys csv math random pip os statistics tkinter socket
To display a list of all available modules, use the following command in the Python console:
>>>help('modules') 

List of important Python Libraries

Python Libraries for Data Collection

 Beautiful Soup
 Scrapy
 Selenium

Python Libraries for Data Cleaning and Manipulation

 Pandas
 PyOD
 NumPy
 Scipy
 Spacy

Python Libraries for Data Visualization

 Matplotlib
 Seaborn
 Bokeh
 NumPy (Numerical Python): Efficient numerical operations, arrays, and mathematical
computations.

 SciPy (Scientific Python): Built on top of NumPy, providing additional functionalities for
optimization, integration, statistics, and signal processing.

1. NumPy: Numerical Computation & Array Operations

NumPy provides a powerful n-dimensional array object (ndarray) and functions for numerical
computation.

1.1 Installing NumPy

pip install numpy

1.2 Basic NumPy Operations

import numpy as np

# Creating arrays

arr1 = np.array([1, 2, 3, 4, 5])

arr2 = np.array([[1, 2, 3], [4, 5, 6]]) # 2D array

# Display arrays
print("1D Array:", arr1)

print("2D Array:\n", arr2)

# Array Properties

print("Shape:", arr2.shape) # (rows, columns)

print("Size:", arr2.size) # Total elements

print("Data Type:", arr2.dtype)

# Array Operations

print("Sum:", np.sum(arr1))

print("Mean:", np.mean(arr1))

print("Standard Deviation:", np.std(arr1))

# Element-wise Operations

print("Multiplication:", arr1 * 2)

print("Square Root:", np.sqrt(arr1))

# Creating Special Arrays

zeros = np.zeros((3,3)) # 3x3 matrix of zeros

ones = np.ones((2,2)) # 2x2 matrix of ones

identity = np.eye(3) # 3x3 identity matrix

# Random Numbers

rand_array = np.random.rand(3,3) # 3x3 random values

1.3 NumPy in Machine Learning

 Dataset Handling: Used to load, manipulate, and preprocess data.

 Linear Algebra: Matrix operations in deep learning and ML.

 Random Sampling: Initializing weights in neural networks.

2. SciPy: Scientific Computation & Advanced Operations

SciPy extends NumPy by adding modules for statistics, optimization, and signal processing.

2.1 Installing SciPy

pip install scipy

2.2 SciPy Modules & Examples

2.2.1 Optimization (scipy.optimize)

Used for solving mathematical optimization problems.

from scipy.optimize import minimize

# Define function to minimize (e.g., x^2 + 3x + 5)

def func(x):

return x**2 + 3*x + 5

result = minimize(func, x0=0) # Find minimum starting at x=0

print("Optimized Result:", result.x)

2.2.2 Linear Algebra (scipy.linalg)

from scipy.linalg import inv, det

A = np.array([[4, 7], [2, 6]])

print("Determinant:", det(A)) # Compute determinant

print("Inverse Matrix:\n", inv(A)) # Compute inverse

2.2.3 Statistics (scipy.stats)

from scipy import stats

data = [12, 15, 14, 10, 13, 18, 21, 19]

print("Mean:", np.mean(data))

print("Median:", np.median(data))

print("Mode:", stats.mode(data).mode[0])

print("Standard Deviation:", np.std(data))

2.2.4 Signal Processing (scipy.signal)

from scipy.signal import butter, filtfilt

# Low-pass filter

b, a = butter(3, 0.05) # 3rd order, cutoff 0.05

filtered_signal = filtfilt(b, a, np.sin(np.linspace(0, 10, 100)))

Week 3:
Study of Python Libraries for ML application such as Pandas and Matplotlib

1. Introduction to Python for ML

Machine Learning requires efficient data handling, processing, and visualization. Python provides several
libraries that make these tasks easier, among which Pandas (for data manipulation) and Matplotlib (for
visualization) are widely used.

2. Pandas: Data Handling & Manipulation

Pandas is a Python library used for data analysis and manipulation, built on top of NumPy.

2.1 Key Features

 DataFrames & Series: Core data structures for handling tabular and labeled data.

 Data Cleaning & Transformation: Handling missing values, filtering, merging, and reshaping
data.

 Descriptive Statistics: Mean, median, correlation, and other statistical operations.

 Integration: Works well with other ML libraries such as Scikit-learn, TensorFlow, and PyTorch.

2.2 Common Pandas Functions

import pandas as pd

# Creating a DataFrame

data = {'Name': ['Alice', 'Bob', 'Charlie'], 'Age': [25, 30, 35], 'Score': [85, 90, 95]}

df = pd.DataFrame(data)

# Display DataFrame

print(df)

# Basic Operations

print(df.describe()) # Summary statistics

print(df.head(2)) # First two rows

print(df.dtypes) # Data types of columns

# Data Manipulation

df['Age'] = df['Age'] + 1 # Modify values

df_filtered = df[df['Score'] > 85] # Filtering data

df_sorted = df.sort_values(by='Age') # Sorting data

2.3 Use Cases in ML

 Preprocessing: Cleaning, normalizing, and structuring datasets before feeding into ML models.

 Feature Engineering: Creating new features from existing data.

 Exploratory Data Analysis (EDA): Analyzing data distributions, correlations, and outliers.

3. Matplotlib: Data Visualization

Matplotlib is a powerful library for creating static, animated, and interactive visualizations.

3.1 Key Features

 Plotting Types: Line plots, bar charts, histograms, scatter plots, etc.

 Customization: Colors, labels, annotations, and styling.

 Integration: Works well with Pandas, NumPy, and Seaborn.

3.2 Common Matplotlib Functions

import matplotlib.pyplot as plt

# Sample Data

x = [1, 2, 3, 4, 5]

y = [10, 20, 25, 30, 50]

# Line Plot

plt.plot(x, y, marker='o', linestyle='-', color='b', label="Growth")

plt.xlabel("X-axis")

plt.ylabel("Y-axis")

plt.title("Simple Line Plot")

plt.legend()
plt.show()

# Scatter Plot

plt.scatter(x, y, color='r')

plt.title("Scatter Plot")

plt.show()

# Histogram

import numpy as np

data = np.random.randn(1000)

plt.hist(data, bins=30, color='g', alpha=0.7)

plt.title("Histogram")

plt.show()

3.3 Use Cases in ML

 Data Exploration: Understanding data distributions and trends.

 Feature Relationships: Identifying correlations between variables.

 Model Performance Evaluation: Visualizing errors, predictions, and accuracy.

4. Combining Pandas and Matplotlib for ML Applications

import pandas as pd

import matplotlib.pyplot as plt

# Load dataset (e.g., Titanic dataset)

df = pd.read_csv("https://raw.githubusercontent.com/datasciencedojo/datasets/master/titanic.csv")

# Data Preprocessing
df['Age'].fillna(df['Age'].median(), inplace=True)

# Plotting Age Distribution

plt.hist(df['Age'], bins=20, color='blue', alpha=0.7)

plt.xlabel("Age")

plt.ylabel("Count")

plt.title("Age Distribution of Titanic Passengers")

plt.show()

# Scatter Plot: Age vs Fare

plt.scatter(df['Age'], df['Fare'], alpha=0.5, color='red')

plt.xlabel("Age")

plt.ylabel("Fare")

plt.title("Age vs Fare")

plt.show()

Week 4:
Write a Python program to implement Simple Linear
Regression and plot the graph.
Linear Regression: Linear regression is defined as an algorithm that provides a linear relationship between
an independent variable and a dependent variable to predict the outcome of future events. It is a statistical
method used in data science and machine learning for predictive analysis. Linear regression is a supervised
learning algorithm that simulates a mathematical relationship between variables and makes predictions for
continuous or numeric variables such as sales, salary, age, product price, etc.

Maths Ukg
82% (84)
Maths Ukg
6 pages
Software Interface Control Document AHRS 8 DC 4E GEDC 6E
No ratings yet
Software Interface Control Document AHRS 8 DC 4E GEDC 6E
83 pages
ML Lab Manual
No ratings yet
ML Lab Manual
28 pages
Ml record_merged (1)
No ratings yet
Ml record_merged (1)
29 pages
ML LAB(R22) MANUAL (4)
No ratings yet
ML LAB(R22) MANUAL (4)
25 pages
r22-1-9-ml-lab-manual-r22-regulations
No ratings yet
r22-1-9-ml-lab-manual-r22-regulations
24 pages
MACHINE LEARNING LAB WORD 12-1-2025. DOCUMENT
No ratings yet
MACHINE LEARNING LAB WORD 12-1-2025. DOCUMENT
68 pages
AI/ML python modules
No ratings yet
AI/ML python modules
17 pages
ML_LAB_MANUAL
No ratings yet
ML_LAB_MANUAL
12 pages
Final Fds Manual
No ratings yet
Final Fds Manual
77 pages
Fds Lab Record
No ratings yet
Fds Lab Record
84 pages
Lab description file (4)
No ratings yet
Lab description file (4)
11 pages
ML-LAB-MANUAL (1)
No ratings yet
ML-LAB-MANUAL (1)
21 pages
Roadmap
No ratings yet
Roadmap
27 pages
Final Fds Manual Print
No ratings yet
Final Fds Manual Print
55 pages
ML Lab Manual
No ratings yet
ML Lab Manual
38 pages
CS3361 Data Science Lab Manual
No ratings yet
CS3361 Data Science Lab Manual
43 pages
Lab Manual - ML - RIT
No ratings yet
Lab Manual - ML - RIT
54 pages
CS3361-DATA SCIENCE LAB MANUAL
No ratings yet
CS3361-DATA SCIENCE LAB MANUAL
44 pages
FDS Lab Meterial CS3361
No ratings yet
FDS Lab Meterial CS3361
30 pages
22-ML Lab Expt 1.docx
No ratings yet
22-ML Lab Expt 1.docx
29 pages
Numpy Lib
No ratings yet
Numpy Lib
19 pages
Numpy
No ratings yet
Numpy
4 pages
FINAL FDS MANUAL print
No ratings yet
FINAL FDS MANUAL print
55 pages
Fds Record
No ratings yet
Fds Record
69 pages
Statistics and Machine Learning in Python
No ratings yet
Statistics and Machine Learning in Python
218 pages
machinelearning_lab manual
No ratings yet
machinelearning_lab manual
26 pages
ML3_Data_Analysis
No ratings yet
ML3_Data_Analysis
80 pages
StatisticsMachineLearningPythonDraft PDF
100% (1)
StatisticsMachineLearningPythonDraft PDF
219 pages
ml file syllabus
No ratings yet
ml file syllabus
43 pages
Statistics Machine Learning Python
No ratings yet
Statistics Machine Learning Python
399 pages
23CS302 - dslab - experiment 1
No ratings yet
23CS302 - dslab - experiment 1
5 pages
Machine Learning Lab Manualuuggiuuhhiuuuuu
No ratings yet
Machine Learning Lab Manualuuggiuuhhiuuuuu
51 pages
CS3362 - Data Science Laboratory - Manual - Final-1
No ratings yet
CS3362 - Data Science Laboratory - Manual - Final-1
76 pages
Cs229 Python Friday
No ratings yet
Cs229 Python Friday
38 pages
Data Preprocessing-AIML Algorithm1
No ratings yet
Data Preprocessing-AIML Algorithm1
47 pages
CS3361 - Data Science Laboratory
No ratings yet
CS3361 - Data Science Laboratory
31 pages
Machine Learning and Pattern Recognition Programming
No ratings yet
Machine Learning and Pattern Recognition Programming
4 pages
l9 Scientific Python Proc
No ratings yet
l9 Scientific Python Proc
30 pages
Machine Learning - Manual
No ratings yet
Machine Learning - Manual
32 pages
fdsa lab manual final
No ratings yet
fdsa lab manual final
70 pages
Programming For Data Science
No ratings yet
Programming For Data Science
48 pages
ML Notesv1
100% (1)
ML Notesv1
300 pages
Statistics Machine Learning Python
100% (1)
Statistics Machine Learning Python
389 pages
ML MANUAL
No ratings yet
ML MANUAL
21 pages
ML With Python Lab (MCA)
No ratings yet
ML With Python Lab (MCA)
36 pages
EXP1-siddhant gupta (23_SE_148)
No ratings yet
EXP1-siddhant gupta (23_SE_148)
17 pages
Practical # 8
No ratings yet
Practical # 8
16 pages
DSF LAB EXP FULL (1) (1)
No ratings yet
DSF LAB EXP FULL (1) (1)
88 pages
Machine Learning - Python Libraries
No ratings yet
Machine Learning - Python Libraries
12 pages
Machine Learning Lab File: Submitted To: Submitted by
No ratings yet
Machine Learning Lab File: Submitted To: Submitted by
9 pages
SMEC ML LAB MANUAL R22
No ratings yet
SMEC ML LAB MANUAL R22
21 pages
StatisticsMachineLearningPythonDraft
No ratings yet
StatisticsMachineLearningPythonDraft
329 pages
Grace Python Numpy MB Final
No ratings yet
Grace Python Numpy MB Final
55 pages
Data Science Lab Manual
No ratings yet
Data Science Lab Manual
18 pages
CO-367 Machine Learning Lab File: Submitted To: Submitted by
No ratings yet
CO-367 Machine Learning Lab File: Submitted To: Submitted by
12 pages
D P Lab Manual
No ratings yet
D P Lab Manual
54 pages
CS3361 - Data Science
No ratings yet
CS3361 - Data Science
56 pages
Numpy in Python
No ratings yet
Numpy in Python
7 pages
lab manual fds
No ratings yet
lab manual fds
44 pages
Python Programming: General-Purpose Libraries; NumPy,Pandas,Matplotlib,Seaborn,Requests,os & sys: Python, #2
From Everand
Python Programming: General-Purpose Libraries; NumPy,Pandas,Matplotlib,Seaborn,Requests,os & sys: Python, #2
e3
No ratings yet
Mastering Data Structures and Algorithms in Python & Java
From Everand
Mastering Data Structures and Algorithms in Python & Java
Sachin Naha
No ratings yet
GIS Data - Asset Management
No ratings yet
GIS Data - Asset Management
78 pages
RFID Reader Specifications
No ratings yet
RFID Reader Specifications
8 pages
20.UPS Rack Mounted 9130
No ratings yet
20.UPS Rack Mounted 9130
4 pages
Hoffmann Impinger Kit Selection Tool B.2023
No ratings yet
Hoffmann Impinger Kit Selection Tool B.2023
4 pages
1.3 Matrices and Matrix Operations
No ratings yet
1.3 Matrices and Matrix Operations
86 pages
All Main Laptop and Notebook Parts Explained
100% (1)
All Main Laptop and Notebook Parts Explained
13 pages
The Excellent Express - Js Handbook
No ratings yet
The Excellent Express - Js Handbook
31 pages
GHDB Reborn Dictionary - NEW ONLY - Exploit-DB - Com - 21sept2011
No ratings yet
GHDB Reborn Dictionary - NEW ONLY - Exploit-DB - Com - 21sept2011
39 pages
System Statistics Are A Little Complex: Neil Chandler's DB Blog
No ratings yet
System Statistics Are A Little Complex: Neil Chandler's DB Blog
16 pages
ASCII - Wikipedia, The Free Encyclopedia
No ratings yet
ASCII - Wikipedia, The Free Encyclopedia
13 pages
Project 11
No ratings yet
Project 11
6 pages
Scheduling in Linux and Windows 2000
No ratings yet
Scheduling in Linux and Windows 2000
34 pages
Computer Software: by Mr. Mohit Dhankhar Hindu Institute of Management
No ratings yet
Computer Software: by Mr. Mohit Dhankhar Hindu Institute of Management
19 pages
Part No. Description Use For N o QT Y: Data Barang New Bad
No ratings yet
Part No. Description Use For N o QT Y: Data Barang New Bad
3 pages
3409 Reading
No ratings yet
3409 Reading
11 pages
OOP Database Connectivity
No ratings yet
OOP Database Connectivity
4 pages
The_Maya_Cache_A_Storage-efficient_and_Secure_Fully-associative_Last-level_Cache
No ratings yet
The_Maya_Cache_A_Storage-efficient_and_Secure_Fully-associative_Last-level_Cache
13 pages
Unit 1
No ratings yet
Unit 1
4 pages
Facetime
No ratings yet
Facetime
13 pages
Hotlinx 21
No ratings yet
Hotlinx 21
16 pages
QWAS Sample Preparation Instructions GB
No ratings yet
QWAS Sample Preparation Instructions GB
14 pages
Design and Anlaysis of Algorithm
No ratings yet
Design and Anlaysis of Algorithm
206 pages
VMW 10Q3 PPT Library VMware Icons Diagrams R7 COMM 2 of 2
No ratings yet
VMW 10Q3 PPT Library VMware Icons Diagrams R7 COMM 2 of 2
42 pages
Vendor List - 2
No ratings yet
Vendor List - 2
4 pages
NEP 2020 Computer Science Syllabus
No ratings yet
NEP 2020 Computer Science Syllabus
15 pages
Smart Electrical Panel - EN
No ratings yet
Smart Electrical Panel - EN
16 pages
Lecture 1 introduction PM (1)
No ratings yet
Lecture 1 introduction PM (1)
21 pages
Blockchain Based Medical Records System and QR
No ratings yet
Blockchain Based Medical Records System and QR
7 pages