0% found this document useful (0 votes)

5 views

Electrical Machine Learning Tool

Machine learning data for Electrical engine

Uploaded by

Martins Richmond

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

Electrical Machine Learning Tool

Machine learning data for Electrical engine

Uploaded by

Martins Richmond

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

In [20]:

# Importing necessary libraries

import pandas as pd # Used for data manipulation and handling
import numpy as np # Useful for numerical operations
from sklearn.model_selection import train_test_split # For splitting the data
from sklearn.linear_model import LinearRegression # The ML model we will use
from sklearn.metrics import mean_squared_error, r2_score # For evaluating the

In [2]:

# Load the dataset

df = pd.read_csv('Electricity_Consumption_Dataset.csv')

In [4]:
df.info()

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 5000 entries, 0 to 4999
Data columns (total 6 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 Date 5000 non-null object
1 Hour 5000 non-null int64
2 Number_of_Appliances 4700 non-null float64
3 Usage_Duration 4700 non-null float64
4 Peak_Usage 4800 non-null float64
5 Electricity_Consumption 5000 non-null float64
dtypes: float64(4), int64(1), object(1)
memory usage: 234.5+ KB

In [5]:
df.head()

Out [5]:
Date Hour Number_of_Appliances Usage_Duration Peak_Usage Electricity_Consumption
2023-
0 0 5.0 1.118288 0.0 4.935174
01-01
2023-
1 1 4.0 1.737984 0.0 7.495992
01-01
2023-
2 2 4.0 3.350142 0.0 11.460053
01-01
2023-
3 3 5.0 4.893616 0.0 28.588596
01-01
2023-
4 4 5.0 1.030203 0.0 4.929359
01-01

In [6]:
df.describe()

Out [6]:
Hour Number_of_Appliances Usage_Duration Peak_Usage Electricity_Consumption
count 5000.000000 4700.000000 4700.000000 4800.000000 5000.000000
mean 11.487200 5.020426 2.750078 0.125000 15.082294
std 6.925332 2.220455 1.295222 0.330753 11.412605
min 0.000000 0.000000 0.500025 0.000000 0.000000
25% 5.000000 3.000000 1.613859 0.000000 6.768928
50% 11.000000 5.000000 2.751787 0.000000 12.560582
75% 17.000000 6.000000 3.854879 0.000000 20.362971
max 23.000000 15.000000 4.999052 1.000000 121.078379
In [7]:

# Data Preprocessing
# -------------------
# Convert 'Date' to datetime type for any time series analysis necessity
df['Date'] = pd.to_datetime(df['Date'])

In [11]:

# Handling missing values by filling them with the median of the column
for column in ['Number_of_Appliances', 'Usage_Duration', 'Peak_Usage']:
if df[column].isnull().any():
df[column].fillna(df[column].median(), inplace=True)

df.head()

Out [11]:
Date Hour Number_of_Appliances Usage_Duration Peak_Usage Electricity_Consumption
2023-
0 0 5.0 1.118288 0.0 4.935174
01-01
2023-
1 1 4.0 1.737984 0.0 7.495992
01-01
2023-
2 2 4.0 3.350142 0.0 11.460053
01-01
2023-
3 3 5.0 4.893616 0.0 28.588596
01-01
2023-
4 4 5.0 1.030203 0.0 4.929359
01-01

In [12]:

# Feature Engineering (if needed)

# -------------------------------
# For example, creating new features that might help improve model performance
# Here, we can think of extracting day of the week or month from the date if r
df['DayOfWeek'] = df['Date'].dt.dayofweek

In [13]:

# Modeling
# --------
# Define features and target variable
X = df[['Hour', 'Number_of_Appliances', 'Usage_Duration', 'Peak_Usage', 'DayOf
y = df['Electricity_Consumption']

In [14]:

# Split the data into train and test sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, rando

In [15]:

# Initialize the Linear Regression model

model = LinearRegression()

In [16]:

# Train the model

model.fit(X_train, y_train)
Out [16]: ▾ LinearRegression

LinearRegression()

In [17]:

# Predict on the test set

y_pred = model.predict(X_test)

In [21]:

# Evaluation
# ----------
# Calculate the Mean Squared Error and the R^2 score to evaluate the model
mse = mean_squared_error(y_test, y_pred)
r2 = r2_score(y_test, y_pred)

In [22]:

print(f'Mean Squared Error (MSE): {mse}')

print(f'R-squared Score: {r2}')

# The MSE provides a measure of how well the model predictions approximate the
# The R-squared score is a statistical measure of how close the data are to th

Mean Squared Error (MSE): 34.27105705807505

R-squared Score: 0.7447340948875754

Data Cleaning - Cheatsheet
100% (2)
Data Cleaning - Cheatsheet
8 pages
Disaggregation Using Nilmtk PDF
No ratings yet
Disaggregation Using Nilmtk PDF
12 pages
Business Mathematics 2nd Quarter 7th Week Lesson Presentation and Analysis of Business Data
75% (4)
Business Mathematics 2nd Quarter 7th Week Lesson Presentation and Analysis of Business Data
23 pages
Energy Consumption Time Series Forcasting 1681824033
No ratings yet
Energy Consumption Time Series Forcasting 1681824033
14 pages
Group-3 Report
No ratings yet
Group-3 Report
38 pages
Individual Household Electric Power Consumption
No ratings yet
Individual Household Electric Power Consumption
29 pages
Linear Regression and SVR
No ratings yet
Linear Regression and SVR
25 pages
Google Cluster Data Preprocessing - Updated
No ratings yet
Google Cluster Data Preprocessing - Updated
4 pages
Code
No ratings yet
Code
2 pages
data wrangling
No ratings yet
data wrangling
6 pages
Load Prediction With 20 Models
No ratings yet
Load Prediction With 20 Models
19 pages
Project Intern - Jupyter Notebook
No ratings yet
Project Intern - Jupyter Notebook
16 pages
WorkingWithData - Ipynb - Colaboratory
No ratings yet
WorkingWithData - Ipynb - Colaboratory
13 pages
s3950476 TimeSeriesAnalysis Assignment 3
No ratings yet
s3950476 TimeSeriesAnalysis Assignment 3
13 pages
DAP writeups_merged
No ratings yet
DAP writeups_merged
33 pages
Python Scripts For Machine Learning
No ratings yet
Python Scripts For Machine Learning
13 pages
CREATE DATABASE Energy
No ratings yet
CREATE DATABASE Energy
7 pages
Supermarket Sales Analysis Project
No ratings yet
Supermarket Sales Analysis Project
8 pages
Task 2 Exploratory Data Analysis
No ratings yet
Task 2 Exploratory Data Analysis
5 pages
task2-eda-cleaning
No ratings yet
task2-eda-cleaning
33 pages
Co Digit Ooo
No ratings yet
Co Digit Ooo
15 pages
27 Jupyter Notebook
No ratings yet
27 Jupyter Notebook
42 pages
Assignment_2 (1)
No ratings yet
Assignment_2 (1)
9 pages
Important Pandas Operations 1697910759
No ratings yet
Important Pandas Operations 1697910759
6 pages
Pandas Roadmap
No ratings yet
Pandas Roadmap
6 pages
Pandas Data Manipulation Extended CheatSheet 1731972219
No ratings yet
Pandas Data Manipulation Extended CheatSheet 1731972219
9 pages
index
No ratings yet
index
4 pages
Manufacturing Machine Learning Tool Mechanical
No ratings yet
Manufacturing Machine Learning Tool Mechanical
13 pages
Efficient Incremental Smart Grid Data Analytics: David Xi Cheng Wojciech Golab Paul A. S. Ward
No ratings yet
Efficient Incremental Smart Grid Data Analytics: David Xi Cheng Wojciech Golab Paul A. S. Ward
8 pages
Solar Data
No ratings yet
Solar Data
15 pages
Ele_pro
No ratings yet
Ele_pro
1 page
Interactive Data Analysis With Jupyter Cheatsheet 1731972443
No ratings yet
Interactive Data Analysis With Jupyter Cheatsheet 1731972443
10 pages
UCI Machine Learning Repository - Individual Household Electric Power Consumption Data Set
No ratings yet
UCI Machine Learning Repository - Individual Household Electric Power Consumption Data Set
1 page
L6 and 7-Data Preprocessing-coding
No ratings yet
L6 and 7-Data Preprocessing-coding
34 pages
Pandas Module (Part-I)
No ratings yet
Pandas Module (Part-I)
36 pages
Data Exploration Preparation
No ratings yet
Data Exploration Preparation
12 pages
Tutorial - Time Series Analysis With Pandas - Dataquest
No ratings yet
Tutorial - Time Series Analysis With Pandas - Dataquest
32 pages
Part A Assignment 6
No ratings yet
Part A Assignment 6
28 pages
PythonForMachineLearning
No ratings yet
PythonForMachineLearning
66 pages
Kedar Maheshwari
No ratings yet
Kedar Maheshwari
17 pages
Time Series Forecasting
No ratings yet
Time Series Forecasting
7 pages
Pandas-1
No ratings yet
Pandas-1
13 pages
Sunbase Data Assignment
No ratings yet
Sunbase Data Assignment
11 pages
Lab Exercise 2-CS0017
No ratings yet
Lab Exercise 2-CS0017
17 pages
Sample report
No ratings yet
Sample report
17 pages
1Demand
No ratings yet
1Demand
13 pages
Algorithm Current Situation
No ratings yet
Algorithm Current Situation
7 pages
assignment
No ratings yet
assignment
4 pages
Practical No. 09.ipynb - Colab
No ratings yet
Practical No. 09.ipynb - Colab
4 pages
Time Series Visualization From Raw Data To Insights
No ratings yet
Time Series Visualization From Raw Data To Insights
34 pages
Load Dataset: Import As
No ratings yet
Load Dataset: Import As
8 pages
Individual Household Electric Power Consumption Forecasting Using Machine Learning Algorithms
No ratings yet
Individual Household Electric Power Consumption Forecasting Using Machine Learning Algorithms
4 pages
Solar Power Generation Forecasting in Europe a Time Series Analysis
No ratings yet
Solar Power Generation Forecasting in Europe a Time Series Analysis
19 pages
41b Data Wrangling, Grouping and Aggregation
No ratings yet
41b Data Wrangling, Grouping and Aggregation
31 pages
Dataframe in Pandas - Cheatsheet
No ratings yet
Dataframe in Pandas - Cheatsheet
8 pages
DS (Pandas)
No ratings yet
DS (Pandas)
17 pages
exp3 python (1)
No ratings yet
exp3 python (1)
15 pages
Practical File IP Class 12 2024 25 Sharing Removed
No ratings yet
Practical File IP Class 12 2024 25 Sharing Removed
29 pages
Data Mining Models: Techniques and Applications
From Everand
Data Mining Models: Techniques and Applications
Ravi Deshpande
No ratings yet
Data Science Programming In Python
From Everand
Data Science Programming In Python
Anita Raichand
No ratings yet
Profound Python Libraries
From Everand
Profound Python Libraries
Onder Teker
No ratings yet
Ojukwu Chika Project - 0
No ratings yet
Ojukwu Chika Project - 0
101 pages
Evaluation of Electrical Power
No ratings yet
Evaluation of Electrical Power
16 pages
Simulation of Capacitor Bank For Improvement of Voltage Profile at Distribution Center (Review)
No ratings yet
Simulation of Capacitor Bank For Improvement of Voltage Profile at Distribution Center (Review)
5 pages
Optimal Sizing and Placement of Capacitor Banks in Distribution Networks Using A Genetic Algorithm
No ratings yet
Optimal Sizing and Placement of Capacitor Banks in Distribution Networks Using A Genetic Algorithm
18 pages
Electrical Engineering Siwes Report
0% (1)
Electrical Engineering Siwes Report
2 pages
PID987658
No ratings yet
PID987658
4 pages
Asuquo IT Report M
100% (1)
Asuquo IT Report M
42 pages
Industrial Training Report
100% (4)
Industrial Training Report
42 pages
Dedication: Table of Contents
No ratings yet
Dedication: Table of Contents
6 pages
Microsoft Word - WAVES
No ratings yet
Microsoft Word - WAVES
4 pages
An "Ancient" Approximation Technique
No ratings yet
An "Ancient" Approximation Technique
2 pages
International Journal of Pressure Vessels and Piping
No ratings yet
International Journal of Pressure Vessels and Piping
10 pages
Sahu - 2012 - Application of Computational Fluid Dynamics To Advanced Guided Munitions
No ratings yet
Sahu - 2012 - Application of Computational Fluid Dynamics To Advanced Guided Munitions
21 pages
Egd Assignment
No ratings yet
Egd Assignment
7 pages
Isothermal Semi-Batch Reaction Example (See Fogler 4 Ed. Problem 4 - 9)
No ratings yet
Isothermal Semi-Batch Reaction Example (See Fogler 4 Ed. Problem 4 - 9)
4 pages
Flame Radiation Characteristics of Open Hydrocarbon Pool Fires
No ratings yet
Flame Radiation Characteristics of Open Hydrocarbon Pool Fires
7 pages
MCNP5 Manual VOL I
No ratings yet
MCNP5 Manual VOL I
416 pages
MATPOWER Manual PDF
No ratings yet
MATPOWER Manual PDF
140 pages
Dividing by 10, 100 and 1000 Activity Sheet
No ratings yet
Dividing by 10, 100 and 1000 Activity Sheet
4 pages
ELL 100 Introduction To Electrical Engineering: L 8: N T S M P T
No ratings yet
ELL 100 Introduction To Electrical Engineering: L 8: N T S M P T
71 pages
(Baker A.) Representations of Finite Groups (BookFi)
No ratings yet
(Baker A.) Representations of Finite Groups (BookFi)
80 pages
Bridge Design Competition
No ratings yet
Bridge Design Competition
5 pages
Module #02 - Verilog HDL - Lexical Tokens
No ratings yet
Module #02 - Verilog HDL - Lexical Tokens
7 pages
Lab Report No. 7 - Group 6
No ratings yet
Lab Report No. 7 - Group 6
3 pages
MOde Frontier Tutorial
No ratings yet
MOde Frontier Tutorial
35 pages
Power System Simulation - Prof - Jain B. Marshel
No ratings yet
Power System Simulation - Prof - Jain B. Marshel
72 pages
Mechanics II Notes
No ratings yet
Mechanics II Notes
74 pages
F24 10423 Homework 4
No ratings yet
F24 10423 Homework 4
19 pages
ME101-Lecture10 - Friction and Wedge
No ratings yet
ME101-Lecture10 - Friction and Wedge
31 pages
Gravitational Field and Potential
No ratings yet
Gravitational Field and Potential
45 pages
MSI Functions Questions-1
No ratings yet
MSI Functions Questions-1
20 pages
Experiment No. 4: Aim: Measurement of Straightness by Wedge Method
No ratings yet
Experiment No. 4: Aim: Measurement of Straightness by Wedge Method
2 pages
BCS304 Super Important - 22SCHEME
No ratings yet
BCS304 Super Important - 22SCHEME
3 pages
Seme 101
No ratings yet
Seme 101
3 pages
6 Gradient
No ratings yet
6 Gradient
15 pages
Math Demo Dec 2 2019 Print
No ratings yet
Math Demo Dec 2 2019 Print
16 pages
Hasil Data Project Spasial - Summary
No ratings yet
Hasil Data Project Spasial - Summary
8 pages
Snake Robots Full Report
No ratings yet
Snake Robots Full Report
13 pages

Electrical Machine Learning Tool

Uploaded by

Electrical Machine Learning Tool

Uploaded by

In [20]:

# Importing necessary libraries

# Load the dataset

# Feature Engineering (if needed)

# Split the data into train and test sets

# Initialize the Linear Regression model

# Train the model

# Predict on the test set

print(f'Mean Squared Error (MSE): {mse}')

Mean Squared Error (MSE): 34.27105705807505

You might also like