0% found this document useful (0 votes)

78 views

Knime - Project Report

The document describes a project to predict a country's Environmental Performance Index score using regression analysis on various environmental measures. The analysis required importing an Excel dataset, removing unnecessary columns and duplicate rows, normalizing the data, and splitting it into training and test sets. A random forest regression model was trained on the training set and achieved 97.4% accuracy on the test set, demonstrating it can effectively predict a country's EPI score based on environmental indicators. Normalization of the data was found to improve model accuracy. The project aims to help identify factors contributing to high environmental performance.

Uploaded by

Ansh Rohatgi

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

78 views

Knime - Project Report

Uploaded by

Ansh Rohatgi

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 11

Business Intelligence and Data Visualization Lab

Manual
CSL 232

Knime Project Report

Faculty name: Dr. Poonam Chaudhary

Student name: Harshita Bhatia and Ankit Jhangu

Roll No.: 20csu305 & 20csu365

Semester: 5th

Group: DS B

Department of Computer Science and Engineering

The NorthCap University, Gurugram- 122001, India
Session 2022-23
BIDV Lab Manual (CSL 232) | 1
2022-23

Table of Contents
S.No Page No.

1. Project Description 2

2. Problem Statement
3
3. Analysis

3.1 Hardware Requirements

3.2 Software Requirements 3

4. Design 3

5. Implementation and Testing (stage/module wise) 4

6. Output (Screenshots) 5

7. Conclusion and Future Scope 10

BIDV Lab Manual (CSL 232) | 2
2022-23
1. Project Description
The Environmental Performance Index is a global rating system that ranks nations based on their
environmental health. It provides a data-driven evaluation of the global level of sustainability.

The Environmental Performance Index (EPI) ranks 180 countries on 32 performance indicators
in the following 11 issue categories: air quality, sanitation and drinking water, heavy metals,
waste management, biodiversity and habitat, ecosystem services, fisheries, climate change,
pollution emissions, agriculture, and water resources. These categories track performance and
progress on two broad policy objectives, environmental health, and ecosystem vitality.

EPI measures help to identify issues, define goals, follow trends, understand outcomes, and
identify effective policy methods.

The Environmental Performance Index (EPI) statistics show that financial resources, excellent
governance, human development, and regulatory quality all play a role in boosting a country’s
sustainability. EPI helps decision-makers to identify all these factors that contribute to top-tier
performance.

By emphasizing these connections, the EPI contributes to the promotion of sustainable

development in support of a more ecologically secure and equitable future.

About Dataset

The dataset contains 181 rows and 1352 columns, some of which are country
name, code, region, eu27, g20, environmental performance index, air quality,
environmental health, household solid fuels, ozone exposure, sanitation and
drinking water, ecosystem vitality, biodiversity and habitat, ecosystem services and
many other.
BIDV Lab Manual (CSL 232) | 3
2022-23
2. Problem Statement:

Predicting the Environmental Performance Index score from the given measures
using regression analysis.

3. Analysis

3.1. Hardware Requirements

A 64-bit operating system with at least 32GB RAM and 8 CPU cores as minimum

3.2. Software Requirements

Knime analytics platform

4. Design
The following steps were taken to get the best model accuracy:

 Importing excel dataset

 Removing unnecessary columns
 Removing duplicate rows
 Normalizing the dataset
 Splitting data into train and test data
 Using model learner
 Model prediction
 Checking model accuracy
BIDV Lab Manual (CSL 232) | 4
2022-23
5. Implementation and Testing (stage/module wise)

a) Excel Reader
Reading the excel file using this node.

b) Column Filter
Removing unnecessary columns

c) Normalizer
Normalizing the data using min-max normalization

d) Partitioning
Dividing the dataset into two parts: 80% of training data and 20% of test data

e) Random Forest Learner (Regression)

Applying random forest technique on the training dataset to train the model. The EPI
score is taken as the target variable.

f) Random Forest Predictor (Regression)

Applying model to the test data.

g) Numeric Scorer
Finding the accuracy of the model
BIDV Lab Manual (CSL 232) | 5
2022-23
6. Output (Screenshots)
File Table

Filtered table
BIDV Lab Manual (CSL 232) | 6
2022-23

Normalized table

Partitioning

- train data
BIDV Lab Manual (CSL 232) | 7
2022-23

-test data

Simple Regression Tree learner

BIDV Lab Manual (CSL 232) | 8
2022-23

Statistics:

Random Forest Learner

BIDV Lab Manual (CSL 232) | 9
2022-23

Statistics:
BIDV Lab Manual (CSL 232) | 10
2022-23

7. Conclusion

Firstly, we applied both the techniques (Random Forest and simple regression tree
learner) on our dataset without normalization. The accuracy was:
Random Forest: 94.8%
Simple Regression tree learner: 91.4%

After normalization, the accuracy changed to:

Random Forest: 97.4%
Simple Regression tree learner: 95%

Therefore, we need to normalize the data. We can clearly see from the above accuracy
scores that Random Forest is better.

Environmental Science Student Edition PDF
95% (21)
Environmental Science Student Edition PDF
683 pages
25 Energy Transfer in Living Organisms-Rennel Burgos
43% (37)
25 Energy Transfer in Living Organisms-Rennel Burgos
6 pages
Nutrient Cycles POGIL ANSWER KEY Yqaw69 1
69% (13)
Nutrient Cycles POGIL ANSWER KEY Yqaw69 1
7 pages
12 Ocean Tides Explore Learning Gizmo
57% (30)
12 Ocean Tides Explore Learning Gizmo
3 pages
Richter Et Al 2024 CRB Water Budget
100% (4)
Richter Et Al 2024 CRB Water Budget
12 pages
Plate Tectonics Gizmo Form PDF
85% (13)
Plate Tectonics Gizmo Form PDF
5 pages
Student Exploration: Greenhouse Effect
70% (10)
Student Exploration: Greenhouse Effect
3 pages
Useful Phrases Describing Weather
87% (238)
Useful Phrases Describing Weather
2 pages
Black Book of Crime
75% (12)
Black Book of Crime
39 pages
Apache Cassandra Administrator Associate - Exam Practice Tests
From Everand
Apache Cassandra Administrator Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet
ISTQB Advanced Level Technical Test Analyst- Exam Insights: Q&A with Explanations
From Everand
ISTQB Advanced Level Technical Test Analyst- Exam Insights: Q&A with Explanations
SUJAN
No ratings yet
Thrive - Long-Term Wilderness Survival Guide Skills, Tips, and Gear For Living On The Land
100% (2)
Thrive - Long-Term Wilderness Survival Guide Skills, Tips, and Gear For Living On The Land
136 pages
The Prepper's Survival Bible - T - Richard Man
100% (1)
The Prepper's Survival Bible - T - Richard Man
128 pages
Managing the Testing Process: Practical Tools and Techniques for Managing Hardware and Software Testing
From Everand
Managing the Testing Process: Practical Tools and Techniques for Managing Hardware and Software Testing
Rex Black
4/5 (8)
Dust Bowls of Empire
No ratings yet
Dust Bowls of Empire
218 pages
Review of The Adam and Eve Story by Chan Thomas
100% (7)
Review of The Adam and Eve Story by Chan Thomas
17 pages
Free Ebook. Human Extermination For Reptilian Replacement Behind Pandemics and World War Three
82% (11)
Free Ebook. Human Extermination For Reptilian Replacement Behind Pandemics and World War Three
366 pages
Knime Project Report
No ratings yet
Knime Project Report
12 pages
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
From Everand
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
alasdair gilchrist
5/5 (1)
Data Science and Big Data Analytics: Discovering, Analyzing, Visualizing and Presenting Data
From Everand
Data Science and Big Data Analytics: Discovering, Analyzing, Visualizing and Presenting Data
EMC Education Services
No ratings yet
Rockhound PDF
100% (4)
Rockhound PDF
31 pages
Printable Article Rocks On The Beach
No ratings yet
Printable Article Rocks On The Beach
2 pages
نظام العقارات الالكتروني-2013
60% (5)
نظام العقارات الالكتروني-2013
95 pages
Stat Guid
No ratings yet
Stat Guid
97 pages
Defect Prediction in Software Development & Maintainence
From Everand
Defect Prediction in Software Development & Maintainence
Rudra Kumar
No ratings yet
Microsoft Certified: Power BI Data Analyst Associate PL 300 Practice Tests
From Everand
Microsoft Certified: Power BI Data Analyst Associate PL 300 Practice Tests
CertSquad Professional Trainers
No ratings yet
Data Mining Models: Techniques and Applications
From Everand
Data Mining Models: Techniques and Applications
Ravi Deshpande
No ratings yet
Each Stage of A Data Mining Project
No ratings yet
Each Stage of A Data Mining Project
5 pages
6th Semester Syllabus
No ratings yet
6th Semester Syllabus
20 pages
Syllabus BI Solutions
No ratings yet
Syllabus BI Solutions
7 pages
Exploring AutoCAD Map 3D 2022, 9th Edition
From Everand
Exploring AutoCAD Map 3D 2022, 9th Edition
Prof. Sham Tickoo
No ratings yet
Best Industry Outcomes
From Everand
Best Industry Outcomes
Terry Cooke-Davies
No ratings yet
BUSINESS INTELLIGENCE docs
No ratings yet
BUSINESS INTELLIGENCE docs
12 pages
ANSYS Workbench 2021 R1: A Tutorial Approach, 4th Edition
From Everand
ANSYS Workbench 2021 R1: A Tutorial Approach, 4th Edition
Prof. Sham Tickoo
No ratings yet
SIM - Chapters - DA T2
No ratings yet
SIM - Chapters - DA T2
5 pages
Advanced Backend Code Optimization
From Everand
Advanced Backend Code Optimization
Sid Touati
No ratings yet
Data Science Project - Flow Graph
No ratings yet
Data Science Project - Flow Graph
7 pages
Blended Data Cleaning
No ratings yet
Blended Data Cleaning
9 pages
Ansh20csu169bi DV
No ratings yet
Ansh20csu169bi DV
70 pages
Zhang Haoze 202112 MSC
No ratings yet
Zhang Haoze 202112 MSC
114 pages
MLM Report Customer Churn
No ratings yet
MLM Report Customer Churn
17 pages
DAV practical 2
No ratings yet
DAV practical 2
6 pages
Handbook on Microgrids for Power Quality and Connectivity
From Everand
Handbook on Microgrids for Power Quality and Connectivity
Asian Development Bank
No ratings yet
Introduction to Finite Element Analysis
From Everand
Introduction to Finite Element Analysis
Rahul Basu
No ratings yet
Thinespary Sitharam 841007106016-Supply Chain Management Data Analytic
No ratings yet
Thinespary Sitharam 841007106016-Supply Chain Management Data Analytic
6 pages
Exploratory Data Analysis-1 (EDA-1)
No ratings yet
Exploratory Data Analysis-1 (EDA-1)
38 pages
CS202 Assignment - 4- GIKI
No ratings yet
CS202 Assignment - 4- GIKI
3 pages
Case Study Data Science
No ratings yet
Case Study Data Science
7 pages
Report-4
No ratings yet
Report-4
50 pages
Rapport ML project
No ratings yet
Rapport ML project
26 pages
AutoCAD Electrical 2020 for Electrical Control Designers, 11th Edition
From Everand
AutoCAD Electrical 2020 for Electrical Control Designers, 11th Edition
Prof. Sham Tickoo
No ratings yet
10. CO DAB 2024-25
No ratings yet
10. CO DAB 2024-25
10 pages
4 11 Final Modified Chapter-4
No ratings yet
4 11 Final Modified Chapter-4
32 pages
SAS Data Analytic Development: Dimensions of Software Quality
From Everand
SAS Data Analytic Development: Dimensions of Software Quality
Troy Martin Hughes
No ratings yet
CAPM Exam Insights: Q&A with Explanations
From Everand
CAPM Exam Insights: Q&A with Explanations
SUJAN
No ratings yet
SVM
No ratings yet
SVM
12 pages
PMI-ACP Exam Companion : Q & A with Explanations
From Everand
PMI-ACP Exam Companion : Q & A with Explanations
SUJAN
No ratings yet
Statistics For Data Science - 1
100% (2)
Statistics For Data Science - 1
38 pages
Predictive Modelling Project 2
100% (4)
Predictive Modelling Project 2
32 pages
CAPM SURE SUCCESS: Expert Q&A with Detailed Explanations
From Everand
CAPM SURE SUCCESS: Expert Q&A with Detailed Explanations
SUJAN
No ratings yet
AML PRG Assign I
No ratings yet
AML PRG Assign I
3 pages
Implementing the Stakeholder Based Goal-Question-Metric (Gqm) Measurement Model for Software Projects
From Everand
Implementing the Stakeholder Based Goal-Question-Metric (Gqm) Measurement Model for Software Projects
Dr. Prashanth Harish Southekal
No ratings yet
AL801 Business Intelligence
No ratings yet
AL801 Business Intelligence
11 pages
MachineLearning
No ratings yet
MachineLearning
7 pages
Decision Support
No ratings yet
Decision Support
21 pages
2025 - Course Kit & Lesson Plan - Business Analytics for Decision Making(1)
No ratings yet
2025 - Course Kit & Lesson Plan - Business Analytics for Decision Making(1)
184 pages
L3 Overview of ML Model Development Lifecycle-1
No ratings yet
L3 Overview of ML Model Development Lifecycle-1
30 pages
Surabhi Charu Project
No ratings yet
Surabhi Charu Project
16 pages
Draft Xai
No ratings yet
Draft Xai
16 pages
General ML Notes
No ratings yet
General ML Notes
30 pages
vamshi ml-1,2
No ratings yet
vamshi ml-1,2
25 pages
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
César Pérez López
No ratings yet
ANSYS Workbench 2023 R2: A Tutorial Approach, 6th Edition
From Everand
ANSYS Workbench 2023 R2: A Tutorial Approach, 6th Edition
Prof. Sham Tickoo
No ratings yet
EDA and Cleaning
No ratings yet
EDA and Cleaning
24 pages
Segmentation and Decision - Tree in Sas
No ratings yet
Segmentation and Decision - Tree in Sas
2 pages
AIML Record Batch 9
No ratings yet
AIML Record Batch 9
88 pages
Data Preprocessing Before Classification: Presented by
No ratings yet
Data Preprocessing Before Classification: Presented by
23 pages
Off The Grid Survival
No ratings yet
Off The Grid Survival
22 pages
Ten Laws of Boundaries
No ratings yet
Ten Laws of Boundaries
1 page
Inspection Toyotascion2005
No ratings yet
Inspection Toyotascion2005
3 pages
Holt Student Edition
100% (3)
Holt Student Edition
976 pages
Rock and Minerals of Pennsylvania
100% (1)
Rock and Minerals of Pennsylvania
37 pages
Hatchet Anticipation Guide
0% (1)
Hatchet Anticipation Guide
2 pages
Guide To Rocks and Minerals of Florida
No ratings yet
Guide To Rocks and Minerals of Florida
71 pages
Weather Modification Programs 1978 Document
No ratings yet
Weather Modification Programs 1978 Document
784 pages
PDF The Shamar Prophet 1st Edition John Eckhardt download
100% (2)
PDF The Shamar Prophet 1st Edition John Eckhardt download
37 pages
Biology Book
100% (8)
Biology Book
1,160 pages
How To Use Fibonacci
No ratings yet
How To Use Fibonacci
6 pages
Agate
100% (1)
Agate
6 pages
A Description of Some Oregon Rocks and Minerals
No ratings yet
A Description of Some Oregon Rocks and Minerals
52 pages
Instant ebooks textbook Treating Complex Trauma in Children and Their Families: An Integrative Approach – Ebook PDF Version download all chapters
100% (5)
Instant ebooks textbook Treating Complex Trauma in Children and Their Families: An Integrative Approach – Ebook PDF Version download all chapters
61 pages
Hydrometer Paper
No ratings yet
Hydrometer Paper
7 pages
Sylvania Homework Matrix
100% (1)
Sylvania Homework Matrix
7 pages
XC350 Final Exam Abdullah Khalid
No ratings yet
XC350 Final Exam Abdullah Khalid
5 pages
Hooked Book Nir Eyal Summary PDF
No ratings yet
Hooked Book Nir Eyal Summary PDF
19 pages
IRT 1300 Drive Manual
No ratings yet
IRT 1300 Drive Manual
54 pages
RAK3172 Datasheet
No ratings yet
RAK3172 Datasheet
8 pages
Literature Review On Industrial Automation
100% (2)
Literature Review On Industrial Automation
4 pages
Chapter1 3
No ratings yet
Chapter1 3
19 pages
29663
No ratings yet
29663
13 pages
Network Attacks and Defenses A Hands on Approach 1st Edition Zouheir Trabelsi (Author) - Download the full ebook now to never miss any detail
100% (1)
Network Attacks and Defenses A Hands on Approach 1st Edition Zouheir Trabelsi (Author) - Download the full ebook now to never miss any detail
57 pages
FortiManager Datasheet
No ratings yet
FortiManager Datasheet
8 pages
RADAR Case Study1
No ratings yet
RADAR Case Study1
2 pages
1 Introduction Python Programming For Data Science
No ratings yet
1 Introduction Python Programming For Data Science
11 pages
An Duong Vuong High School - Mock Test K1006 Full Name - 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11
No ratings yet
An Duong Vuong High School - Mock Test K1006 Full Name - 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11
5 pages
Java 1 Assignment
No ratings yet
Java 1 Assignment
12 pages
Basic Excel Terminology: Absolute Cell Reference Active Cell
No ratings yet
Basic Excel Terminology: Absolute Cell Reference Active Cell
4 pages
Lesson 8 - Online Creation Tools, Platforms and Applications For Ict Development
No ratings yet
Lesson 8 - Online Creation Tools, Platforms and Applications For Ict Development
18 pages
APOS Development Best Pratices WL
No ratings yet
APOS Development Best Pratices WL
28 pages
Comparative Research of AR and VR Technology Based On User Experience
No ratings yet
Comparative Research of AR and VR Technology Based On User Experience
9 pages
JGEC AICTE ATAL Online FDP
No ratings yet
JGEC AICTE ATAL Online FDP
2 pages
Susarla Et Al 2023 The Janus Effect of Generative Ai Charting The Path For Responsible Conduct of Scholarly Activities
No ratings yet
Susarla Et Al 2023 The Janus Effect of Generative Ai Charting The Path For Responsible Conduct of Scholarly Activities
11 pages
Control Statements in Python
No ratings yet
Control Statements in Python
8 pages
5G QoS Parameters
No ratings yet
5G QoS Parameters
2 pages
An Efficient Incremental Clustering Algorithm
No ratings yet
An Efficient Incremental Clustering Algorithm
3 pages
FI Archiving Objects and Archiving Conditions
No ratings yet
FI Archiving Objects and Archiving Conditions
4 pages
Rubiks Solution
No ratings yet
Rubiks Solution
31 pages
Mackie HUI Service Manual
No ratings yet
Mackie HUI Service Manual
45 pages
Ds-Module 5 Lecture Notes
No ratings yet
Ds-Module 5 Lecture Notes
12 pages
ELET442 - Artificial Neural Networks (ANNs)
No ratings yet
ELET442 - Artificial Neural Networks (ANNs)
56 pages

Knime - Project Report

Uploaded by

Knime - Project Report

Uploaded by

Business Intelligence and Data Visualization Lab

Knime Project Report

Faculty name: Dr. Poonam Chaudhary

Student name: Harshita Bhatia and Ankit Jhangu

Roll No.: 20csu305 & 20csu365

Department of Computer Science and Engineering

3.1 Hardware Requirements

3.2 Software Requirements 3

5. Implementation and Testing (stage/module wise) 4

7. Conclusion and Future Scope 10

By emphasizing these connections, the EPI contributes to the promotion of sustainable

3.1. Hardware Requirements

3.2. Software Requirements

 Importing excel dataset

e) Random Forest Learner (Regression)

f) Random Forest Predictor (Regression)

Simple Regression Tree learner

Random Forest Learner

After normalization, the accuracy changed to:

You might also like