A Machine Learning Proposal

This document is a thesis submitted by Olatayo Raymond Oluwafemi to the Department of Computer Science at the University of Abuja in June 2021. It examines using a machine learning approach for information security. Specifically, it investigates using the UNSW-NB 15 dataset and algorithms like Naive Bayes, C4.5 Decision Tree, and K-Nearest Neighbors (KNN) to classify network connections as normal or an attack for protecting information systems. The study analyzes existing security systems and the proposed machine learning models. It then details implementing the models and testing them on a dataset to classify connections and conclude on using machine learning for information security.

Uploaded by

Feddy Micheal Feddy

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

121 views

A Machine Learning Proposal

Uploaded by

Feddy Micheal Feddy

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

A MACHINE LEARNING APPROACH TO INFORMATION SECURITY

OLATAYO, Raymond Oluwafemi

REG NO: 16283260

SUBMITTED TO
MRS. M.M USMAN

DEPARTMENT OF COMPUTER SCIENCE

UNIVERSITY OF ABUJA

JUNE, 2021

1
CHAPTER ONE: INTRODUCTION

1.1 Background of the Study

Information security, infosec for short, is the act and practice of shielding information against
information attackers and hackers. Information security is the preservation of the confidentiality,
integrity and availability of information. Additionally, other properties, such as authenticity,
accountability, non-repudiation and reliability of information can also be involved (ISO/IEC
27000:2009). Infosec is the protection of information and information systems from
unauthorized access, use, disclosure, disruption, modification, or destruction in order to provide
confidentiality, integrity, and availability (CNSS, 2010).
1.2 Statement of Problem
Information and or Cyber-attacks are increasing within the cyber world. There ought to be some
advanced security measures taken to scale back or avoid the number of cyber-attacks. There are
various information security attacks or threats. Some of the most common threats today are
software attacks, theft of intellectual property, theft of identity, theft of equipment or
information, sabotage, and information extortion.
1.3 Aim and Objectives of the Study

The aim of this study is to examine a machine learning approach to information security.
Specifically, it sought to:
1. Investigate the use of UNSW_NB 15 (University of New South Wales –NB 2015) for
the protection of information system
2. Find out how Naive Bayes is used for the protection of information system
3. Examine the use of C4.5 Decision Tree machine learning algorithms for the protection of
information system
4. Ascertain the how KNN (K-Nearest Neighbour) is used for the protection of information
system
1.4 Scope and Limitation of Study

The study is carried out to a machine learning approach to information security. Machine
learning approaches are widely used to solve various types of information securities. The
proposed project would cover a Machine Learning, Network Intrusion Detection system for the
protection of information system based on the UNSW-NB15 dataset, Naive Bayes, KNN and
Decision Models.

However, in the effort of carrying out this research, researcher will face problem of time and
finance.
1.5 Significant of the Study
The results of this study will help the cyber security experts as it will direct them on how to save
guard and secured an information system against the notorious activities of hackers and cyber
attacker, the task of keeping information system secured and sustained in a secured state during
the period of their usage ( lifetime) is the aim of this research work.

2
1.6 Definition of terms
Algorithm: a process or set of rules to be followed in a computer
Cyber attack: any attempt to expose, alter, disable, destroy, steal or gain information through
unauthorized means
Cyber security: the practice of protecting systems, networks, and programs from digital attacks
Machine learning: the study of computer algorithms that improve automatically through
experience and by the use of data.
Information security: sometimes shortened to infosec, is the practice of
protecting information by mitigating information risks. It is part of information risk management.
CHAPTER TWO: LITERATURE REVEW
2.1 Machine Learning
Machine learning (ML) is the study of computer algorithms that improve automatically through
experience and by the use of data. It is seen as a part of artificial intelligence.
2.2 Machine Learning Approaches
Machine learning approaches are traditionally divided into three broad categories, depending on
the nature of the "signal" or "feedback" available to the learning system:

 Supervised learning: The computer is presented with example inputs and their desired
outputs, given by a "teacher", and the goal is to learn a general rule that maps inputs to
outputs.
 Unsupervised learning: No labels are given to the learning algorithm, leaving it on its own to
find structure in its input. Unsupervised learning can be a goal in itself (discovering hidden
patterns in data) or a means towards an end (feature learning).
 Reinforcement learning: A computer program interacts with a dynamic environment in
which it must perform a certain goal (such as driving a vehicle or playing a game against an
opponent). As it navigates its problem space, the program is provided feedback that's
analogous to rewards, which it tries to maximize

2.3 Information Security

Information security, sometimes shortened to infosec, is the practice of protecting information by
mitigating information risks. It is part of information risk management
2.4 Information Security Threats
Information security threats come in many different forms. Some of the most common threats
today are software attacks, theft of intellectual property, theft of identity, theft of equipment or
information, sabotage, and information extortion. Most people have experienced software attacks
of some
2.5 Responses to Threats
Possible responses to a security threat or risk are:

3
 reduce/mitigate – implement safeguards and countermeasures to eliminate vulnerabilities or
block threats
 assign/transfer – place the cost of the threat onto another entity or organization such as
purchasing insurance or outsourcing
 accept – evaluate if the cost of the countermeasure outweighs the possible cost of loss due to
the threat
CHAPTER THREE: ANALYSIS AND DESIGN
3.1.0 ANALYSIS OF THE EXISTING SYSTEM
The existing system of machine learning, network intrusion detection system for the protection
of information system. It refers to the systems, tools and processes that are designed and then
deployed to field sensitive and confidential data from being compromised or tampered with.
3.1.1 STRENGTH OF THE EXISTING SYSTEM
The advantage of this system is to save guard and secured an information system against the
notorious activities of hackers and cyber attacker.
3.1.2 WEAKNESSES OF THE EXISTING SYSTEM
The weaknesses of the existing system were InfoSec was traditionally considered an IT
problem– this couldn’t be further from the truth. Attacks could occur from any weak link in the
company regardless of the hierarchy or department, so it is imperative that the entire enterprise is
protected by seamless security programmes.
3.2 ANALYSIS OF THE PROPOSED SYSTEM
Unsw-nb15 dataset has two attributes that can serve as class label; label and the attack_cat
attributes, the label attribute is a binary label attribute has value of 0 for normal connection and
value of 1 for attack connection, the attack_cat attribute has 10 values, each for the nine attacks
categories connections and the normal connection.
3.3 METHODOLOGY
This section presented machine learning-based information security intrusion detection models.
This comprised of several processing steps: exploring the security dataset, preparing raw data,
determining feature importance and ranking, and building the resultant models.
3.4 SYSTEM DESIGN
System design is a solution to a problem, it demands the translation of the requirements
uncovered in analysis into possible ways of meeting them (E.O Nwachukwu).
3.4.1 INPUT and output SPECIFICATION
The inputs and outputs to a machine learning task may be of different kinds. Generally, they are
in the form of numeric (both discrete and real-valued) or nominal attributes. Numeric attributes
may have continuous numeric values whereas nominal values may have values from a pre-
defined set.

4
CHAPTER FOUR: SYSTEM IMPLEMENTATION AND TESTING
4.1 Implementation
4.1.1 Naïve Bayes (NB).
These algorithms are probabilistic classifiers which make the a-priori assumption that the
features of the input dataset are independent from each other. They are scalable and do not
require huge training datasets to produce appreciable results
4.1.2 K-Nearest Neighbour (KNN).
KNN are used for classification and can be used for multi-class problems. However, both their
training and test phase are computationally demanding as to classify each test sample, they
compare it against all the training samples.
4.1.3 C4.5 Decision Tree
In this type of classification, the target concept is represented in the form of a tree.
The tree is built by using the principle of recursive partitioning. An attribute is selected as a
partitioning attribute (also referred to as node) based on some criteria (like information gain)
[Mit97].
4.2 Testing
The models were evaluated using the testing dataset, from the work,
CHAPTER FIVE: SUMMARY, CONCLUSION AND RECOMMENDATION
5.1 Summary
5.2 Conclusion
5.3 Recommendation
5.4 Future work
REFERENCE
APPENDIX

Machine Learning For Absolute Beginners A - Oliver Theobald
100% (2)
Machine Learning For Absolute Beginners A - Oliver Theobald
179 pages
Influence of Big Brother Naija On Youths' Social Behaviour in Federal Capital Territoty (FCT) Chapter One 1.1 1.2 Background To The Study
No ratings yet
Influence of Big Brother Naija On Youths' Social Behaviour in Federal Capital Territoty (FCT) Chapter One 1.1 1.2 Background To The Study
51 pages
CS321 Grosse Lecture Notes
No ratings yet
CS321 Grosse Lecture Notes
169 pages
A Technical Report On IT
No ratings yet
A Technical Report On IT
34 pages
Iss-Assignment II
50% (2)
Iss-Assignment II
36 pages
Ms 98-367
No ratings yet
Ms 98-367
38 pages
Writing Sample On Cyber Defense Organization
No ratings yet
Writing Sample On Cyber Defense Organization
21 pages
Unit 1 - Machine Learning - WWW - Rgpvnotes.in
No ratings yet
Unit 1 - Machine Learning - WWW - Rgpvnotes.in
23 pages
Title of Assignment: Security Vulnerabilities and Countermeasures in
No ratings yet
Title of Assignment: Security Vulnerabilities and Countermeasures in
19 pages
01 - Introduction To Information Security
No ratings yet
01 - Introduction To Information Security
42 pages
Preventing Identity Theft Using Blockchain Technology
No ratings yet
Preventing Identity Theft Using Blockchain Technology
5 pages
A Data Centric Security Model
No ratings yet
A Data Centric Security Model
10 pages
Induction Generator
100% (2)
Induction Generator
24 pages
BPMS Project IT HelpDesk
No ratings yet
BPMS Project IT HelpDesk
16 pages
Information Security Awareness
No ratings yet
Information Security Awareness
7 pages
Ebay Case Study
No ratings yet
Ebay Case Study
6 pages
Bca Second Semester: SUBJECT:-Cyber Security
No ratings yet
Bca Second Semester: SUBJECT:-Cyber Security
51 pages
The Threats To Our Products: Loren Kohnfelder and
No ratings yet
The Threats To Our Products: Loren Kohnfelder and
9 pages
PHD Research Project Proposal: Title
No ratings yet
PHD Research Project Proposal: Title
5 pages
Metric Based Security Assessment
No ratings yet
Metric Based Security Assessment
1 page
Hybrid Encryption For Cloud Database Security-Annotated
No ratings yet
Hybrid Encryption For Cloud Database Security-Annotated
7 pages
Study On Cloud Security in Japan
No ratings yet
Study On Cloud Security in Japan
33 pages
Merkow - PPT - 02 F
No ratings yet
Merkow - PPT - 02 F
20 pages
Cyber Security Awareness
No ratings yet
Cyber Security Awareness
40 pages
A Study of Cyberbullying Detection Using Machine
No ratings yet
A Study of Cyberbullying Detection Using Machine
14 pages
"Dynamic Password Policy Generation System: A Project Report On
No ratings yet
"Dynamic Password Policy Generation System: A Project Report On
76 pages
Cybercrime & Social Media Awareness
No ratings yet
Cybercrime & Social Media Awareness
15 pages
Document - Repository and Search Engine For Alumni of College
67% (3)
Document - Repository and Search Engine For Alumni of College
110 pages
The Study On Resolutions of STRIDE Threat Model
No ratings yet
The Study On Resolutions of STRIDE Threat Model
3 pages
Identity-Based Authentication For Cloud Computing PDF
No ratings yet
Identity-Based Authentication For Cloud Computing PDF
10 pages
Cyber Security: A Case Study of Brazil
No ratings yet
Cyber Security: A Case Study of Brazil
63 pages
Design and Implementation of A Security Information Sys
No ratings yet
Design and Implementation of A Security Information Sys
86 pages
Lecture 1 Information Security Design
No ratings yet
Lecture 1 Information Security Design
55 pages
0290 Cyber Safety Basics Tutorial
No ratings yet
0290 Cyber Safety Basics Tutorial
23 pages
Reserach Paper On Cyber Security
No ratings yet
Reserach Paper On Cyber Security
8 pages
Microsoft Threat Modeling Tool 2016 Getting Started Guide Beta
No ratings yet
Microsoft Threat Modeling Tool 2016 Getting Started Guide Beta
30 pages
Prac 2
No ratings yet
Prac 2
33 pages
It Governance
No ratings yet
It Governance
12 pages
Reasearch Proposal
No ratings yet
Reasearch Proposal
6 pages
Ransomware in High-Risk Environments
No ratings yet
Ransomware in High-Risk Environments
38 pages
Social Engineering Audit and Security Awareness Programme PDF
No ratings yet
Social Engineering Audit and Security Awareness Programme PDF
4 pages
Examination Verification System Using Fingerprint Biometric Uj2016ns0361
No ratings yet
Examination Verification System Using Fingerprint Biometric Uj2016ns0361
63 pages
ISEC-655 Security Governance Management Assignment 1 Guidelines
No ratings yet
ISEC-655 Security Governance Management Assignment 1 Guidelines
2 pages
Self-Attention GRU Networks For Fake Job Classification
No ratings yet
Self-Attention GRU Networks For Fake Job Classification
5 pages
Design and Implementation of A Network Security Model For Cooperative Network PDF
No ratings yet
Design and Implementation of A Network Security Model For Cooperative Network PDF
11 pages
Graphical Password Authentication Using Persuasive Cued Click Point
No ratings yet
Graphical Password Authentication Using Persuasive Cued Click Point
31 pages
Module 3 Week 5 - IT Security and Incidents
No ratings yet
Module 3 Week 5 - IT Security and Incidents
9 pages
Case Study - The Home Depot Data Breach of 2014 - (Essay Example), 2227 Words GradesFixer
No ratings yet
Case Study - The Home Depot Data Breach of 2014 - (Essay Example), 2227 Words GradesFixer
8 pages
Design and Implementation of Security Management Using Data Encryption and Decryption Techniquesfjo1nr4mf
100% (1)
Design and Implementation of Security Management Using Data Encryption and Decryption Techniquesfjo1nr4mf
12 pages
UNIT - 2 Notes
No ratings yet
UNIT - 2 Notes
10 pages
Data Storage Security Challenges in Cloud Computing
100% (1)
Data Storage Security Challenges in Cloud Computing
10 pages
Risk Assessment Case Study
No ratings yet
Risk Assessment Case Study
12 pages
Identity Theft PowerPoint
No ratings yet
Identity Theft PowerPoint
13 pages
Threats and Attacks: CSE 4471: Information Security Instructor: Adam C. Champion, PH.D
No ratings yet
Threats and Attacks: CSE 4471: Information Security Instructor: Adam C. Champion, PH.D
26 pages
Project Report
100% (1)
Project Report
28 pages
Chapter 7
No ratings yet
Chapter 7
26 pages
MIS_Chapter_5
No ratings yet
MIS_Chapter_5
31 pages
R18CSE4101 Cryptography Network Security
No ratings yet
R18CSE4101 Cryptography Network Security
184 pages
Unit-5 Legal, Ethical and Professional Issues in Information Security
No ratings yet
Unit-5 Legal, Ethical and Professional Issues in Information Security
49 pages
Information Security in Banking Sector PDF
100% (1)
Information Security in Banking Sector PDF
24 pages
Unit 38 DatabaseManagementSyst
No ratings yet
Unit 38 DatabaseManagementSyst
27 pages
IS Lab Manual
No ratings yet
IS Lab Manual
34 pages
A Study of Cybersecurity
No ratings yet
A Study of Cybersecurity
7 pages
Unit 1 - Cyber Crime
No ratings yet
Unit 1 - Cyber Crime
25 pages
Equity of Cybersecurity in the Education System: High Schools, Undergraduate, Graduate and Post-Graduate Studies.
From Everand
Equity of Cybersecurity in the Education System: High Schools, Undergraduate, Graduate and Post-Graduate Studies.
Joseph O. Esin
No ratings yet
Complete Dissertation For Eunice Corrected
No ratings yet
Complete Dissertation For Eunice Corrected
82 pages
Final PHD Dissertation
No ratings yet
Final PHD Dissertation
237 pages
IMPACT OF CATERING SERVICES
100% (1)
IMPACT OF CATERING SERVICES
63 pages
Computer Programming Business Plan
No ratings yet
Computer Programming Business Plan
33 pages
Phi 303
No ratings yet
Phi 303
2 pages
Effect of Merger and Acqusition On The Financial Performance of Deposit Money Banks in Nigeria (A Study of Access Bank PLC)
No ratings yet
Effect of Merger and Acqusition On The Financial Performance of Deposit Money Banks in Nigeria (A Study of Access Bank PLC)
50 pages
Avocado 1
No ratings yet
Avocado 1
8 pages
ASSESSMENT OF Heavy Metal CADMIUM AND LEAD CONTAMINATION OF A PLAYGROUND SOIL OF L.E.A PRIMARY SCHOOL
No ratings yet
ASSESSMENT OF Heavy Metal CADMIUM AND LEAD CONTAMINATION OF A PLAYGROUND SOIL OF L.E.A PRIMARY SCHOOL
47 pages
Chapter One 1.1 Background To The Study
No ratings yet
Chapter One 1.1 Background To The Study
79 pages
Asemota Proj 1-5
100% (1)
Asemota Proj 1-5
85 pages
Impact of Home Video and Its Relevance To The Nigeria Society
100% (1)
Impact of Home Video and Its Relevance To The Nigeria Society
19 pages
Global Economic Meltdown and The Nigerian Capital Market
No ratings yet
Global Economic Meltdown and The Nigerian Capital Market
61 pages
HeartBeat Rate and Temperature Monitoring Device 1 EDITED
No ratings yet
HeartBeat Rate and Temperature Monitoring Device 1 EDITED
57 pages
Ai Class 9 Final Study Material
No ratings yet
Ai Class 9 Final Study Material
6 pages
Lecture 2 Autoregressive Models
No ratings yet
Lecture 2 Autoregressive Models
113 pages
Machine Learning Masterclass 2023
No ratings yet
Machine Learning Masterclass 2023
6 pages
Ai Notes
No ratings yet
Ai Notes
7 pages
REPORT - STOCK PRICE PREDICTION - New
No ratings yet
REPORT - STOCK PRICE PREDICTION - New
40 pages
Capstone 1 Gantt Chart
No ratings yet
Capstone 1 Gantt Chart
8 pages
Machine Learning Techniques (KCS 055) 26.09.2023
No ratings yet
Machine Learning Techniques (KCS 055) 26.09.2023
58 pages
3.IOT Based Smart Irrigation System Using Reinforcement Learning
No ratings yet
3.IOT Based Smart Irrigation System Using Reinforcement Learning
51 pages
Artificial Intelligence in The Age of Neural Networks and Brain Computing 1st Edition Robert Kozma Cesare Alippi Yoonsuck Choe Francesco Morabito
100% (4)
Artificial Intelligence in The Age of Neural Networks and Brain Computing 1st Edition Robert Kozma Cesare Alippi Yoonsuck Choe Francesco Morabito
52 pages
Unit 1-Artificial Intelligence ETI (22618) : Content
No ratings yet
Unit 1-Artificial Intelligence ETI (22618) : Content
15 pages
Machine Learning - For Beginners Your Definitive Guide For Neural Networks, Algorithms, Random Forests and Decision Trees Made Simple-AUVA PRESS (2017)
No ratings yet
Machine Learning - For Beginners Your Definitive Guide For Neural Networks, Algorithms, Random Forests and Decision Trees Made Simple-AUVA PRESS (2017)
80 pages
Three Learning Phases For Radial-Basis-Function Networks: Friedhelm Schwenker, Hans A. Kestler, Guènther Palm
No ratings yet
Three Learning Phases For Radial-Basis-Function Networks: Friedhelm Schwenker, Hans A. Kestler, Guènther Palm
20 pages
Me Internship Certificate(s)
No ratings yet
Me Internship Certificate(s)
27 pages
Question-Answers in Machine Learning
No ratings yet
Question-Answers in Machine Learning
14 pages
Applications of Artificial Intelligence To Network
No ratings yet
Applications of Artificial Intelligence To Network
17 pages
Get Getting Started with Artificial Intelligence 2nd Edition Tom Markiewicz And Josh Zheng free all chapters
100% (2)
Get Getting Started with Artificial Intelligence 2nd Edition Tom Markiewicz And Josh Zheng free all chapters
40 pages
Impact of Business Analytics and Enterprise Systems On Managerial Accounting
No ratings yet
Impact of Business Analytics and Enterprise Systems On Managerial Accounting
16 pages
Thesis Presentation Single Image Denoising
No ratings yet
Thesis Presentation Single Image Denoising
57 pages
高盛生成式AI投资框架（英）
No ratings yet
高盛生成式AI投资框架（英）
61 pages
Machine Learning 3
No ratings yet
Machine Learning 3
30 pages
AI NOTES-5
No ratings yet
AI NOTES-5
18 pages
Final Thesis
No ratings yet
Final Thesis
80 pages
Supervised and Unsupervised Learning: Ciro Donalek Ay/Bi 199 - April 2011
No ratings yet
Supervised and Unsupervised Learning: Ciro Donalek Ay/Bi 199 - April 2011
69 pages
DRDO Report
No ratings yet
DRDO Report
16 pages
Instant download Machine Learning Algorithms for Signal and Image Processing Suman Lata Tripathi pdf all chapter
100% (8)
Instant download Machine Learning Algorithms for Signal and Image Processing Suman Lata Tripathi pdf all chapter
51 pages
Chapter 03
No ratings yet
Chapter 03
16 pages
14 Different Types of Learning in Machine Learning
No ratings yet
14 Different Types of Learning in Machine Learning
32 pages