Cyber-threat

The document presents a synopsis on exploring open source information for cyber threat intelligence, focusing on the use of machine learning techniques to detect and classify cyber threats from Twitter data. It outlines the goals, objectives, and features of the proposed system, which utilizes various algorithms such as SVM, Decision Trees, Naive Bayes, Random Forest, and Artificial Neural Networks for analysis. The project aims to provide meaningful insights into new patterns of cyber-attacks and security threats through the analysis of social media data.

Uploaded by

bhivarkarsanket13

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views4 pages

Cyber-threat

Uploaded by

bhivarkarsanket13

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

DEPARTMENT OF COMPUTER ENGINEERING

G.H.Raisoni College Of Engineering & Management, Pune

2021-2022

SYNOPSIS ON

EXPLORING OPEN SOURCE INFORMATION FOR CYBER

THREAT INTELLIGENCE
Submitted for partial fulfilment of the Requirement for the S.Y.M. Tech in

COMPUTER ENGINEERING
Submitted By:

Aishwarya Bhagwat (2020ACRE2101001)

Sign of Guide
Abstract
Cyberspace is one of the most complicated systems ever created by humanity; many people use cyber-
technology resources on a daily basis, yet the bulk of them have little understanding of it. To use of social
media cannot replace the requirement for security experts to conduct in-depth analyses of specific sorts
of attacks, such as detecting anomalies in network traffic, worms, and port scans, among other things.
Analysing social media data, on the other hand, can help discover new patterns of cyber threat and
security threats including data theft, carding, and hijacking. We used machine learning to predict cyber
threat in the proposed system. The best model is created by training a dataset of Twitter cyber-Threat
using the SVM, NB, DT, RF and ANN algorithms. Used best model for predicting cyber threats and
which categories.

Goals and Objectives

 To Detect Cyber Threat using machine learning techniques.
 To classify and Train dataset of cyber threat twitter dataset using Different Machine
Learning algorithm.
 To analysing social media data can provide meaningful insights in detecting new patterns
of cyber-attack and security threats such as data breach, carding, and hijacking.
 In tweets keywords includes username of selected cybersecurity organizations, list of
buzzwords related to cybersecurity terms (‘ciphertext’, ‘cryptography’, ‘hacked’, ‘breach’,
‘sniffer’, ‘firewall’, ‘hijacking’,‘Clickjacking’, ‘Malware’,‘Sphear phising’, ‘virus’, and
‘vulnerability’) from cybersecurity domain experts.

Features of System:
 Preparing the dataset
 Data Pre-processing
 Feature extraction
 Classification using Algorithm

Technologies and Tools

 Python
 Scikit-learn
 Pandas
 SVM
 DT
 NB
 RF
 ANN

Cyber Threat Intelligence project

This is divided into 3 parts:

1. Creating the dataset

2. Training a Different ML on the Twitter Cyber-threat dataset
3. Predicting the Tweets (Display Cyber-Threat Categories)

Dataset:-
In proposed system we have collect dataset of twitter (related to cyber threats) on kaggle website.
In Dataset A list of keywords was selected to filter the tweets retrieved from the stream listener.
These keywords includes username of selected cybersecurity organizations, list of buzzwords
related to cybersecurity terms (‘ciphertext’, ‘cryptography’, ‘hacked’, ‘breach’, ‘sniffer’,
‘firewall’, ‘hijacking’,‘Clickjacking’, ‘Malware’,‘Sphear phising’, ‘virus’, and ‘vulnerability’)
from cybersecurity domain experts.
Algorithm -
SVM (Support Vector Machine) :-
Support Vector Machine (SVM) is a controlled approach for machine learning that is suitable for both
classification and regression difficulties. It is employed largely in classification issues, however. Each
data item is defined in the SVM algorithm n-dimensional space point (where n is a number of features)
each feature value is the value of a specific coordinate. Then we carry out Support Vectors are merely
individual observation coordinates. The SVM is a boundary between both the two classes (hyper planes
/ rows). Categorization by finding the hyper-plane that distinguishes the classes very well.
DT (Decision Tree):
The goal of using a Decision Tree is to create a training model that can use to predict the class or value
of the target variable by learning simple decision rules inferred from prior data (training data). In
Decision Trees, for predicting a class label for a record we start from the root of the tree.

NB (Nave Bayes):
The number of parameters required by Nave Bayes classifiers is linear in the number of variables
(features/predictors) in a learning problem. Maximum-likelihood training can be done in linear time by
evaluating a closed-form expression, rather than the time-consuming iterative approximation required by
many other forms of classifiers.
RF (Random Forest):
Random forest is a supervised learning algorithm which is used for both classification as well as
regression. But however, it is mainly used for classification problems. As we know that a forest is made
up of trees and more trees means more robust forest. Similarly, random forest algorithm creates decision
trees on data samples and then gets the prediction from each of them and finally selects the best solution
by means of voting. It is an ensemble method which is better than a single decision tree because it reduces
the over-fitting by averaging the result.
ANN:
An Artificial Neural Network is an information processing technique. It works like the way human brain
processes information. ANN includes a large number of connected processing units that work together
to process information. They also generate meaningful results from it.
Artificial Neural network is typically organized in layers. Layers are being made up of many
interconnected ‘nodes’ which contain an ‘activation function’. A neural network may contain the
following 3 layers: a. Input layer, b. Hidden layer and c. Output layer.

REFERENCES:
[1] Wang, S. (2010). Crawling Deep Web using a GA-based set covering algorithm.
[2] Zhou, S., Long, Z., Tan, L., & Guo, H. (2018). Automatic identification of indicators of
compromise using neural-based sequence labelling. arXiv preprint arXiv:1810.10156.
[3] Guo, M.,& Wang, J. A. (2009, April). An ontology-based approach to model common
vulnerabilities and exposures in information security. In ASEE Southest Section Conference.
[4] Ninth Annual Cost if Cybercrime Study unlocking The Value of Improved Cybersecurity
Protection .The Cost of Cybercrime Contents.
[5] Ranade, P., Mittal, S., Joshi, A., & Joshi, K. (2018, November). Using deep neural networks
to translate multi-lingual threat intelligence. In 2018 IEEE International Conference on
Intelligence and Security Informatics (ISI) (pp. 238-243). IEEE.
[6] Dong, Y., Guo, W., Chen, Y., Xing, X., Zhang, Y., & Wang, G. (2019). Towards the detection
of inconsistencies in public security vulnerability reports. In 28th USENIX Security Symposium
(USENIX Security 19) (pp. 869-885).
[7] Rodriguez, A., & Okamura, K. (2020). Social Media Data Mining for Proactive Cyber Defense.
Journal of Information Processing, 28, 230- 238.

Arctic and Antarctic PDF
100% (1)
Arctic and Antarctic PDF
50 pages
Machine learning methods for secure internet of things against cyber threats synopsis (1)
No ratings yet
Machine learning methods for secure internet of things against cyber threats synopsis (1)
5 pages
Machine learning methods for secure internet of things against cyber threats synopsis
No ratings yet
Machine learning methods for secure internet of things against cyber threats synopsis
4 pages
GRPPRJCT
No ratings yet
GRPPRJCT
15 pages
Cyber Threat Detection Based On Artificial Neural Networks
No ratings yet
Cyber Threat Detection Based On Artificial Neural Networks
5 pages
Final Project
No ratings yet
Final Project
15 pages
IJHS-9745+1341-1349
No ratings yet
IJHS-9745+1341-1349
9 pages
DDOS Attack Final
No ratings yet
DDOS Attack Final
41 pages
Dr. Mujiono - MachineLearningApplicationsCyberSecurity-Final-MS
No ratings yet
Dr. Mujiono - MachineLearningApplicationsCyberSecurity-Final-MS
28 pages
Supervised Machine Learning Algorithms For Intrusion Detection
No ratings yet
Supervised Machine Learning Algorithms For Intrusion Detection
14 pages
Base Paper Interview
No ratings yet
Base Paper Interview
5 pages
Deep Learning Approach For Intelligent Intrusion Detection System
No ratings yet
Deep Learning Approach For Intelligent Intrusion Detection System
5 pages
Thesis Book
No ratings yet
Thesis Book
21 pages
Lab1
No ratings yet
Lab1
3 pages
КШ - 1.2 англ
No ratings yet
КШ - 1.2 англ
14 pages
Threat Detection Model Based On Machine
No ratings yet
Threat Detection Model Based On Machine
5 pages
mlns notes
No ratings yet
mlns notes
20 pages
AI Based Threat Detection System IEEE Report 1 1 1
No ratings yet
AI Based Threat Detection System IEEE Report 1 1 1
12 pages
IEEE-Ai For Cybersecurity
100% (1)
IEEE-Ai For Cybersecurity
3 pages
Machine Learning and Deep Learning Methods for Cybersecurity Ijariie24911
No ratings yet
Machine Learning and Deep Learning Methods for Cybersecurity Ijariie24911
4 pages
29927
No ratings yet
29927
5 pages
Dynamic File Analysis Using Ensemble of RNN and SVM
No ratings yet
Dynamic File Analysis Using Ensemble of RNN and SVM
24 pages
Cyber Threat Detection Synopsis
No ratings yet
Cyber Threat Detection Synopsis
14 pages
Literature Review
No ratings yet
Literature Review
2 pages
Manjunath_jusstuu
No ratings yet
Manjunath_jusstuu
11 pages
Final Progress
No ratings yet
Final Progress
22 pages
19bit0368 Capstone Final Review
No ratings yet
19bit0368 Capstone Final Review
48 pages
Intrusion Detection Using Self Training Vector
No ratings yet
Intrusion Detection Using Self Training Vector
35 pages
Machine Learning and Deep Learning 2nd Review1
No ratings yet
Machine Learning and Deep Learning 2nd Review1
8 pages
Ke 2021 J. Phys. Conf. Ser. 2113 012074
No ratings yet
Ke 2021 J. Phys. Conf. Ser. 2113 012074
14 pages
Fulltext
No ratings yet
Fulltext
123 pages
Cyber Threat Detection Based On Artificial Neural
No ratings yet
Cyber Threat Detection Based On Artificial Neural
20 pages
Deep_Convolutional_Neural_Networks_for_Intrusion_Detection_in_Automotive_Ethernet_Networks
No ratings yet
Deep_Convolutional_Neural_Networks_for_Intrusion_Detection_in_Automotive_Ethernet_Networks
6 pages
Detailed Explanations For Your Presentation
No ratings yet
Detailed Explanations For Your Presentation
5 pages
Semi Supervised
No ratings yet
Semi Supervised
13 pages
MMAKR
No ratings yet
MMAKR
13 pages
Sat - 100.Pdf - Prediction of Cyber Attacks Using Data Science Technique
No ratings yet
Sat - 100.Pdf - Prediction of Cyber Attacks Using Data Science Technique
11 pages
LSP Wireless network attacks using supervised machine learning techniques
No ratings yet
LSP Wireless network attacks using supervised machine learning techniques
28 pages
Network-Based Intrusion Detection With Support Vector Machines
No ratings yet
Network-Based Intrusion Detection With Support Vector Machines
14 pages
Network Intrusion Detection Using Machine Learning: Project Guide DR K Suresh
No ratings yet
Network Intrusion Detection Using Machine Learning: Project Guide DR K Suresh
40 pages
2312.17270
No ratings yet
2312.17270
28 pages
Evaluation of Cybersecurity Data Set Characteristics For Their Applicability To Neural Networks Algorithms Detecting Cybersecurity Anomalies
No ratings yet
Evaluation of Cybersecurity Data Set Characteristics For Their Applicability To Neural Networks Algorithms Detecting Cybersecurity Anomalies
10 pages
A Review of AI Based Threat Detection Enhancing Network Security With Machine Learning
No ratings yet
A Review of AI Based Threat Detection Enhancing Network Security With Machine Learning
9 pages
Cyber Defense11
No ratings yet
Cyber Defense11
59 pages
Sada
No ratings yet
Sada
11 pages
Detection of Cyber Attack in Network Using Machine Learning Techniques
No ratings yet
Detection of Cyber Attack in Network Using Machine Learning Techniques
73 pages
Machine_Learning_Algorithms_for_DoS_and_DDoS_Cyberattacks_Detection_in_Real-Time_Environment
No ratings yet
Machine_Learning_Algorithms_for_DoS_and_DDoS_Cyberattacks_Detection_in_Real-Time_Environment
2 pages
AI Based Threat Detection System_IEEE Report (1)
No ratings yet
AI Based Threat Detection System_IEEE Report (1)
10 pages
Project Paper Publication
No ratings yet
Project Paper Publication
10 pages
Explainable AI
No ratings yet
Explainable AI
4 pages
Machine Learning Based Intrusion Detection System
No ratings yet
Machine Learning Based Intrusion Detection System
5 pages
Multi Level Deep Learning Model For Network Anomal
No ratings yet
Multi Level Deep Learning Model For Network Anomal
12 pages
APP REPORT
No ratings yet
APP REPORT
19 pages
2407.06014v1
No ratings yet
2407.06014v1
6 pages
KNN 2
No ratings yet
KNN 2
4 pages
Applied Sciences: Fficient Distributed Preprocessing Model For
No ratings yet
Applied Sciences: Fficient Distributed Preprocessing Model For
19 pages
Ai and Machine Learning For Network Security - Applications and Case Studies
No ratings yet
Ai and Machine Learning For Network Security - Applications and Case Studies
13 pages
النسخة بعد الترقيم 6 بعد المراجعة
No ratings yet
النسخة بعد الترقيم 6 بعد المراجعة
89 pages
Journal
No ratings yet
Journal
11 pages
Review On Network Intrusion Detection Techniques Using Machine Learning
No ratings yet
Review On Network Intrusion Detection Techniques Using Machine Learning
6 pages
Top Networking Terms You Should Know
From Everand
Top Networking Terms You Should Know
JOHN SMITH
No ratings yet
What Is Sustainability and Why Is It Important
No ratings yet
What Is Sustainability and Why Is It Important
7 pages
Pipeline Deflection Inspection - Pipe Deflectometers
No ratings yet
Pipeline Deflection Inspection - Pipe Deflectometers
2 pages
P.E-Grade 12 DLL Week 2
100% (1)
P.E-Grade 12 DLL Week 2
4 pages
Zone 5 Packages
No ratings yet
Zone 5 Packages
5 pages
University of Professional Studies, Accra Bachelor of Business Administration
No ratings yet
University of Professional Studies, Accra Bachelor of Business Administration
14 pages
Bài Thuyết Trình NLBĐS
No ratings yet
Bài Thuyết Trình NLBĐS
21 pages
Post-Mining Land Use For The Function of Geotouris
No ratings yet
Post-Mining Land Use For The Function of Geotouris
7 pages
Week 2 Pharmacology
No ratings yet
Week 2 Pharmacology
3 pages
Planck Units: Natural Units and The Key Equations in Physics
No ratings yet
Planck Units: Natural Units and The Key Equations in Physics
38 pages
Digital Electronics and Computer Organization
No ratings yet
Digital Electronics and Computer Organization
3 pages
3 Day In-Service Training
No ratings yet
3 Day In-Service Training
2 pages
PARCC Style Test
No ratings yet
PARCC Style Test
8 pages
قوانین گازها
No ratings yet
قوانین گازها
2 pages
Piping Material External Surface
No ratings yet
Piping Material External Surface
1 page
IV Day6
No ratings yet
IV Day6
4 pages
Quillium Group Application
No ratings yet
Quillium Group Application
3 pages
Sirona Teneo Dental Unit - Maintenance Manual
No ratings yet
Sirona Teneo Dental Unit - Maintenance Manual
34 pages
1.2 - Word Formation
No ratings yet
1.2 - Word Formation
40 pages
Aee Electronics 1
No ratings yet
Aee Electronics 1
2 pages
Applied Improvisation Leading Collaborating And Creating Beyond The Theatre Theresa Robbins Dudeck Caitlin Mcclure download
No ratings yet
Applied Improvisation Leading Collaborating And Creating Beyond The Theatre Theresa Robbins Dudeck Caitlin Mcclure download
83 pages
Mtp-1368e User 0210
No ratings yet
Mtp-1368e User 0210
5 pages
Get (eBook PDF) Introduction to Forensic Psychology: Research and Application 5th Edition free all chapters
100% (14)
Get (eBook PDF) Introduction to Forensic Psychology: Research and Application 5th Edition free all chapters
45 pages
Practice Quiz Answers
No ratings yet
Practice Quiz Answers
16 pages
The Mayan Civilization: The Rise of The Maya
No ratings yet
The Mayan Civilization: The Rise of The Maya
4 pages
Cassidys Forward Planning Document
No ratings yet
Cassidys Forward Planning Document
11 pages
Brosur (Cummins)
No ratings yet
Brosur (Cummins)
2 pages
3b. Lecture slides - Asset Pricing Models
No ratings yet
3b. Lecture slides - Asset Pricing Models
35 pages
Restorative With Answers (9.2023) Students' Version
0% (1)
Restorative With Answers (9.2023) Students' Version
48 pages
Class 6 Full Year 6th Grade Review: Answer The Questions
No ratings yet
Class 6 Full Year 6th Grade Review: Answer The Questions
3 pages