hackathon1[1]

Uploaded by

sahgyan9

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

0 views

hackathon1[1]

Uploaded by

sahgyan9

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Department Of Computer

Science &
Engineering
Enhanced Phishing Detection and Prevention System
Using Natural Language Processing (NLP)

Khyathisree Yarra- AP23110010215

Chittem Mahesh Babu-
AP23110010084
Praneeth gadipudi -
AP23110010292
Gottipati Harshith Sai-
AP23110010170
Problem Statement :
Phishing attacks have become increasingly sophisticated, often
bypassing traditional detection methods. Develop a next-gen phishing
detection system using NLP and machine learning that can analyze
message context, intent, and linguistic cues to identify and flag phishing
attempts with minimal false positives.
Introduction to Phishing:
Phishing: A type of cyber-attack where attackers impersonate legitimate
entities to deceive individuals into revealing sensitive information.
Proposed solution
1. Input Data Collection
1. Gather email content, URLs, and metadata like sender details and email
headers.
2. Preprocessing Module
1. Tokenization: Break down email text into individual words or tokens.
2. Cleaning: Remove irrelevant characters (HTML Tags, special characters)
3. Stop Word Removal: Filter out common words (e.g “and” , “the”) that
don’t contribute to meaning.
4. Stemming/Lemmatization: Standardize words to their root forms for
consistency.
5. Convert email text to lowercase for consistent analysis.
3. Feature Extraction
1. Text Analysis: Use NLP to analyze the email's content, identifying
suspicious words, phrases, and language patterns (e.g., urgent language or
abnormal greetings).
Proposed solution
2.URL Analysis: Analyze embedded URLs for:
1. Uncommon domain structures or misspelled URLs.
2. Reputation of domains using a third-party API or historical data.
3. Extract features like domain reputation, URL length, and presence of
uncommon characters.
3.Metadata and Header Analysis: Extract metadata such as sender’s email
domain, IP address , SPF, and DKIM verification to check for signs of spoofing
or impersonation.

4.Phishing Score Calculation

2. Each component (text, URL, and metadata analysis) produces an
independent score indicating the likelihood of phishing.
3. Combine these scores into a weighted aggregate score using a formula to
prioritize high-risk indicators (e.g., suspicious URLs or unverified metadata).
5
Proposed solution
5. Threshold Evaluation and Classification
1. Compare the aggregate score to a threshold:
1. If the score exceeds the threshold, classify the email as phishing.
2. Otherwise, mark it as legitimate.
6. Action Module
2. Based on the classification, initiate appropriate actions:
1. If phishing, quarantine the email, alert the user, and send notifications
to system administrators.
2. If legitimate, allow the email to be delivered to the inbox.
7. Continuous Learning and Feedback Loop
3. Gather user and admin feedback on the system's performance.
4. Feed any false positives or false negatives back into the system for
retraining and refining the model over time.
Start
Proposed Architecture:
Classification and
Threshold
Validation
If score>threshold

Input data
(Emails)

Data Preprocessing
->Tokenization
->Remove irrelevant Mark as Phishing
Mark as Legitimate
items ->Quarantine
->Deliver to inbox
->Alert user/admin

Feature Extraction
->Textual Features
->URL Features ->Update model
End of Detection
->Metadata Features based on user/admin
Process
input

Phishing Detection Model

->Training and Prediction
->Producing confidence
source Stop

Phising Detection project
No ratings yet
Phising Detection project
14 pages
AI-Generated Phishing Detection System
No ratings yet
AI-Generated Phishing Detection System
5 pages
Review of Related Literature
No ratings yet
Review of Related Literature
8 pages
security
No ratings yet
security
14 pages
Phishing Detection Tool
No ratings yet
Phishing Detection Tool
16 pages
Patent. US11483343 (EN)
No ratings yet
Patent. US11483343 (EN)
4 pages
Business Understanding
No ratings yet
Business Understanding
5 pages
Round 3 Presentation_compressed
No ratings yet
Round 3 Presentation_compressed
10 pages
Detecting Phishing Websites Using Machine Learning
No ratings yet
Detecting Phishing Websites Using Machine Learning
16 pages
Main Project (1)
No ratings yet
Main Project (1)
48 pages
Phishing Email Detection Abstract
No ratings yet
Phishing Email Detection Abstract
8 pages
main project
No ratings yet
main project
48 pages
Phishing-Detection-Tool
No ratings yet
Phishing-Detection-Tool
14 pages
PHISHING PPT FINAL
No ratings yet
PHISHING PPT FINAL
24 pages
final project
No ratings yet
final project
60 pages
PPT
No ratings yet
PPT
14 pages
AI-Generated Phishing
No ratings yet
AI-Generated Phishing
12 pages
updated_phishing_url_detection
No ratings yet
updated_phishing_url_detection
13 pages
Phishing
No ratings yet
Phishing
8 pages
Midterm Project Report
No ratings yet
Midterm Project Report
21 pages
Cream Neutral Minimalist New Business Pitch Deck Presentation
No ratings yet
Cream Neutral Minimalist New Business Pitch Deck Presentation
6 pages
a
No ratings yet
a
7 pages
CSSML
No ratings yet
CSSML
30 pages
Abstract
No ratings yet
Abstract
1 page
BSC Final Project PPT
No ratings yet
BSC Final Project PPT
8 pages
Detection of Phishing Website
No ratings yet
Detection of Phishing Website
23 pages
1822 B.E Cse Batchno 287
No ratings yet
1822 B.E Cse Batchno 287
65 pages
Project Docoment Merged
No ratings yet
Project Docoment Merged
86 pages
Apurva Sontakke - 2018 - IEEE - Detecting Phishing Attacks Using Natural Language Processing and Machine Learning
No ratings yet
Apurva Sontakke - 2018 - IEEE - Detecting Phishing Attacks Using Natural Language Processing and Machine Learning
2 pages
Synopsis
No ratings yet
Synopsis
13 pages
NIS Microproject
No ratings yet
NIS Microproject
10 pages
AI Powered Cybersecurity Phoshing Detection
No ratings yet
AI Powered Cybersecurity Phoshing Detection
7 pages
PhishNotCloud-Based ML
No ratings yet
PhishNotCloud-Based ML
11 pages
Anti-Phishing Mini Project
No ratings yet
Anti-Phishing Mini Project
7 pages
Phishing-Detection Using Ml[1]
No ratings yet
Phishing-Detection Using Ml[1]
14 pages
Phishing 094610
No ratings yet
Phishing 094610
26 pages
phishingreport (1)
No ratings yet
phishingreport (1)
19 pages
Innovative Nitesh
No ratings yet
Innovative Nitesh
11 pages
Phishing Website Detection
No ratings yet
Phishing Website Detection
19 pages
sensors-24-02077-v2
No ratings yet
sensors-24-02077-v2
19 pages
MINI PROJECT PHISHING WEBSITE DETECTION USING ML
No ratings yet
MINI PROJECT PHISHING WEBSITE DETECTION USING ML
45 pages
final ppt
No ratings yet
final ppt
26 pages
Bcck Nhom4 Baomattmdt Tiet789
No ratings yet
Bcck Nhom4 Baomattmdt Tiet789
26 pages
Innovative Nitesh
No ratings yet
Innovative Nitesh
14 pages
Various Methodological Approaches to Phishing Detection
No ratings yet
Various Methodological Approaches to Phishing Detection
8 pages
Report PUD
No ratings yet
Report PUD
20 pages
127_A Comparison of Natural Language Processing and Machine Learning Methods for Phishing Email Detection
No ratings yet
127_A Comparison of Natural Language Processing and Machine Learning Methods for Phishing Email Detection
12 pages
Detecting Phishing Website With Code Implementation
No ratings yet
Detecting Phishing Website With Code Implementation
13 pages
Applsci 13 08756 v2
No ratings yet
Applsci 13 08756 v2
19 pages
paper2
No ratings yet
paper2
10 pages
(IJCST-V12I3P8) :annie Florance V, Fathima G
No ratings yet
(IJCST-V12I3P8) :annie Florance V, Fathima G
6 pages
1NT21MC081 Research Report
No ratings yet
1NT21MC081 Research Report
5 pages
Presentation Slides
No ratings yet
Presentation Slides
42 pages
Jain 2018
No ratings yet
Jain 2018
14 pages
Major Proj Sumanthppt
No ratings yet
Major Proj Sumanthppt
13 pages
Final report scanned
No ratings yet
Final report scanned
100 pages
128 Submission
No ratings yet
128 Submission
7 pages
ISAA Report PDF
No ratings yet
ISAA Report PDF
24 pages
Spamfinal
No ratings yet
Spamfinal
10 pages
Practical Pentesting Guide: Preparation for Certification and Ethical Hacking
From Everand
Practical Pentesting Guide: Preparation for Certification and Ethical Hacking
Evan Blake
No ratings yet
Article 1
No ratings yet
Article 1
17 pages
Sex & Love Making Format For Yahoo - PDF Intimate Relationships PDF Scribd
100% (1)
Sex & Love Making Format For Yahoo - PDF Intimate Relationships PDF Scribd
1 page
2022 Supply Needs
No ratings yet
2022 Supply Needs
1 page
MIL-Q3-M2
No ratings yet
MIL-Q3-M2
12 pages
New Media and Assamese Literature An Introduction
No ratings yet
New Media and Assamese Literature An Introduction
3 pages
Start Vserver
100% (1)
Start Vserver
129 pages
Lecture Notes in Artificial Intelligence 3230
No ratings yet
Lecture Notes in Artificial Intelligence 3230
497 pages
Dse 數學聯盟 2020 p1
No ratings yet
Dse 數學聯盟 2020 p1
16 pages
Development and Evaluation of A Software System For Medical Students To Teach and Practice Anamnestic Interviews With Virtual Patient Avatars
No ratings yet
Development and Evaluation of A Software System For Medical Students To Teach and Practice Anamnestic Interviews With Virtual Patient Avatars
9 pages
HDB3 PDF
No ratings yet
HDB3 PDF
2 pages
Introduction To Optical Networking
No ratings yet
Introduction To Optical Networking
39 pages
Ransomware Guide From CISA - September 2020
No ratings yet
Ransomware Guide From CISA - September 2020
16 pages
DevOps UNIT-5
No ratings yet
DevOps UNIT-5
13 pages
C Programming and Data Structures - CS3353 2021 Regulation - Semester Question Paper 2022 Nov Dec
No ratings yet
C Programming and Data Structures - CS3353 2021 Regulation - Semester Question Paper 2022 Nov Dec
6 pages
Org Integration of Software Architecture in Requirements Elicitation For Rapid Software Development
No ratings yet
Org Integration of Software Architecture in Requirements Elicitation For Rapid Software Development
21 pages
1055 - Tejasvi Borole FP Assignement-4
No ratings yet
1055 - Tejasvi Borole FP Assignement-4
5 pages
Simply Production of Metal Parts: EOS M 290
No ratings yet
Simply Production of Metal Parts: EOS M 290
4 pages
Types of Consistency Models Used in DSM: 1.weak Consistency 2.release Consistency 3.entry Consistency
No ratings yet
Types of Consistency Models Used in DSM: 1.weak Consistency 2.release Consistency 3.entry Consistency
13 pages
class 10 project cycle
No ratings yet
class 10 project cycle
34 pages
TAS3251 Evaluation Module (Rev. B)
No ratings yet
TAS3251 Evaluation Module (Rev. B)
41 pages
C Patterns
No ratings yet
C Patterns
63 pages
SB18 (I/m) : User Manual
No ratings yet
SB18 (I/m) : User Manual
18 pages
Lesson Plan For TTL 1
No ratings yet
Lesson Plan For TTL 1
6 pages
Codebucket Solutions Private Limited Codebucket Solutions Private Limited
No ratings yet
Codebucket Solutions Private Limited Codebucket Solutions Private Limited
24 pages
TN TRB Assistant Professor Syllabus 2024 29 34
No ratings yet
TN TRB Assistant Professor Syllabus 2024 29 34
6 pages
A Success Story With Mikrotik and DMASoftlab RADIUS MANAGER (Glass Line PVT LTD
No ratings yet
A Success Story With Mikrotik and DMASoftlab RADIUS MANAGER (Glass Line PVT LTD
17 pages
FortiClient_ems-compatibility-matrix
No ratings yet
FortiClient_ems-compatibility-matrix
1 page
Top 65 Windows Server Interview Questions
No ratings yet
Top 65 Windows Server Interview Questions
11 pages
Iea1a121 PDF
No ratings yet
Iea1a121 PDF
474 pages
Final Report Ued102 Mohamad Hafizuddin Bin Mohd Bokhori
No ratings yet
Final Report Ued102 Mohamad Hafizuddin Bin Mohd Bokhori
21 pages

hackathon1[1]

Uploaded by

hackathon1[1]

Uploaded by

Department Of Computer

Khyathisree Yarra- AP23110010215

4.Phishing Score Calculation

Phishing Detection Model

You might also like