0% found this document useful (0 votes)

3 views3 pages

project

The document consists of three main files for a phishing website detection application using Flask. The 'index.html' file provides the front-end interface for users to input URLs for phishing detection, while 'app.py' handles the backend logic, including model loading and prediction. The 'train_model.py' script trains a machine learning model using XGBoost and LightGBM algorithms, evaluates their performance, and saves the best model for future predictions.

Uploaded by

priyadharshinimurugesan29

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views3 pages

project

Uploaded by

priyadharshinimurugesan29

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 3

index.

html:
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="UTF-8">
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<title>Phishing Website Detection</title>
<link rel="stylesheet" href="style.css">
</head>
<body>
<div class="animated-background">
<canvas id="background-canvas"></canvas>
</div>

<div class="content">

<div class="header">
<h1>⚡ Cybercrime Detection Portal ⚡</h1>
<p>Secure your digital world with cutting-edge phishing detection.</p>
</div>

<div class="form-container">
<form action="/predict" method="post" id="phishing-form">
<label for="url">Enter URL:</label>
<input type="text" id="url" name="url"
placeholder="https://example.com" required>
<button type="submit" class="detect-btn">Detect Phishing</button>
</form>
</div>
</div>

app.py:
from flask import Flask, request, render_template
import joblib
import numpy as np
import pandas as pd

# Initialize Flask app

app = Flask(__name__)

# Load the trained model

model_path = 'C:/Users/priya/phishingwebsite/phishing_model.pkl' # Ensure this
path is correct
model = joblib.load(model_path)

# Route for the homepage

@app.route('/')
def home():
return render_template('index.html')

# Route for handling predictions

@app.route('/predict', methods=['POST'])
def predict():
# Get the input URL from the form
url = request.form['url']

# Step 1: Extract features from the input URL

# Implement this function based on your dataset
features = extract_features_from_url(url)

# Step 2: Ensure the features align with the training set

features = pd.DataFrame([features]) # Convert to DataFrame

# Step 3: Make a prediction

prediction = model.predict(features)[0]
result = "Phishing" if prediction == 1 else "Legitimate"

# Return the result to the frontend

return render_template('index.html', prediction_text=f"The URL is: {result}")

# Feature extraction logic (implement based on your dataset)

def extract_features_from_url(url):
# Replace this with your actual feature extraction logic
return {
'url_length': len(url),
'dot_count': url.count('.'),
'slash_count': url.count('/'),
# Add other features based on your dataset here
}

if __name__ == "__main__":
app.run(debug=True)

train_model.py:
import pandas as pd
import numpy as np
from sklearn.model_selection import train_test_split, GridSearchCV
from sklearn.metrics import accuracy_score, classification_report
import joblib
import matplotlib.pyplot as plt
import seaborn as sns
from xgboost import XGBClassifier
from lightgbm import LGBMClassifier
import warnings
warnings.filterwarnings('ignore') # Suppress warnings for cleaner output

# Step 2: Load the dataset

# Replace with the path to your dataset
data_path = 'C:/Users/priya/phishingwebsite/phishing_dataset.csv'
data = pd.read_csv(data_path)
print("Dataset loaded successfully.")

# Step 3: Data Preprocessing

# Check and remove any missing values
print("No missing values found. Proceeding without removing rows.")

# Step 4: Feature Selection

X = data.drop('class', axis=1) # Features
y = data['class'].map({-1: 0, 1: 1})

# Step 5: Split the dataset

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3,
random_state=42)
# Step 6: Train models with advanced algorithms
# XGBoost
xgb_model = XGBClassifier(use_label_encoder=False, eval_metric='mlogloss',
random_state=42)
xgb_model.fit(X_train, y_train)
print("XGBoost model trained successfully.")

# LightGBM
lgbm_model = LGBMClassifier(random_state=42)
lgbm_model.fit(X_train, y_train)
print("LightGBM model trained successfully.")

# Step 7: Evaluate the models

y_pred_xgb = xgb_model.predict(X_test)
y_pred_lgbm = lgbm_model.predict(X_test)

accuracy_xgb = accuracy_score(y_test, y_pred_xgb)

accuracy_lgbm = accuracy_score(y_test, y_pred_lgbm)

print(f"XGBoost Accuracy: {accuracy_xgb * 100:.2f}%")

print(f"LightGBM Accuracy: {accuracy_lgbm * 100:.2f}%")

print("\nClassification Report (XGBoost):\n", classification_report(y_test,

y_pred_xgb))
print("\nClassification Report (LightGBM):\n", classification_report(y_test,
y_pred_lgbm))

# Step 8: Save the best model

# Step 8: Save the best model
if accuracy_xgb > accuracy_lgbm:
best_model = xgb_model
print("XGBoost is the best model based on test accuracy.")
else:
best_model = lgbm_model
print("LightGBM is the best model based on test accuracy.")

model_path = 'C:/Users/priya/phishingwebsite/phishing_model.pkl'
joblib.dump(best_model, model_path)
print(f"Best model saved to {model_path}.")

# Visualize feature importance

# Improved Feature Importance Visualization
importances = best_model.feature_importances_
features = X.columns

# Sorting the feature importances for better visualization

sorted_indices = np.argsort(importances)[::-1] # Sort in descending order
plt.figure(figsize=(10, 8))
sns.barplot(x=importances[sorted_indices], y=features[sorted_indices])
plt.title("Feature Importance (Sorted)")
plt.xlabel("Importance")
plt.ylabel("Feature")
plt.show()

Fast API
No ratings yet
Fast API
14 pages
Fast and Memory Efficient Phishing Detection Using Extended XGBoost and LightGBM
No ratings yet
Fast and Memory Efficient Phishing Detection Using Extended XGBoost and LightGBM
6 pages
Review 4
No ratings yet
Review 4
9 pages
Phase3 Credit Card Fraud Detection
No ratings yet
Phase3 Credit Card Fraud Detection
7 pages
Deep Learning For Data Architects Shekhar Khandelwal pdf download
No ratings yet
Deep Learning For Data Architects Shekhar Khandelwal pdf download
86 pages
pip install
No ratings yet
pip install
2 pages
NIS Microproject
No ratings yet
NIS Microproject
10 pages
app.py
No ratings yet
app.py
7 pages
app
No ratings yet
app
10 pages
Phishing-Detection Using Ml[1]
No ratings yet
Phishing-Detection Using Ml[1]
14 pages
Phishing URL Detection Presentation[1]
No ratings yet
Phishing URL Detection Presentation[1]
12 pages
Data Analyzing AI Web App
No ratings yet
Data Analyzing AI Web App
3 pages
Phishing URL Detection Using ML: Project Report
No ratings yet
Phishing URL Detection Using ML: Project Report
25 pages
Chapter 1_ Data Handling using Pandas _ Solutions of Informatics Practices (065) by Preeti Arora for Class 12 CBSE _ KnowledgeBoat
No ratings yet
Chapter 1_ Data Handling using Pandas _ Solutions of Informatics Practices (065) by Preeti Arora for Class 12 CBSE _ KnowledgeBoat
40 pages
PhishTrim_Fast_and_adaptive_phishing_detection_based_on_deep_representation_learning
No ratings yet
PhishTrim_Fast_and_adaptive_phishing_detection_based_on_deep_representation_learning
5 pages
AttiqAhmadAfsarAssignment2
No ratings yet
AttiqAhmadAfsarAssignment2
13 pages
Ai Phishing Report
No ratings yet
Ai Phishing Report
3 pages
Url Pishing
No ratings yet
Url Pishing
28 pages
B5_PPT_Final-1
No ratings yet
B5_PPT_Final-1
15 pages
Phishing
No ratings yet
Phishing
10 pages
Problem Statement - Phishing URL Detection
No ratings yet
Problem Statement - Phishing URL Detection
2 pages
Phishing Detection Tool
No ratings yet
Phishing Detection Tool
16 pages
128 Submission
No ratings yet
128 Submission
7 pages
Appendices A D
No ratings yet
Appendices A D
24 pages
Tittle of the Project
No ratings yet
Tittle of the Project
1 page
Data Analytics Roadmap
No ratings yet
Data Analytics Roadmap
26 pages
Phishing_Review_2023
No ratings yet
Phishing_Review_2023
17 pages
MLC PhishingEstimation Lab
No ratings yet
MLC PhishingEstimation Lab
8 pages
AI Project-1 - 21L-7744 21L-5433
No ratings yet
AI Project-1 - 21L-7744 21L-5433
5 pages
phishing final
No ratings yet
phishing final
13 pages
P1
No ratings yet
P1
13 pages
Appendices e F
No ratings yet
Appendices e F
6 pages
Final CPE
No ratings yet
Final CPE
29 pages
Mainpy (Customer Segmentation)
No ratings yet
Mainpy (Customer Segmentation)
6 pages
Aifb Lab Manual Exp 6 - Aids
No ratings yet
Aifb Lab Manual Exp 6 - Aids
3 pages
Project 3 - Phishing Detector Using LR
No ratings yet
Project 3 - Phishing Detector Using LR
3 pages
CCPe
No ratings yet
CCPe
17 pages
PHISHING PPT FINAL
No ratings yet
PHISHING PPT FINAL
24 pages
22 04 CPE Presentation
No ratings yet
22 04 CPE Presentation
18 pages
Fake Url
No ratings yet
Fake Url
64 pages
Integrating Machine Learning Into Web Applications With Flask
No ratings yet
Integrating Machine Learning Into Web Applications With Flask
7 pages
Phishing URL Detection Using ML: Project Report
No ratings yet
Phishing URL Detection Using ML: Project Report
24 pages
Swastika
No ratings yet
Swastika
60 pages
20mis0106 VL2023240103172 Pe003
No ratings yet
20mis0106 VL2023240103172 Pe003
5 pages
paper2
No ratings yet
paper2
10 pages
27 28 37 49 Cpe
No ratings yet
27 28 37 49 Cpe
19 pages
phisingppt
No ratings yet
phisingppt
15 pages
Department of Computer Engineering: Phishing Website Detector Using ML
No ratings yet
Department of Computer Engineering: Phishing Website Detector Using ML
13 pages
DSUP (AI-DS) Experiments Prem
No ratings yet
DSUP (AI-DS) Experiments Prem
107 pages
L6
No ratings yet
L6
67 pages
vertopal.com_Untitled11
No ratings yet
vertopal.com_Untitled11
15 pages
Phishing Seminar
No ratings yet
Phishing Seminar
19 pages
Final PPT - Phishing Website
100% (1)
Final PPT - Phishing Website
23 pages
Grade12_IP_Answer_Key_TA-1
No ratings yet
Grade12_IP_Answer_Key_TA-1
6 pages
Machine Learning-Driven Phishing Detection: A Robust Browser Extension Solution
No ratings yet
Machine Learning-Driven Phishing Detection: A Robust Browser Extension Solution
4 pages
Phishing Website Detection by Machine Learning Techniques Presentation
No ratings yet
Phishing Website Detection by Machine Learning Techniques Presentation
12 pages
updated_phishing_url_detection
No ratings yet
updated_phishing_url_detection
13 pages
IP project i
No ratings yet
IP project i
51 pages
Phishing Phase1 Report
No ratings yet
Phishing Phase1 Report
20 pages
Detecting Phishing Websites Using Machine Learning
No ratings yet
Detecting Phishing Websites Using Machine Learning
16 pages
Diwali Sales Analysis (1)
No ratings yet
Diwali Sales Analysis (1)
7 pages
Enhancing Phishing URL Detection Through Comprehen
No ratings yet
Enhancing Phishing URL Detection Through Comprehen
7 pages
Ultimate Data Science Programming in Python 9365895669
100% (1)
Ultimate Data Science Programming in Python 9365895669
756 pages
Timeline2GUI - A Log2Timeline CSV Parser and Training Scenarios
No ratings yet
Timeline2GUI - A Log2Timeline CSV Parser and Training Scenarios
13 pages
Python Machine Learning Decision Tree
No ratings yet
Python Machine Learning Decision Tree
17 pages
Final Synopsisi 2
No ratings yet
Final Synopsisi 2
11 pages
HCSCI132 Lab Manual
No ratings yet
HCSCI132 Lab Manual
27 pages
Practical 1
No ratings yet
Practical 1
65 pages
Python Data Analytics Libraries
No ratings yet
Python Data Analytics Libraries
8 pages
Paper 1
No ratings yet
Paper 1
5 pages
Python in Oil Refineries-
No ratings yet
Python in Oil Refineries-
5 pages
Data Manipulation and Visualization
No ratings yet
Data Manipulation and Visualization
21 pages
Hrithik Saini Class 12th c1, Roll No 1033
No ratings yet
Hrithik Saini Class 12th c1, Roll No 1033
25 pages
IP Practic MINE
No ratings yet
IP Practic MINE
30 pages
devish all unit
No ratings yet
devish all unit
42 pages
Angular Portfolio App Development: Create your personal brand
From Everand
Angular Portfolio App Development: Create your personal brand
Abdelfattah Ragab
No ratings yet
Sales Data Practice Assignment
No ratings yet
Sales Data Practice Assignment
12 pages
CP4252 Machine Learning Lab Manual
No ratings yet
CP4252 Machine Learning Lab Manual
26 pages
Phishing Website Detection Using ML 2-1
No ratings yet
Phishing Website Detection Using ML 2-1
20 pages
Angular Generative AI: Building an intelligent CV enhancer with Google Gemini
From Everand
Angular Generative AI: Building an intelligent CV enhancer with Google Gemini
Abdelfattah Ragab
No ratings yet
MIT data engineering
No ratings yet
MIT data engineering
20 pages
Python
No ratings yet
Python
3 pages
React Portfolio App Development: Increase your online presence and create your personal brand
From Everand
React Portfolio App Development: Increase your online presence and create your personal brand
Abdelfattah Ragab
No ratings yet
Pandas Interview Questions
No ratings yet
Pandas Interview Questions
21 pages
Lab3 - Python - Pandas DataFrame - GeeksforGeeks
No ratings yet
Lab3 - Python - Pandas DataFrame - GeeksforGeeks
20 pages
Data Analytics lab manual
No ratings yet
Data Analytics lab manual
47 pages
Prac 7
No ratings yet
Prac 7
5 pages
ccs346 Eda
No ratings yet
ccs346 Eda
2 pages
MC4112 Set1
100% (1)
MC4112 Set1
3 pages
How to a Developers Guide to 4k: Developer edition, #3
From Everand
How to a Developers Guide to 4k: Developer edition, #3
Xinc Cyberwizard
No ratings yet

project

Uploaded by

project

Uploaded by

index.

# Initialize Flask app

# Load the trained model

# Route for the homepage

# Route for handling predictions

# Step 1: Extract features from the input URL

# Step 2: Ensure the features align with the training set

# Step 3: Make a prediction

# Return the result to the frontend

# Feature extraction logic (implement based on your dataset)

# Step 2: Load the dataset

# Step 3: Data Preprocessing

# Step 4: Feature Selection

# Step 5: Split the dataset

# Step 7: Evaluate the models

accuracy_xgb = accuracy_score(y_test, y_pred_xgb)

print(f"XGBoost Accuracy: {accuracy_xgb * 100:.2f}%")

print("\nClassification Report (XGBoost):\n", classification_report(y_test,

# Step 8: Save the best model

# Visualize feature importance

# Sorting the feature importances for better visualization

You might also like