0% found this document useful (0 votes)

7 views

Fake_News_Detector_Report

The project report outlines the development of a Fake News Detector using machine learning and Natural Language Processing techniques to classify news articles as 'Fake' or 'Real.' The model achieved a high accuracy of 93% through data preprocessing, feature extraction, and evaluation metrics. Future enhancements include using advanced models, expanding the dataset, and deploying the solution as a web application.

Uploaded by

visheshadarsh393

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

Fake_News_Detector_Report

Uploaded by

visheshadarsh393

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Project Report: Fake News Detector

1. Introduction

In recent years, the spread of fake news has become a significant problem in media and

communication. This project aims to build a Fake News Detector using machine learning

techniques. The model classifies news articles as either "Fake" or "Real" based on their textual

content. The solution leverages Natural Language Processing (NLP) techniques and machine

learning algorithms to achieve this.

2. Objectives

- To preprocess and analyze news article text data.

- To build a machine learning model for classifying news as "Fake" or "Real."

- To evaluate the performance of the model and test it on new data.

3. Tools and Technologies

- Programming Language: Python

- Libraries Used:

- pandas for data manipulation

- nltk for text preprocessing

- scikit-learn for machine learning

- TfidfVectorizer for feature extraction

- Dataset: Fake and Real News Dataset (available on Kaggle)

4. Methodology
Step 1: Data Collection

The dataset used for this project consists of news articles labeled as "Fake" or "Real." It contains

two columns:

- text: The content of the article.

- label: The classification label ("Fake" or "Real").

Step 2: Data Preprocessing

The text data is preprocessed to remove noise and improve model performance:

- Removal of HTML tags and special characters.

- Conversion of text to lowercase.

- Tokenization and removal of stopwords using NLTK.

Step 3: Feature Extraction

The TfidfVectorizer is used to convert the text data into numerical features. This technique captures

the importance of words in each article relative to the dataset.

Step 4: Model Training

A Logistic Regression model is trained on the preprocessed data to classify news articles. The

dataset is split into training and testing sets in an 80-20 ratio.

Step 5: Model Evaluation

The model is evaluated using metrics such as accuracy, precision, recall, and F1-score. These

metrics help in understanding the model's performance.

Step 6: Prediction Function

A function is implemented to predict whether a new article is "Fake" or "Real."

5. Results

The model achieved the following results on the test dataset:

- Accuracy: 93%

- Precision: 92%

- Recall: 94%

- F1-score: 93%

These results indicate that the model is effective in distinguishing between fake and real news

articles.

6. Key Code Snippets

Preprocessing Function

```python

from nltk.corpus import stopwords

from nltk.tokenize import word_tokenize

import re
def preprocess_text(text):

text = re.sub(r'<.*?>', '', text) # Remove HTML tags

text = re.sub(r'[^\w\s]', '', text) # Remove punctuation

text = text.lower() # Convert to lowercase

tokens = word_tokenize(text) # Tokenize

tokens = [word for word in tokens if word not in stopwords.words('english')]

return ' '.join(tokens)

```

Prediction Function

```python

def predict_news(article):

processed_article = preprocess_text(article)

article_vectorized = vectorizer.transform([processed_article])

prediction = model.predict(article_vectorized)

return "Real" if prediction[0] == 1 else "Fake"

```

7. Conclusion

This project successfully implemented a machine learning-based Fake News Detector. The model

demonstrated high accuracy and can be further enhanced by:

- Using advanced deep learning models like BERT.

- Expanding the dataset to include more diverse articles.

- Deploying the model as a web application using Flask or Streamlit.

8. References
- Dataset: Fake and Real News Dataset on Kaggle

- Libraries: Official documentation of pandas, nltk, and scikit-learn.

The Art of Public Speaking-Dale Carnegie
No ratings yet
The Art of Public Speaking-Dale Carnegie
450 pages
Poster - Template - PPTX (1) (2) A Fe
No ratings yet
Poster - Template - PPTX (1) (2) A Fe
1 page
Fake News Detection PPT 1
No ratings yet
Fake News Detection PPT 1
13 pages
AAT Cover Page
No ratings yet
AAT Cover Page
17 pages
Ai Fake News Detection
No ratings yet
Ai Fake News Detection
3 pages
DOC-20241014-WA0038.
No ratings yet
DOC-20241014-WA0038.
13 pages
Final Synopsis-Major Abhilasha, Ananya
No ratings yet
Final Synopsis-Major Abhilasha, Ananya
10 pages
Fake News Detection Presentation
No ratings yet
Fake News Detection Presentation
15 pages
Mega
No ratings yet
Mega
14 pages
Fake News Detection
No ratings yet
Fake News Detection
5 pages
Fake news detection project documentation
No ratings yet
Fake news detection project documentation
16 pages
Fake News Detector With Real Time Web Scraping
No ratings yet
Fake News Detector With Real Time Web Scraping
11 pages
A Machine Learning Project Report
No ratings yet
A Machine Learning Project Report
12 pages
FAke news report
No ratings yet
FAke news report
16 pages
AI Project Proporsal - Fake News Detection
No ratings yet
AI Project Proporsal - Fake News Detection
4 pages
ML Report Fake News Detection
No ratings yet
ML Report Fake News Detection
15 pages
Untitled Document
No ratings yet
Untitled Document
4 pages
Project Documentation
No ratings yet
Project Documentation
44 pages
Fake Phase3
No ratings yet
Fake Phase3
14 pages
Project Documentation
No ratings yet
Project Documentation
6 pages
Fake_News_Report_Preview
No ratings yet
Fake_News_Report_Preview
5 pages
Synopsis
No ratings yet
Synopsis
5 pages
Untitled Document - Merged
No ratings yet
Untitled Document - Merged
5 pages
Ai_Project
No ratings yet
Ai_Project
16 pages
211423205047-Exp13.pdf
No ratings yet
211423205047-Exp13.pdf
6 pages
Project Synopsis Report Format
No ratings yet
Project Synopsis Report Format
9 pages
Final Year of Computer Engineering 2022-23 Semester VII Project Synopsis
No ratings yet
Final Year of Computer Engineering 2022-23 Semester VII Project Synopsis
11 pages
Tarp Rev3
No ratings yet
Tarp Rev3
32 pages
20SCSE1180073 Shreyansh.
No ratings yet
20SCSE1180073 Shreyansh.
21 pages
Geetha Internship
No ratings yet
Geetha Internship
17 pages
A i Project Proposal
No ratings yet
A i Project Proposal
10 pages
Fake News Detector project Abstract
No ratings yet
Fake News Detector project Abstract
9 pages
Detection of Fake News
No ratings yet
Detection of Fake News
17 pages
Document 1
No ratings yet
Document 1
8 pages
Synopsis Minor Project-2
No ratings yet
Synopsis Minor Project-2
5 pages
The Main Objective Is To Detect The Fake News, Which Is A Classic Text Classification
No ratings yet
The Main Objective Is To Detect The Fake News, Which Is A Classic Text Classification
57 pages
Fake_News_Detection_Report_new
No ratings yet
Fake_News_Detection_Report_new
16 pages
legystorme[1]-1
No ratings yet
legystorme[1]-1
2 pages
ML PPT
No ratings yet
ML PPT
16 pages
Headline Detecting Fake News With M
No ratings yet
Headline Detecting Fake News With M
3 pages
Case Study DL
No ratings yet
Case Study DL
8 pages
FYP__Copy_ (6)
No ratings yet
FYP__Copy_ (6)
42 pages
A Project Report On Fake News Detection
100% (1)
A Project Report On Fake News Detection
29 pages
mini project[1]
No ratings yet
mini project[1]
24 pages
Fake News Detection
No ratings yet
Fake News Detection
3 pages
Final CPE Report
No ratings yet
Final CPE Report
26 pages
(NetCrypt)Review Paper PDF
No ratings yet
(NetCrypt)Review Paper PDF
5 pages
Fake News Detection
No ratings yet
Fake News Detection
11 pages
Wa0007.3307303433096114618
No ratings yet
Wa0007.3307303433096114618
20 pages
Face Mask Detection Using Deep Learning
No ratings yet
Face Mask Detection Using Deep Learning
31 pages
Fake News Synopsis 1
No ratings yet
Fake News Synopsis 1
6 pages
Project Report
No ratings yet
Project Report
12 pages
Presentation Slide of AI
No ratings yet
Presentation Slide of AI
30 pages
Fake News Proposal
No ratings yet
Fake News Proposal
18 pages
Fake News Analysis
No ratings yet
Fake News Analysis
46 pages
AI_Phase2
No ratings yet
AI_Phase2
6 pages
Fake News Detection Using Machine Learning12 2
No ratings yet
Fake News Detection Using Machine Learning12 2
65 pages
FakeNewsDetection
No ratings yet
FakeNewsDetection
9 pages
Daa - Mini - Project (1) Orginal
No ratings yet
Daa - Mini - Project (1) Orginal
21 pages
NLP 1
No ratings yet
NLP 1
3 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Ict p2 Practical Questions
No ratings yet
Ict p2 Practical Questions
5 pages
Activity 3 GUW
No ratings yet
Activity 3 GUW
6 pages
Important RGPV Question, CS-502, Database Management System ,V Sem (DBMS), CSE, B.tech. ? CS.com
No ratings yet
Important RGPV Question, CS-502, Database Management System ,V Sem (DBMS), CSE, B.tech. ? CS.com
17 pages
Torres Strait Everyday Words
No ratings yet
Torres Strait Everyday Words
5 pages
Selection Test 1 Review Questions
No ratings yet
Selection Test 1 Review Questions
3 pages
Workflow - Common Errors and Solutions - Rev
0% (1)
Workflow - Common Errors and Solutions - Rev
40 pages
Significance+Tests+Four Step+Practice+Answer+Key+ +Intro+Stats+ +Stats+Medic
No ratings yet
Significance+Tests+Four Step+Practice+Answer+Key+ +Intro+Stats+ +Stats+Medic
2 pages
E070 The Novel
No ratings yet
E070 The Novel
69 pages
Dara Shukoh - The Prince Who Turned Sufi
100% (2)
Dara Shukoh - The Prince Who Turned Sufi
4 pages
Online Examination System
50% (2)
Online Examination System
81 pages
Input Hypothesis by Krashen
No ratings yet
Input Hypothesis by Krashen
3 pages
Mtech-Syllabus-Data Science - Sem2
No ratings yet
Mtech-Syllabus-Data Science - Sem2
18 pages
3 Idiots
No ratings yet
3 Idiots
2 pages
A 2023 Mythical Divine Remedial Plants Corresponded To Sun in Vedic and Western Astrology - A Scientific Outlook
No ratings yet
A 2023 Mythical Divine Remedial Plants Corresponded To Sun in Vedic and Western Astrology - A Scientific Outlook
6 pages
BMP6005 WER-Assessment 2 Brief
No ratings yet
BMP6005 WER-Assessment 2 Brief
3 pages
Sem I Syllabus BSC (Hons. Physics)
No ratings yet
Sem I Syllabus BSC (Hons. Physics)
13 pages
Connection Log 16261
No ratings yet
Connection Log 16261
10 pages
29759-Article Text-33813-1-2-20240324 (1)
No ratings yet
29759-Article Text-33813-1-2-20240324 (1)
9 pages
Readme Cooldt Manual PDF
No ratings yet
Readme Cooldt Manual PDF
9 pages
Sindhu Avalur_Product Manager 02 25
No ratings yet
Sindhu Avalur_Product Manager 02 25
8 pages
How To Install Elastic Stack 8 On Debian 11 - Linux Tutorial - Atetux
No ratings yet
How To Install Elastic Stack 8 On Debian 11 - Linux Tutorial - Atetux
17 pages
Hebrew: Range: 0590-05FF
No ratings yet
Hebrew: Range: 0590-05FF
4 pages
Dr. Seuss Lesson Plan With ECIPs
100% (1)
Dr. Seuss Lesson Plan With ECIPs
186 pages
Professional Resume
No ratings yet
Professional Resume
3 pages
Creative-Nonfiction-M3
No ratings yet
Creative-Nonfiction-M3
16 pages
Adobe Scan 19 Oct 2024
No ratings yet
Adobe Scan 19 Oct 2024
2 pages
ENGLISH TEST Unit 1 GRADE 7 BILINGUAL
No ratings yet
ENGLISH TEST Unit 1 GRADE 7 BILINGUAL
2 pages
A Node-Positioning Algorithm For General Trees: TR89-034 September, 1989
No ratings yet
A Node-Positioning Algorithm For General Trees: TR89-034 September, 1989
32 pages
Alexa Griffiths: Special Educator - Allen, TX
No ratings yet
Alexa Griffiths: Special Educator - Allen, TX
1 page