0% found this document useful (0 votes)

4 views

Fake News detection Using Machine Learning | IEEE Conference Publication | IEEE Xplore

The document presents a system for detecting fake news using machine learning techniques, specifically Support Vector Machine (SVM) as a classifier and TF-IDF for feature extraction. It highlights the challenges in fake news detection due to limited datasets and proposes a novel method that includes text preprocessing and feature extraction from both fake and true news datasets. The results demonstrate the efficiency of the proposed system in accurately classifying news as real or fake.

Uploaded by

Pradeep Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

Fake News detection Using Machine Learning | IEEE Conference Publication | IEEE Xplore

Uploaded by

Pradeep Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Conferences > 2020 2nd International Worksh...

Fake News detection Using Machine Learning

Publisher: IEEE Cite This  PDF

Nihel Fatima Baarir ; Abdelhamid Djeffal All Authors

42 4153 ®  ©  
Full
Cites in
Text
Papers
Views

Abstract Abstract:
The phenomenon of Fake news is experiencing a rapid and growing progress with the evolution of the means of
Document Sections communication and Social media. Fake news detection is an emerging research area which is gaining big interest. It
faces however some challenges due to the limited resources such as datasets and processing and analysing
I. Introduction techniques. In this work, we propose a system for Fake news detection that uses machine learning techniques. We
II. Related Works
used term frequency-inverse document frequency (TF-IDF) of bag of words and n-grams as feature extraction
technique, and Support Vector Machine (SVM) as a classifier. We propose also a dataset of fake and true news to train
III. Proposed System the proposed system. Obtained results show the efficiency of the system. In this work, we propose a system for Fake
news detection that uses machine learning techniques. We used term frequency-inverse document frequency (TF-IDF)
IV. Experiments and
of bag of words and n-grams as feature extraction technique, and Support Vector Machine (SVM) as a classifier. We
Results
propose also a dataset of fake and true news to train the proposed system. Obtained results show the efficiency of the
V. Conclusion system.

Authors Published in: 2020 2nd International Workshop on Human-Centric Smart Environments for Health and Well-being
(IHSH)
Figures

Date of Conference: 09-10 February 2021 DOI: 10.1109/IHSH51661.2021.9378748

References
Date Added to IEEE Xplore: 19 March 2021 Publisher: IEEE
Citations
 ISBN Information: Conference Location: Boumerdes, Algeria
Keywords

Metrics SECTION I.
More Like This Introduction
In the last decade, Fake News phenomenon has experienced a very significant spread, favored by social
networks. This fake news can be broadcasted for different purposes. Some are made only to increase the
number of clicks and visitors on a site. Others, to influence public opinion on political decisions or on
financial markets. For example, by impacting the reputation of companies and institutions on the Web. Fake
news concerning health on social media represents a risk to global health. The WHO warned in February
2020 that the COVID-19 outbreak had been accompanied by a massive ‘infodemic’, or an overabundance of
information-some of which was accurate and some of which was not-which made it difficult for people to find
reliable sources and trustworthy information when they needed it. The consequences of disinformation
overload are the spread of uncertainty, fear, anxiety and racism on a scale not seen in previous epidemics [11].

In this paper, we present a novel method and tool for detecting fake news that uses:

Text preprocessing: consisting of steaming and analyzing the text by removing stop words and special
characters.

Encoding of the text: using bag of words and N-gram then TF-IDF.

Extraction of the characteristics: this allows a precise identification of false information. We use the
source of a news, its author, the date and the feeling given by the text as features of a news.

Support vector machine: a supervised machine learning algorithm that allows the classification of
new information.
:
This paper is structured as follows: Section 2 presents some existing proposals for fake news detection.
Section 3 details our proposal and its different components. Section 4 presents the implementation of our
proposal as well as some of the obtained results. Section 5 concludes the paper and presents some
perspectives.

SECTION II.
Related Works
In literature, many works are interested to fake new detection.

Authors of [3] propose a typology of several methods of truth assessment emerging from two main categories:
linguistic cue approaches with machine learning and network analysis approaches, for detecting fake news.

In [5], authors present a simple approach to fake news detection using a naive Bayesian classifier. This
approach is tested on a set of data extracted from Facebook news posts. They claim to be able to achieve an
accuracy of 74%. The rate of this model is good but not the best, as many other works have achieved a better
rate using other classifiers. We discuss these works in the following.

Authors of [1] propose a fake news detection model that uses n-gram analysis and machine learning
techniques by comparing two different feature extraction techniques and six different classification
techniques. The experiments carried out show that the best performances are obtained by using the so-called
features extraction method (TF-IDF). The used the Linear Support Vector Machine (LSVM) classifier that
gives an accuracy of 92%. This model uses LSVM that is limited to treat only the case of two linearly separated
classes.

Authors of [14] describe how users of social networks can ensure the truth of information. They also describe
the mechanisms that allow their validation and the role of journalists or what to expect from researchers and
official institutions. This work helps people see a little bit of the truth behind the news on social media and not
believe anything.

Authors of [9] propose several strategies and types of indices relating to different modalities (text, image,
social information). They also explore the value of combining and merging these approaches to assess and
verify shared information.

In [8], the authors present an overall performance analysis of different approaches on three different datasets.
This work focused on the text of the information and the feeling given by it, and ignores some features like the
source, the author or the the date of the publication that can have a dramatic impact on the result. Besides, in
our work, we will show that the integration of the feeling in the detection process does not bring any valuable
information.

Authors of [16] created a new public dataset of valid new articles and proposed a text-processing based
machine learning approach for automatic identification of Fake News with 87% accuracy. It appears that this
work focuses on the emerging feelings from the text and not on the contain of the text in it self.

Authors of [17] introduced LIAR, a new dataset for automatic fake news detection. This corpus can also be
used for stance classification, argument mining, topic modeling, rumor detection, and political NLP research.
Most of the works in this area have used this benchmark. However, it is well-known that this last is restricted
to political information, while others have integrated information from various fields.

The overall drawback of these approaches is that the categorical data encoding may not be valid in reality !
Besides, the usual fake news classification is limited to two values (i.e., namely, Real or Fake), while in reality
we can not say that the news is real or fake at 100%, but according to a degree of confidence. We consider that
this point is very important to classify the news in social media.

SECTION III.
Proposed System
The system we propose uses a news dataset to build a decision model based on support vector machine
:
method. The model is then used to classify novel news to fake or real.

A. General Architecture of the Proposed System

The proposed system takes as input a dataset of comments and their related information, such as date, source
and author. It then transforms them into a features dataset that can be used in the learning phase. This
transformation is called preprocessing, it performs a series of operations such as cleansing, filtering and
encoding. The preprocessed dataset is divided into two parts: the first for training and the second for testing.
The training module uses the training dataset and support vector machine algorithm to build a decision model
that can be applied to the test dataset. If the model is accepted (i.e., it is able to achieve an acceptable accuracy
rate), it can be kept and used and then training ends. Otherwise, the parameters of the learning algorithm are
revised in order to improve the accuracy rate. Figure 1 illustrates the general scheme of the proposed system.


Figure 1.
The proposed fake news detection system architecture's

B. Preprocessing
In the news dataset, news characteristics are classified into three categories: textual data, categorical data and
numerical data. Each category preprocessing is performed through a set of operations as illustratrd in Figure
2:


Figure 2.
Preprocessing of different categories of news charateristics

• Textual Data
Represent the text written by the author in a news and pre-processed by the following operations:

1. Cleaning: eliminating stop words and special characters.

2. Steaming: transforming the useful words into roots.

3. Encoding: transforming all the words of the comment into a numerical vector. This needs two steps: the
combination of two techniques, namelly, bag of words [13] and N-grams [4], then the application of the
TF-IDF method [12] on the result.

n D
T F − IDFt = T Ft × IDFt = × log
k Ḋt

View Source

Where:

T Ft = n
k
: the number of appearances of term t in the document n divided by the total number k of
terms in the document, keeping the multiplicity of each term.

D : the total number of documents D divided by the number D of documents citing

IDFt = log D t
t
this term.

• Categorical Data
Represent the source of the news such as TV channel, newspaper or magazine, and its author. The pre-
treatment of these data is performed through two steps:

Cleaning: eliminating special characters and transforming letters into lowercase.

Encoding: for sources we used a label encoding. For authors, we created our own encoding to convert the
author's names into digital numbers, so that authors from the same source are close to each other
:
compared to authors from other sources.

We created a list containing two fields, the first for the source and the second for its authors, then we
replaced each author by its index number by adding the sum of the sizes of the previous sources plus
one. Figure 3 shows an example where:

T is the number of authors of the source (size).

ik is the author index number k.


Figure 3.
Calculation of authors indices

• Numerical Data
Represent the date of posting the comment and the sentiment given by the text. Since the date is already
represented by a numerical value, we only split it into three unique values: day, month and year. For the
sentiment given by the text, we calculate the sum of the sentiment degrees of the words.

According to the experts, each word has a degree of sentiment which allows it to be classified into three
classes:

If the sum is less than 0, the feeling is negative.

If the sum is greater than 0, the sentiment is positive.

If the sum is 0, the feeling is neutral.

C. Learning
It brings together two modules, namely, training and validation.

1) Training
To train our model, we have chosen the support vector machine algorithm [15]. This allows to use the value of
the decision function given for a news as a confidence level of its classification: a positive value for the
decision function designates, at the same time, a true news as well as its degree of truth and vice-versa, a
negative value of the decision function designates a Fake news as well as its degree of fakeness. Figure 4
illustrates this idea.


Figure 4.
Degree of confidence for news classification using support vector machine decision function

The maximum and minimum of the decision function are therefore calculated during the training phase and
used to compute the degree of truth or fallacy by the following function:

Dec × 100
p = { Max
if Dec > O
dec
Dec ∗ 100 else
Min dec

View Source

Where:

Dec is the decision function value;

M axdec and M indec are the maximum and minimum values of the decision function;

p is the percentage of truth or fake.

:
2) Validation
To measure the capacity of the model to recognize new examples, we set aside some of the examples to be
used as test models. The features dataset is then subdivided into two parts, a training part and a test part. Its
usefulness consists in avoiding over-fitting, i.e., testing the model on the same training dataset. The
subdivision is not done at random but according to a particular sample using the method of cross validation
[10].

D. Revision of Parameters
This operation aims to improve the model's accuracy by tuning or setting the parameters of the support
vector machine algorithm, namely, Cost, γ, ϵ and change the cross-validation variant [2].

E. Use
This is the last and most important phase in our system. After reaching the best recognition rate, i.e., after
building the best model, we can now use it on new unlabeled news, and the model allows us to predict their
classes: wrong or true, with a confidence degree.

SECTION IV.
Experiments and Results
The performances of the proposed system was tested using a dataset that we build by merging a true news
datset with a fake news one.

A. Used Dataset
We have merged two existing datasets “Getting Real about Fake News” [6] containing fake news and “All the
news” [7] containing real news. These datasets were obtained from the Kaggle site, the first contains text and
metadata extracted from 244 websites marked as false by Daniel Sieradski's BS Detector Chrome detector,
extracted using the API web-hose.io. This dataset contains approximately 12,999 social media posts, divided
into 20 columns of different types; categorical, numeric and textual. The second dataset contains texts and
metadata taken from New York Times, Breitbart, CNN, Business Insider, Atlantic, Fox News, Talking Points
Memo, Buzzfeed News, National Review, New York Post, The Guardian, NPR, Reuters, Vox and the
Washington Post, retrieved using BeautifulSoup and stored in Sqlite, split into three separate CSV files. This
last dataset contains texts and metadata subdivided into 10 columns of different types; categorical, numeric
and textual.

After pre-processing the two datasets and testing the features one by one until reaching the best accuracy rate.
We have obtained a dataset which contains the following features:

5 words obtained by the bag of words method,

3 compound words obtained by the N-gram method,

date: day, month and year,

feeling,

source,

author,

class: fake or real.

B. Results and Discussion

To get the best decision model with highest accuracy, we tuned many parameters. First we tried to get the best
parameters from both bag of words and n-gram techniques which give the best recognition rate on our
dataset.

For bag of words' technique we have directed the number of most frequent words taken from each comment.
This operation is repeated several times until the best rate is reached. At each time we increased the number
of the most frequent words. On the Weka software and using the SMO library, and with the cross validation
:
for 10 parts, we obtained the following results:


Figure 5.
Evolution of the rate according to the number of frequent words for the word bag

As shown in Figure 5 the recognition rate increases with the number of most frequent words, up to 25 words,
then begins to decrease, which we explain by the phenomenon of over-fitting.

For the n-gram's technique, we stepped the number of grams. This operation was also repeated several times
increasing the value of n each time. We got these results:


Figure 6.
Evolution of the rate according to the n-gram

In Figure 6, we observe that after 2-grams the recognition rate started to decrease, which is very logical, due
to the small size of the text of the news; a block of words of more than 2 will not be repeated several times in
the same piece of news which will not exceed 5 lines at most.

We stopped at 2-grams and proceeded to switch the number of most frequent word blocks k. We obtained the
following results:


Figure 7.
Evolution of the rate according to the k * (2-grams)

In Figure 7 we observe that the rate continued to increase without exceeding the rate obtained by the word
bag technique. This is due, in our opinion, to two causes: either the small size of the information text, or the
incompatibility of n-grams with the TF-IDF method. We then thought to combine the two techniques. We
started by combining 5 frequent words with the frequent 3 * (2-gram) which gave us a rate of 52.30%. Then,
we added the other characteristics to measure the influence of each one on the recognition rate. The following
figure 8 represents the evolution of accuracy rate depending on different features: by testing on the training
data and using the RBF kernel of the LIBSVM [2] method in WEKA [18].


Figure 8.
Influence of different features on accuracy

In Figure 8, we notice that the influence of the feature “Sentiment” on the accuracy is almost negligible, which
seems to be very logical: if a feeling released by a comment was negative it does not mean that it is fake.
However, the characteristic “source” increased it up to 89.27%, and “date” up to 96%. While the author
feature pushed it to 100%, which shows the effectiveness of the encoding we have proposed.

Figure 9 shows the results obtained by the different kernels: LIBSVM and their tunning on WEKA.


Figure 9.
Accuracy according to the kernel type

It is clear that linear and polynomial kernels give the best results. The linear kernel is parameterless and
faster, however, theoretically it cannot model the cases of complicated overlap of the two classes. On the
:
other hand the Gaussian kernel makes it possible to model any type of overlap but its accuracy depends on
the parameters C (Cost), ϵ and γ. We have studied the influence of these parameters on the precision of the
model.

Influence of Cost C: the following Figure 10 represents the evolution of accuracy according to the Cost C:
by testing on the training dataset and using the RBF kernel of the LIBSVM method in WEKA:

At the start the cost is equal to 0 and the value of the rate is 52% then by increasing the cost value we
observe a rapid increase in the rate up to the value 150, then a stabilization of the rate all around the
value 82% despite the fact that we continued to increase the cost with high values.

This is due to our opinion for the following reason: it is known that for high values of C, the optimization
will choose a hyper-plane with a smaller margin, conversely, a very small value of C will cause the
optimization to seek a separation hyper-plane with a larger margin. So in this case the two classes are
very close to each other, then the separation margin is small and that is found with the value 150, after
this value there is no data in the margin.

Influence of ϵ : Figure 11 represents the evolution of accuracy depending on ϵ : by testing on the training
data and using the RBF kernel of the LIBSVM method in WEKA:

We observe a stabilization of the rate around 82% up to the value 0.1 then a slight drop in the rate which
can be neglected up to the value 1. This shows that the ϵ parameter does not have a great influence on
the rate of recognition. Which is very logical because this parameter determines the tolerance of the
termination criterion. That's the allowed error rate that's all.

Influence of γ: the following Figure 12 represents the evolution of accuracy depending on the γ: by
testing on the training data and using the RBF kernel of the LIBSVM method in WEKA:

With a C = 300 and a ϵ = 0.0001 , the recognition rate increases to the value of γ = 0.001 , then a
stabilization around the rate 82% then we observe a rapid decrease from the value of γ = 0.01.

At the end we obtained the best model accuracy with the following parameters: Cost
C = 300, ϵ = 0.0001 and γ = 0.001 .


Figure 10.
Evolution of the accuracy according to the Cost C


Figure 11.
Evolution of the accuracy rate according to ϵ


Figure 12.
Evolution of the accuracy rate according to gamma

SECTION V.
Conclusion
This paper presents a method of detecting fake news using support vector machine, trying to determine the
best features and techniques to detect fake news. We started by studying the field of fake news, its impact and
its detection methods. We then designed and implemented a solution that uses a dataset of news preprocessed
:
using cleaning techniques, steaming, N-gram encoding, bag of words and TF-IDF to extract a set of features
allowing to detect fake news. We applied then Support Vector Machine algorithm on our features dataset to
build a model allowing the classification of the new information.

Through the research carried out during this study, we obtained the following results:

the best features to detect fake news are in order: text, author, source, date and sentiment.

the followed process resulted in a recognition rate of 100%.

the analysis of the sentiment given by the text is interesting, however it would be more influential in the
case of opinion mining.

the N-gram method gives a better result than the bag of words with bulky datasets and with large texts.

the support vector machine seems the best algorithm to detect fake news, because it gave a better
recognition rate, and allowed to give for each information a degree of confidence for its classification.

the parameters influencing the support vector machine are in order: Cost C, gamma γ and epsilon ϵ .

The work we have done could be completed and continued in different aspects. It would be relevant to extend
this study with a larger dataset, and to evolve its supervised learning by another online for a continuous
update and automatic integration of new fake news.

Authors 
Figures 
References 
Citations 
Keywords 
Metrics 

IEEE
IEEE Personal
Personal Purchase
Purchase Details
Details Profile
Profile Information
Information Need
Need Help?
Help? Follow
Follow
Account
Account
PAYMENT OPTIONS COMMUNICATIONS US & CANADA: +1 800    
CHANGE PREFERENCES 678 4333
VIEW PURCHASED
USERNAME/PASSWORD
DOCUMENTS PROFESSION AND WORLDWIDE: +1 732
EDUCATION 981 0060

TECHNICAL INTERESTS CONTACT & SUPPORT

Teks 2
No ratings yet
Teks 2
12 pages
Fake News Detection Using Machine Learning Models
No ratings yet
Fake News Detection Using Machine Learning Models
5 pages
Jenkins
No ratings yet
Jenkins
39 pages
Fake News Detection On Social Media Using Machine Learning Report
100% (1)
Fake News Detection On Social Media Using Machine Learning Report
27 pages
Fake News Detection PPT 1
No ratings yet
Fake News Detection PPT 1
13 pages
CHFIv9 Module 03 Understanding Hard Disks and File Systems PDF
100% (2)
CHFIv9 Module 03 Understanding Hard Disks and File Systems PDF
158 pages
Fake News Detection Using Machine Learning: Nihel Fatima Baarir Abdelhamid Djeffal
No ratings yet
Fake News Detection Using Machine Learning: Nihel Fatima Baarir Abdelhamid Djeffal
6 pages
jpnr-2022-04-140
No ratings yet
jpnr-2022-04-140
7 pages
Machine Learning Techniques For The Classification of Fake News
No ratings yet
Machine Learning Techniques For The Classification of Fake News
5 pages
Fake News Synopsis 1
No ratings yet
Fake News Synopsis 1
6 pages
Face Mask Detection Using Deep Learning
No ratings yet
Face Mask Detection Using Deep Learning
31 pages
Ieee Paper
No ratings yet
Ieee Paper
4 pages
Fake News Synopsis 1
No ratings yet
Fake News Synopsis 1
6 pages
Fake News Detection
No ratings yet
Fake News Detection
11 pages
Fake News Detection Using Python and Machine Learning
No ratings yet
Fake News Detection Using Python and Machine Learning
6 pages
Fake News Detection Using Supervised Learning Meth
No ratings yet
Fake News Detection Using Supervised Learning Meth
5 pages
Synopsis
No ratings yet
Synopsis
8 pages
Report Se
No ratings yet
Report Se
4 pages
FAke news report
No ratings yet
FAke news report
16 pages
Machine Learning-Based Approach For Fake News Detection
No ratings yet
Machine Learning-Based Approach For Fake News Detection
22 pages
Fake News Detection Using Deep Learning
No ratings yet
Fake News Detection Using Deep Learning
5 pages
Review Paper[1]
No ratings yet
Review Paper[1]
7 pages
A Novel Technique To Detect The Fake News by
No ratings yet
A Novel Technique To Detect The Fake News by
52 pages
TARP
No ratings yet
TARP
21 pages
Machine Learning For The Classification of Fake News
No ratings yet
Machine Learning For The Classification of Fake News
4 pages
A Tool For Fake News Detection: September 2018
No ratings yet
A Tool For Fake News Detection: September 2018
9 pages
Fake News - 01
No ratings yet
Fake News - 01
5 pages
Ijresm V3 I6 32
No ratings yet
Ijresm V3 I6 32
3 pages
(NetCrypt)Review Paper
No ratings yet
(NetCrypt)Review Paper
7 pages
A novel hybrid multi-thread metaheuristic approach for fake news detection in social media
No ratings yet
A novel hybrid multi-thread metaheuristic approach for fake news detection in social media
21 pages
Fake News Detection Using Natural Language Processing
100% (1)
Fake News Detection Using Natural Language Processing
8 pages
fake news detection
No ratings yet
fake news detection
21 pages
Fakenews
No ratings yet
Fakenews
5 pages
Arti research paper mca
No ratings yet
Arti research paper mca
8 pages
Fake News Detection Using Python
No ratings yet
Fake News Detection Using Python
11 pages
Fake News Detection Using Machine Learning
No ratings yet
Fake News Detection Using Machine Learning
4 pages
Fake News Detection Report
No ratings yet
Fake News Detection Report
20 pages
Tarp Rev3
No ratings yet
Tarp Rev3
32 pages
Fake News Detection Based On Word and Document Embedding Using Machine Learning Classifiers
No ratings yet
Fake News Detection Based On Word and Document Embedding Using Machine Learning Classifiers
11 pages
Fake News Detec-WPS Office
No ratings yet
Fake News Detec-WPS Office
4 pages
Fake News Detection
No ratings yet
Fake News Detection
5 pages
Reserch Paper
No ratings yet
Reserch Paper
8 pages
reserch paperUpdated
No ratings yet
reserch paperUpdated
8 pages
338f0c
No ratings yet
338f0c
24 pages
Fake News Detection Using Machine Learning
No ratings yet
Fake News Detection Using Machine Learning
8 pages
Fake News Detection System Using LSTM and Tensorflow
No ratings yet
Fake News Detection System Using LSTM and Tensorflow
4 pages
An Enhanced Method For Detecting Fake Ne
No ratings yet
An Enhanced Method For Detecting Fake Ne
19 pages
3.efficient Fake New Detector
No ratings yet
3.efficient Fake New Detector
9 pages
ML Paper 7
No ratings yet
ML Paper 7
18 pages
Fake News Detection Using Machine Learning Report Final
No ratings yet
Fake News Detection Using Machine Learning Report Final
24 pages
Ppt -Fake News Detection-1
No ratings yet
Ppt -Fake News Detection-1
37 pages
kumarjain2020_6
No ratings yet
kumarjain2020_6
6 pages
Fake News Detection With Different Model
No ratings yet
Fake News Detection With Different Model
15 pages
AI_Phase2
No ratings yet
AI_Phase2
6 pages
alasaad2018_8
No ratings yet
alasaad2018_8
8 pages
Masters Thesis Revised
No ratings yet
Masters Thesis Revised
4 pages
Fake News Detection System by Manish Verma 16scse111009
No ratings yet
Fake News Detection System by Manish Verma 16scse111009
7 pages
Fake News Detection Based On A Hybrid Bert and Lightgbm Models
No ratings yet
Fake News Detection Based On A Hybrid Bert and Lightgbm Models
12 pages
Real Time Fake News Detection Using Machine Learning and NLP
No ratings yet
Real Time Fake News Detection Using Machine Learning and NLP
5 pages
Synopsis Minor Project-2
No ratings yet
Synopsis Minor Project-2
5 pages
Fake News Detection Using ML: Srishti Agrawal, Vaishali Arora, Ruchika Arora, Pronika Chawla, Madhumita Kathuria
No ratings yet
Fake News Detection Using ML: Srishti Agrawal, Vaishali Arora, Ruchika Arora, Pronika Chawla, Madhumita Kathuria
6 pages
IRJET-V6I5733
No ratings yet
IRJET-V6I5733
3 pages
The Art of AI Project Management & Work
From Everand
The Art of AI Project Management & Work
Tom Henricksen
No ratings yet
Cours Embedded Systems PDF
No ratings yet
Cours Embedded Systems PDF
31 pages
Sample Resume For An IT Professional
No ratings yet
Sample Resume For An IT Professional
22 pages
Java and Software Design: Introduction To
No ratings yet
Java and Software Design: Introduction To
40 pages
RDD - Mini - Project - 1 - 1707570179 2024-02-10 13 - 03 - 29
No ratings yet
RDD - Mini - Project - 1 - 1707570179 2024-02-10 13 - 03 - 29
10 pages
ABB Ability
No ratings yet
ABB Ability
20 pages
Fractal Audio Setlists Songs Mini Manual
No ratings yet
Fractal Audio Setlists Songs Mini Manual
7 pages
Online Java Compiler - Online Java Editor - Java Code Online2
No ratings yet
Online Java Compiler - Online Java Editor - Java Code Online2
2 pages
Assignment - User Authentication
No ratings yet
Assignment - User Authentication
4 pages
Informatica Cloud Application Integration
No ratings yet
Informatica Cloud Application Integration
9 pages
Write A C Program To Identify Different Types of Tokens in A Given Program
No ratings yet
Write A C Program To Identify Different Types of Tokens in A Given Program
46 pages
Practical File Radhika-1
No ratings yet
Practical File Radhika-1
30 pages
MonthlyStatus_BasicReport
No ratings yet
MonthlyStatus_BasicReport
3 pages
Cute Lion Baby Shower Theme Invitation
No ratings yet
Cute Lion Baby Shower Theme Invitation
1 page
Handwritten Digit Recognition Using Machine and Deep Learning Algorithms
No ratings yet
Handwritten Digit Recognition Using Machine and Deep Learning Algorithms
6 pages
Gitanjali Senior School Ideathon Newsletter
No ratings yet
Gitanjali Senior School Ideathon Newsletter
2 pages
Assignment #1
No ratings yet
Assignment #1
4 pages
Homework Questions and Answers
50% (2)
Homework Questions and Answers
4 pages
8c74aea7-6462-47d8-8401-cccae75fa3a7
No ratings yet
8c74aea7-6462-47d8-8401-cccae75fa3a7
23 pages
Report 2020 Crowd Strike Global Threat Report
No ratings yet
Report 2020 Crowd Strike Global Threat Report
68 pages
Quiz (Sample Portofolio)
No ratings yet
Quiz (Sample Portofolio)
10 pages
DaVinci Resolve 17 Editors Guide
100% (1)
DaVinci Resolve 17 Editors Guide
608 pages
What Would The Rockefellers Do Download PDF
100% (2)
What Would The Rockefellers Do Download PDF
34 pages
ProRec 1.0_2
No ratings yet
ProRec 1.0_2
5 pages
Application Form (Basahin Muna Ang General Instructions Sa Baba Bago Sagutan
No ratings yet
Application Form (Basahin Muna Ang General Instructions Sa Baba Bago Sagutan
1 page
MCIT-103 - OOT - Lab - Manual
No ratings yet
MCIT-103 - OOT - Lab - Manual
29 pages
RAM-TVL11 Technical Drafting
No ratings yet
RAM-TVL11 Technical Drafting
12 pages
Ms Office Excel 2007 Shortcuts
No ratings yet
Ms Office Excel 2007 Shortcuts
5 pages

Fake News detection Using Machine Learning | IEEE Conference Publication | IEEE Xplore

Uploaded by

Fake News detection Using Machine Learning | IEEE Conference Publication | IEEE Xplore

Uploaded by

Conferences > 2020 2nd International Worksh...

Fake News detection Using Machine Learning

Nihel Fatima Baarir ; Abdelhamid Djeffal All Authors

Date of Conference: 09-10 February 2021 DOI: 10.1109/IHSH51661.2021.9378748

A. General Architecture of the Proposed System

1. Cleaning: eliminating stop words and special characters.

2. Steaming: transforming the useful words into roots.

D : the total number of documents D divided by the number D of documents citing

Cleaning: eliminating special characters and transforming letters into lowercase.

T is the number of authors of the source (size).

ik is the author index number k.

If the sum is less than 0, the feeling is negative.

If the sum is greater than 0, the sentiment is positive.

If the sum is 0, the feeling is neutral.

Dec is the decision function value;

p is the percentage of truth or fake.

5 words obtained by the bag of words method,

3 compound words obtained by the N-gram method,

date: day, month and year,

class: fake or real.

B. Results and Discussion

the followed process resulted in a recognition rate of 100%.

TECHNICAL INTERESTS CONTACT & SUPPORT

You might also like