0% found this document useful (0 votes)

209 views39 pages

Classifying Fake News Articles Using NLP To Identify In-Article Attribution As A Supervised Learning

Classifying Fake News Articles Using NLP to Identify In-Article Attribution as a Supervised Learning

Uploaded by

mramusworld

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

209 views39 pages

Classifying Fake News Articles Using NLP To Identify In-Article Attribution As A Supervised Learning

Classifying Fake News Articles Using NLP to Identify In-Article Attribution as a Supervised Learning

Uploaded by

mramusworld

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 39

INDEX

TITLES PAGE NO

CONTENTS
ABSTRACT vi

1. INTRODUCTION 1

2. LITERATURE SURVEY 5
2.1 KEY-DEDUPLICATION WITH IBBE 5
2.2 SERVER LESS DISTRIBUTEDFILESYSTEM 6
2.3 THE GOOGLEFILESYSTEM 7
2.4 CONVERGENTKEYMANAGEMENT 8
2.5 SOFTWAREENVIRONMENT 9
2.6 WHY CHOOSEPYTHON 10

3. SYSTEM ANALYSIS 14
3.1 EXISTING SYSTEM 16
3.2 PROPOSEDSYSTEM 16

4. FEASIBILITYSTUDY 17
4.1 ECONOMICALFEASIBILITY 17
4.2 TECHNICALFEASIBILITY 17
4.3 SOCIALFEASIBILITY 18

5. SYSTEMREQUIREMENTS 19
6. SYSTEMDESIGN 20
6.1 SYSTEMARCHITECTURE 20
6.2 DATAFLOW DIAGRAM 20
6.3 UMLDIAGRAMS 22

7. IMPLEMENTATION 27
7.1 MODULES 27
7.2 SAMPLE CODE 28

8. SYSTEMTESTING 29
8.1 UNITTESTING 31
8.2 INTEGRATION TESTING 32
8.3 ACCEPTANCETESTING 33

9. INPUT DESIGN ANDOUTPUTDESIGN 34

9.1 INPUTDESIGN 34

9.2 OUTPUTDESIGN 35

10. SCREENSHOTS 37

11. FUTUREWORK 50

12. CONCLUSION 51

13. BIBLOGRAPHY 52
ABSTRACT:
Intentionally deceptive content presented under the guise of legitimate journalism is a worldwide
information accuracy and integrity problem that affects opinion forming, decision making, and
voting patterns. Most so-called ‘fake news’ is initially distributed over social media conduits like
Facebook and Twitter and later finds its way onto mainstream media platforms such as traditional
television and radio news. The fake news stories that are initially seeded over social media
platforms share key linguistic characteristics such as making excessive use of unsubstantiated
hyperbole and non-attributed quoted content. In this paper, the results of a fake news
identification study that documents the performance of a fake news classifier are presented. The
Textblob, Natural Language, and SciPy Toolkits were used to develop a novel fake news detector
that uses quoted attribution in a Bayesian machine learning system as a key feature to estimate
the likelihood that a news article is fake. The resultant process precision is 63.333% effective at
assessing the likelihood that an article with quotes is fake. This process is called influence mining
and this novel technique is presented as a method that can be used to enable fake news and even
propaganda detection. In this paper, the research process, technical analysis, technical linguistics
work, and classifier performance and results are presented. The paper concludes with a
discussion of how the current system will evolve into an influence mining system.
1. INTRODUCTION

Intentionally deceptive content presented under the guise of legitimate journalism (or ‘fake news,’ as it is
commonly known) is a worldwide information accuracy and integrity problem that affects opinion
forming, decision making, and voting patterns. Most fake news is initially distributed over social media
conduits like Facebook and Twitter and later finds its way onto mainstream media platforms such as
traditional television and radio news. The fake news stories that are initially seeded over social media
platforms share key linguistic characteristics such as excessive use of unsubstantiated hyperbole and non-
attributed quoted content. The results of a fake news identification study that documents the performance
of a fake news classifier are presented and discussed in this paper.
2. LITERATURE SURVEY
2.1 When Fake News Becomes Real: Combined Exposure to Multiple News
Sources and Political Attitudes of Inefficacy, Alienation, and Cynicism

Authors: M. Balmas, Abstract: This research assesses

possible associations between viewing fake news (i.e.,
political satire) and attitudes of inefficacy, alienation, and
cynicism toward political candidates. Using survey data
collected during the 2006 Israeli election campaign, the
study provides evidence for an indirect positive effect of
fake news viewing in fostering the feelings of inefficacy,
alienation, and cynicism, through the mediator variable of
perceived realism of fake news. Within this process, hard
news viewing serves as a moderator of the association
between viewing fake news and their perceived realism.
It was also demonstrated that perceived realism of fake
news is stronger among individuals with high exposure to
fake news and low exposure to hard news than among
those with high exposure to both fake and hard news.
Overall, this study contributes to the scientific knowledge
regarding the influence of the interaction between
various types of media use on political effects

2.2 The Impact of Real News about "Fake News": Intertextual Processes and
Political Satire

Authors: P. R. Brewer, D. G. Young, and M. Morreale, Abstract: This study

builds on research about political humor, press
metacoverage, and intertextuality to examine the effects
of news coverage about political satire on audience
members. The analysis uses experimental data to test
whether news coverage of Stephen Colbert’s Super PAC
influenced knowledge and opinion regarding Citizens
United, as well as political trust and internal political
efficacy. It also tests whether such effects depended on
previous exposure to The Colbert Report (Colbert’s
satirical television show) and traditional news. Results
indicate that exposure to news coverage of satire can
influence knowledge, opinion, and political trust.
Additionally, regular satire viewers may experience
stronger effects on opinion, as well as increased internal
efficacy, when consuming news coverage about issues
previously highlighted in satire programming.

2.3 Fake News Mitigation via Point Process Based Intervention

Authors: M. Farajtabar et al Abstract: We

propose the first multistage intervention framework that
tackles fake news in social networks by combining
reinforcement learning with a point process network
activity model. The spread of fake news and mitigation
events within the network is modeled by a multivariate
Hawkes process with additional exogenous control
terms. By choosing a feature representation of states,
defining mitigation actions and constructing reward
functions to measure the effectiveness of mitigation
activities, we map the problem of fake news mitigation
into the reinforcement learning framework. We develop a
policy iteration method unique to the multivariate
networked point process, with the goal of optimizing the
actions for maximal total reward under budget
constraints. Our method shows promising performance in
real-time intervention experiments on a Twitter network
to mitigate a surrogate fake news campaign, and
outperforms alternatives on synthetic datasets.
2.4 Fake News or Truth? Using Satirical Cues to Detect Potentially
Misleading News
Authors: V. L. Rubin, N. J. Conroy, Y. Chen, and S. Cornwell

Abstract: Satire is an attractive subject in deception

detection research: it is a type of deception that
intentionally incorporates cues revealing its own
deceptiveness. Whereas other types of fabrications aim
to instill a false sense of truth in the reader, a successful
satirical hoax must eventually be exposed as a jest. This
paper provides a conceptual overview of satire and
humor, elaborating and illustrating the unique features of
satirical news, which mimics the format and style of
journalistic reporting. Satirical news stories were carefully
matched and examined in contrast with their legitimate
news counterparts in 12 contemporary news topics in 4
domains (civics, science, business, and “soft” news).
Building on previous work in satire detection, we
proposed an SVM-based algorithm, enriched with 5
predictive features (Absurdity, Humor, Grammar,
Negative Affect, and Punctuation) and tested their
combinations on 360 news articles. Our best predicting
feature combination (Absurdity, Grammar and
Punctuation) detects satirical news with a 90% precision
and 84% recall (F-score=87%). Our work in
algorithmically identifying satirical news pieces can aid in
minimizing the potential deceptive impact of satire. [Note:
The associated dataset of the Satirical and Legitimate
News, S-n-L News DB 2015-2016, is available via
http://victoriarubin.fims.uwo.ca/news-verification/ . The
set is password-protected to avoid automated harvesting.
Please feel free to request the password, if you are
interested.]
2.5 SOFTWARE ENVIRONMENT
Python is a high-level, interpreted scripting language developed in the late
1980s by Guido van Rossum at the National Research Institute for Mathematics
and Computer Science in the Netherlands. The initial version was published at the
alt. Sources newsgroup in 1991, and version 1.0 was released in 1994.

Python 2.0 was released in 2000, and the 2.x versions were the prevalent releases
until December 2008. At that time, the development team made the decision to
release version 3.0, which contained a few relatively small but significant changes
that were not backward compatible with the 2.x versions. Python 2 and 3 are very
similar, and some features of Python 3 have been back ported to Python 2. But in
general, they remain not quite compatible.

Both Python 2 and 3 have continued to be maintained and developed, with periodic
release updates for both. As of this writing, the most recent versions available are
2.7.15 and 3.6.5. However, an official End of Life date of January 1, 2020 has been
established for Python 2, after which time it will no longer be maintained. If you
are a newcomer to Python, it is recommended that you focus on Python 3, as this
tutorial will do.

Python is still maintained by a core development team at the Institute, and Guido is
still in charge, having been given the title of BDFL (Benevolent Dictator For Life)
by the Python community. The name Python, by the way, derives not from the
snake, but from the British comedy troupe Monty Python’s Flying Circus, of which
Guido was, and presumably still is, a fan. It is common to find references to Monty
Python sketches and movies scattered throughout the Python documentation.

2.6 WHY CHOOSE PYTHON

If you’re going to write programs, there are literally dozens of commonly used
languages to choose from. Why choose Python? Here are some of the features that
make Python an appealing choice.

Python is Popular

Python has been growing in popularity over the last few years. The 2018 Stack
Overflow Developer Survey ranked Python as the 7th most popular and the number
one most wanted technology of the year. World-class software development
countries around the globe use Python every single day.

According to research by Dice Python is also one of the hottest skills to have and
the most popular programming language in the world based on the Popularity of
Programming Language Index.

Due to the popularity and widespread use of Python as a programming language,

Python developers are sought after and paid well. If you’d like to dig deeper
into Python salary statistics and job opportunities, you can do so here.

Python is interpreted

Many languages are compiled, meaning the source code you create needs to be
translated into machine code, the language of your computer’s processor, before it
can be run. Programs written in an interpreted language are passed straight to an
interpreter that runs them directly.

This makes for a quicker development cycle because you just type in your code and
run it, without the intermediate compilation step.

One potential downside to interpreted languages is execution speed. Programs that

are compiled into the native language of the computer processor tend to run more
quickly than interpreted programs. For some applications that are particularly
computationally intensive, like graphics processing or intense number crunching,
this can be limiting.

In practice, however, for most programs, the difference in execution speed is

measured in milliseconds, or seconds at most, and not appreciably noticeable to a
human user. The expediency of coding in an interpreted language is typically worth
it for most applications.

Python is Free
The Python interpreter is developed under an OSI-approved open-source license,
making it free to install, use, and distribute, even for commercial purposes.

A version of the interpreter is available for virtually any platform there is,
including all flavors of Unix, Windows, macOS, smart phones and tablets, and
probably anything else you ever heard of. A version even exists for the half dozen
people remaining who use OS/2.

Python is Portable
Because Python code is interpreted and not compiled into native machine
instructions, code written for one platform will work on any other platform that has
the Python interpreter installed. (This is true of any interpreted language, not just
Python.)

Python is Simple
As programming languages go, Python is relatively uncluttered, and the developers
have deliberately kept it that way.

A rough estimate of the complexity of a language can be gleaned from the number
of keywords or reserved words in the language. These are words that are reserved
for special meaning by the compiler or interpreter because they designate specific
built-in functionality of the language.

Python 3 has 33 keywords, and Python 2 has 31. By contrast, C++ has 62, Java has
53, and Visual Basic has more than 120, though these latter examples probably
vary somewhat by implementation or dialect.

Python code has a simple and clean structure that is easy to learn and easy to read.
In fact, as you will see, the language definition enforces code structure that is easy
to read.

But It’s Not That Simple For

all its syntactical simplicity, Python supports most constructs that would be
expected in a very high-level language, including complex dynamic data types,
structured and functional programming, and object-oriented programming.

Additionally, a very extensive library of classes and functions is available that

provides capability well beyond what is built into the language, such as database
manipulation or GUI programming.
Python accomplishes what many programming languages don’t: the language itself
is simply designed, but it is very versatile in terms of what you can accomplish
with it.

Conclusion
This section gave an overview of the Python programming language, including:

 A brief history of the development of Python

 Some reasons why you might select Python as your language of choice

Python is a great option, whether you are a beginning programmer looking to learn
the basics, an experienced programmer designing a large application, or anywhere
in between. The basics of Python are easily grasped, and yet its capabilities are
vast. Proceed to the next section to learn how to acquire and install Python on your
computer.

Python is an open source programming language that was made to be easy-to-read

and powerful. A Dutch programmer named Guido van Rossum made Python in
1991. He named it after the television show Monty Python's Flying Circus. Many
Python examples and tutorials include jokes from the show.

Python is an interpreted language. Interpreted languages do not need to

be compiled to run. A program called an interpreter runs Python code on almost
any kind of computer. This means that a programmer can change the code and
quickly see the results. This also means Python is slower than a compiled language
like C, because it is not running machine code directly.

Python is a good programming language for beginners. It is a high-level language,

which means a programmer can focus on what to do instead of how to do it.
Writing programs in Python takes less time than in some other languages.
Python drew inspiration from other programming languages like C, C+
+, Java, Perl, and Lisp.

Python has a very easy-to-read syntax. Some of Python's syntax comes from C,
because that is the language that Python was written in. But Python uses
whitespace to delimit code: spaces or tabs are used to organize code into groups.
This is different from C. In C, there is a semicolon at the end of each line and curly
braces ({}) are used to group code. Using whitespace to delimit code makes Python
a very easy-to-read language.

Python use [change / change source]

Python is used by hundreds of thousands of programmers and is used in many

places. Sometimes only Python code is used for a program, but most of the time it
is used to do simple jobs while another programming language is used to do more
complicated tasks.

Its standard library is made up of many functions that come with Python when it is

installed. On the Internet there are many other libraries available that make it
possible for the Python language to do more things. These libraries make it a
powerful language; it can do many different things.

Some things that Python is often used for are:

 Web development
 Scientific programming
 Desktop GUIs
 Network programming
 Game programming
3. SYSTEM ANALYSIS
3.1 EXISTING SYSTEM:

Up to now, most of the research on PDS has focused on how to enforce user privacy preferences
and how to secure data when stored into the PDS. In contrast, the key issue of helping users to
specify their privacy preferences on PDS data has not been so far deeply investigated. This is a
fundamental issue since average PDS users are not skilled enough to understand how to translate
their privacy requirements into a set of privacy preferences. As several studies have shown,
average users might have difficulties in properly setting potentially complex privacy preferences.

DISADVANTAGES OF EXISTING SYSTEM:

Personal data we are digitally producing are scattered in different online systems managed by
different providers (e.g., online social media, hospitals, banks, airlines, etc). In this way, on the
one hand users are losing control on their data, whose protection is under the responsibility of the
data provider, and, on the other, they cannot fully exploit their data, since each provider keeps a
separate view of them.

3.2 PROPOSED SYSTEM:

Personal Data Storage (PDS) has inaugurated a substantial change to the way people can store
and control their personal data, by moving from a service-centric to a user-centric model. PDSs
enable individuals to collect into a single logical vault personal information they are producing.
Such data can then be connected and exploited by proper analytical tools, as well as shared with
third parties under the control of end users.
4. FEASIBILITY STUDY
The feasibility of the project is analyzed in this phase and business proposal is put
forth with a very general plan for the project and some cost estimates. During
system analysis the feasibility study of the proposed system is to be carried out.
This is to ensure that the proposed system is not a burden to the company. For
feasibility analysis, some understanding of the major requirements for the system is
essential.

Three key considerations involved in the feasibility analysis are

 ECONOMICAL FEASIBILITY
 TECHNICAL FEASIBILITY
 SOCIAL FEASIBILITY

4.1 ECONOMICAL FEASIBILITY

This study is carried out to check the economic impact that the system will
have on the organization. The amount of fund that the company can pour into the
research and development of the system is limited. The expenditures must be
justified. Thus the developed system as well within the budget and this was
achieved because most of the technologies used are freely available. Only the
customized products had to be purchased.
4.2 TECHNICAL FEASIBILITY
This study is carried out to check the technical feasibility, that is, the
technical requirements of the system. Any system developed must not have a high
demand on the available technical resources. This will lead to high demands on the
available technical resources. This will lead to high demands being placed on the
client. The developed system must have a modest requirement, as only minimal or
null changes are required for implementing this system.
4.3 SOCIAL FEASIBILITY
The aspect of study is to check the level of acceptance of the system by the
user. This includes the process of training the user to use the system efficiently.
The user must not feel threatened by the system, instead must accept it as a
necessity. The level of acceptance by the users solely depends on the methods that
are employed to educate the user about the system and to make him familiar with
it. His level of confidence must be raised so that he is also able to make some
constructive criticism, which is welcomed, as he is the final user of the system.

5. SYSTEM REQUIREMENTS

5.1 HARDWARE REQUIREMENTS:

• System : Pentium Dual Core.

• Hard Disk : 120 GB.
• Monitor : 15’’ LED
• Input Devices : Keyboard, Mouse
• Ram : 1 GB
5.2 SOFTWARE REQUIREMENTS:

• Operating system : Windows 10

• Coding Language : python

• Tool : PyCharm

• Database : MYSQL

• Server : Flask

6. SYSTEM DESIGN

6.1 SYSTEM ARCHITECTURE:

6.2 DATA FLOW DIAGRAM:

1. The DFD is also called as bubble chart. It is a simple graphical formalism

that can be used to represent a system in terms of input data to the system,
various processing carried out on this data, and the output data is generated
by this system.
2. The data flow diagram (DFD) is one of the most important modeling tools. It
is used to model the system components. These components are the system
process, the data used by the process, an external entity that interacts with
the system and the information flows in the system.
3. DFD shows how the information moves through the system and how it is
modified by a series of transformations. It is a graphical technique that
depicts information flow and the transformations that are applied as data
moves from input to output.
4. DFD is also known as bubble chart. A DFD may be used to represent a
system at any level of abstraction. DFD may be partitioned into levels that
represent increasing information flow and functional detail.
User

Unauthorized user
Check

Upload News Articles

Run FakeNews Detector Algorithm

Logout

End process
6.3 UML DIAGRAMS:
UML stands for Unified Modeling Language. UML is a standardized
general-purpose modeling language in the field of object-oriented software
engineering. The standard is managed, and was created by, the Object Management
Group.
The goal is for UML to become a common language for creating models of
object oriented computer software. In its current form UML is comprised of two
major components: a Meta-model and a notation. In the future, some form of
method or process may also be added to; or associated with, UML.
The Unified Modeling Language is a standard language for specifying,
Visualization, Constructing and documenting the artifacts of software system, as
well as for business modeling and other non-software systems.
The UML represents a collection of best engineering practices that have
proven successful in the modeling of large and complex systems.
The UML is a very important part of developing objects oriented software
and the software development process. The UML uses mostly graphical notations
to express the design of software projects.

GOALS:
The Primary goals in the design of the UML are as follows:
1. Provide users a ready-to-use, expressive visual modeling Language so that
they can develop and exchange meaningful models.
2. Provide extendibility and specialization mechanisms to extend the core
concepts.
3. Be independent of particular programming languages and development
process.
4. Provide a formal basis for understanding the modeling language.
5. Encourage the growth of OO tools market.
6. Integrate best practices.

USE CASE DIAGRAM:

A use case diagram in the Unified Modeling Language (UML) is a type of
behavioral diagram defined by and created from a Use-case analysis. Its purpose is
to present a graphical overview of the functionality provided by a system in terms
of actors, their goals (represented as use cases), and any dependencies between
those use cases. The main purpose of a use case diagram is to show what system
functions are performed for which actor. Roles of the actors in the system can
bedepicted.
CLASS DIAGRAM:
In software engineering, a class diagram in the Unified Modeling Language (UML)
is a type of static structure diagram that describes the structure of a system by
showing the system's classes, their attributes, operations (or methods), and the
relationships among the classes. It explains which class contains information.

SEQUENCE DIAGRAM:
A sequence diagram in Unified Modeling Language (UML) is a kind of interaction
diagram that shows how processes operate with one another and in what order. It is
a construct of a Message Sequence Chart. Sequence diagrams are sometimes called
event diagrams, event scenarios, and timing diagrams.

ACTIVITY DIAGRAM:
Activity diagrams are graphical representations of workflows of stepwise activities
and actions with support for choice, iteration and concurrency. In the Unified
Modeling Language, activity diagrams can be used to describe the business and
operational step-by-step workflows of components in a system. An activity
diagram shows the overall flow of control.
7. IMPLEMENTATION

7.1 MODULES:
 Upload News Articles

 Run FakeNews Detector Algorithm

MODULES DESCRIPTION:
7.2 SAMPLE CODE
8. SYSTEM TESTING

The purpose of testing is to discover errors. Testing is the process of trying

to discover every conceivable fault or weakness in a work product. It provides a
way to check the functionality of components, sub assemblies, assemblies and/or a
finished product It is the process of exercising software with the intent of ensuring
that the Software system meets its requirements and user expectations and does not
fail in an unacceptable manner. There are various types of test. Each test type
addresses a specific testing requirement.

TYPES OF TESTS

Unit testing:
Unit testing involves the design of test cases that validate that the internal
program logic is functioning properly, and that program inputs produce valid
outputs. All decision branches and internal code flow should be validated. It is the
testing of individual software units of the application .it is done after the
completion of an individual unit before integration. This is a structural testing, that
relies on knowledge of its construction and is invasive. Unit tests perform basic
tests at component level and test a specific business process, application, and/or
system configuration. Unit tests ensure that each unique path of a business process
performs accurately to the documented specifications and contains clearly defined
inputs and expected results.
Integration testing:
Integration tests are designed to test integrated software components to
determine if they actually run as one program. Testing is event driven and is more
concerned with the basic outcome of screens or fields. Integration tests demonstrate
that although the components were individually satisfaction, as shown by
successfully unit testing, the combination of components is correct and consistent.
Integration testing is specifically aimed at exposing the problems that arise from
the combination of components.
Functional test:
Functional tests provide systematic demonstrations that functions tested are
available as specified by the business and technical requirements, system
documentation, and user manuals.
Functional testing is centered on the following items:
Valid Input : identified classes of valid input must be accepted.
Invalid Input : identified classes of invalid input must be rejected.
Functions : identified functions must be exercised.
Output : identified classes of application outputs must be exercised.
Systems/Procedures : interfacing systems or procedures must be invoked.
Organization and preparation of functional tests is focused on requirements,
key functions, or special test cases. In addition, systematic coverage pertaining to
identify Business process flows; data fields, predefined processes, and successive
processes must be considered for testing. Before functional testing is complete,
additional tests are identified and the effective value of current tests is determined.
System Test:
System testing ensures that the entire integrated software system meets
requirements. It tests a configuration to ensure known and predictable results. An
example of system testing is the configuration oriented system integration test.
System testing is based on process descriptions and flows, emphasizing pre-driven
process links and integration points.
White Box Testing:
White Box Testing is a testing in which in which the software tester has
knowledge of the inner workings, structure and language of the software, or at least
its purpose. It is purpose. It is used to test areas that cannot be reached from a black
box level.
Black Box Testing:
Black Box Testing is testing the software without any knowledge of the inner
workings, structure or language of the module being tested. Black box tests, as
most other kinds of tests, must be written from a definitive source document, such
as specification or requirements document, such as specification or requirements
document. It is a testing in which the software under test is treated, as a black
box .you cannot “see” into it. The test provides inputs and responds to outputs
without considering how the software works.
8.1 Unit Testing:
Unit testing is usually conducted as part of a combined code and unit test
phase of the software lifecycle, although it is not uncommon for coding and unit
testing to be conducted as two distinct phases.
Test strategy and approach:
Field testing will be performed manually and functional tests will be written
in detail.

Test objectives:
 All field entries must work properly.
 Pages must be activated from the identified link.
 The entry screen, messages and responses must not be delayed.

Features to be tested
 Verify that the entries are of the correct format
 No duplicate entries should be allowed
 All links should take the user to the correct page.
8.2 Integration Testing
Software integration testing is the incremental integration testing of two or
more integrated software components on a single platform to produce failures
caused by interface defects.
The task of the integration test is to check that components or software
applications, e.g. components in a software system or – one step up – software
applications at the company level – interact without error.
Test Results: All the test cases mentioned above passed successfully. No defects
encountered.

8.3 Acceptance Testing

User Acceptance Testing is a critical phase of any project and requires
significant participation by the end user. It also ensures that the system meets the
functional requirements.
Test Results: All the test cases mentioned above passed successfully. No defects
encountered.

9. INPUT DESIGN AND OUTPUT DESIGN

9.1 INPUT DESIGN:
The input design is the link between the information system and the user. It
comprises the developing specification and procedures for data preparation and
those steps are necessary to put transaction data in to a usable form for
processingcan be achieved by inspecting the computer to read data from a written
or printed document or it can occur by having people keying the data directly into
the system. The design of input focuses on controlling the amount of input
required, controlling the errors, avoiding delay, avoiding extra steps and keeping
the process simple. The input is designed in such a way so that it provides security
and ease of use with retaining the privacy. Input Design considered the following
things:
 What data should be given as input?
 How the data should be arranged or coded?
 The dialog to guide the operating personnel in providing input.
 Methods for preparing input validations and steps to follow when error
occur.
OBJECTIVES:
1. Input Design is the process of converting a user-oriented description of the input
into a computer-based system. This design is important to avoid errors in the data
input process and show the correct direction to the management for getting correct
information from the computerized system.
2.It is achieved by creating user-friendly screens for the data entry to handle large
volume of data. The goal of designing input is to make data entry easier and to be
free from errors. The data entry screen is designed in such a way that all the data
manipulates can be performed. It also provides record viewing facilities.
3. When the data is entered it will check for its validity. Data can be entered with
the help of screens. Appropriate messages are provided as when needed so that the
user will not be in maize of instant. Thus the objective of input design is to create
an input layout that is easy to follow
9.2 OUTPUT DESIGN:
A quality output is one, which meets the requirements of the end user and presents
the information clearly. In any system results of processing are communicated to
the users and to other system through outputs. In output design it is determined
how the information is to be displaced for immediate need and also the hard copy
output. It is the most important and direct source information to the user. Efficient
and intelligent output design improves the system’s relationship to help user
decision-making.
1. Designing computer output should proceed in an organized, well thought out
manner; the right output must be developed while ensuring that each output
element is designed so that people will find the system can use easily and
effectively. When analysis design computer output, they should Identify the
specific output that is needed to meet the requirements.
2. Select methods for presenting information.
3. Create document, report, or other formats that contain information produced by
the system.
The output form of an information system should accomplish one or more of the
following objectives.
 Convey information about past activities, current status or projections of the
 Future.
 Signal important events, opportunities, problems, or warnings.
 Trigger an action.
 Confirm an action

.
10. SCREENSHOTS
11. FUTURE ENHANCEMENT
Future planned research efforts involve combing attribution feature extraction with other factors that
emerge from the research to produce tools that not only identify potential false content, but influence
based content designed to compel a reader or target audience to make inaccurate or altered decisions.
12. CONCLUSION

This paper presented the results of a study that produced a limited fake news detection system.
The work presented herein is novel in this topic domain in that it demonstrates the results of a
full-spectrum research project that started with qualitative observations and resulted in a working
quantitative model. The work presented in this paper is also promising, because it demonstrates a
relatively effective level of machine learning classification for large fake news documents with
only one extraction feature. Finally, additional research and work to identify and build additional
fake news classification grammars is ongoing and should yield a more refined classification
scheme for both fake news and direct quotes. Future planned research efforts involve combing
attribution feature extraction with other factors that emerge from the research to produce tools
that not only identify potential false content, but influence based content designed to compel a
reader or target audience to make inaccurate or altered decisions.

13. BIBLIOGRAPHY

[1] M. Balmas, “When Fake News Becomes Real: Combined Exposure to Multiple
News Sources and Political Attitudes of Inefficacy, Alienation, and Cynicism,”
Communic. Res., vol. 41, no. 3, pp. 430–454, 2014.
[2] C. Silverman and J. Singer-Vine, “Most Americans Who See Fake News
Believe It, New Survey Says,” BuzzFeed News, 06-Dec-2016.
[3] P. R. Brewer, D. G. Young, and M. Morreale, “The Impact of Real News about
‘“Fake News”’: Intertextual Processes and Political Satire,” Int. J. Public Opin.
Res., vol. 25, no. 3, 2013.
[4] D. Berkowitz and D. A. Schwartz, “Miley, CNN and The Onion,” Journal.
Pract., vol. 10, no. 1, pp. 1–17, Jan. 2016.
[5] C. Kang, “Fake News Onslaught Targets Pizzeria as Nest of Child-
Trafficking,” New York Times, 21-Nov-2016.
[6] C. Kang and A. Goldman, “In Washington Pizzeria Attack, Fake News
Brought Real Guns,” New York Times, 05-Dec-2016.
[7] R. Marchi, “With Facebook, Blogs, and Fake News, Teens Reject Journalistic
"Objectivity",” J. Commun. Inq., vol. 36, no. 3, pp. 246–262, 2012.
[8] C. Domonoske, “Students Have ‘Dismaying’ Inability o Tell Fake News From
Real, Study Finds,” Natl. Public Radio Two-w., 2016.
[9] H. Allcott and M. Gentzkow, “Social Media and Fake News in the 2016
Election,” J. Econ. Perspect., vol. 31, no. 2, 2017.
[10] C. Shao, G. L. Ciampaglia, O. Varol, A. Flammini, and F. Menczer, “The
spread of fake news by social bots.”
[11] A. Gupta, H. Lamba, P. Kumaraguru, and A. Joshi, “Faking Sandy:
Characterizing and Identifying Fake Images on Twitter during Hurricane Sandy,”
in WWW 2013 Companion, 2013.
[12] E. Mustafaraj and P. T. Metaxas, “The Fake News Spreading Plague: Was it
Preventable?”
[13] M. Farajtabar et al., “Fake News Mitigation via Point Process Based
Intervention.”
[14] M. Haigh, T. Haigh, and N. I. Kozak, “Stopping Fake News,” Journal. Stud.,
vol. 19, no. 14, pp. 2062–2087, Oct. 2018.
[15] O. Batchelor, “Getting out the truth: the role of libraries in the fight against
fake news,” Ref. Serv. Rev., vol. 45, no. 2, pp. 143–148, Jun. 2017.
[16] B. D. Horne and S. Adalı, “This Just In: Fake News Packs a Lot in Title, Uses
Simpler, Repetitive Content in Text Body, More Similar to Satire than Real News,”
in NECO Workshop, 2017.
[17] V. L. Rubin, N. J. Conroy, Y. Chen, and S. Cornwell, “Fake News or Truth?
Using Satirical Cues to Detect Potentially Misleading News,” in Proceedings of
NAACL-HLT 2016, 2016, pp. 7–17.
[18] S. Volkova, K. Shaffer, J. Y. Jang, and N. Hodas, “Separating Facts from
Fiction: Linguistic Models to Classify Suspicious and Trusted News Posts on
Twitter,” in Proceedings of the 55th Annual Meeting of the Association for
Computational Linguistics, 2017, pp. 647–653.
[19] N. J. Conroy, V. L. Rubin, and Y. Chen, “Automatic Deception Detection:
Methods for Finding Fake News,” in Proceedings of ASIST, 2015.
[20] Y. Chen, N. J. Conroy, and V. L. Rubin, “News in an Online World: The Need
for an "Automatic Crap Detector",” in Proceedings of ASIST 2015, 2015.
[21] J. Kim, B. Tabibian, A. Oh, B. Schölkopf, and M. Gomez-Rodriguez,
“Leveraging the Crowd to Detect and Reduce the Spread of Fake News and
Misinformation.”
[22] B. Riedel, I. Augenstein, G. P. Spithourakis, and S. Riedel, “A simple but
tough-to-beat baseline for the Fake News Challenge stance detection task.”
[23] H. Rashkin, E. Choi, J. Y. Jang, S. Volkova, Y. Choi, and P. G. Allen, “Truth
of Varying Shades: Analyzing Language in Fake News and Political Fact-
Checking,” in Proceedings of the 2017 Conference on Empirical Methods in
Natural Language Processing, 2017, pp. 2931– 2937.
[24] Z. Jin, J. Cao, Y.-G. Jiang, and Y. Zhang, “News Credibility Evaluation on
Microblog with a Hierarchical Propagation Model,” in Proceedings of the IEEE
International Conference on Data Mining, 2014.
[25] K. Shu, A. Sliva, S. Wang, J. Tang, and H. Liu, “Fake News Detection on
Social Media: A Data Mining Perspective.”
[26] D. Saez-Trumper, “Fake Tweet Buster: A Webtool to Identify Users
Promoting Fake News on Twitter,” in Proceedings of HT’14, 2014.
[27] S. Pareti, T. O’keefe, I. Konstas, J. R. Curran, and I. Koprinska,
“Automatically Detecting and Attributing Indirect Quotations,” in Proceedings of
the 2013 Conference on Empirical Methods in Natural Language Processing, 2013,
pp. 18–21.
[28] T. O’keefe, S. Pareti, J. R. Curran, I. Koprinska, and M. Honnibal, “A
Sequence Labelling Approach to Quote Attribution,” in Proceedings of the 2012
Joint Conference on Empirical Methods in Natural Language Processing and
Computational Natural Language Learning, 2012, pp. 12– 14.
[29] G. Muzny, M. Fang, A. X. Chang, and D. Jurafsky, “A Two-stage Sieve
Approach for Quote Attribution,” in Proceedings of the 15th Conference of the
European Chapter of the Association for Computational Linguistics: Volume 1,
Long Papers, 2017, vol. 1, pp. 460–470.
[30] B. G. Glaser and A. L. Strauss, The discovery of grounded theory: strategies
for qualitative theory. New Brunswick: Aldine, 196

De-Dhd: Stern Drive Service Manual
67% (3)
De-Dhd: Stern Drive Service Manual
25 pages
Hospital Sops
90% (10)
Hospital Sops
57 pages
Mooc File On Introduce To Machine Learning
No ratings yet
Mooc File On Introduce To Machine Learning
13 pages
Real World Case Study (IBM - Walmart)
No ratings yet
Real World Case Study (IBM - Walmart)
9 pages
Fake News Detection Using Natural Language Processing
100% (1)
Fake News Detection Using Natural Language Processing
8 pages
Blockchain
No ratings yet
Blockchain
8 pages
UNIT - II Part 1 LC& LP
No ratings yet
UNIT - II Part 1 LC& LP
39 pages
Network Security and Cryptography
No ratings yet
Network Security and Cryptography
68 pages
PDF Seminar Report Blockchainpdf Compress
No ratings yet
PDF Seminar Report Blockchainpdf Compress
7 pages
Daa Ktu Notes
No ratings yet
Daa Ktu Notes
112 pages
Unit-Iii Distributed Objects and Remote Invocation
No ratings yet
Unit-Iii Distributed Objects and Remote Invocation
12 pages
Unit 5
No ratings yet
Unit 5
30 pages
Blockchain in Marketing
100% (1)
Blockchain in Marketing
14 pages
Blockchain Presentation
No ratings yet
Blockchain Presentation
29 pages
Detection of Phishing Websites Using Machine Learning Techniques
No ratings yet
Detection of Phishing Websites Using Machine Learning Techniques
5 pages
BCT - UNIT-3
100% (1)
BCT - UNIT-3
25 pages
Foundations of Data Science PPT TEXT BOOK
No ratings yet
Foundations of Data Science PPT TEXT BOOK
132 pages
Blockchain in Action
100% (1)
Blockchain in Action
13 pages
Generating Fake News Detection Model Using A Two-Stage Evolutionary Approach 7th Aug 2023 Published
No ratings yet
Generating Fake News Detection Model Using A Two-Stage Evolutionary Approach 7th Aug 2023 Published
19 pages
Blockchain DAPP For Supply Chain
No ratings yet
Blockchain DAPP For Supply Chain
17 pages
Use of Blockchain Technology in General PDF
No ratings yet
Use of Blockchain Technology in General PDF
15 pages
Blockchain Seminar
No ratings yet
Blockchain Seminar
23 pages
Proposal 1 1
No ratings yet
Proposal 1 1
2 pages
Analytics: The Real-World Use of Big Data: How Innovative Enterprises Extract Value From Uncertain Data
100% (1)
Analytics: The Real-World Use of Big Data: How Innovative Enterprises Extract Value From Uncertain Data
22 pages
CNS UNIT 5 Notes
No ratings yet
CNS UNIT 5 Notes
36 pages
Fake News Detection
No ratings yet
Fake News Detection
18 pages
Artificial Intelligence Course Code ECE4 PDF
No ratings yet
Artificial Intelligence Course Code ECE4 PDF
72 pages
Digital Empowerment Notes
No ratings yet
Digital Empowerment Notes
23 pages
Block chain technology full notes
No ratings yet
Block chain technology full notes
86 pages
Community, Politics, and Regulation
No ratings yet
Community, Politics, and Regulation
59 pages
Lect1 Introduction To Blockchain Technology
No ratings yet
Lect1 Introduction To Blockchain Technology
33 pages
CNS Bits
No ratings yet
CNS Bits
3 pages
BCT UNIT--IV
No ratings yet
BCT UNIT--IV
49 pages
UNIT-4 Data Link Layer of IoT
100% (1)
UNIT-4 Data Link Layer of IoT
20 pages
Ip Final Sem - Merged
No ratings yet
Ip Final Sem - Merged
36 pages
Blockchain and Cryptography
No ratings yet
Blockchain and Cryptography
9 pages
Unit 3
No ratings yet
Unit 3
62 pages
Blockchain A Game Changer For Securing IoT Data
100% (2)
Blockchain A Game Changer For Securing IoT Data
8 pages
Lab Manual
No ratings yet
Lab Manual
44 pages
Using Predicate Logic: Representation of Simple Facts in Logic
No ratings yet
Using Predicate Logic: Representation of Simple Facts in Logic
10 pages
Fake News Detection: Using Machine Learning & Python (Predicting Website)
No ratings yet
Fake News Detection: Using Machine Learning & Python (Predicting Website)
13 pages
Unit 4
No ratings yet
Unit 4
21 pages
CNS Notes
100% (1)
CNS Notes
51 pages
Blockchain Technologies Module 1
No ratings yet
Blockchain Technologies Module 1
30 pages
CS8792 CNS Unit5
No ratings yet
CS8792 CNS Unit5
17 pages
Information Storage and Retrieval - Professional Practice
No ratings yet
Information Storage and Retrieval - Professional Practice
22 pages
Unit Ii Mac Protocols For Ad Hoc Wireless Networks
No ratings yet
Unit Ii Mac Protocols For Ad Hoc Wireless Networks
76 pages
Unit 1 Q - Bank
No ratings yet
Unit 1 Q - Bank
2 pages
Blockchain Unit 1
No ratings yet
Blockchain Unit 1
13 pages
Chapter 6-Client Side Scripting Using JavaScript-hsslive
No ratings yet
Chapter 6-Client Side Scripting Using JavaScript-hsslive
29 pages
Fake News Detection Using Machine Learning
No ratings yet
Fake News Detection Using Machine Learning
8 pages
AI Chatbot Unit 2
No ratings yet
AI Chatbot Unit 2
7 pages
Exp-1 Excel Dashboard To Covid 19
No ratings yet
Exp-1 Excel Dashboard To Covid 19
11 pages
Unit 1-PROBLEM SOLVING AND PYTHON PROGRAMMING
No ratings yet
Unit 1-PROBLEM SOLVING AND PYTHON PROGRAMMING
85 pages
Fake News Detection Using Machine Learning Algorithms
No ratings yet
Fake News Detection Using Machine Learning Algorithms
10 pages
What Is Apache Flume?: Collecting, Aggregating, and Moving Large Amounts of Log Data. in
No ratings yet
What Is Apache Flume?: Collecting, Aggregating, and Moving Large Amounts of Log Data. in
8 pages
Internet of Things
No ratings yet
Internet of Things
40 pages
BLOCKCHAIN TECHNOLOGYunit1'
No ratings yet
BLOCKCHAIN TECHNOLOGYunit1'
37 pages
Blockchain Technology
No ratings yet
Blockchain Technology
33 pages
Machine Learning Blockchain
100% (1)
Machine Learning Blockchain
15 pages
Equity of Cybersecurity in the Education System: High Schools, Undergraduate, Graduate and Post-Graduate Studies.
From Everand
Equity of Cybersecurity in the Education System: High Schools, Undergraduate, Graduate and Post-Graduate Studies.
Joseph O. Esin
No ratings yet
Perezrosas Coling18 PDF
No ratings yet
Perezrosas Coling18 PDF
11 pages
Formato Candidato CV Inglés SpringLHH-2023
No ratings yet
Formato Candidato CV Inglés SpringLHH-2023
4 pages
SMPS
0% (2)
SMPS
29 pages
QuickBooks Online Core Certification Exercise Book
No ratings yet
QuickBooks Online Core Certification Exercise Book
28 pages
Using Information in Healthcare Management
No ratings yet
Using Information in Healthcare Management
25 pages
SINAMICS S120 - Function - Manual - Safety - Integrated
No ratings yet
SINAMICS S120 - Function - Manual - Safety - Integrated
232 pages
Work-life Balance Book.pdf
No ratings yet
Work-life Balance Book.pdf
61 pages
4 3 2 1 Rubric For Written Output / Case Presentation
No ratings yet
4 3 2 1 Rubric For Written Output / Case Presentation
2 pages
Schedule.Winter.2025.40_M-F_HarborFLEX (1)
No ratings yet
Schedule.Winter.2025.40_M-F_HarborFLEX (1)
1 page
Get Standard Catalog Of World Coins 1701 1800 Standard Catalog of World Coins Eighteenth Century 1701 1800 5th Edition George S. Cuhaj free all chapters
No ratings yet
Get Standard Catalog Of World Coins 1701 1800 Standard Catalog of World Coins Eighteenth Century 1701 1800 5th Edition George S. Cuhaj free all chapters
77 pages
Topic 8 - Short-Circuit and Open Circuit Test of Transformer
No ratings yet
Topic 8 - Short-Circuit and Open Circuit Test of Transformer
20 pages
An Organizational Study at The Sangrose Laboratories PVT LTD
100% (1)
An Organizational Study at The Sangrose Laboratories PVT LTD
12 pages
Haleeb Foods LTD
No ratings yet
Haleeb Foods LTD
8 pages
Tanaka Isson
No ratings yet
Tanaka Isson
2 pages
3 0 bjt1
No ratings yet
3 0 bjt1
32 pages
MRIDC - VN - 49 - Asst. Manager, Sr. Exectutive, Executive (Civil)
No ratings yet
MRIDC - VN - 49 - Asst. Manager, Sr. Exectutive, Executive (Civil)
3 pages
Questionnaire Eco
No ratings yet
Questionnaire Eco
4 pages
2 Mechanical-Presses UsaM S
0% (1)
2 Mechanical-Presses UsaM S
14 pages
APP601S Chapter 5 - Calibration
No ratings yet
APP601S Chapter 5 - Calibration
36 pages
G.R. No. 1641 _ Jaboneta v. Gustilo
No ratings yet
G.R. No. 1641 _ Jaboneta v. Gustilo
3 pages
EDU1020 Assign1 19001114 PDF
No ratings yet
EDU1020 Assign1 19001114 PDF
8 pages
3 sbl (1)
No ratings yet
3 sbl (1)
6 pages
Jeff Clintone Clinical Nurse CV
No ratings yet
Jeff Clintone Clinical Nurse CV
6 pages
Params Array
No ratings yet
Params Array
13 pages
Level Crossing Requirements PDF
No ratings yet
Level Crossing Requirements PDF
70 pages
Moon Shots For Management
No ratings yet
Moon Shots For Management
9 pages
Entrepreneurship With Myth Explained
No ratings yet
Entrepreneurship With Myth Explained
138 pages
Pages From ACI360R-10
0% (1)
Pages From ACI360R-10
4 pages
Unveiling The Evolution of Generative AI (GAI) A Comprehensive and Investigative Analysis Toward LLM Models (2021-2024) and Beyond
No ratings yet
Unveiling The Evolution of Generative AI (GAI) A Comprehensive and Investigative Analysis Toward LLM Models (2021-2024) and Beyond
21 pages

Classifying Fake News Articles Using NLP To Identify In-Article Attribution As A Supervised Learning

Uploaded by

Classifying Fake News Articles Using NLP To Identify In-Article Attribution As A Supervised Learning

Uploaded by

INDEX

9. INPUT DESIGN ANDOUTPUTDESIGN 34

Authors: M. Balmas, Abstract: This research assesses

Authors: P. R. Brewer, D. G. Young, and M. Morreale, Abstract: This study

2.3 Fake News Mitigation via Point Process Based Intervention

Authors: M. Farajtabar et al Abstract: We

Abstract: Satire is an attractive subject in deception

2.6 WHY CHOOSE PYTHON

Due to the popularity and widespread use of Python as a programming language,

One potential downside to interpreted languages is execution speed. Programs that

In practice, however, for most programs, the difference in execution speed is

But It’s Not That Simple For

Additionally, a very extensive library of classes and functions is available that

 A brief history of the development of Python

Python is an open source programming language that was made to be easy-to-read

Python is an interpreted language. Interpreted languages do not need to

Python is a good programming language for beginners. It is a high-level language,

Python use [change / change source]

Python is used by hundreds of thousands of programmers and is used in many

Its standard library is made up of many functions that come with Python when it is

Some things that Python is often used for are:

DISADVANTAGES OF EXISTING SYSTEM:

3.2 PROPOSED SYSTEM:

Three key considerations involved in the feasibility analysis are

4.1 ECONOMICAL FEASIBILITY

5.1 HARDWARE REQUIREMENTS:

• System : Pentium Dual Core.

• Operating system : Windows 10

• Coding Language : python

6.1 SYSTEM ARCHITECTURE:

1. The DFD is also called as bubble chart. It is a simple graphical formalism

Upload News Articles

Run FakeNews Detector Algorithm

USE CASE DIAGRAM:

 Run FakeNews Detector Algorithm

The purpose of testing is to discover errors. Testing is the process of trying

8.3 Acceptance Testing

9. INPUT DESIGN AND OUTPUT DESIGN

You might also like