Neural Processing Letters
https://doi.org/10.1007/s11063-020-10365-x
Integrating Machine Learning Techniques in Semantic Fake
News Detection
Adrian M. P. Bra¸ soveanu
1,3
· R˘ azvan Andonie
2,3
Accepted: 3 October 2020
© Springer Science+Business Media, LLC, part of Springer Nature 2020
Abstract
The nuances of languages, as well as the varying degrees of truth observed in news items,
make fake news detection a difficult problem to solve. A news item is never launched without
a purpose, therefore in order to understand its motivation it is best to analyze the relations
between the speaker and its subject, as well as different credibility metrics. Inferring details
about the various actors involved in a news item is a problem that requires a hybrid approach
that mixes machine learning, semantics and natural language processing. This article dis-
cusses a semantic fake news detection method built around relational features like sentiment,
entities or facts extracted directly from text. Our experiments are focused on short texts
with different degrees of truth and show that adding semantic features improves accuracy
significantly.
Keywords NLP · Semantics · Relation extraction · Deep learning
1 Introduction
Detecting fake news is an interdisciplinary problem, as it requires us to examine which
methods where used to disseminate the news (e.g., social networks [53]), the links between
the various actors involved (e.g., by using the information available in public Knowledge
Graphs like Wikipedia), the propaganda tools (e.g., language can often be examined through
the lens of semantics [6]) or even the geopolitics (e.g., as proven by the Camdridge Analytica
scandal, some news might be targeted to some specific groups who might be more likely to
respond to it). At a superficial level it is important to distinguish between satire and political
weapons (or any other kind of weapons built on top of deceptive news) [8] or between the
various news outlets that spread it, but when analyzing a news item it often helps to deploy a
B Adrian M. P. Bra¸ soveanu
adrian.brasoveanu@modul.ac.at
R˘ azvan Andonie
andonie@cwu.edu
1
MODUL Technology GmbH, Vienna, Austria
2
Computer Science Department, Central Washington University, Ellensburg, WA, USA
3
Electronics and Computers Department, Transilvania University of Bra¸ sov, Bra¸ sov, Romania
123