Auditing YouTube’s Recommendation Algorithm for Misinformation
Filter Bubbles
IVAN SRBA, Kempelen Institute of Intelligent Technologies, Slovakia
ROBERT MORO, Kempelen Institute of Intelligent Technologies, Slovakia
MATUS TOMLEIN, Kempelen Institute of Intelligent Technologies, Slovakia
BRANISLAV PECHER
∗
, Faculty of Information Technology, Brno University of Technology, Czechia
JAKUB SIMKO, Kempelen Institute of Intelligent Technologies, Slovakia
ELENA STEFANCOVA, Kempelen Institute of Intelligent Technologies, Slovakia
MICHAL KOMPAN
2
, Kempelen Institute of Intelligent Technologies, Slovakia
ANDREA HRCKOVA, Kempelen Institute of Intelligent Technologies, Slovakia
JURAJ PODROUZEK, Kempelen Institute of Intelligent Technologies, Slovakia
ADRIAN GAVORNIK, Kempelen Institute of Intelligent Technologies, Slovakia
MARIA BIELIKOVA
3
, Kempelen Institute of Intelligent Technologies, Slovakia
In this paper, we present results of an auditing study performed over YouTube aimed at investigating how fast a user can get into
a misinformation ilter bubble, but also what it takes to “burst the bubblež, i.e., revert the bubble enclosure. We employ a sock
puppet audit methodology, in which pre-programmed agents (acting as YouTube users) delve into misinformation ilter bubbles
by watching misinformation promoting content. Then they try to burst the bubbles and reach more balanced recommendations
by watching misinformation debunking content. We record search results, home page results, and recommendations for the
watched videos. Overall, we recorded 17,405 unique videos, out of which we manually annotated 2,914 for the presence of
misinformation. The labeled data was used to train a machine learning model classifying videos into three classes (promoting,
debunking, neutral) with the accuracy of 0.82. We use the trained model to classify the remaining videos that would not be
feasible to annotate manually.
Using both the manually and automatically annotated data, we observe the misinformation bubble dynamics for a range
of audited topics. Our key inding is that even though ilter bubbles do not appear in some situations, when they do, it is
possible to burst them by watching misinformation debunking content (albeit it manifests diferently from topic to topic). We
∗
Also with Kempelen Institute of Intelligent Technologies.
2
Also with slovak.AI.
3
Also with slovak.AI.
Authors’ addresses: Ivan Srba, Kempelen Institute of Intelligent Technologies, Bratislava, Slovakia, ivan.srba@kinit.sk; Robert Moro, Kempelen
Institute of Intelligent Technologies, Bratislava, Slovakia, robert.moro@kinit.sk; Matus Tomlein, Kempelen Institute of Intelligent Technologies,
Bratislava, Slovakia, matus.tomlein@kinit.sk; Branislav Pecher, Faculty of Information Technology, Brno University of Technology, Brno,
Czechia, branislav.pecher@kinit.sk; Jakub Simko, Kempelen Institute of Intelligent Technologies, Bratislava, Slovakia, jakub.simko@kinit.sk;
Elena Stefancova, Kempelen Institute of Intelligent Technologies, Bratislava, Slovakia, elena.stefancova@kinit.sk; Michal Kompan, Kempelen
Institute of Intelligent Technologies, Bratislava, Slovakia, michal.kompan@kinit.sk; Andrea Hrckova, Kempelen Institute of Intelligent
Technologies, Bratislava, Slovakia, andrea.hrckova@kinit.sk; Juraj Podrouzek, Kempelen Institute of Intelligent Technologies, Bratislava,
Slovakia, juraj.podrouzek@kinit.sk; Adrian Gavornik, Kempelen Institute of Intelligent Technologies, Bratislava, Slovakia, adrian.gavornik@
intern.kinit.sk; Maria Bielikova, Kempelen Institute of Intelligent Technologies, Bratislava, Slovakia, maria.bielikova@kinit.sk.
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that
copies are not made or distributed for proit or commercial advantage and that copies bear this notice and the full citation on the irst page.
Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy
otherwise, or republish, to post on servers or to redistribute to lists, requires prior speciic permission and/or a fee. Request permissions from
permissions@acm.org.
© 2022 Copyright held by the owner/author(s). Publication rights licensed to ACM.
2770-6699/2022/10-ART $15.00
https://doi.org/10.1145/3568392
ACM Trans. Recomm. Syst.