Using graph solutions to identify "troll farms" and fake news propagation channels
Patryk Sulej, Krzysztof Hryniów
DOI: http://dx.doi.org/10.15439/2023F2738
Citation: Proceedings of the 18th Conference on Computer Science and Intelligence Systems, M. Ganzha, L. Maciaszek, M. Paprzycki, D. Ślęzak (eds). ACSIS, Vol. 35, pages 1161–1166 (2023)
Abstract. This paper addresses the issue of fake news detection, with a particular focus on solutions derived from graph theory. It covers identifying channels, which are sources of fake news, and identifying users spreading false information, considering users deliberately misleading their audience, forming clusters called 'troll farms'. It proposes a solution using graph theory, which includes classifying users based on the social context extracted in graph centrality measures built from user interactions or networks built from followers on the social network Twitter. The solution includes not only the identification of trolls but also potential unintentional users spreading false information, users exposed to false information, or automated scripts spreading information (bots). Thorough research on the efficiency of different features and classifiers is conducted on MIB and FakeNewsNet datasets. Conducted research confirms general conclusions from previous studies and offers some improvements.
References
- X. Zhou and R. Zafarani, “A survey of fake news: Fundamental theories, detection methods, and opportunities,” ACM Comput. Surv., 9 2020. [Online]. Available: https://dl.acm.org/doi/pdf/10.1145/3395046
- C. Silverman, “This analysis shows how viral fake election news stories outperformed real news on facebook. buz- zfeed news.” [Online]. Available: https://www.buzzfeednews.com/article/craigsilverman/viral-fake-election-news-outperformed-real-news-on-facebook
- H. F. Gylfason, A. H. Sveinsdottir, V. Vésteinsdóttir, and R. Sigurvinsdottir, “Haters gonna hate, trolls gonna troll: The personality profile of a facebook troll,” Int J Environ Res Public Health, 2022. [Online]. Available: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8199376/#B1-ijerph-18-05722
- M. Marek, “Russian information war: the activities of the russian propaganda apparatus in the context of the war in ukraine (as of the first half of march 2022),” Bezpieczeństwo teoria i praktyka, 2022. [Online]. Available: https://btip.ka.edu.pl/btip-2022-nr3/
- S. Hangloo and B. Arora, “Fake news detection tools and methods – a review,” International Journal of Advance and Innovative Research, 6 2021. [Online]. Available: https://arxiv.org/ftp/arxiv/papers/2112/2112.11185.pdf
- K. Shu, D. Mahudeswaran, D. L. Suhang Wang, and H. Liu, “Fakenewsnet: A data repository with news content, social context and spatialtemporal information for studying fake news on social media,” 2018. [Online]. Available: https://arxiv.org/abs/1809.01286
- B. D. Horne and S. Adali, “This just in: Fake news packs a lot in title, uses simpler, repetitive content in text body, more similar to satire than real news.” The 2nd International Workshop on News and Public Opinion at ICWSM, 2017. [Online]. Available: https://arxiv.org/abs/1703.09398
- Y. Dou, K. Shu, C. Xia, P. S. Yu, and L. Sun, “User preference-aware fake news detection,” in Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021. [Online]. Available: https://arxiv.org/abs/2104.12259
- F. Monti, F. Frasca, D. Eynard, D. Mannion, and M. M. Bronstein, “Fake news detection on social media using geometric deep learning,” vol. abs/1902.06673, 2019. [Online]. Available: https://www.sciencedirect.com/science/article/pii/S1877050918318210
- A. Mehrotra, M. Sarreddy, and S. Singh, “Detection of fake twitter followers using graph centrality measures,” in 2016 2nd International Conference on Contemporary Computing and Informatics (IC3I), 2016. http://dx.doi.org/10.1109/IC3I.2016.7918016 pp. 499–504.
- Y. Zhao and J. Weber, “Detecting fake users on social media with a graph database,” vol. 12, 10 2021. [Online]. Available: https://doi.org/10.18357/tar121202120027
- A. Badawy, E. Ferrara, and K. Lerman, “Analyzing the digital traces of political manipulation: The 2016 russian interference twitter campaign,” in 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM). IEEE, 8 2018. [Online]. Available: https://doi.org/10.1109\%2Fasonam.2018.8508646
- W. Lyon, “The story behind russian twitter trolls: How they got away with looking human – and how to catch them in the future,” 3 2018. [Online]. Available: https://neo4j.com/blog/story-behind-russian-twitter-trolls
- S. Cresci, R. Di Pietro, M. Petrocchi, A. Spognardi, and M. Tesconi, “Fame for sale: Efficient detection of fake twitter followers,” Decision Support Systems, vol. 80, pp. 56–71, 2015. [Online]. Available: https://www.sciencedirect.com/science/article/pii/S0167923615001803
- S. Cresci, R. Di Pietro, M. Petrocchi, A. Spognardi, and M. Tesconi, “The paradigm-shift of social spambots: Evidence, theories, and tools for the arms race.” Republic and Canton of Geneva, CHE: International World Wide Web Conferences Steering Committee, 2017, p. 963–972. [Online]. Available: https://doi.org/10.1145/3041021.3055135
- B. C. Ross, “Mutual information between discrete and continuous data sets,” PLOS ONE, vol. 9, no. 2, pp. 1–5, 02 2014. http://dx.doi.org/10.1371/journal.pone.0087357. [Online]. Available: https://doi.org/10.1371/journal.pone.0087357
- scikit learn.org, “Api reference.” [Online]. Available: https://scikit-learn.org/stable/modules/classes.html#module-sklearn.feature_selection
- B. Schölkopf, A. J. Smola, R. C. Williamson, and P. L. Bartlett, “New support vector algorithms,” Neural Comput., vol. 12, no. 5, p. 1207–1245, 5 2000. [Online]. Available: https://doi.org/10.1162/089976600300015565
- I. J. Good, “Rational decisions,” Journal of the Royal Statistical Society. Series B (Methodological), vol. 14, no. 1, pp. 107–114, 1952. [Online]. Available: http://www.jstor.org/stable/2984087
- M. Ojala and G. C. Garriga, “Permutation tests for studying classifier performance.” Journal of machine learning research, vol. 11, no. 6, 2010. [Online]. Available: https://www.jmlr.org/papers/volume11/ojala10a/ojala10a.pdf
- P. Refaeilzadeh, L. Tang, and H. Liu, “On comparison of feature selection algorithms,” in Proceedings of AAAI workshop on evaluation methods for machine learning II, vol. 3, no. 4. AAAI Press Vancouver, 2007, p. 5. [Online]. Available: https://www.aaai.org/Library/Workshops/2007/ws07-05-007.php