Abstract
Detecting fake news is currently one of the critical challenges facing modern societies. The problem is particularly relevant, as disinformation is readily used for political warfare but can also cause significant harm to the health of citizens, such as by promoting false data on the harmfulness of selected therapies. One way to combat disinformation is to treat fake news detection as a machine learning task. This paper presents such an approach, which additionally addresses an important problem related to the non-stationarity characteristics of the fake news. We elaborated a stream data with the simulation of domain shift based on two popular benchmark datasets dedicated to the fake news classification problem (Kaggle Fake News and Constraint@AAAI2021–COVID19 Fake News Detection). The proposed learning system works in a Continual Learning (CL) framework and integrates a self domain shift adaptation in a machine learning scheme. The method was built following state-of-the-art techniques, that includes Word2Vec as a feature extractor and the LSTM model as a classifier. The performance of the approach has been evaluated over the generated data stream. The convenience of our approach is showed in the results, where the accuracy gain with respect to a CL approach without domain adaptation is observed to be significant.
Original language | English |
---|---|
Title of host publication | Proceedings of the 2023 IEEE 10th International Conference on Data Science and Advanced Analytics (DSAA) |
Number of pages | 10 |
Publisher | IEEE |
Publication date | 2023 |
ISBN (Print) | 979-8-3503-4504-9 |
ISBN (Electronic) | 979-8-3503-4503-2 |
DOIs | |
Publication status | Published - 2023 |
Event | 2023 IEEE 10th International Conference on Data Science and Advanced Analytics - Thessaloniki, Greece Duration: 9 Oct 2023 → 13 Oct 2023 |
Conference
Conference | 2023 IEEE 10th International Conference on Data Science and Advanced Analytics |
---|---|
Country/Territory | Greece |
City | Thessaloniki |
Period | 09/10/2023 → 13/10/2023 |
Keywords
- Continual Learning
- Domain Shift
- Fake News
- Disinformation
- Word2Vec