Optimizing Fake News Classification Using Data Fusion and NLP-Based Machine Learning Techniques
- Title
- Optimizing Fake News Classification Using Data Fusion and NLP-Based Machine Learning Techniques
- Creator
- Khan, Zahir Abbas; Rekha, V.
- Description
- In this research, the performance of different machine learning algorithms for identifying fake news using a dataset of news articles labeled as fake or real. The dataset was preprocessed to remove stop words, punctuation, digits, and special characters, and text normalization was applied. Two feature extraction methods, BOW (Bag-of-Words) and TF-IDF, were utilized to convert text data into numerical features. The dataset was split into training and testing phases to train and evaluate models, including Support Vector Classifier, Logistic Regression, Decision Trees, Gradient Boosting Classifier, Random Forest, and Multinomial Naive Bayes. Ensemble models combining various classifiers were also tested. Performance metrics, including precision, recall, and F1-score, were assessed, and confusion matrices were analyzed. Results showed that TF-IDF generally outperformed BOW. The Random Forest model achieved the highest precision (93%) but had a lower recall (83%). The SVC model showed a balanced performance with a precision of 90%, recall of 87%, and an F1-score of 86%. Ensemble models like GB?+?RF exhibited high precision (99%) but lower recall. These findings highlight the strengths of different algorithms in fake news detection and inform the development of practical classification tools. The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.
- Source
- Lecture Notes in Networks and Systems;Volume;1355 LNNS;pp.315-326
- Date
- 01-01-2025
- Publisher
- Springer Science and Business Media Deutschland GmbH
- Subject
- Count vectorizer; Fake news detection; Machine learning algorithms; Natural language processing (NLP); Random forest classifier; Support vector classifier
- Coverage
- Khan Z.A., Department of Computer Science and Engineering, School of Engineering and Technology, Christ (Deemed to Be University), Bangalore, India; Rekha V., Department of Computer Science and Engineering, School of Engineering and Technology, Christ (Deemed to Be University), Bangalore, India
- Rights
- Restricted Access; Hardcopy may be available in the library
- Relation
- ISSN: 23673370; ISBN: 978-981964882-5;
- Format
- online
- Language
- English
- Type
- Conference paper
Collection
Citation
Khan, Zahir Abbas; Rekha, V., “Optimizing Fake News Classification Using Data Fusion and NLP-Based Machine Learning Techniques,” CHRIST (Deemed To Be University) Institutional Repository, accessed June 19, 2026, https://archives.christuniversity.in/items/show/25550.
