Machine Learning Based Spam E-Mail Detection Using Logistic Regression Algorithm
- Title
- Machine Learning Based Spam E-Mail Detection Using Logistic Regression Algorithm
- Creator
- Livia Shreenithi S.A.; Yougandar S.V.; Jayapandian N.
- Description
- The rise of spam mail, or junk mail, has emerged as a significant nuisance in the modern digital landscape. This surge not only inundates user's email inboxes but also exposes them to security threats, including malicious content and phishing attempts. To tackle this escalating problem, the proposed machine learning-based strategy that employs Logistic Regression for accurate spam mail prediction. This research is creating an effective and precise spam classification model that effectively discerns between legitimate and spam emails. To achieve this, we harness a meticulously labeled dataset of emails, each classified as either spam or non-spam. This model is to apply preprocessing techniques to extract pertinent features from the email content, encompassing word frequencies, email header data, and other pertinent textual attributes. The choice of Logistic Regression as the foundational classification algorithm is rooted in its simplicity, ease of interpretation, and appropriateness for binary classification tasks. To process train the model using the annotated dataset, refining its hyper parameters to optimize its performance. By incorporating feature engineering and dimensionality reduction methodologies, bolster the model's capacity to generalize effectively to unseen data. Our evaluation methodology encompasses rigorous experiments and comprehensive performance contrasts with other well-regarded machine learning algorithms tailored for spam classification. The assessment criteria encompass accuracy, precision, recall, and the F1 score, offering a holistic appraisal of the model's efficacy. Furthermore, we scrutinize the model's resilience against diverse forms of spam emails, in addition to its capacity to generalize to new data instances. This model is to findings conclusively demonstrated that our Logistic Regression-driven spam mail prediction model achieves a competitive performance standing when juxtaposed with cutting-edge methodologies. The model adeptly identifies and sieves out spam emails, thereby cultivating a more trustworthy and secure email environment for users. The interpretability of the model lends valuable insights into the pivotal features contributing to spam detection, thereby aiding in the identification of emerging spam patterns. 2023 IEEE.
- Source
- 3rd IEEE International Conference on ICT in Business Industry and Government, ICTBIG 2023
- Date
- 2023-01-01
- Publisher
- Institute of Electrical and Electronics Engineers Inc.
- Subject
- E-Mail; Logistic Regression; Machine Learning; Nae Bayes; Spam detection; Support Vector Machines
- Coverage
- Livia Shreenithi S.A., Christ (Deemed to Be University), Department of Cse, Bangalore, India; Yougandar S.V., Christ (Deemed to Be University), Department of Cse, Bangalore, India; Jayapandian N., Christ (Deemed to Be University), Department of Cse, Bangalore, India
- Rights
- Restricted Access
- Relation
- ISBN: 979-835034327-4
- Format
- Online
- Language
- English
- Type
- Conference paper
Collection
Citation
Livia Shreenithi S.A.; Yougandar S.V.; Jayapandian N., “Machine Learning Based Spam E-Mail Detection Using Logistic Regression Algorithm,” CHRIST (Deemed To Be University) Institutional Repository, accessed February 25, 2025, https://archives.christuniversity.in/items/show/19711.