Browse Items (9795 total)
Sort by:
-
Machine Learning Based Spam E-Mail Detection Using Logistic Regression Algorithm
The rise of spam mail, or junk mail, has emerged as a significant nuisance in the modern digital landscape. This surge not only inundates user's email inboxes but also exposes them to security threats, including malicious content and phishing attempts. To tackle this escalating problem, the proposed machine learning-based strategy that employs Logistic Regression for accurate spam mail prediction. This research is creating an effective and precise spam classification model that effectively discerns between legitimate and spam emails. To achieve this, we harness a meticulously labeled dataset of emails, each classified as either spam or non-spam. This model is to apply preprocessing techniques to extract pertinent features from the email content, encompassing word frequencies, email header data, and other pertinent textual attributes. The choice of Logistic Regression as the foundational classification algorithm is rooted in its simplicity, ease of interpretation, and appropriateness for binary classification tasks. To process train the model using the annotated dataset, refining its hyper parameters to optimize its performance. By incorporating feature engineering and dimensionality reduction methodologies, bolster the model's capacity to generalize effectively to unseen data. Our evaluation methodology encompasses rigorous experiments and comprehensive performance contrasts with other well-regarded machine learning algorithms tailored for spam classification. The assessment criteria encompass accuracy, precision, recall, and the F1 score, offering a holistic appraisal of the model's efficacy. Furthermore, we scrutinize the model's resilience against diverse forms of spam emails, in addition to its capacity to generalize to new data instances. This model is to findings conclusively demonstrated that our Logistic Regression-driven spam mail prediction model achieves a competitive performance standing when juxtaposed with cutting-edge methodologies. The model adeptly identifies and sieves out spam emails, thereby cultivating a more trustworthy and secure email environment for users. The interpretability of the model lends valuable insights into the pivotal features contributing to spam detection, thereby aiding in the identification of emerging spam patterns. 2023 IEEE. -
Machine Learning Based Time Series Analysis for COVID-19 Cases in India
The World Health Organization declared the Coronavirus Infection, or COVID-19, to be widespread. One of the most appropriate methodologies for COVID-19 is time series analysis. The most appropriate technique for COVID-19 is time series analysis. It can be applied to Recognizing Information Patterns and Predicting Insights. The paper summarises the components of time series using the COVID-19 dataset for India as an example of one of the most important methodologies in predictive analytics. Time series models are chosen because they can predict future outcomes, comprehend prior outcomes, provide strategy recommendations, and much more. These common goalrists of temporal arrangement modelling do not differ significantly from those of cross-sectional or board data modelling. Machine Learning may be a well-known fact that it is an excellent technique for imagining, discourse, and standard dialect management for a large clarified accessible dataset. The results for confirmed, recovered, and death cases are presented in this study. 2022 IEEE. -
Machine learning based Unique Perfume Flavour Creation Using Quantitative Structure-Activity Relationship (QSAR)
Artificial intelligence played a vital role in brings revolutionary changes in the field of perfumery. It is much evident with events including the success of Philyra, exhibitions showcasing the ideas of this concept. Machine learning made it user friendly and more comfortable for the users by means of suggestive interaction. Machine learning also benefited the perfumers in helping them to choose the best combinations and likely successful outcomes. With growing concern about a healthy lifestyle, the thoughts about having an artificial intelligence to predict the user friendliness could be a huge success. This definitely would require a huge database comprising a large detail about diseases and the causes and combinational results of the various chemicals used in perfumery. This system may not be a completely successful one but would be reliable to a better extent. It would gain a positive response from various governmental health departments and would be encouraged by the consumers. Also, another possible development would be Artificial intelligence that is able to predict how long a perfume can last. This would let the consumer choose the one that suits the need. Through this idea we could now get a clear idea about the progress that we have made till this day. Further we can also be driven into vague ideas about how the future of Artificial intelligence would likely grow into. Machine learning and deep learning is a major pillar of artificial intelligence with larger application. Coming to our domain of discussion, artificial intelligence changed the way that things were in the past centuries about fragrance. This article proposed Quantitative structure-activity relationship (QSAR) method is used to predict the best perfume flavour. The proposed system also reduces mean absolute error (MAE). The proposed QSAR is also reducing the chemical composition and increase the perfume quality. 2021 IEEE. -
Machine Learning Classifiers for Credit Risk Analysis
The modern world is a place of global commerce. Since globalization became popular, entrepreneurs of small and medium enterprises to large ones have looked up to banks, which have existed in various forms since antiquity, as their pillars of support. The risk of granting loans in various forms has significantly increased as a consequence of this, the businesses face financing difficulties. Credit Risk Analysis is a major aspect of approving the loan application that is done by analyzing different types of data. The goal is to minimize the risk of approving the loan for the Individuals or businesses who might not pay back on time. This research paper addresses this challenge by applying various machine learning classifiers to the German credit risk dataset. By evaluating and comparing the accuracy of these models to identify the most effective classifier for credit risk analysis. Furthermore, it proposes a contributory approach that combines the strengths of multiple classifiers to enhance the decision-making process for loan approvals. By leveraging ensemble learning techniques, such as the Voting Ensemble model, the aim is to improve the accuracy and reliability of credit risk analysis. Additionally, it explores tailored feature engineering techniques that focus on selecting and engineering informative features specific to credit risk analysis. 2024 Sudiksha et al., licensed to EAI. -
Machine Learning Enabled Financial Statements in Assessing a Business's Performance
Machine Learning Enabled Financial Statements (MLEFS) revolutionize corporate performance analysis. This study examines MLEFS's dramatic effects using data gathering, model creation, interpretability, deployment, and ethics. We found that MLEFS accurately predicts crucial financial measures, helping investors, lenders, and financial analysts make better judgments. The study emphasizes the importance of financial measures like Return on Assets (ROA) in supporting financial theories and models. The research also stresses interpretability and ethics, promoting responsible machine learning in finance. Future trends include enhanced interpretability, strong ethical frameworks, real-time analysis, big data integration, regulatory adaption, and industrial acceptance. This study opens the door to data-driven financial analysis and decision-making, improving strategic planning, risk reduction, and investor trust. 2024 IEEE. -
Machine Learning for Smart School Selection and Admissions
Choosing the best school for their kid is an important choice that parents must make, and it is sometimes stressful and unsure. Machine learning is a potential way to improve and streamline the admissions and school selection process in the current digital era. This study investigates the use of machine learning methods in the context of selective admissions and smart school selection. We propose a user-friendly, web-based tool in the early phases of our study that helps parents and guardians locate the ideal school for their kid by using machine learning algorithms. To provide individualized school recommendations, the platform gathers and analyses a range of data, such as extracurricular activity participation, academic achievement, regional preferences, and school reputation. This makes choosing a school easier and supports parents in making wise choices. This paper's second section explores the technical details of the machine learning techniques used, going into the nuances of feature selection, data preparation, and model assessment. We also draw attention to the difficulties and moral issues - such as maintaining impartiality and avoiding bias - that come with using machine learning to school selection. 2023 IEEE. -
Machine Learning in Cyber Threats Intelligent System
Cybercriminals disrupt services, exfiltrate sensitive data, and exploit victim machines and networks to perform malicious activities against organizations. A malicious adversary seeks to steal, destroy, or compromise business assets that have a specific financial, reputational, or intellectual value. As a result, organizations are complementing their perimeter defenses with threat intelligence platforms to address these security challenges and eliminate security blind spots for their systems. Any type of information useful for identifying, assessing, monitoring, and responding to cyber threats is considered cyber threat intelligence. Organizations can benefit from increased visibility into cyber threats and policy violations. An organizations threat intelligence allows them to prevent or mitigate various types of cyberattacks. The use of machine learning and artificial intelligence is a key component of cybersecurity conflict, which together allows attackers and defenders to function at new speeds and scales. In spear-phishing attacks, relatively frivolous machine learning algorithms have been used to overwhelming effect as adversarial artificial intelligence. This chapter discusses the various cyber threats, cyber security attack types, publicly available datasets for research work, and machine learning techniques in cyber-physical systems. 2024 selection and editorial matter, S. Vijayalakshmi, P. Durgadevi, Lija Jacob, Balamurugan Balusamy, and Parma Nand; individual chapters, the contributors. -
Machine Learning in Financial Distress: A Scoping Review
Predicting financial distress is crucial for stakeholders, policymakers, governments, and management in decision-making processes. Researchers have developed various prediction models encompassing both traditional and machine-learning approaches. Notably, recent attention has shifted towards employing machine learning models to address the limitations of traditional methods. This study seeks to offer insights into current trends, identify gaps, and suggest future research directions using machine learning models for financial distress prediction, employing the PRISMA Extension for Scoping Reviews methodology. To achieve this, a comprehensive search was conducted across three databasesScience Direct, EBSCO, and ProQuestspanning from 2020 to 2023, identifying 34 relevant articles for analysis. The findings underscore the prevalent use of Support Vector Machine in financial distress prediction, followed by the Random Forest Classifier and Artificial Neural Network, with little attention paid to other models. Furthermore, the study underscores the necessity for more research in developing countries, noting the predominance of studies from developed nations. While machine learning models hold promise for enhancing the accuracy and efficiency of financial distress prediction, additional research is imperative to evaluate their effectiveness and applicability across diverse contexts. This scoping review aims to furnish researchers, policymakers, and institutions with valuable insights and policy recommendations, shedding light on underexplored machine-learning techniques. 2024, Iquz Galaxy Publisher. All rights reserved. -
Machine learning in smart agriculture
Agriculture is the cultivation of the soil, the growth of crops and the raising of livestock. Agriculture is critical to the economic development of a country. Farming generates nearly 58% of a country's primary income. Previously, cultivators had accepted conventional farming practices. Because these methods were imprecise, they produced less and took longer time. Precise farming boosts productivity by precisely determining which steps must be completed at what time. Precision farming entails forecasting the weather, analyzing soil, recommending crops for cultivation and calculating the amount of fertilizer and pesticides that must be used. Precise farming uses advanced technologies such as IoT, data mining, data analytics, and machine learning (ML) to collect data, train systems and predict outcomes. Precision farming employs technology to reduce manual labor and boost productivity. Farmers have recently faced several difficulties, such as crop failure due to insufficient rainfall, soil infertility and so on. The proposed work in determining the soil, managing crops and harvesting efficiently can solve the problems caused by environmental changes. It guides a person's farming strategy to produce better results through a proper prediction process. The goal of this research is to assist an individual in efficiently cultivating crops, resulting in high productivity at a low cost. It also assists in estimating the total cost of cultivation and forecasting the likely economic barriers. This would help a person plan activities prior to cultivation, resulting in an integrated farming solution. 2023 River Publishers. All rights reserved. -
Machine learning insights into mental health risk factors associated with climate change: Impact on schoolchildren's cognitive abilities
In this chapter, we use machine learning techniques to investigate how the effects of climate change and certain risk factors for mental health affect students' cognitive skills in the classroom. The mental health of at-risk populations, especially students, must be considered in light of the fact that the world's environment is changing significantly. Using state-of-the-art machine learning algorithms, we analyze large datasets that include environmental variables, socio-economic characteristics, and markers of mental health among school-aged persons. We are primarily interested in identifying key relationships and trends that might help us understand the complex relationship between climate change and cognitive health in this population. In order to uncover complex insights, the chapter takes a holistic approach by combining feature selection, model training, and interpretability analysis. The cognitive capacities of school-aged children may be significantly impacted by some climate- related stresses, according to preliminary results. The findings add to our knowledge of the interconnected webs of environmental shifts, psychological susceptibilities, and cognitive consequences. Educators, legislators, and healthcare providers can benefit from this study's use of machine learning insights into the possible effects of climate change on students' mental health. It also paves the way for the creation of tailored treatments and adaptive techniques to deal with the highlighted dangers, fostering resilience and prosperity in the face of a changing environment. 2024, IGI Global. All rights reserved. -
Machine Learning Insights into Mobile Phone Usage and Its Effects on Student Health and Academic Achievement
The research intends to find how students' health and academic performance are affected by their smartphone use. Considering how widely smartphones are used among students, it is important to know how they could affect health and learning results. This study aims to create prediction models that can spot trends and links between smartphone usage, health ratings, and academic achievement, thereby offering insightful information for teachers and legislators to encourage better and more efficient use among their charges. Data on students' mobile phone use, health evaluations, and academic achievement were gathered for the study. Preprocessing of the dataset helped to translate categorical variables into numerical forms and manage missing values. Trained and assessed were many machine learning models: Random Forest, SVM, Decision Tree, Gradient Boosting, Logistic Regression, AdaBoost, and K-Nearest Neighbors (KNN). The models' performance was evaluated in line with their accuracy in influencing performance effects and health ratings. Predictive accuracy was improved by use of feature engineering and model optimization methods. With 63.33% of accuracy for estimating health ratings, the SVM model was most successful in capturing the link between smartphone usage and health results. With an accuracy of 50%, logistic regression performed very well in forecasting performance effect, therefore stressing important linear connections between consumption habits and academic success. Random Forest and Decision Tree models were less successful for performance impact even if they showed strong performance in health forecasts. These results highlight the need of customized treatments to reduce the detrimental consequences of too high mobile phone use on students' academic performance and health. 2024 IEEE. -
Machine Learning Insights into Predicting Crude Oil Prices
This research paper delves into the complexities of crude oil, highlighting its extraction, composition, and transformation into valuable derivatives. Examining the pricing dynamics, it explores the intricate interplay of social and economic factors that shape crude oil's value, emphasizing its critical role in global energy and industrial sectors. A forecasting model is introduced, focusing on key factors - heating oil, SPX, GPNY, and EU DOL EX - utilizing five machine learning models. Historical data reveals the efficacy of conventional models, particularly Random Forest, in predicting crude oil prices, enhanced by feature engineering techniques. The paper concludes by suggesting avenues for further exploration, offering valuable insights for readers in this dynamic research domain. 2024 IEEE. -
Machine Learning Integration for Enhanced Solar Power Generation Forecasting
This paper reviews the advancements in machine learning techniques for enhanced solar power generation forecasting. Solar energy, a potent alternative to traditional energy sources, is inherently intermittent due to its weather-dependent nature. Accurate forecasting of photovoltaic power generation (PVPG) is paramount for the stability and reliability of power systems. The review delves into a deep learning framework that leverages the long short-term memory (LSTM) network for precise PVPG forecasting. A novel approach, the physics-constrained LSTM (PCLSTM), is introduced, addressing the limitations of conventional machine learning algorithms that rely heavily on vast data. The PC-LSTM model showcases superior forecasting capabilities, especially with sparse data, outperforming standard LSTM and other traditional methods. Furthermore, the paper examines a comprehensive study from Morocco, comparing six machine learning algorithms for solar energy production forecasting. The study underscores the Artificial Neural Network (ANN) as the most effective predictive model, offering optimal parameters for real-world applications. Such advancements not only bolster the accuracy of solar energy forecasting but also pave the way for sustainable energy solutions, emphasizing the integration of these findings in practical applications like predictive maintenance of PV power plants. The Authors, published by EDP Sciences, 2024. -
Machine Learning Methods for Online Education Case
Online education has become a popular choice for learners of all ages and backgrounds due to its accessibility and flexibility. However, providing personalized learning experiences for a diverse range of students in online education can be challenging. Machine learning methods can be used to provide personalized learning experiences and improve student engagement in online education. In this case study, We're going to do some research on machine learning. methods in an online education platform. The platform provides courses in various subjects and is designed to be accessible to students from all over the world. The platform collects data on student behavior, such as the courses they enroll in, the time they spend on each course, and their performance on assignments and quizzes. We will explore several machine learning methods that can be applied to this data, including clustering, classification, and recommendation systems. Clustering algorithms can be used to group students based on their learning behavior and preferences, allowing instructors to provide personalized feedback and course recommendations. Classification algorithms can be used to predict student success in a particular course, allowing instructors to intervene and provide additional support if needed. Recommendation systems can be used to suggest courses to students based on their interests and past behavior. We will also discuss the potential benefits and challenges of using machine learning methods in online education. Benefits include increased student engagement, improved learning outcomes, and more efficient use of resources. Challenges include ensuring data privacy and security, preventing algorithmic bias, and maintaining transparency and fairness in the decision-making process. Overall, machine learning methods have the potential to transform online education by providing personalized learning experiences and improving student outcomes. By leveraging the vast amounts of data generated by online education platforms, we can create more effective and efficient learning experiences that meet the needs of students from diverse backgrounds and learning styles. 2023 IEEE. -
Machine Learning Methods leveraging ADFA-LD Dataset for Anomaly Detection in Linux Host Systems
Advancement in network technology and revolution in the global internet transformed the overall Information Technology (IT) infrastructure and its usage. In the era of the Internet of Things (IoT) and the Internet of Everything (IoE), most everyday gadgets and electronic devices are IT-enabled and can be connected over the internet. With the advancements in IT technologies, operating systems also evolved to leverage these advancements. Today's operating systems are more user-friendly and feature-rich to support current IT requirements and provide sophisticated functionalities. On the one hand, these features enabled operating systems accomplish all current requirements, but on the other hand, these modern operating systems increased their attack surface considerably. Intrusion detection systems play a significant role in providing security against the broad spectrum of attacks on host systems. Intrusion detection systems based on anomaly detection have become a prominent research area among diverse areas of cyber security. The traditional approaches for anomaly detection are inadequate to discover the operating system level anomalies. The advancement and research in Machine Learning (ML) based anomaly detection open new opportunities to tackle this challenge. The dataset plays a significant role in ML-based system efficacy. The Australian Defence Force Academy Linux Dataset (ADFA-LD) comprises thousands of normal and attack processes system call traces for the Linux platform. It is the benchmark dataset used for dynamic approach-based anomaly detection. This paper provided a comprehensive and structured study of various research works based on the ADFA-LD for host-based anomaly detection and presented a comparative analysis. 2022 IEEE. -
Machine Learning Methods to Identify Aggressive Behavior in Social Media
With the more usage of Internet and online social media, platforms creep with lot of cybercrimes. Texts in the online platforms and chat rooms are aggressive. In few instances, people target and humiliate them with the text. It affects victim mental health. Therefore, there is a need of detecting the abuse words in the text. In this paper, a study of machine learning methods is done to identify the aggressive behavior. Accuracy can be improved by incorporating additional features. 2023, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. -
Machine Learning Model Enabled with Data Optimisation for Prediction of Coronary Heart Disease
Cardiovascular disorders remain leading cause for mortality worldwide, necessitating robust early risk assessment. Although machine learning models show promise, most rely on conventional preprocessing, which lacks model portability across datasets. We propose an integrated preprocessing pipeline enhancing model generalizability. Our methodology standardises features solely based on training statistics and then transforms test data identically to prevent leakage. We handle class imbalance through synchronised oversampling, enabling consistent performance despite distribution shifts. This framework was evaluated on an open-source dataset of clinical parameters from an African cohort using classifiers like support vector machines and gradient boosting. All models achieved upto 80% accuracy. Remarkably, evaluating the identical models on five external European and Asian datasets maintains 80% - 86% accuracy. Our reproducible data conditioning strategy enables precise and transportable heart disease risk prediction, overcoming population variability. The framework provides the flexibility to readily retrain models on new data or update risk algorithms for clinical implementation in diverse locales. Our work accelerates the safe translation of machine learning to guide cardiovascular screening worldwide. 2024 IEEE. -
Machine Learning Model for Depression Prediction during COVID-19 Pandemic
Depression is an unfamous mental health disorder that has affected half the population worldwide. In December 2019, the break of the COVID-19 pandemic was first spotted in Wuhan, China, and later spread to 212 countries and territories worldwide, impacting half the population. It took a significant toll on their physical health and their mental health. Many among the population lost their loved ones, businesses, and being in quarantine for years, completely shifted to the online mode made everyone's life miserable. Many may be dealing with escalated levels of alcohol and drug use, sleeplessness, and an anxious state of mind. So, the need to address this and help the severely affected ones is significant. Self-quarantine also causes additional stress and challenges the mental health of citizens. This paper intends to identify the people who were mentally affected by the pandemic using machine learning techniques. A survey was conducted among college-going students and professionals. The paper used classification techniques such as Naive Bayes, KNN, Random Forest, Logistic Regression, k-fold cross-validation to get results. Support Vector Machine gave the maximum accuracy of 99.35%. 2022 IEEE. -
Machine Learning Model to Detect Chronic Leukemia in Microscopic Blood Smear Images
Chronic leukemia is a slow-progressing form of disease, If not diagnosed on time can progress and increase the risk of life-threatening complications. It is essential to develop a fully automated system to recognize and categorize type of leukemia for proper evaluation and treatment. This paper aims to provide a machine learning model to identify and classify chronic lymphocytic leukemia, chronic myeloid Leukemia and healthy cells. Digital microscopic blood smear images were automatically cropped into single nucleus and segmented using watershed algorithm. Grey level co-occurrence matrix (GLCM) and geometrical features were extracted from the segmented nucleus images and random forest algorithm is used to classify chronic leukemia and healthy cells. This prognosis aids pathologists and physicians in identifying leukemic patients early and selecting the most effective course of action. 2023 IEEE. -
Machine Learning Observation on the Prediction of Diabetes Mellitus Disease
Diabetes disease has become as one of the common syndromes in many of the age groups. Diabetes can result in high blood sugar levels, a heart attack, or heart disease. This is one of the fastest developing illnesses, and it requires regular care. After seeing the doctor and being diagnosed, the patient is typically compelled to obtain their reports. Because this procedure is time-consuming and costly, we have the option of using ML approaches to solve this problem. Our research aims to foster a framework prepared to do all the more precisely foreseeing a patient's diabetes risk level. To develop models, classification methods such as Logistic Regression, K-Nearest Neighbor, Support Vector Machine, and Random Forest Classifier are employed. The results indicate that the techniques are quite accurate. The result showed that the prediction with the Logistic Regression model acquired the highest accuracy. 2023 IEEE.
