Browse Items (11855 total)
Sort by:
-
Exploring Shopping Opportunities and Elevating Customer Experiences Through AI-Powered E-Commerce Strategies
This research explores the efficacy of clustering algorithms in enhancing customer experiences within the e-commerce landscape. Through experiment trials utilizing K-means and DBSCAN clustering techniques, valuable insights have been gleaned. The trials yielded silhouette scores ranging from 0.55 to 0.72, indicating moderate to good clustering quality across different experiments. In K-means clustering, the number of clusters varied from 3 to 6, with inertia values spanning approximately 722.41456.8. Conversely, DBSCAN clustering resulted in varying cluster numbers, ranging from 2 to 4, contingent on the combinations of epsilon and min_samples values explored. These findings underscore the significance of judiciously selecting clustering algorithms and parameter settings to achieve meaningful segmentation of e-commerce data. Effective utilization of clustering algorithms empowers businesses to discern valuable insights into customer behavior, preferences, and patterns. Consequently, businesses can tailor their strategies to deliver personalized experiences, targeted marketing campaigns, and optimized product recommendations. This research propels the exploration of additional clustering techniques and parameter refinements for enhanced clustering performance in e-commerce applications. The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024. -
Utilizing Deep Learning Techniques for Lung Cancer Detection
Deep learning can extract meaningful insights from complex biomedical statistics, which includes Radiographs and virtual tomosynthesis. Traits in contemporary deep studying architectures have enabled faster and more correct mastering of the functions gifted in clinical imagery, main to better accuracy and precision in medical analysis and imaging. Deep studying strategies may be used to pick out patterns within the pics which may be indicative of illnesses like lung cancer. Those ailment patterns, which include small lung nodules, can be used for early detection and prognosis of the sickness. Recent studies have employed deep learning strategies consisting of Convolutional Neural Networks (CNNs) and switch learning to come across most lung cancers in CT pictures. The first step in this manner is to generate datasets of pictures of the lungs, each from wholesome people and those with most lung cancers. Those datasets can then be used to teach a deep knowledge of a set of rules that may be optimized to it should locate those styles. Once educated, the version can be used to come across styles indicative of lung most cancers from new take a look at images with high accuracy. For further accuracy and reliability, extra up-processing techniques, along with segmentation and records augmentation, may be used. Segmentation can be used to detect a couple of lung nodules in a photo, and records augmentation can be used to lessen fake high quality outcomes. 2024 IEEE. -
AdvanDNN: Deep Neural Network Analysis of Neuroimaging for Identifying Vulnerable Brain Regions in Autism Spectrum Disorder
Exploring the neurological framework of autism spectrum disorder (ASD) presents a significant challenge due to its diverse manifestations and cognitive impacts. This study introduces an innovative deep learning approach, employing an advanced deep neural network (AdvanDNN) model to identify and analyze brain regions vulnerable to ASD. Utilizing the AAL116 brain atlas for anatomical standardization, our model processes a comprehensive set of neuroimaging data, including structural and functional MRI scans, to discern distinct neural patterns associated with ASD. The AdvanDNN model, with its robust deep learning architecture, was meticulously trained and validated, demonstrating a notable accuracy of 91.17% in distinguishing between ASD-affected individuals and controls. This marks an improvement over the state of the art, contributing a significant advance to the diagnostic processes. Notably, the model identified a pronounced anticorrelation in brain function between anterior and posterior regions, corroborating existing empirical evidence of disrupted connectivity within ASD neurology. The analysis further pinpointed critical regions, such as the prefrontal cortex, amygdala, and temporal lobes, that exhibit significant deviations from typical developmental patterns. These findings illustrate the potential of deep learning in enhancing early detection and providing pathways for intervention. The application of the AdvanDNN model offers a promising direction for personalized treatment strategies and underscores the value of precision medicine in addressing neurodevelopmental disorders. The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024. -
Machine Learning Insights into Mobile Phone Usage and Its Effects on Student Health and Academic Achievement
The research intends to find how students' health and academic performance are affected by their smartphone use. Considering how widely smartphones are used among students, it is important to know how they could affect health and learning results. This study aims to create prediction models that can spot trends and links between smartphone usage, health ratings, and academic achievement, thereby offering insightful information for teachers and legislators to encourage better and more efficient use among their charges. Data on students' mobile phone use, health evaluations, and academic achievement were gathered for the study. Preprocessing of the dataset helped to translate categorical variables into numerical forms and manage missing values. Trained and assessed were many machine learning models: Random Forest, SVM, Decision Tree, Gradient Boosting, Logistic Regression, AdaBoost, and K-Nearest Neighbors (KNN). The models' performance was evaluated in line with their accuracy in influencing performance effects and health ratings. Predictive accuracy was improved by use of feature engineering and model optimization methods. With 63.33% of accuracy for estimating health ratings, the SVM model was most successful in capturing the link between smartphone usage and health results. With an accuracy of 50%, logistic regression performed very well in forecasting performance effect, therefore stressing important linear connections between consumption habits and academic success. Random Forest and Decision Tree models were less successful for performance impact even if they showed strong performance in health forecasts. These results highlight the need of customized treatments to reduce the detrimental consequences of too high mobile phone use on students' academic performance and health. 2024 IEEE. -
Sentiment Analysis on Live Webscraped YouTube Comments Using VADER Sentiment Analyzer
After the covid disease came in the beginning of 2020s, the amount of people using social medias has increased dramatically. So as an effect of that, the viewers and engagement in one of the worlds largest platform by google called YouTube also increased. So many new content creators also born during these times. So this project is getting the sentiment from the audience or user to the content creators by which they can improve their content quality. This research holds promise in harnessing the power of sentiment analysis to enhance the overall YouTube experience and inform content creators and platform administrators in their decision-making processes. Understanding these trends is vital for content creators, as it can offer invaluable insights into viewer engagement and preferences. By gaining a deeper understanding of how viewers react to content, creators can refine their strategies, tailor their content to their audience, and enhance the overall quality of videos. By incorporating sentiment information into recommendations, the platform can suggest videos that resonate more effectively with users, thereby increasing engagement and satisfaction. The identification of negative sentiment and harmful comments enables YouTubes content moderation systems to proactively address issues such as hate speech, harassment, and toxicity. This, in turn, contributes to a safer and more welcoming space for users to share their thoughts and opinions. The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024. -
Impact of Variable Viscosity and Gravity Variations on Rayleigh-Bard Instabilities of Viscoelastic Liquids in Energy Sustainable System
Energy sustainability systems are vital for transitioning to a low-carbon economy, addressing climate change, and ensuring a sustainable future for all. Rayleigh-Bard convection (RBC) in viscoelastic liquids is a crucial phenomenon in various industrial and environmental applications, including energy sustainability systems where fluid dynamics play a pivotal role in optimizing heat transfer and system efficiency. The study deals with the combined influence of variable viscosity and variable gravity on RBC in viscoelastic liquids. The influence of space-dependent gravity on the onset of convection is considered. The results are analyzed against the background of constant gravity RBC in viscoelastic/Newtonian liquids with constant/variable viscosity. The possibility of variable gravity accelerating/decelerating the onset of convective instability is examined in this paper. 2024 IEEE. -
Decoding Big Data: The Essential Elements Shaping Business Intelligence
In today's Business Intelligence (BI) world, Big Data Analytics integration has become critical, transforming company strategy and decision-making processes. This study investigates the complex influence of Big Data on business intelligence, focusing on important drivers of this transition. It investigates how Big Data's improved data processing capabilities, integration of advanced analytics techniques such as machine learning, and real-time data insights enable businesses to make more informed decisions and achieve a competitive advantage. Furthermore, the paper emphasizes the importance of personalized consumer insights, operational savings, and strategic benefits obtained from predictive analytics when adopting Big Data for BI. 2024 IEEE. -
Harnessing Machine Learning for Mental Health: A Study on Classifying Depression-Related Social Media Posts
This study is of particular relevance in the way it identifies depression-related content on social media using a machine learning model to classify posts and comments. This dataset, encompassing around 6500 entries from various platforms including Facebook, was rigorously annotated by four proficient English-speaking undergraduate students together with the final label which is established via majority voting. Data Preprocessing, initial cleaning, normalization and TF-IDF feature creation through vectorization for the output of POS tags. The different machine learning models that were trained and tested are Logistic Regression, Random Forest, SVM (Support Vector Machine), Naive Bayes Gradient Boosting Algorithm K-NN (K nearest Neighbors) AdaBoost Decision Tree. Authors evaluated the models and measured their accuracy, precision score, recall rate (also known as sensitivity) in addition to F1-score. Gradient Boost, Random Forest, and SVM were top performers among which Gradient boosting was found to be an overall best one with almost 98.5%. They show that machine learning model can successfully predict the label of social media posts, as a way for accurately identifying depression from text data. This detailed model performance evaluation is useful in understanding what each approach does well and poorly, shedding light into whether they are / would be actually suitable for real-world applications. This study not only developed discriminative classifiers, but also included detailed analysis of their performance which should hopefully guide future work and help in practical implementations for real-time mental health monitoring. Through this work, this study aim to facilitate timely identification of depression-related posts, ultimately supporting mental health awareness and intervention efforts on social media platforms. 2024 IEEE. -
Diabetes Mellitus Classification Using Machine Learning Algorithms with Hyperparameter Tuning
Diabetes Mellitus is a prevalent condition globally, marked by elevated blood sugar levels resulting from either insufficient production of insulin or the body cells' inability to respond appropriately to released insulin. For people with diabetes to lead healthy, normal lives, early identification and treatment of the condition are essential. With the need to move away from current traditional procedures, towards a noninvasive methodology, machine learning and data mining technologies can be very useful in the classification of diabetes. Creating an effective machine learning model for the classification of diabetes mellitus was the primary goal of this research. This work is primarily carried out on combined Pima Indian diabetes dataset and German Frankfurt diabetes dataset. The class imbalance issue has been resolved using Synthetic Minority Oversampling Technique. One-hot encoding is applied to convert categorial features to numerical and various single and ensemble classifiers with the best hyperparameters obtained using GridSearchCV method were employed on the pre-processed dataset. With an AUC of 0.98 and maximum accuracy of 98.79%, the Random Forest ensemble technique outperformed the other models, according to the experimental results. As a result, the algorithm might be used to predict diabetes and alert doctors to serious cases that call for emergency care. 2024 IEEE. -
Advanced Sentiment Analysis: From Lexicon-Enhanced BERT to Dimensionality Reduction Using NLP
Social media platforms serve as vital connections for communication, generating massive quantities of data that represent an array of perspectives. Efficient sentiment analysis is necessary for understanding public opinion, particularly in domains such as product reviews and socio-political discussion. This paper develops a novel sentiment analysis model that is customized for social media data by integrating machine learning algorithms, language processing techniques with part-of-speech tagging, and dimensionality reduction methods. The model will improve sentiment analysis performance by tackling challenges like noise and data domain variations. To further improve sentiment representation, it includes convolutional neural networks (CNNs), BERT embeddings, N-grams, and sentiment lexicons. The model's effectiveness is determined on a variety of datasets, which enhances sentiment analysis in social media discussion. This paper goes beyond sentiment analysis in code-mixed, multilingual text and highlights the importance of careful data before treatment and an extensive variety of ML algorithms. This study attempts to explain the nuances of sentiment analysis and its use in social media discussions through methodical research. 2024 IEEE. -
Predicting and Analyzing Early Onset of Stroke Using Advanced Machine Learning Classification Technique
Around the world, stroke is the leading cause of death. When blood vessels in the brain rupture, they cause damage. Alternatively, blockage in a blood vessel that supplies oxygen and other nutrients may also lead to this disease. This study uses various machine learning models to predict whether someone will have a stroke or not. Different physiological features were taken into account by this study while using Logistic Regression; Decision Tree Classification; Random Forest Classification; K-Nearest Neighbors (KNN); Support Vector Machine (SVM); Nae Bayes classifier algorithm; and XGBoost classification algorithm - these were used for six different models to ensure accurate predictions are made. We will accomplish the finest exactness with Bayes cv look which may be a hyper-tuning classifier with 92.87%. This consideration can be utilized for future work by doing the increase and include designing on the dataset. It is constrained to literary information, so it might not continuously be right for foreseeing stroke. so utilize the datasets that contain pictures and work on those datasets. 2024 IEEE. -
Leveraging Ensemble Methods for Accurate Prediction of Customer Spending Scores in Retail
This study primarily aims to estimate consumer spending trends in a retail context. The goal is to identify the best model for predicting Purchasing Scores, which indicate customer loyalty and potential income, using demographic and financial data. The dataset included information about customers' age, gender, and annual income, and the objective was to analyze their Spending Scores. Several regression models were tested, including Linear Regression, Random Forest, Gradient Boosting, K-Nearest Neighbors (KNN), and Lasso Regression. To improve the models, we engineered features like Age Squared, Income per Age, and Spending Score per Income. Each model was trained and tested using 3fold cross-validation. We evaluated their performance with Mean Absolute Error (MAE), Root Mean Squared Error (RMSE), and R-squared (R2) metrics. The results showed significant differences in model performance. The Random Forest model stood out, with the lowest Mean Absolute Error (MAE) of 0.33, Root Mean Square Error (RMSE) of 0.52, and the highest R-squared (R22) score of 0.9997. Gradient Boosting also performed well, achieving a Mean Absolute Error (MAE) of 1.77, Root Mean Square Error (RMSE) of 2.41, and an Rsquared (R2) score of 0.9930. While Linear Regression showed moderate accuracy, KNN and Lasso Regression had higher errors and lower R2 values, indicating less reliable predictions. The findings suggest that ensemble methods, particularly Random Forest, excel at predicting customer Spending Scores. The high accuracy and reliability of this model point to its potential for customer segmentation and targeted marketing strategies, ultimately enhancing customer relationship management and boosting business value. Further refinement and exploration of additional features could further improve these prediction capabilities. 2024 IEEE. -
Cross-Modal Ingredient Recognition and Recipe Suggestion using Computer Vision and Predictive Modeling
This paper is focused on the development of a novel system known as 'IngredEye.' It involves various approaches that can be grouped into categories, such as computer vision, including YOLOv8, a KNN prediction model, and a Flutter framework that hosts all of them in a mobile application environment. Previous studies have analyzed the application of computer vision and OpenCV recognition in cooking and proved that such approaches could enhance the level of convenience in the culinary field. This paper addresses issues like changes in lighting, occlusions, and other factors that have to be solved by the algorithms envisaged for real applications. The objective of this paper solely relies on integrating the OpenCV object detection method with comprehensive machine learning techniques specialized for the culinary field. Presenting the end-user with recipe recommendations based on the visual input they have given. 2024 IEEE. -
Predicting Player Engagement in Online Gaming: A Machine Learning Approach
The aim of this research is to make precise forecasts on player participation in online game using state-of-the-art machine learning algorithms. Player engagement plays a crucial element in determining the success of online games because it affects player retention, satisfaction and monetization. By understanding and predicting engagement levels, game developers and marketers can enhance the gaming experience and develop strategies to keep players invested. This research involves a comprehensive analysis of player behavior data from an online gaming platform. The dataset includes various demographic and behavioral features such as age, gender, location, game genre, playtime hours, in-game purchases, game difficulty, sessions per week, average session duration, player level, achievements unlocked, and engagement level. The data was preprocessed through handling missing values, normalizing numerical features, and encoding categorical variables. Exploratory Data Analysis (EDA) was conducted to understand the distribution and relationships between different features. Multiple machine learning models were evaluated to predict player engagement levels, including Random Forest, Gradient Boosting, XGBoost, and Support Vector Machine (SVM). These models were then compared through the accuracy, precision, recall, and F1-score metrics. In the comparison, XGBoost emerged as the best model. Since it is the best-performing model, we can make the feature importance analysis to identify the best factors for predicting engagement in the next step. The XGBoost model achieved the highest accuracy of 91%, demonstrating superior precision, recall, and F1-scores across all engagement levels (High, Medium, Low). Ensemble methods like XGBoost, Gradient Boosting, and Random Forest outperformed the SVM model, highlighting their effectiveness in handling complex datasets. 2024 IEEE. -
Comparative Study on GANs and VAEs in Credit Card Fraud Detection
In today's world, the major issue credit card sectors encounter is fraud. This comparative study deals with how GANs and VAEs detect fraudulent transactions. The dataset comprised 284807 transactions, of which 492 were fraudulent. These two models, GANs and VAEs, are trained on this dataset, during which, in the training process, the models are learned to deal with the imbalance in the dataset. VAEs are trained so that fraud transactions are considered anomalies, and only legitimate transactions are passed onto the model for training. Conversely, GANs generate synthetic data of fraud by addressing the problem of data imbalance and passed on to the ML model for classification. We can observe that Both the models have very good AUC-ROC scores of around 96%, which indicates their distinguishing capability between the classes. In all other aspects, GANs outperformed VAEs, which makes GANs a better option for fraud detection. 2024 IEEE. -
Sub-Optimization based Random Forest Algorithm for Accurate and Efficient Land use and Land Cover Classification using Landsat Time Series Data
The land use and land cover (LULC) play an essential role to investigate the impacts of environmental factors and socio-economic development in the Earth's surface. Extracting the hidden information from the remote sensing images in the observed earth environment is the challenging process. In this research, implemented a model that uses Landsat data to investigate the LULC changes. Utilized the Landsat 5,7 and 8 as inputs for the 1985 to 2019 by Google Earth Engine (GEE) is applied for the robust classification. This paper proposed a Sub-forest optimization based Random forest (SO-RF) classifier with faster diagnosis speed for LULC classification. Moreover, to increase the multispectral Landsat band's resolution from 30 m to 15 m, the pan-sharpening algorithm is utilized. In addition, analyzed the various image configurations grounded numerous spectral indices and other supplementary data such as land surface temperature (LST) and digital elevation model (DEM) on final classification accuracy. The proposed SO-RF produced higher accuracy (0.97 for kappa, 96.78% Overall accuracy (OA), 0.94 for f1-score) than Copernicus Global Land Cover Layers (CGLCL) map and state of art methods like K-Nearest Neighbor (KNN), Decision Tree (DT), and Multi-class Support Vector machine (MSVM). 2024 IEEE. -
Depth Wise Separable Convolutional Neural Network with Context Axial Reverse Attention Based Sentiment Analysis on Movie Reviews
Sentiment Analysis (SA) in movie reviews involves using natural language processing techniques to determine the sentiment expressed in reviews. This analysis helps in understanding the overall audience sentiment towards a movie, categorizing reviews as positive, negative, or neutral. It's useful for filmmakers, marketers, and audiences. The existing methods does not provide sufficient accuracy, error rate and complexity was increased. To overcome the aforementioned problem, Depth Wise Separable Convolutional Neural Networks with Context Axial Reverse Attention Network (DWSCNN-CARAN) is proposed for accurately classifying SA in movie reviews. In this input image is taken from two datasets such as IMDB dataset and Polarity dataset. The pre-processing is done using six steps namely, Cleaning, Tokenization, Case Folding, Normalization, Stop Word Elimination, and Stemming for the purpose of removing noises. Following that feature extraction are done using Bag-Of-Words and Term Frequency-Inverse Document Frequency (BOW-TF-IDF). After that classification are done using Depth Wise Separable Convolutional Neural Networks with Context Axial Reverse Attention Network (DWSCNN-CARAN)for classifying the AS in movie reviews. The efficiency of the proposed DWSCNN-CARAN-BOA is analyzed using a dataset and attains 99.94% accuracy, 98.76% recall and attains better results compared with the existing methods. In the future, this approach will use the adversarial instances it generated to conduct adversarial training and assess the potential improvement in classification performance. It also looks into the possibilities of creating adversarial examples at the word and sentence levels by combining structured knowledge from high-quality knowledge bases. 2024 IEEE. -
Improving Groundwater Forecasting Accuracy with a Hybrid ARIMA-XGBoost Approach.
In addressing the critical challenge of accurate groundwater level prediction, this study explores the comparative performance of various machine learning models. We implement a novel hybrid model combining ARIMA and Extreme Gradient Boosting (XGB) for the prediction of groundwater levels, and compare it against traditional models including ARIMA, XGBoost, LightGBM, Random Forest, and Decision Trees. Traditional approaches often rely on single models; however, our research seeks to delve into the intricacies of hybrid model architectures. Combining the strengths of ARIMA and XGB, we aim to build a highly accurate and efficient groundwater level prediction system. Comprehensive evaluations were conducted using metrics such as Mean Absolute Error (MAE) and Root Mean Squared Error (RMSE), The future scope of machine learning in water resource management includes integrating such models with real-time monitoring systems and expanding their applications to diverse environmental conditions and regions. 2024 IEEE. -
Seismic Performance Assessment of Reinforced Concrete Frames: Insights from Pushover Analysis
This paper offers a comprehensive exploration of the seismic response of Reinforced Concrete (RC) frames examined through pushover analysis. The frames analyzed are designed as per IS 13920 and IS 456 for different levels of earthquake intensities and different levels of axial loads. Nonlinear analysis techniques have gained prominence in assessing the response of RC frames, especially when subjected to extreme loading events or when accurate predictions of structural behavior are required beyond the linear elastic range. The study aims to delve into the structural behavior of RC frames under seismic influences, employing pushover analysis as the principal analytical tool. With a focus on assessing the effectiveness and reliability of pushover analysis, the research endeavors to elucidate the seismic performance of RC frames while considering their response to different seismic zones and axial loading scenarios. The methodology involves conducting a series of pushover analyses on RC frames using advanced structural analysis software. The results obtained are meticulously analyzed to discern the shear capacities and ultimate displacements of the frames, by investigating the displacement versus shear capacity relationship across varying seismic zones and axial loading scenarios. Through this comprehensive investigation, the paper aims to enhance our understanding of the seismic behavior of RC frames and will provide valuable insights for seismic design. The Author(s), under exclusive license to Springer Nature Switzerland AG 2024. -
An Innovative Method for Brain Stroke Prediction based on Parallel RELM Model
Strokes occur when blood supply to the brain is suddenly cut off or severely impaired. Stroke victims may experience cell death as a result of oxygen and food shortages. The effectiveness of various predictive data mining algorithms in illness prediction has been the subject of numerous studies. The three stages that make up this suggested method are feature selection, model training, and preprocessing. Missing value management, numeric value conversion, imbalanced dataset handling, and data scaling are all components of data preparation. The chi-square and RFE methods are utilized in feature selection. The former assesses feature correlation, while the latter recursively seeks for ever-smaller feature sets to choose features. The whole time the model was being trained, a Parallel RELM was used. This new method outperforms both ELM and RELM, achieving an average accuracy of 95.84%. 2024 IEEE.