Browse Items (16481 total)
Sort by:
-
Predicting Football Players Market Value via Machine Learning
Football, arguably the most popular sport in the world, has become much more than just a sport, it is a multibillion-dollar industry with its center in Europe. Every year millions of euros are spent in transfer window to buy and sell players and a common theme that has been seen is players not living up to the price the clubs paid for them. This research aims to predict football players market values using machine learning techniques. Departing from traditional methods that broadly categorize players into positions like Goalkeeper, Defender, Midfielder, and Forward, this study provides a more nuanced approach by classifying players into specific roles such as Center-back, Full-back, Defensive Midfielder, Attacking Midfielder, and Winger. By incorporating performance metrics tailored to each position and weighing the performance indicators based on the relevance to that specific position, the research aims to provide a robust method to predict players market value within a negotiation tolerance range. Using data from the past three seasons, including detailed player performance statistics and contractual details, models were developed to assist clubs in making data-driven transfer decisions. Machine learning algorithms, including Random Forest Regressor and Light GBM, were utilized, with RMSE and R2 Score as evaluation metrics. Both algorithms demonstrated robust performance, with some positional models predicting market values within an acceptable error range of 312million, enabling clubs to negotiate transfer fees with greater precision based on empirical evidence of player performance. The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025. -
Predicting Graduate Admissions using Ensemble Machine Learning Techniques: A Comparative Study of Classifiers and Regressors
The goal of this research is to apply machine learning techniques to forecast a student's probability of being accepted into a graduate program. Applicants' GRE and TOEFL grades, university rankings, letters of recommendation, statements of purpose, cumulative grade point averages, and prior research experience are all included in the dataset utilized for this analysis. The goal is to calculate an applicant's expected acceptance rate. This study uses a combination of Classifiers and regressors. Different prediction models are contrasted in this study: Random Forest Classifier (RFC), Decision Tree Classifier (DTC), K-Neighbors Classifier (KNC), Support Vector Classifier (SVC), Gradient Boosting Classifier (GBC), Logistic regression (LR), Support vector Regressor (SVR), Random Forest Regressor(RFR), Gradient Boosting Regressor(GBR) and Decision Tree Regressor(DTR). Using these characteristics, the models are trained and evaluated. Evaluation criteria such as accuracy, kappa value, AUC-ROC, and confusion matrix are used to find the models' effectiveness. In order to determine which model performed the best, the assessment results are compared with one another. Based on study findings, the Gradient Boosting Classifier outperforms the other models tested by a significant margin (96 per cent). This model's AUC-ROC of 0.97 indicates it does a decent job at separating the positive and negative categories. 2023 IEEE. -
Predicting heart ailment in patients with varying number of features using data mining techniques
Data mining can be defined as a process of extracting unknown, verifiable and possibly helpful data from information. Among the various ailments, heart ailment is one of the primary reason behind death of individuals around the globe, hence in order to curb this, a detailed analysis is done using Data Mining. Many a times we limit ourselves with minimal attributes that are required to predict a patient with heart disease. By doing so we are missing on a lot of important attributes that are main causes for heart diseases. Hence, this research aims at considering almost all the important features affecting heart disease and performs the analysis step by step with minimal to maximum set of attributes using Data Mining techniques to predict heart ailments. The various classification methods used are Nae Bayes classifier, Random Forest and Random Tree which are applied on three datasets with different number of attributes but with a common class label. From the analysis performed, it shows that there is a gradual increase in prediction accuracies with the increase in the attributes irrespective of the classifiers used and Nae Bayes and Random Forest algorithms comparatively outperforms with these sets of data. 2019 Institute of Advanced Engineering and Science. -
Predicting Intention to Buy Organic Food during the COVID-19 Pandemic: A multi-group analysis based on the Health Belief Model
The ongoing COVID-19 pandemic has deeply affected physical and psychological health of people. It also had a huge impact on their dietary choices. This study specifically attempts to determine the impact of the constructs of health belief model on consumer purchase intention of organic food in the pandemic scenario. A survey was conducted among 413 Indian organic food consumers. The proposed hypotheses are tested by employing structural equation modeling. The findings highlight those perceived benefits is an important predictor of consumers behavioral intention to buy organic food, followed by cues to action and perceived threats. It is also found that consumers age moderates the impact of perceived threat and perceived barrier on consumers purchase intention, with a 22% difference in model prediction. In conclusion, the health belief model is found to be one of the most suitable models to predict consumer intention toward organic food purchase during the COVID-19 pandemic. 2022 Taylor & Francis Group, LLC. -
Predicting Job Risk from Artificial Intelligence in London Using Supervised Machine Learning Models
This study investigates the risk of job automation in London due to artificial intelligence (AI), applying supervised machine learning techniques to identify occupations most at risk. Leveraging a dataset encompassing job-specific features such as primary tasks, industry domains, and associated AI models, the research develops two predictive models. A Random Forest Classifier is used to categorize jobs as low, medium, or high automation risk, while a Linear Regression model estimates the proportion of each occupation's workload likely to be automated. The Random Forest model achieved a high accuracy rate of 97% in classifying job risk, indicating strong predictive capability. Meanwhile, the regression model explained 85% of the variance in the AI workload ratio, highlighting a significant relationship between job attributes and automation potential. These results suggest that job characteristics are reliable indicators of AI impact, particularly in routine, repetitive, and low-skilled roles that are more easily codified and replicated by algorithms. The findings align with broader economic theories such as creative destruction and technological waves, suggesting that AI not only displaces certain roles but also drives structural transformation within the labor market. By focusing on London, this study provides a localized understanding of how AI is reshaping employment patterns. It underscores the growing urgency for strategic workforce re-skilling and adaptive policy frameworks to mitigate negative outcomes and maximize opportunities presented by AI. Ultimately, this research contributes valuable insights into the interaction between AI technologies and employment, helping policymakers, employers, and educators anticipate change and prepare for a more resilient, inclusive labor market. 2025 IEEE. -
Predicting Liver Injury Risk from Chemical Properties and Drug Label Information Using Machine Learning Models
This research aims to create a drug-induced liver injury (DILI) severity prediction system based on machine learning to aid healthcare professionals in safety assessment. FDA's Liver Toxicity Knowledge Base supplied a drug dataset of 1042 drugs, and later, after pre-processing and API data extraction, each drug was defined by 16 chemical features such as molecular descriptors and pharmacokinetic properties. To improve uniformity and get quality input for training, data preparation involved correcting missing values, encoding categorical values, and normalising numerical data. Various machine learning models were trained and evaluated to forecast the levels of DILI severity, i.e., Random Forest, Gradient Boosting, and XGBoost. The importance of features was approximated for identifying the predictors that impacted the most. The best overall performance was recorded for XGBoost, and it had 81% accuracy when it was evaluated. Its acceptable discrimination was established for mild, moderate, and severe cases. The aptness of being applied to the medical sector is demonstrated by drastically lowering the principal misclassifications, especially from mild to severe. The application of machine learning in improving medicine safety assessment and reducing risks associated with pharmaceutical development is illustrated here. 2025 IEEE. -
Predicting Nitrogen Flavanol Index (NFI) in Mentha arvensis Using UAV Imaging and Machine Learning Techniques for Sustainable Agriculture
Crop growth monitoring at various growth stages is essential for optimizing agricultural inputs and enhancing crop yield. Nitrogen plays a critical role in plant development; however, its improper application can reduce productivity and, in the long term, degrade soil health. The aim of this study was to develop a non-invasive approach for nitrogen estimation through proxies (Nitrogen Flavanol Index) in Mentha arvensis using UAV-derived multispectral vegetation indices and machine learning models. Support Vector Regression, Random Forest, and Gradient Boosting were used to predict the Nitrogen Flavanol Index (NFI) across different growth stages. Among the tested models, Random Forest achieved the highest predictive accuracy (R2 = 0.86, RMSE = 0.32) at 75 days after planting (DAP), followed by Gradient Boosting (R2 = 0.75, RMSE = 0.43). Model performance was lowest during early growth stages (1530 DAP) but improved markedly from mid to late growth stages (4590 DAP). The findings highlight the significance of UAV-acquired data coupled with machine learning approaches for non-destructive nitrogen flavanol estimation, which can immensely contribute to improving real-time crop growth monitoring. 2025 by the authors. -
Predicting nitrous oxide contaminants in Cauvery basin using region-based convolutional neural network
Nitrous oxide (N2O) in riverbeds affects hydrological processes by contributing to the greenhouse effect, indicating poor water quality, disrupting biogeochemical cycling, and linking to eutrophication. Elevated N2O levels signal environmental issues, impacting aquatic life and necessitating precise forecasting for effective environmental management and reduced greenhouse gas emissions. Precisely forecasting nitrous oxide (N2O) emissions from riverbeds is paramount for effective environmental management, given its significant potency as a greenhouse gas. This study focuses on the difficulties related to spatial feature extraction and modeling accuracy in predicting N2O in riverbeds in Tamil Nadu. To address the obstacles, the research suggests utilizing the Deep Learning Based Prediction of Nitrous Oxide Contaminants (DL-PNOC), which studies the N2O contaminants in water using Region-based Convolutional Neural Network (RCNN) for spatial feature extraction, to predict nitrous oxide contaminants. The study is centered on the Cauvery River Basin located in Tamil Nadu, where the emission of N2O is a matter of environment. The outcomes encompass the specialized N2O contaminant model for riverbeds and the implementation of RCNN achieves precise N2O forecasting. The DL-PNOC approach combines a contaminant model with RCNN deep learning techniques to capture spatial characteristics and predict N2O pollutants accurately. Furthermore, using the River Bed Dynamics Simulator reinforces the dependability of the findings. The DL-PNOC approach has exhibited encouraging results, as evidenced by the following metrics: a high IoU of 88.66%, precision of 88.96%, recall of 90.03%, F1 score of 89.22%, and low RMSE and MAE values of 9.14% and 7.59%, respectively. The findings highlight the efficacy of the DL-PNOC approach in precisely forecasting N2O pollutants in river sediments. 2024 Elsevier B.V. -
Predicting of Credit Risk Using Machine Learning Algorithms
Credit risk management is one of the key processes for banks and is crucial to ensuring the banks stability and success. However, due to the need for more rigid forecasting models with strong mapping abilities, credit risk prediction has become challenging for the banking industry. Therefore, this paper attempts to predict commercial banks credit risk (CR) by using various machine learning algorithms. Machine learning algorithms, namely linear regression, KNN, SVR, DT, RF, XGB, and MLP, are compared with and without feature selection and feature extraction techniques to examine their prediction capabilities. Various determinants of credit risk (features) have been extracted to predict credit risk, and these features have been used to train machine learning models. Findings revealed that the decision tree algorithm had the highest performance, with the lowest mean absolute error (MSE) value of 0.1637 and the lowest root mean squared error (RMSE) value of 0.2158. The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024. -
Predicting of Open Source Software Component Reusability Level Using Object-Oriented Metrics by Taguchi Approach
Component-based software development (CBSD) is an efficient approach used by software developers to develop new software. The commercial off the shelf (COTS) and open-source software (OSS) are two styles to implement CBSD. The COTS provides the interface and depicts the black-box behavior, but does not support several software quality characteristics. On the other hard, OSS is a more efficient approach compared to COTS due to its source code availability. This research aims to identify the reusability level of OSS components from an online repository of OSS. The OSS components are classified based on Chidamber and Kemerer reusability metrics (CK-metrics). This paper proposed a mathematical model to establish the relationship between the reusability of CK-metrics. Reusability level of OSS component has been measured and most effective CK-metrics obtained by applying the Taguchi design and analysis of variance (ANOVA). The input parameters for the experimental design are evaluated based on the OSS repository. Performance analysis has been carried out based upon the interaction effect between the reusability of CK-metrics. Main effect plots are created to identify the most reusable component of the OSS. The genetic algorithm (GA) is used to predict the optimized value of the different control parameters. The results indicate that the OSS component reusability level is 0.698194. The reusability of software has a significant effect on the quality of software. The quality of software can be improved by increasing the reusability of software components. 2021 World Scientific Publishing Company. -
Predicting Player Engagement in Online Gaming: A Machine Learning Approach
The aim of this research is to make precise forecasts on player participation in online game using state-of-the-art machine learning algorithms. Player engagement plays a crucial element in determining the success of online games because it affects player retention, satisfaction and monetization. By understanding and predicting engagement levels, game developers and marketers can enhance the gaming experience and develop strategies to keep players invested. This research involves a comprehensive analysis of player behavior data from an online gaming platform. The dataset includes various demographic and behavioral features such as age, gender, location, game genre, playtime hours, in-game purchases, game difficulty, sessions per week, average session duration, player level, achievements unlocked, and engagement level. The data was preprocessed through handling missing values, normalizing numerical features, and encoding categorical variables. Exploratory Data Analysis (EDA) was conducted to understand the distribution and relationships between different features. Multiple machine learning models were evaluated to predict player engagement levels, including Random Forest, Gradient Boosting, XGBoost, and Support Vector Machine (SVM). These models were then compared through the accuracy, precision, recall, and F1-score metrics. In the comparison, XGBoost emerged as the best model. Since it is the best-performing model, we can make the feature importance analysis to identify the best factors for predicting engagement in the next step. The XGBoost model achieved the highest accuracy of 91%, demonstrating superior precision, recall, and F1-scores across all engagement levels (High, Medium, Low). Ensemble methods like XGBoost, Gradient Boosting, and Random Forest outperformed the SVM model, highlighting their effectiveness in handling complex datasets. 2024 IEEE. -
Predicting Price Direction of Bitcoin based on Hybrid Model of LSTM and Dense Neural Network Approach
Bitcoin is a rapidly growing but extremely risky cryptocurrency. It marks a watershed moment in the history of cash. These days, digital currency is preferred to actual money. Bitcoin has decentralized authority and placed it in the hands of its users. Many people are joining the largest and most well-known Bitcoin mining pools as the risk of working alone is too great. In order to enhance their chances of creating the next block in the Bitcoins blockchain and decrease the mining reward volatility, users can band together to form Bitcoin pools. This tendency toward consolidation may also be seen in the rise of large-scale mining farms equipped with powerful mining resources and speedy processing capability. Because of the risk of a 51% assault, this pattern shows that Bitcoin's pure, decentralized protocol is moving toward greater centralization in its distribution network. Not to be overlooked is the resulting centralization of the bitcoin network as a result of cloud wallets making it simple for new users to join. Because of the easily hackable nature of Bitcoin technologies, this could lead to a wide range of security vulnerabilities. The proposed approach uses normalization and filling missing values in preprocessing, PCA for feature Extraction and finally training the model using LSTM-DNN Models. The proposed approach outperforms other two models such as CNN and DNN. 2023 IEEE. -
Predicting Song Popularity Using Data Analysis
In today's music landscape, predicting a song's success is crucial for musicians, record labels, and streaming platforms. This paper introduces a methodology for estimating popularity using Spotify data, termed the 'Proxy Popularity Score.' Three models - Random Forest, LightGBM Regressor, and XGBoost Regressor - are utilized for prediction. Performance metrics including mean absolute error, mean squared error, root mean squared error, and R-squared error are employed to evaluate model accuracy. Correlation values of 99.85%, 99.87%, and 99.84% are achieved for XGBoost, LightGBM, and Random Forest respectively. The study concludes with a ranking of songs based on predicted popularity scores. 2024 IEEE. -
Predicting Stock Market Indexes with Artificial Intelligence
The forecasting of the Share market has been a popular research area, involving the analysis of input and output stock data using computer technology and algorithmic knowledge. This involves building unpredictable relationships among the data and analyzing the stock market trends to provide a reference for investors. The inception of artificial intelligence (AI) technology, blended with the web, immense data, and cloud computing has provided technical support for various industries. AI technology is employed to scrutinize and predict the equity market, exploring curvilinear associations amid stock market information, and furnishing a foundation for investors to formulate investment determinations. Predicting equity prices is a demanding undertaking due to diverse factors like governmental happenings, fiscal circumstances, business resolutions, investor mentality, and overseas currency hazards. The securities exchange is a vastly active and disordered framework, and producing precise projections of the securities exchange is of paramount significance. 2024 Sachi Nandan Mohanty, Preethi Nanjundan and Tejaswini Kar. -
Predicting Stock Market Movements Through Multisource Data Fusion Graphs: An Approach Employing Graph Convolutional Neural Network
The stock market plays an important role in the capital market, and investigating price fluctuations in the stock market has consistently been a prominent subject for researchers. The application of soft computing techniques to predict and categorize stock market movements is a significant research challenge that has gathered considerable attention from researchers. Although several studies highlight the significance of incorporating information from two sources in stock movement prediction, the potential of advanced graphical techniques for modeling and analyzing multi-source data remains an unattended research area. This study aims to address this gap by introducing a novel model that utilizes multi-source data fusion graphs to predict future market movements. The primary challenge involves establishing a model that can effectively gather the relationships among various data sources and employ this understanding to improve prediction performance. Compared to several existing methods relying only on historical data or sentiment data, which show limited predictive power and lack generality, the proposed approach seeks to overcome these limitations. The proposed model integrates various information sources, including historical prices, news data, Twitter data, and technical indicators for predicting future stock market trends. This presented method involves constructing a subgraph map for each data type to capture events from both rising and falling markets. Then, a Gated Recurrent Unit (GRU) is employed to aggregate the subgraph nodes. These aggregated nodes are then integrated with a Graph Convolutional Neural Network (GCNN) to classify the multi-source graph, therefore achieving stock market trend prediction effectively. To further validate its effectiveness, the presented model is applied to Indian stock market data, demonstrating its feasibility in fusing multi-source stock data and establishing its suitability for effectively predicting stock market movements. 2024 Seventh Sense Research Group -
Predicting Stock Market Price Movement Using Machine Learning Technique: Evidence from India
The stock market is uncertain, volatile, and multidimensional. Stock prices have been difficult to predict since they are influenced by a variety of factors. In order to make critical investment and financial decisions, investors and analysts are interested in predicting stock prices. Predicting a stock's price entails developing price pathways that a stock might take in the future. ANN and mathematical Geometric Brownian movement technique were employed in this study to forecast a stock market closing price of Indian companies. The comparative analysis indicates that the Geometric Brownian Method is better than ANN in giving better MAPE and RMSE Values. 2022 IEEE. -
Predicting Stock Market Trends: Machine Learning Approaches of a Possible Uptrend or Downtrend
This paper delves into a statistical analysis of the stock market, emphasizing the significance of accuracy in stock predictions. Large data sets can be handled by machine learning algorithms, which can also forecast outcomes based on past data and spot intricate patterns in financial data. They assist control risks, automate decision-making procedures, and adjust to changing circumstances. Multi-source data can be combined by ML models to provide a comprehensive picture of market circumstances. They can manage intricate, nonlinear interactions, provide impartial analysis, and lessen human bias. Models are able to adjust to shifting market conditions through ongoing learning and retraining. They must, however, exercise caution when deploying models in real-world situations and ensure that they are validated. Although machine learning has advantages for stock market analysis, it must be carefully evaluated for dangers and validated before being used in practical situations. The traditional machine learning model, Logistic Regression has been used in order to predict stock prices. It focuses on binary classification based on the trend of the stock. Through the model training and evaluation and additional analysis done on the results, this research contributes towards obtaining predictions and studying reasons of a possible uptrend or downtrend to further assist companies. The Author(s), under exclusive license to Springer Nature Switzerland AG 2025. -
Predicting Stock Market Trends: Machine Learning Approaches of a Possible Uptrend or Downtrend
This paper delves into a statistical analysis of the stock market, emphasizing the significance of accuracy in stock predictions. Large data sets can be handled by machine learning algorithms, which can also forecast outcomes based on past data and spot intricate patterns in financial data. They assist control risks, automate decision-making procedures, and adjust to changing circumstances. Multi-source data can be combined by ML models to provide a comprehensive picture of market circumstances. They can manage intricate, nonlinear interactions, provide impartial analysis, and lessen human bias. Models are able to adjust to shifting market conditions through ongoing learning and retraining. They must, however, exercise caution when deploying models in real-world situations and ensure that they are validated. Although machine learning has advantages for stock market analysis, it must be carefully evaluated for dangers and validated before being used in practical situations. The traditional machine learning model, Logistic Regression has been used in order to predict stock prices. It focuses on binary classification based on the trend of the stock. Through the model training and evaluation and additional analysis done on the results, this research contributes towards obtaining predictions and studying reasons of a possible uptrend or downtrend to further assist companies. The Author(s), under exclusive license to Springer Nature Switzerland AG 2025. -
Predicting sustainable equity indices using deep long short-term memory neural network: Evidence from developed and emerging markets
The present study aims to propose a predictive model to forecast the sustainable stock indices. For this, the Long Short-Term Memory (LSTM) neural network model is applied through Keras and TensorFlow to closing values of six developed and emergingmarkets: the US, the UK, Japan, Brazil, South Africa, and China. Further, the Adam optimiser and mean squared error loss function are used to train the model. To gauge the superiority of the LSTM model, a rolling window Autoregressive Integrated Moving Average (ARIMA) model is also employed. The performance accuracy of both models is evaluated using the Root Mean Square Error (RMSE), Mean Absolute Percentage Error (MAPE), and R-squared (R2). The LSTM model, with two LSTM and two dense layers, yields the best results, achieving the highest precision in predicting the values of sustainable indices. The values of RMSE and MAPE confirmed this, and the accuracy is also verified by the R2 values. LSTM shows superior predictive accuracy and is indicated to be fit for non-linear market patterns than rolling window ARIMA. The study enables policymakers and practitioners to forecast these indices and design policies to motivate related investments. 2026 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group. -
Predicting the Cerebral Blood Flow Change Condition during Brain Strokes using Feature Fusion of FMRI Images and Clinical Features
By fusing clinical information with functional magnetic resonance imaging (fFMRI) pictures, this study describes a novel method for predicting changes in cerebral blood flow during brain strokes. The FMRI data and patient-specific variables, such as age, gender, and medical history, are combined via feature fusion in the proposed technique. As a result, the model developed can accurately forecast changes in cerebral blood flow that occur during brain strokes. The efficiency of the suggested strategy is shown by experimental findings. The performance of the model is greatly enhanced when FMRI data and clinical characteristics are combined as opposed to just one data source. The findings of this study have important ramifications for increasing the accuracy of stroke diagnosis and treatment and, eventually, for bettering patient outcomes. The experimental results showed that the proposed method a high level of accuracy in predicting changes in cerebral blood flow after brain strokes. The performance of the model was much enhanced by combining clinical characteristics with FMRI data as opposed to using only one of these data sources. This emphasizes the value of including pertinent clinical information in the diagnosis and management of stroke. 2023 IEEE.
