Browse Items (11809 total)
Sort by:
-
File Validation intheData Ingestion Process Using Apache NiFi
In the industries of today, development and maintenance of data pipelines is of paramount importance. With large volumes of data being generated across industries on a continuous basis, there is a growing need to process and store this ingested data in a fast, and efficient manner. Apache NiFi is one such tool which possesses crucial capabilities that can be used to enhance, modify, and automate data pipelines. However, automation of the ingestion process creates certain inherent issues which, without being resolved, tend to be detrimental to the entire ingestion process. These issues vary in nature, ranging from corrupted data to changes in the file schema, to name a few. In this paper, a solution to this problem is proposed. By exploiting Apache NiFis custom processor development capabilities, problem-specific processors can be designed and deployed which can ensure accurate validation of the ingestion process on a real-time basis. To demonstrate this, two processors were developed as a proof-of-concept, which tackle specific file-related validation issues in the ingestion processthat of the file size, and, the ingestion frequency. These custom-built processors are designed to be inserted into the pipeline at key points to ensure that the ingested data is validated against certain standards and requirements. Having successfully demonstrated its capabilities, the paper presents the exploitation of Apache NiFis custom processor capabilities as a potential way forward to resolve the plethora of ingestion issues in industry, today. The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024. -
Domain-Driven Summarization: Models for Diverse Content Realms
In todays information-rich landscape, automatic text summarization systems are pivotal in condensing extensive textual content into concise and informative summaries. The current study ventures into domain-agnostic summarization, delving into advanced models spanning various domains, such as business, entertainment, sports, politics, and technology. The study aims to uncover domain-specific enhancements, assess resource efficiency, and explore the boundaries of applicability. This study covers nine cutting-edge models, including Google Pegasus-Large, Facebook BART-Base, SSHLEIFER DistilBART-CNN-6-6, Facebook BART-Large, T5-Large, T5-Base, Facebook BART-Large-CNN, Facebook BART-Large-Xsum, and SSHLEIFER DistilBART-Xsum-12-1. Each model undergoes rigorous evaluation, revealing its efficacy within various domains. Google Pegasus-Large emerges as a standout choice for cross-domain summarization, while Facebook BART-Base demonstrates remarkable stability. Models like SSHLEIFER DistilBART-CNN-6-6, T5 variants, and others contribute to the evolving landscape of summarization. This study endeavors to establish a robust foundation for enhancing the efficiency and effectiveness of summarization techniques within various domains, thereby contributing valuable insights to the broader literature on text summarization. The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024. -
A Comprehensive Review of Linear Regression, Random Forest, XGBoost, and SVR: Integrating Machine Learning and Actuarial Science for Health Insurance Pricing
Actuarial science and data science are being studied as a fusion using Industry 4.0 technologies such as the Internet of Things, artificial intelligence, big data, and machine learning (ML) algorithms. When analyzing earlier components of actuarial science, it could have been more accurate and quick, but when later stages of AI and ML were integrated, the algorithms weren't up to the standard, and actuaries experienced some accuracy concerns. The company requires actuaries to be precise with analysis to acquire reliable results. As a result of the large amount of data these companies collect, a choice made manually may turn out to be incorrect. We will, therefore, examine alternative models in this article as part of the decision-making process. Once we have chosen the best path of action, we will use our actuarial expertise to evaluate the risk associated with specific charges features. The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024. -
Cybersecurity Threats Detection in Intelligent Networks using Predictive Analytics Approaches
The modern scenario of network vulnerabilities necessitates the adoption of sophisticated detection and mitigation strategies. Predictive analytics is surfaced to be a powerful tool in the fight against cybercrime, offering unparalleled capabilities for automating tasks, analyzing vast amounts of data, and identifying complex patterns that might elude human analysts. This paper presents a comprehensive overview of how AI is transforming the field of cybersecurity. Machine intelligence can bring revolution to cybersecurity by providing advanced defense capabilities. Addressing ethical concerns, ensuring model explainability, and fostering collaboration between researchers and developers are crucial for maximizing the positive impact of AI in this critical domain. 2024 IEEE. -
Enhancing Medical Decision Support Systems withtheTwo-Parameter Logistic Regression Model
The logistic regression model is an invaluable tool for predicting binary response variables, yet it faces a significant challenge in scenarios where explanatory variables exhibit multicollinearity. Multicollinearity hinders the models ability to provide accurate and reliable predictions. To address this critical issue, this study introduces innovative combinations of Ridge and Liu estimators tailored for the two-parameter logistic regression model. To evaluate the effectiveness of the combination of ridge and Liu estimators under the two-parameter logistic regression, a real-world dataset from the medical domain is utilized, and Mean Squared Errors are employed as a performance metric. The findings of our investigation revealed that the ridge estimator, denoted as k4, outperforms other Liu estimators when multicollinearity is present in the data. The significance of this research lies in its potential to enhance the reliability of predictions for binary outcome variables in the medical domain. These novel estimators offer a promising solution to the multicollinearity challenge, contributing to more accurate and trustworthy results, ultimately benefiting medical practitioners and researchers alike. The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024. -
Improved Acceptance model: Unblocking Potential of Blockchain in Banking Space
Over the past ten years, blockchain has emerged as the new buzzword in the banking sector.The new technology is being adopted globally in many industries, including the business sector,because of its unique uses and features. However, no adoption model is available to help with this process.This research paper examines the new technology known as blockchain, which powers cryptocurrencies like Bitcoin and others. It looks at what blockchain technology is, how it works especially in the banking sector, and how it can change and upend the financial services sector. It outlines the features of the technology and discusses why these can have a significant effect on the financial industry as a whole in areas like identity services, payments, and settlements in addition to spawning new products based on things like 'smart contracts'. The adoption variables found in the literature study were used to gather, test, and evaluate the official papers that are currently available from regulatory organizations, practitioners, and research bodies. This study was able to classify adoption factors into three categories - supporting, impeding, and circumstantial - identify a new adoption factor, and determine the relative relevance of the factors. Consequently, an institutional adoption paradigm for blockchain technology in the banking sector is put out. In light of this, it is advised to conduct additional research on using the suggested model at banks using the new technology in order to assess its suitability. 2024 IEEE. -
Facial Recognition Model Using Custom Designed Deep Learning Architecture
Facial Recognition is widely used in some applications such as attendance tracking, phone unlocking, and security systems. An extensive study of methodologies and techniques used in face recognition systems has already been suggested, but it doesn't remain easy in the real-world domain. Preprocessing steps are mentioned in this, including data collection, normalization, and feature extraction. Different classification algorithms such as Support Vector Machines (SVM), Nae Bayes, and Convolutional Neural Networks (CNN) are examined deeply, along with their implementation in different research studies. Moreover, encryption schemes and custom-designed deep learning architecture, particularly designed for face recognition, are also covered. A methodology involving training data preprocessing, dimensionality reduction using Principal Component Analysis, and training multiple classifiers is further proposed in this paper. It has been analyzed that a recognition accuracy of 91% is achieved after thorough experimentation. The performance of the trained models on the test dataset is evaluated using metrics such as accuracy and confusion matrix. The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024. -
From Text to Action: NLP Techniques for Washing Machine Manual Processing
This scientific research study focuses on the advancements in Natural Language Processing (NLP) driven by large-scale parallel corpora and presents a comprehensive methodology for creating a parallel, multilingual corpus using NLP techniques and semantic technologies, with a particular focus on washing machine manuals. The study highlights the significant progress made in NLP through the utilization of large-scale parallel corpora and advanced NLP techniques. The successful creation of a parallel, multilingual corpus for washing machine manuals, coupled with the integration of semantic technologies and ontology modeling, demonstrates the broad applicability and potential of NLP in diverse domains.The research covers various aspects, including text extraction, segmentation, and the development of specialized pipelines for question-answering, translation, and text summarization tailored for washing machine manuals. Translation experiments using fine-tuned models demonstrated the feasibility of providing washing machine manuals in local languages, expanding accessibility and understanding for users worldwide. Additionally, the study explored text summarization using a powerful transformer-based model, which exhibited remarkable proficiency in generating concise and coherent summaries from complex input texts. The implementation of a question-answering pipeline showcased the effectiveness of various language models in handling question-answering tasks with high accuracy and effectiveness.Additionally, the article discusses the processes of data collection, information preparation, ontology creation, alignment strategies, and text analytics. Furthermore, the study addresses the challenges and potential future developments in this field, offering insights into the promising applications of NLP in the context of washing machine manuals. 2024 Elsevier B.V.. All rights reserved. -
Insights on the Optical and Infrared Nature of MAXI J0709-159: Implications for High-Mass X-ray Binaries
In our previous study (Bhattacharyya et al., 2022), HD 54786, the optical counterpart of the MAXI J0709-159 system, was identified to be an evolved star, departing from the main sequence, based on comparisons with non-X-ray binary systems. In this paper, using color-magnitude diagram (CMD) analysis for High-Mass X-ray Binaries (HMXBs) and statistical t-tests, we found evidence supporting HD 54786s potential membership in both Be/X-ray binaries (BeXRBs) and supergiant X-ray binaries (SgXBs) populations of HMXBs. Hence, our study points towards dual optical characteristics of HD 54786, as an X-ray binary star and also belonging to a distinct evolutionary phase from BeXRB towards SgXB. Our further analysis suggests that MAXI J0709-159, associated with HD 54786, exhibits low-level activity during the current epoch and possesses a limited amount of circumstellar material. Although similarities with the previously studied BeXRB system LSI +61? 235 (Coe et al., 1994) are noted, continued monitoring and data collection are essential to fully comprehend the complexities of MAXI J0709-159 and its evolutionary trajectory within the realm of HMXBs. 2024 Societe Royale des Sciences de Liege. All rights reserved. -
GLANCEGuided Language Through Autoregression Establishing Natural and Classifier-Free Editing
In this study, researchers aimed to simplify text conversion into images using the latest text-to-image generation methods. While these methods have improved the quality and relevance of generated images, certain crucial questions remained unanswered, limiting their practicality and overall quality. To address these issues, the researchers introduced a novel text-to-image method. This method allows for better control of the scene depicted in the image through text, enhances the tokenization process by incorporating specific knowledge about key image regions such as faces and important objects, and provides guidance to the transformer model without needing a classifier. The outcome of this work was a model that achieved state-of-the-art results in terms of image quality and human evaluation, enabling the generation of high-fidelity 512?512-pixel images. Moreover, this method introduced new capabilities, including scene editing, text editing with reference scenes, handling out-of-distribution text prompts, and generating story illustrations. The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024. -
Study of the Balmer Decrements for Galactic Classical Be Stars Using the Himalayan Chandra Telescope of India
In a recent study, Banerjee et al. (2021) produced an atlas of all major emission lines found in a large sample of 115 Galactic field Be stars using the 2-m Himalayan Chandra Telescope (HCT) facility located at Ladakh, India. This paper presents our further exploration of these stars to estimate the electron density in their discs. Our study using Balmer decrement values indicate that their discs are generally optically thick in nature with electron density (ne) in their circumstellar envelopes (CEs) being in excess of 1013 cm-3 for around 65% of the stars. For another 19% stars, the average ne in their discs probably range between 1012 cm-3 and 1013 cm-3. We noticed that the nature of the H? and H? line profiles might not influence the observed Balmer decrement values (i.e. D34 and D54) of the sample of stars. Interestingly, we also found that around 50% of the Be stars displaying D34 greater than 2.7 are of earlier spectral types, i.e. within B0B3. 2024 Societe Royale des Sciences de Liege. All rights reserved. -
Traffic Optimization and Route Detection Based on Air Quality and Pollution Level
This research outlines the development of a groundbreaking Traffic Optimization and Route Detection system based on pollution and air quality. Urbanization has led to increased vehicular traffic, exacerbating concerns about air pollution and its adverse effects on public health. The proposed system aims to address this critical issue by integrating real-time environmental data into route recommendations, prioritizing routes that minimize exposure to high-pollution areas. Beyond improving air quality, the system promotes the health and well-being of commuters, encourages the adoption of eco-friendly transportation modes, and contributes to overall environmental sustainability. An air quality detection system is developed to gather data for the development of the system. This innovative approach aligns with the goals of efficient urban mobility, sustainable transportation, and data-driven decision-making. Through this research, we anticipate providing valuable insights into the potential impact of integrating pollution and air quality considerations into urban transportation systems, ultimately contributing to healthier and more sustainable urban environments. 2024 IEEE. -
Enhancing Educational Adaptability: A Review and Analysis of AI-Driven Adaptive Learning Platforms
This study explores the transformative potential of AI-powered adaptive learning platforms (ALPs) in education, specifically focusing on personalized learning paths and their impact on student engagement and outcomes. Through a comprehensive analysis of four prominent ALPs - Carnegie Learning, DreamBox Learning, Smart Sparrow, and Knewton - this study investigates their approaches to content tailoring and feedback delivery. The comparative analysis highlights each platform's strengths and limitations, providing educators with valuable insights for informed selection and implementation. This study also considers the broader landscape of ALPs, acknowledging concerns such as bias, data privacy, and the role of educators in the tech-driven educational environment. The findings contribute to our understanding of how ALPs can empower educators, personalize learning, and address achievement gaps, offering a nuanced perspective on the complex tapestry of AI in education. 2024 IEEE. -
Unsupervised Learning for Understanding Diversity: Applying Feature Engineering and Cluster Analysis to Deaf and Hard of Hearing Data
As e-Learning emerges as a promising tool for instruction delivery, personalizing the e-Learning platform for DHH learners will benefit them to improve their learning engagement and educational attainment. This study aims to collect and analyze the different features unique to DHH learners and analyze the significant features among them. This study highlights the importance of addressing the diversity among DHH learners, while creating a personalized learning environment for them. With this focus, we employ the K-Means clustering algorithm to group the learners based on similar needs and preferences and identified that distinguishing clusters can be formed within the DHH group. We also tried to understand the significant features contributing to forming well separated groups. These results provide valuable insights into the diverse preferences and requirements when they interact with the learning materials. These findings emphasize the significance of personalized approach for DHH learners in educational settings and serve as the stepping stone to develop a personalized learning environment for them. 2024 IEEE. -
Data Economy: Data and Money
The article explores the concept of data economy, which is based on the sharing of data across platforms and ecosystems. Data has evolved from factual information to a new asset for companies worldwide, and the article discusses its evolution from brittle paper records to complex databases and algorithms like blockchain. With a prediction of a data explosion of about 175 zettabytes by 2025, data is used extensively in various domains, from agriculture to healthcare. The article also discusses how the data economy is not domain-specific but is a universal shift as all companies transition to become technology-driven companies. The data network effect is a cycle that uses data to acquire service users and generate more data. This has become a B2B service model that has added profits to various tech giants balance sheets. The article concludes by exploring the current need for data sharing across organizations and the future scope of the data economy. The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024. -
Role of Artificial Intelligence and Robotics in Shaping the Students: A Higher Educational Perspective
An unprecedented shift in technology has begun in the modern era. Robotics and artificial intelligence (AI) advancements have created fresh positions while de-skilling or retraining many existing ones. Technical developments at higher education institutions (HEIs) protect students against potential changes in their field of study brought on by A) and prepare them for success in the workplace. This research aims to investigate how, over the past 150 years; globalization has fundamentally changed human civilization. Conventional education confronts enormous challenges as energy, the internet of things, and the cyber-physical systems they oversee diminish. One may argue that energy, the internet of things, and the cyber-physical systems that are under its jurisdiction are the foundations of all future education. The demise of these systems presents a significant threat to traditional schooling. Students' screen time is increased by this action, which has an impact on their mental health. Five-fold cross-validation with 210 students from Delhi NCR and abroad is beneficial for the classification techniques SVM, Naive Bayes, and Random Forest. The study examined the factors that contributed to an increased rate of mental health issues among undergraduate students in Delhi, India, following the introduction of the COVID-19 virus. The results have demonstrated that while technology's practical applications will likely have a positive influence on education in the future, there may be negative effects as well. This is an opportunity for educators and learners to support excellence and remove obstacles that prevent many kids and schools from achieving it. Therefore, in the future, every nation will need to create an education system that is more technologically sophisticated. 2024 IEEE. -
Emoji Sentiment Analysis of User Reviews on Online Applications Using Supervised Machine Learning
Analyzing the sentiment behind emojis can provide valuable insights into the emotional context and user sentiment associated with textual content. To conduct a comparative analysis of diverse supervised machine learning models that can achieve the highest level of accuracy in Emoji Sentiment Analysis is the purpose of this research. Five machine learning models used in this research are K-Nearest Neighbors (KNN), Artificial Neural Network (ANN), Logistic Regression, Naive Bayes, and Random Forest. The experimental process resulted in ANN and KNN models giving an accuracy of 92%. The ANN model shows its proficiency in effectively managing large datasets. ANN also supports fault tolerance. The KNN model refrains from conducting calculations during the training phase and only constructs a model when a query is executed on the dataset. This characteristic makes KNN particularly well-suited for data mining. Both ANN and K-NN excelled in the experimental study due to these distinctive attributes. The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024. -
Optimizing Antenna Structures for 60 GHz Systems Microstrip Patch vs Microstrip Slot
This paper conducts a thorough comparison between microstrip patch and microstrip slot antennas for 60 GHz wireless communication systems, excluding the meander line antenna. The design process involves meticulous selection of substrate material, antenna geometry, and feed mechanism to achieve a compact, efficient, and wideband antenna suitable for 60 GHz applications. Performance analysis, based on theoretical derivations and HFSS simulator simulations, covers key parameters like radiation pattern, gain, and bandwidth. Results demonstrate that the proposed microstrip antenna meets 60 GHz system requirements, indicating potential for further optimization. The study highlights the unique advantages and disadvantages of each antenna structure, emphasizing that selection should align with specific application needs. This comparative analysis aids researchers and engineers in making informed decisions regarding the most suitable antenna structure for their 60 GHz wireless communication requirements. 2024 IEEE. -
An Efficient Detection and Prediction of Intrusion in Smart Grids Using Artificial Neural Networks
In recent years, fraud identification on Internet of Things (IoT) devices has been essential to obtaining better results in all fields, such as smart cities, smart grids, etc. As a result, there are more IoT devices in the smart grid's power management sectors, and apart from these identifications, intrusion into the smart grid is very difficult. Hence, to overcome this, a proposed intrusion detection system in a smart grid using an artificial neural network (ANN) has been used to detect the intrusion and improve the prediction rate, and it has been very effective on various faults injected into the smart grids in ranges and seasons. As per the simulation result, the proposed method shows better results as compared to a conventional neural network (CNN) with respect to the root mean square error in terms of weekly, monthly, and seasonal terms of 0.25%, 0.15%, and 0.26%, respectively. The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024. -
Emotion Detection Using Machine Learning Technique
Face Emotion Recognition (FER) is an emerging and crucial topic today; since much research has been done in this field, there are still many things to explore. In daily life, where people dont have time to fill out feedback, emotion detection plays an important role, which helps to know customer feedback by analyzing expressions and gestures. Analyzing current studies in emotion recognition demonstrates notable advancements made possible by deep learning. A thorough overview of facial emotion recognition (FER) is provided in this publication. The literature cited in this study is taken from various credible research published in the last 10years. This study has built a model for emotion recognition using photos or a camera. The paper is based on the concepts of Machine Learning (ML), Deep Learning (DL), and Neural Networks (NN). A range of publicly available datasets have been used to evaluate evaluation metrics. The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024.