Introduction to Data Mining and Knowledge Discovery
- Title
- Introduction to Data Mining and Knowledge Discovery
- Creator
- Chakraborty S.; Islam S.H.; Samanta D.
- Description
- Data mining is a process of discovering some necessary hidden patterns from a large chunk of data that can be stored in multiple heterogeneous resources. It has an enormous use to make strategic decisions by business executives after analyzing the hidden truth of data. Data mining one of the steps in the knowledge-creation process. A data mining system consists of a data warehouse, a database server, a data mining engine, a pattern analysis module, and a graphical user interface. Data mining techniques include mining the frequent patterns and association learning rules with analysis, sequence analysis. Data mining technique is applicable on the top of various kinds of intelligent data storage systems such as data warehouses. It provides some analysis processes to make some useful strategic decisions. There are various issues and challenges faced by a data mining system in large databases. It provides a great place to work for data researchers and developers. Data mining is the process of classification, which can be executed based on the examination of training data (i.e., objects whose class label is predefined). With the help of an expert set of previous class objects with known class labels, it can find a model that can predict a class object with an unknown class label. These classification models can be classified into a variety of categories, including nearest neighbor, neural network, and others. Bayesian model, decision tree, neural network Random forest, decision trees Support vector machine, random forest SVM (support vector machine), for example. By analyzing the most common class among k closest samples, the K-Nearest Neighbor (KNN) technique aids in predicting of the class object with the unknown class label. Its an easy-to-use strategy that yields a solid classification result from any distribution. The Naive Bayes theory helps to perform the classification. It is one of the fastest classification algorithms, capable of efficiently handling real-world discrete data. 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG.
- Source
- EAI/Springer Innovations in Communication and Computing, pp. 1-22.
- Date
- 2022-01-01
- Publisher
- Springer Science and Business Media Deutschland GmbH
- Coverage
- Chakraborty S., JIS University, Dum Dum Cantonment, India; Islam S.H., Indian Institute of Information Technology Kalyani, West Bengal, India; Samanta D., Department of Computer Science, CHRIST (Deemed to be University), Bangalore, India
- Rights
- Restricted Access
- Relation
- ISSN: 25228595
- Format
- Online
- Language
- English
- Type
- Book chapter
Collection
Citation
Chakraborty S.; Islam S.H.; Samanta D., “Introduction to Data Mining and Knowledge Discovery,” CHRIST (Deemed To Be University) Institutional Repository, accessed February 23, 2025, https://archives.christuniversity.in/items/show/18678.