Including category information as supplements in latent semantic analysis of Hindi documents
- Title
- Including category information as supplements in latent semantic analysis of Hindi documents
- Creator
- Krishnamurthi K.; Panuganti V.R.; Bulusu V.V.
- Description
- Latent semantic analysis (LSA) is a mathematical model that is used to capture the semantic structure of documents by using the correlations between the textual elements in them. LSA captures the semantic structure very well being independent of external sources of semantics. However, the model's performance increases when it is supplemented with extra information. The work presented in this paper is to modify the model to analyse word correlations in documents by considering the document category information as supplements in the process. This enhancement is called supplemented latent semantic analysis (SLSA). SLSA's performance is empirically evaluated in a document classification application by comparing the accuracies of classification against plain LSA for various term weighting schemes. An increment of 1.14%, 1.30% and 1.63% is observed in the classification accuracies when SLSA is compared with plain LSA for tf, idf and tfidf respectively in the initial term-bydocument matrix. Copyright 2017 Inderscience Enterprises Ltd.
- Source
- International Journal of Computational Science and Engineering, Vol-15, No. 45689, pp. 138-145.
- Date
- 2017-01-01
- Publisher
- Inderscience Publishers
- Subject
- Dimensionality reduction; Document classification; Latent semantic analysis; LSA; Semantic structure; Singular value decomposition
- Coverage
- Krishnamurthi K., Department of Computer Science, Christ University, Bangalore, India; Panuganti V.R., Department of Computer Science and Engineering, Gokaraju Rangaraju Institute of Engineering and Technology, Hyderabad, India; Bulusu V.V., Department of Computer Science and Engineering, JNTUHCEJ, Karimnagar, India
- Rights
- Restricted Access
- Relation
- ISSN: 17427185
- Format
- Online
- Language
- English
- Type
- Article
Collection
Citation
Krishnamurthi K.; Panuganti V.R.; Bulusu V.V., “Including category information as supplements in latent semantic analysis of Hindi documents,” CHRIST (Deemed To Be University) Institutional Repository, accessed February 25, 2025, https://archives.christuniversity.in/items/show/17195.