<?xml version="1.0" encoding="UTF-8"?>
<item xmlns="http://omeka.org/schemas/omeka-xml/v5" itemId="1258" public="1" featured="1" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://omeka.org/schemas/omeka-xml/v5 http://omeka.org/schemas/omeka-xml/v5/omeka-xml-5-0.xsd" uri="https://archives.christuniversity.in/items/show/1258?output=omeka-xml" accessDate="2026-06-04T05:17:02+00:00">
  <fileContainer>
    <file fileId="1118">
      <src>https://archives.christuniversity.in/files/original/ac80541c744606dd8e266ad9201ca8de.pdf</src>
      <authentication>73c3f5ec669d12a4c7c20bb0757ace71</authentication>
    </file>
  </fileContainer>
  <collection collectionId="23">
    <elementSetContainer>
      <elementSet elementSetId="1">
        <name>Dublin Core</name>
        <description>The Dublin Core metadata element set is common to all Omeka records, including items, files, and collections. For more information see, http://dublincore.org/documents/dces/.</description>
        <elementContainer>
          <element elementId="50">
            <name>Title</name>
            <description>A name given to the resource</description>
            <elementTextContainer>
              <elementText elementTextId="64989">
                <text>MPHIL</text>
              </elementText>
            </elementTextContainer>
          </element>
        </elementContainer>
      </elementSet>
    </elementSetContainer>
  </collection>
  <itemType itemTypeId="18">
    <name>Mphil</name>
    <description>Mphil Thesis</description>
  </itemType>
  <elementSetContainer>
    <elementSet elementSetId="1">
      <name>Dublin Core</name>
      <description>The Dublin Core metadata element set is common to all Omeka records, including items, files, and collections. For more information see, http://dublincore.org/documents/dces/.</description>
      <elementContainer>
        <element elementId="50">
          <name>Title</name>
          <description>A name given to the resource</description>
          <elementTextContainer>
            <elementText elementTextId="6003">
              <text>Classification and Retrieval of Research

 
Classification and Retrieval of Research Papers: A Semantic Hierarchical Approach

</text>
            </elementText>
          </elementTextContainer>
        </element>
        <element elementId="39">
          <name>Creator</name>
          <description>An entity primarily responsible for making the resource</description>
          <elementTextContainer>
            <elementText elementTextId="6004">
              <text>Mirza Nazura  Abdulkarim</text>
            </elementText>
          </elementTextContainer>
        </element>
        <element elementId="40">
          <name>Date</name>
          <description>A point or period of time associated with an event in the lifecycle of the resource</description>
          <elementTextContainer>
            <elementText elementTextId="6005">
              <text>2010</text>
            </elementText>
          </elementTextContainer>
        </element>
        <element elementId="48">
          <name>Source</name>
          <description>A related resource from which the described resource is derived</description>
          <elementTextContainer>
            <elementText elementTextId="6006">
              <text>Computer Science</text>
            </elementText>
          </elementTextContainer>
        </element>
        <element elementId="41">
          <name>Description</name>
          <description>An account of the resource</description>
          <elementTextContainer>
            <elementText elementTextId="6007">
              <text>"Classification and Retrieval of Research papers: A Semantic Hierarchical Approach" demonstrates an effective and efficient technique for classification of Research documents pertaining to Computer Science. The explosion in the number of documents and research publications in electronic form and the need to perform a semantic search for their retrieval has been the incentive for this research.



The popularity and the widespread use of electronic documents and publications, has necessitated the development of an efficient document archival and retrieval mechanism. Categorizing journal papers by assigning them relevant and meaningful classes, predicting the latent concept or the topic of research, based on the relevant terms and assigning the appropriate Classification labels is the objective of this thesis.



This thesis takes a semantic approach and applies the text mining techniques in a hierarchical manner in order to classify the documents.



The use of a lexicon containing domain specific terms (DSL) adds a semantic dimension to classification and document retrieval. The Concept Prediction based on Term Relevance (CPTR) technique demonstrates a semantic model for assigning concepts or topics to papers.

This Thesis proposes a conceptual framework for organizing and classifying the research papers   pertaining   to   Computer   Science.   The   efficacy  of   the   proposed   concepts   is demonstrated with the help of Classification experiments. Classification experiments reveal that  the  DSL  technique  of  training  works  efficiently  when  categorization  is  based  on keywords. The CPTR technique, on the other hand, shows very high accuracy; when the classification is based on the contents of the document.

Both these techniques lend a semantic dimension to classification.



Narrowing down the scope of search at each level of hierarchy enables time efficient retrieval and access of the goal documents. The hierarchical interface for Document Retrieval enables retrieval of the target documents by gradually restricting the scope of search at each level of hierarchy

 

This work comprises of two main components.



1. The Framework for Hierarchical Classification.



2. The Hierarchical Interface for Document Retrieval.



Two distinct techniques for classification are proposed in this thesis. These include



1.   The use of Domain Specific Lexicon (DSL) which is comparable to a Domain Specific Ontology.



2.   The Concept Prediction Based on Term Relevance (CPTR) technique



These techniques lend a semantic dimension to classification.



Keywords: Text Mining, Classification, Document Retrieval, Hierarchical, Domain specific lexicon (DSL), Probabilistic Latent Semantic Analysis (PLSA) , Concept Prediction based on Term frequency (CPTR)

 </text>
            </elementText>
          </elementTextContainer>
        </element>
      </elementContainer>
    </elementSet>
  </elementSetContainer>
</item>
