<?xml version="1.0" encoding="UTF-8"?>
<item xmlns="http://omeka.org/schemas/omeka-xml/v5" itemId="25369" public="1" featured="0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://omeka.org/schemas/omeka-xml/v5 http://omeka.org/schemas/omeka-xml/v5/omeka-xml-5-0.xsd" uri="https://archives.christuniversity.in/items/show/25369?output=omeka-xml" accessDate="2026-06-19T15:44:20+00:00">
  <collection collectionId="7">
    <elementSetContainer>
      <elementSet elementSetId="1">
        <name>Dublin Core</name>
        <description>The Dublin Core metadata element set is common to all Omeka records, including items, files, and collections. For more information see, http://dublincore.org/documents/dces/.</description>
        <elementContainer>
          <element elementId="50">
            <name>Title</name>
            <description>A name given to the resource</description>
            <elementTextContainer>
              <elementText elementTextId="3139">
                <text>Faculty Publications</text>
              </elementText>
            </elementTextContainer>
          </element>
        </elementContainer>
      </elementSet>
    </elementSetContainer>
  </collection>
  <itemType itemTypeId="28">
    <name>Conference Paper</name>
    <description>Faculty Publications- Conference Papers</description>
  </itemType>
  <elementSetContainer>
    <elementSet elementSetId="1">
      <name>Dublin Core</name>
      <description>The Dublin Core metadata element set is common to all Omeka records, including items, files, and collections. For more information see, http://dublincore.org/documents/dces/.</description>
      <elementContainer>
        <element elementId="39">
          <name>Creator</name>
          <description>An entity primarily responsible for making the resource</description>
          <elementTextContainer>
            <elementText elementTextId="249200">
              <text>Kokatnoor, Sujatha Arun; Shukla, Samiksha</text>
            </elementText>
          </elementTextContainer>
        </element>
        <element elementId="50">
          <name>Title</name>
          <description>A name given to the resource</description>
          <elementTextContainer>
            <elementText elementTextId="249201">
              <text>NLP and Topic Modeling in Healthcare: Identifying Diseases from Patient Histories</text>
            </elementText>
          </elementTextContainer>
        </element>
        <element elementId="40">
          <name>Date</name>
          <description>A point or period of time associated with an event in the lifecycle of the resource</description>
          <elementTextContainer>
            <elementText elementTextId="249202">
              <text>01-01-2026</text>
            </elementText>
          </elementTextContainer>
        </element>
        <element elementId="48">
          <name>Source</name>
          <description>A related resource from which the described resource is derived</description>
          <elementTextContainer>
            <elementText elementTextId="249203">
              <text>Smart Innovation, Systems and Technologies;Volume;454 SIST;pp.501-516</text>
            </elementText>
          </elementTextContainer>
        </element>
        <element elementId="43">
          <name>Identifier</name>
          <description>An unambiguous reference to the resource within a given context</description>
          <elementTextContainer>
            <elementText elementTextId="249204">
              <text>&lt;a href="https://doi.org/10.1007/978-3-032-07837-7_38" target="_blank" rel="noreferrer noopener"&gt;https://doi.org/10.1007/978-3-032-07837-7_38&lt;/a&gt; &lt;br /&gt;&lt;br /&gt;&lt;a href="https://www.scopus.com/pages/publications/105036673854?origin=resultslist" target="_blank" rel="noreferrer noopener"&gt;https://www.scopus.com/pages/publications/105036673854?origin=resultslist&lt;/a&gt;</text>
            </elementText>
          </elementTextContainer>
        </element>
        <element elementId="38">
          <name>Coverage</name>
          <description>The spatial or temporal topic of the resource, the spatial applicability of the resource, or the jurisdiction under which the resource is relevant</description>
          <elementTextContainer>
            <elementText elementTextId="249205">
              <text>Kokatnoor S.A., Department of Computer Science and Engineering, School of Engineering and Technology, Christ University, Karnataka, Bengaluru, India; Shukla S., Department of Computer Science and Engineering, School of Engineering and Technology, Christ University, Karnataka, Bengaluru, India</text>
            </elementText>
          </elementTextContainer>
        </element>
        <element elementId="41">
          <name>Description</name>
          <description>An account of the resource</description>
          <elementTextContainer>
            <elementText elementTextId="249206">
              <text>Topic modeling and Natural Language Processing (NLP) have demonstrated significant prospects in the healthcare industry for extracting insightful information from unstructured patient histories that can help diagnose diseases and enhance clinical decisions. In this study, patient histories are grouped into ten different clusters using advanced K-Means clustering, with the Dunn Index being used to validate the clustering performance. After the clusters are formed, each cluster is subjected to topic modeling approaches. Four topic modeling approaches are examined in this study, Latent Dirichlet Allocation (LDA), Hierarchical Dirichlet Process (HDP), Latent Semantic Indexing (LSI), and Non-negative Matrix Factorization (NMF). These techniques are used to find disease-related terms from patient histories. Coherence scores, which show the semantic significance of the terms produced, and execution times, which show the computational efficiency needed for real-time healthcare applications, are used to evaluate the models. According to experimental findings forthe USMLE Step 2 Clinical Skills exam dataset, NMF and HDP generated the most cohesive terms, with NMFs faster execution time (1.67s) making it appropriate for widespread healthcare applications. Whereas, a reasonable balance between coherence and computational demands is offered by LDA and LSI.  The Author(s), under exclusive license to Springer Nature Switzerland AG 2026.</text>
            </elementText>
          </elementTextContainer>
        </element>
        <element elementId="49">
          <name>Subject</name>
          <description>The topic of the resource</description>
          <elementTextContainer>
            <elementText elementTextId="249207">
              <text>Dunn index; K-means clustering; Latent dirichlet allocation (LDA); Latent semantic indexing (LSI); Natural Language Processing (NLP); Non-negative matrix factorization (NMF); Personalized healthcare; Topic modeling</text>
            </elementText>
          </elementTextContainer>
        </element>
        <element elementId="45">
          <name>Publisher</name>
          <description>An entity responsible for making the resource available</description>
          <elementTextContainer>
            <elementText elementTextId="249208">
              <text>Springer Science and Business Media Deutschland GmbH</text>
            </elementText>
          </elementTextContainer>
        </element>
        <element elementId="46">
          <name>Relation</name>
          <description>A related resource</description>
          <elementTextContainer>
            <elementText elementTextId="249209">
              <text>ISSN: 21903018; ISBN: 978-303207836-0;</text>
            </elementText>
          </elementTextContainer>
        </element>
        <element elementId="44">
          <name>Language</name>
          <description>A language of the resource</description>
          <elementTextContainer>
            <elementText elementTextId="249210">
              <text>English</text>
            </elementText>
          </elementTextContainer>
        </element>
        <element elementId="51">
          <name>Type</name>
          <description>The nature or genre of the resource</description>
          <elementTextContainer>
            <elementText elementTextId="249211">
              <text>Conference paper</text>
            </elementText>
          </elementTextContainer>
        </element>
        <element elementId="47">
          <name>Rights</name>
          <description>Information about rights held in and over the resource</description>
          <elementTextContainer>
            <elementText elementTextId="249212">
              <text>Restricted Access; Hardcopy may be available in the library</text>
            </elementText>
          </elementTextContainer>
        </element>
        <element elementId="42">
          <name>Format</name>
          <description>The file format, physical medium, or dimensions of the resource</description>
          <elementTextContainer>
            <elementText elementTextId="249213">
              <text>online</text>
            </elementText>
          </elementTextContainer>
        </element>
      </elementContainer>
    </elementSet>
  </elementSetContainer>
</item>
