<?xml version="1.0" encoding="UTF-8"?>
<item xmlns="http://omeka.org/schemas/omeka-xml/v5" itemId="19801" public="1" featured="0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://omeka.org/schemas/omeka-xml/v5 http://omeka.org/schemas/omeka-xml/v5/omeka-xml-5-0.xsd" uri="https://archives.christuniversity.in/items/show/19801?output=omeka-xml" accessDate="2026-05-13T10:19:19+00:00">
  <collection collectionId="16">
    <elementSetContainer>
      <elementSet elementSetId="1">
        <name>Dublin Core</name>
        <description>The Dublin Core metadata element set is common to all Omeka records, including items, files, and collections. For more information see, http://dublincore.org/documents/dces/.</description>
        <elementContainer>
          <element elementId="50">
            <name>Title</name>
            <description>A name given to the resource</description>
            <elementTextContainer>
              <elementText elementTextId="51377">
                <text>Conference Papers</text>
              </elementText>
            </elementTextContainer>
          </element>
        </elementContainer>
      </elementSet>
    </elementSetContainer>
  </collection>
  <itemType itemTypeId="28">
    <name>Conference Paper</name>
    <description>Faculty Publications- Conference Papers</description>
  </itemType>
  <elementSetContainer>
    <elementSet elementSetId="1">
      <name>Dublin Core</name>
      <description>The Dublin Core metadata element set is common to all Omeka records, including items, files, and collections. For more information see, http://dublincore.org/documents/dces/.</description>
      <elementContainer>
        <element elementId="50">
          <name>Title</name>
          <description>A name given to the resource</description>
          <elementTextContainer>
            <elementText elementTextId="172816">
              <text>Preprocessing Big Data using Partitioning Method for Efficient Analysis</text>
            </elementText>
          </elementTextContainer>
        </element>
        <element elementId="49">
          <name>Subject</name>
          <description>The topic of the resource</description>
          <elementTextContainer>
            <elementText elementTextId="172817">
              <text>Data Analytics; Partitioning; Preprocessing; Smart Data</text>
            </elementText>
          </elementTextContainer>
        </element>
        <element elementId="41">
          <name>Description</name>
          <description>An account of the resource</description>
          <elementTextContainer>
            <elementText elementTextId="172818">
              <text>Big data collection is the process of gathering unprocessed and unstructured data from disparate sources. As data deluge, the large volume of data collected and integrated consist missing values, outliers, and redundant records. This makes the big dataset insignificant for processing and mining knowledge. Also, it unnecessarily consumes large amount of valuable storage for storing redundant data and meaningless data. The result obtained after applying mining techniques in this insignificant data lead to wrong inferences. This makes it inevitable to preprocess data in order to store and process big dataset effectively and draw correct inferences. When data is preprocessed before analytics the storage consumption is less and computation and communication complexity is reduced. The analytics result is of high quality and the needed time for processing is considerably reduced. Preprocessing data is inevitable for applying any analytics algorithm to obtain valuable pattern. The quality of knowledge mined from large volume of big data depends on the quality of input data used for processing. The major steps in big data preprocessing include data integration from disparate sources, missing value imputation, outlier detection and treatment, and handling redundant data. The process of integration includes steps such as extraction, transformation, and loading. The data extraction step gathers useful data used for analytics and the transformation process organize the collected data in structured format suitable for analytics. The role of load process is to store transformed data into secured storage so that data can be obtained and processed effectively in future. This work provides preprocessing techniques for big data that deals with missing values and outliers and results in obtaining quality data partitions.   2023 IEEE.</text>
            </elementText>
          </elementTextContainer>
        </element>
        <element elementId="39">
          <name>Creator</name>
          <description>An entity primarily responsible for making the resource</description>
          <elementTextContainer>
            <elementText elementTextId="172819">
              <text>Reena M.J.</text>
            </elementText>
          </elementTextContainer>
        </element>
        <element elementId="48">
          <name>Source</name>
          <description>A related resource from which the described resource is derived</description>
          <elementTextContainer>
            <elementText elementTextId="172820">
              <text>Proceedings of IEEE InC4 2023 - 2023 IEEE International Conference on Contemporary Computing and Communications</text>
            </elementText>
          </elementTextContainer>
        </element>
        <element elementId="45">
          <name>Publisher</name>
          <description>An entity responsible for making the resource available</description>
          <elementTextContainer>
            <elementText elementTextId="172821">
              <text>Institute of Electrical and Electronics Engineers Inc.</text>
            </elementText>
          </elementTextContainer>
        </element>
        <element elementId="40">
          <name>Date</name>
          <description>A point or period of time associated with an event in the lifecycle of the resource</description>
          <elementTextContainer>
            <elementText elementTextId="172822">
              <text>2023-01-01</text>
            </elementText>
          </elementTextContainer>
        </element>
        <element elementId="43">
          <name>Identifier</name>
          <description>An unambiguous reference to the resource within a given context</description>
          <elementTextContainer>
            <elementText elementTextId="172823">
              <text>&lt;a href="https://doi.org/10.1109/InC457730.2023.10262924" target="_blank" rel="noreferrer noopener"&gt;https://doi.org/10.1109/InC457730.2023.10262924&lt;/a&gt;
&lt;br /&gt;&lt;br /&gt;&lt;a href="https://www.scopus.com/inward/record.uri?eid=2-s2.0-85174734797&amp;amp;doi=10.1109%2FInC457730.2023.10262924&amp;amp;partnerID=40&amp;amp;md5=a801b60481d9b624c2c690700a4719d8" target="_blank" rel="noreferrer noopener"&gt;https://www.scopus.com/inward/record.uri?eid=2-s2.0-85174734797&amp;amp;doi=10.1109%2fInC457730.2023.10262924&amp;amp;partnerID=40&amp;amp;md5=a801b60481d9b624c2c690700a4719d8&lt;/a&gt;</text>
            </elementText>
          </elementTextContainer>
        </element>
        <element elementId="47">
          <name>Rights</name>
          <description>Information about rights held in and over the resource</description>
          <elementTextContainer>
            <elementText elementTextId="172824">
              <text>Restricted Access</text>
            </elementText>
          </elementTextContainer>
        </element>
        <element elementId="46">
          <name>Relation</name>
          <description>A related resource</description>
          <elementTextContainer>
            <elementText elementTextId="172825">
              <text>ISBN: 979-835033577-4</text>
            </elementText>
          </elementTextContainer>
        </element>
        <element elementId="42">
          <name>Format</name>
          <description>The file format, physical medium, or dimensions of the resource</description>
          <elementTextContainer>
            <elementText elementTextId="172826">
              <text>Online</text>
            </elementText>
          </elementTextContainer>
        </element>
        <element elementId="44">
          <name>Language</name>
          <description>A language of the resource</description>
          <elementTextContainer>
            <elementText elementTextId="172827">
              <text>English</text>
            </elementText>
          </elementTextContainer>
        </element>
        <element elementId="51">
          <name>Type</name>
          <description>The nature or genre of the resource</description>
          <elementTextContainer>
            <elementText elementTextId="172828">
              <text>Conference paper</text>
            </elementText>
          </elementTextContainer>
        </element>
        <element elementId="38">
          <name>Coverage</name>
          <description>The spatial or temporal topic of the resource, the spatial applicability of the resource, or the jurisdiction under which the resource is relevant</description>
          <elementTextContainer>
            <elementText elementTextId="172829">
              <text>Reena M.J., CHRIST (Deemed to Be University), Department of Computer Science and Engineering, Bangalore, India</text>
            </elementText>
          </elementTextContainer>
        </element>
      </elementContainer>
    </elementSet>
  </elementSetContainer>
</item>
