Classification by Clustering (CbC): An Approach of Classifying Big Data based on Similarities

UIU Institutional Repository

    • Login
    View Item 
    •   UIU DSpace Home
    • School of Science and Engineering (SoSE)
    • Department of Computer Science and Engineering (CSE)
    • B.Sc Thesis/Project
    • View Item
    •   UIU DSpace Home
    • School of Science and Engineering (SoSE)
    • Department of Computer Science and Engineering (CSE)
    • B.Sc Thesis/Project
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Classification by Clustering (CbC): An Approach of Classifying Big Data based on Similarities

    Thumbnail
    View/Open
    Classification by Clustering (CbC) An Approach of Classifying Big Data based on Similarities.pdf (968.6Kb)
    Date
    2019-01-30
    Author
    Khan, Sakib Shahriar
    Ahamed, Shakim
    Jannat, Miftahul
    Monwar, Irin
    Metadata
    Show full item record
    Abstract
    Data classification in supervised learning is the process of classifying data for data mining task that helps to analyses data for decision making. The objective of a classification model is to correctly predict the categorical class labels of known/ unknown instances. In machine learning for data mining applications, the classification models are trained based on labelled training data sets. In this paper, we have investigated if we can build a classification model based on the similarities of the instances instead of class labels of instances. Data labeling is always very costly and time consuming process, and it's become very difficult task if the data is big data. The proposed approach clusters the big data and builds the classifier based on the clusters without considering the class labels, which basically improve the performance of the classifier. However, we can relate the clusters with class labels. We have collected 10 big data from the UC Irvine machine learning repository for experimental analysis and applied three popular decision tree induction algorithms: ID3 (Iterative Dichotomiser 3), C4.5 (extension of ID3 algorithm), and CART (Classification & Regression Tree) for classifier construction.
    URI
    http://dspace.uiu.ac.bd/handle/52243/743
    Collections
    • B.Sc Thesis/Project [82]

    Copyright 2003-2017 United International University
    Contact Us | Send Feedback
    Developed by UIU CITS
     

     

    Browse

    All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

    My Account

    LoginRegister

    Copyright 2003-2017 United International University
    Contact Us | Send Feedback
    Developed by UIU CITS