Categorization of Dissertation using Machine Learning Techniques

2020 
Machine learning techniques are widely used to take intelligent decisions in industrial and educational domains. In the educational domain, when a research scholar submits a dissertation, then it has to be indexed and classified. The number of dissertations that are submitted in an educational institute is usually high and if done manually, it becomes difficult to index and classify correctly. This study applies machine learning techniques to automate the indexing and categorization of dissertations. We have focused on dissertations from the Engineering, Medical, Social Science, and General Science fields. We used the Bag of Words (BoW) method to extract features and K-means, Density-based spatial clustering of applications with noise (DBSCAN) and Expectation-Maximisation (EM) to train our model. Our experimental results reveal that the proposed K- means technique for indexing and categorization leads to higher accuracy and significant reduction in negative predictions as compared to DBSCAN and Expectation-Maximisation (EM).
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    14
    References
    0
    Citations
    NaN
    KQI
    []