Identifying and Removing Outlier Features Using Neighborhood Rough Set

2020 
The neighborhood rough set (NRS) is used to remove redundant features after identifying neighborhood relation among samples of features. In this study, a new NRS is proposed to determine and remove outlier features. An outlier score is calculated by measuring the neighborhood relation and non-neighborhood relation among samples with respect to a feature. Features that have an outlier score below the average outlier score are removed from the data set. In this research work, a support vector machine (SVM) and its extended version to reduce input features are used to evaluate the quality of the selected features from the proposed NRS. The experiment involves twelve real world data sets. The results show that the proposed method can reduce at least half of the features effectively from these data sets. Although the classification accuracy is slightly lower than both SVM-based solutions, the proposed NRS with SVM could significantly remove more number of input attributes and requires much shorter execution time.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    14
    References
    0
    Citations
    NaN
    KQI
    []