Performance Analysis of Supervised Classifiers Using PCA Based Techniques on Breast Cancer

2019 
Focus of this paper is to recognize tumorous (malignant) and non-tumorous (benign) from the dataset. Wisconsin breast cancer data (WBCD) has been used and taken from UCI machine learning repository. Most popular supervised learning classifiers with PCA based dimensionality rebate techniques applied. Support Vector Machine, K Nearest Neighbor, Decision Tree, Naive Bayes and Logistic Regression used with Linear, Sigmoid, Cosine, Poly and Radial basis function based PCA's. Numerous performance metrics tested after getting confusion matrix. Among them accuracy, sensitivity, specificity, false positive rate, false omission rate, precision, prevalence, f1-score, negative predicted value, false negative rate, false discovery rate and markedness. Our best performing models then relatively compared with other existing models. Sigmoid based Naive Bayes exhibits best accuracy of 99.20%.K Nearest Neighbor also illustrate superb performance with all kernel PCA based techniques. Accuracy ranges from 96.4% to 97.8%
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    0
    References
    8
    Citations
    NaN
    KQI
    []