A New Bayesian Classification Algorithm for Non-Balance Datasets

2010 
Based on the idea of semi-supervised learning,a new Bayesian classifier model by using an improved EM (Expectation-Maximum) algorithm is proposed to classify and predict non-balance data gathered from mobile communication networks.Firstly,a statistical analysis is performed to calculate the priori probabilities based on the actual data.By using these priori probabilities as the initial values of the Bayesian model,we can speed up the convergence process of the EM algorithm.Secondly,a classifier based on the Bayesian network is constructed to learn the category characteristics of the historic communication data by improving the EM (Expectation-Maximum) steps.Thirdly,by using this classifier,the label of the current data sample is predicted.The experimental results demonstrate that,the proposed method highly increases the prediction accuracy of the negative label,and gains better performance than the traditional statistical methods.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    0
    References
    0
    Citations
    NaN
    KQI
    []