Predicting protein–protein interactions by fusing various Chou's pseudo components and using wavelet denoising approach

2019 
Abstract Research on protein–protein interactions (PPIs) not only helps to reveal the nature of life activities but also plays a driving role in understanding the mechanisms of disease activity and the development of effective drugs. The rapid development of machine learning provides new opportunities and challenges for understanding the mechanism of PPIs. It plays an important role in the field of proteomics research. In recent years, an increasing number of computational methods for predicting PPIs have been developed. This paper proposes a new method for predicting PPIs based on multi-information fusion. First, the pseudo-amino acid composition (PseAAC), auto-covariance (AC) and encoding based on grouped weight (EBGW) methods are used to extract the features of protein sequences, and the extracted three groups of feature vectors were fused. Secondly, the fused feature vectors are denoised by two-dimensional (2-D) wavelet denoising. Finally, the denoised feature vectors are input to the support vector machine (SVM) classifier to predict the PPIs. The ACC of PPIs of Helicobacter pylori ( H. pylori ) and Saccharomyces cerevisiae ( S. cerevisiae ) datasets were 95.97% and 95.55% by 5-fold cross-validation test and compared with other prediction methods. The experimental results show that the proposed multi-information fusion prediction method can effectively improve the prediction performance of PPIs. The source code and all datasets are available at https://github.com/QUST-AIBBDRC/PPIs-WDSVM/ .
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    158
    References
    26
    Citations
    NaN
    KQI
    []