Large-scale recognition of high-affinity protease–inhibitor complexes using topological autocorrelation and support vector machines

2016 
Several methods have been developed for the computer-aided discovery of new protease substrates and inhibitors. In this paper, we report a novel machine learning implementation to identify high-affinity protease–inhibitor complexes. The implemented proteochemometrics algorithm consists of creating topological autocorrelation descriptors for proteases and inhibitors, and then to develop support vector machine models to relate the feature vectors to the affinity class (high or low) of hypothetical protein–inhibitor complexes based on experimental inhibition constant (Ki) values. The approach based on the autocorrelation features surpassed an atom-centred (AC) approach using AC information of inhibitors. Unique to our approach is that our final classifier could recognise 80% of inhibition-complex of new ligands to be stable or unstable using only chemical connectivity of the ligands and sequence information of the targets. Moreover, the analysis of substructure classification showed a very homogenous behavio...
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    61
    References
    1
    Citations
    NaN
    KQI
    []