VSMP: A novel variable selection and modeling method based on the prediction
2003
The use of numerous descriptors that are indicative of molecular structure and topology is becoming more common in quantitative structure−activity relationship (QSAR). How to choose the adequate descriptors for QSAR studies is important but difficult because there are no absolute rules to govern this choice. A variety of variable selection techniques including stepwise, partial least squares/principal component analysis (PLS/PCA), neural network, and evolutionary algorithm such as genetic algorithm have been applied to this common problem. All-subsets regression (ASR) is capable of finding out the best variable subset from among a large pool. In this paper, a novel variable selection and modeling method based on the prediction, for short VSMP, has been developed. Here two controllable parameters, the interrelation coefficient between the pairs of the independent variables (rint) and the correlation coefficient (q2) obtained using the leave-one-out (LOO) cross-validation technique, are introduced into the ...
Keywords:
- Principal component analysis
- Feature selection
- Variables
- Partial least squares regression
- Genetic algorithm
- Evolutionary algorithm
- Correlation coefficient
- Statistics
- Mathematics
- Quantitative structure–activity relationship
- Artificial neural network
- Pattern recognition
- Combinatorics
- Machine learning
- Artificial intelligence
- Correction
- Source
- Cite
- Save
- Machine Reading By IdeaReader
25
References
91
Citations
NaN
KQI