Machine learning prediction of nanoparticle in vitro toxicity: a comparative study of classifiers and ensemble-classifiers using the Copeland Index

2019 
Abstract Nano-Particles (NPs) are well established as important components across a broad range of products from cosmetics to electronics. Their utilization is increasing with their significant economic and societal potential yet to be fully realized. Inroads have been made in our understanding of the risks posed to human health and the environment by NPs but this area will require continuous research and monitoring. In recent years Machine Learning (ML) techniques have exploited large datasets and computation power to create breakthroughs in diverse fields from facial recognition to genomics. More recently, ML techniques have been applied to nanotoxicology with very encouraging results. In this study, categories of ML classifiers (rules, trees, lazy, functions and bayes) were compared using datasets from the Safe and Sustainable Nanotechnology (S2NANO) database to investigate their performance in predicting NPs in vitro toxicity. Physicochemical properties, toxicological and quantum-mechanical attributes and in vitro experimental conditions were used as input variables to predict the toxicity of NPs based on cell viability. Voting, an ensemble meta-classifier, was used to combine base models to optimize the classification prediction of toxicity. To facilitate inter-comparison, a Copeland Index was applied that ranks the classifiers according to their performance and suggested the optimal classifier. Neural Network (NN) and Random forest (RF) showed the best performance in the majority of the datasets used in this study. However, the combination of classifiers demonstrated an improved prediction resulting meta-classifier to have higher indices. This proposed Copeland Index can now be used by researchers to identify and clearly prioritize classifiers in order to achieve more accurate classification predictions for NP toxicity for a given dataset.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    92
    References
    17
    Citations
    NaN
    KQI
    []