Maximal Information Coefficient and Support Vector Regression Based Nonlinear Feature Selection and QSAR Modeling on Toxicity of Alcohol Compounds to Tadpoles of Rana temporaria
文献类型: 外文期刊
第一作者: Wang, Cong
作者: Wang, Cong;Zhou, Xiaomao;Bai, Lianyang;Wang, Lifeng;Wang, Cong;Zhou, Xiaomao;Bai, Lianyang;Xing, Pengwei;Dai, Zhijun
作者机构:
关键词: alcohol compounds; Rana temporaria; feature selection; support vector regression (SVR); qualitative structure-activity relationship (QSAR)
期刊名称:JOURNAL OF THE BRAZILIAN CHEMICAL SOCIETY ( 影响因子:1.838; 五年影响因子:1.9 )
ISSN: 0103-5053
年卷期: 2019 年 30 卷 2 期
页码:
收录情况: SCI
摘要: Efficient evaluation of biotoxicity of organics is of vital significance to resource utilization and environmental protection. In this study. toxicity of 110 alcohol compounds to tadpoles of Rana temporaria is adopted as the dependent variable and 1388 physiochemical parameters (features) calculated by PCLIENT are used for representing each compound. A feature selection pipeline with three steps is developed to refine the feature subset: 282 features that significantly correlated with biotoxicity of chemical compounds are preliminarily selected via the maximum information coefficient (MIC); 138 descriptors that have positive contribution to the model's performance are reserved after a support vector regression (SVR) based backward elimination; 18 descriptors are finally selected via a forward selection process that integrated minimal redundancy maximal relevance (mRMR), MIC and SVR. In terms of feature subsets with different numbers of variables, quantitative structure activity relationship (QSAR) models are built using multiple linear regression (MLR), partial least square regression (PLS) and SVR, respectively. The independent prediction evaluation index. Q(2), increases from -74.787, 0.824 and 0.868 to 0.892. 0.878 and 0.940, for the three regression models, respectively. Results suggest that nonlinear feature selection methods involved in MIC and SVR can effectively eliminate irrelevant descriptors. SVR outperforms classical statistical models to QSAR modeling on high-dimensional data containing nonlinear relationship between features. The methods proposed in this study have a potential application in the QSAR research field such as biotoxicity compounds.
分类号:
- 相关文献
作者其他论文 更多>>
-
No-tillage practice enhances soil total carbon content in a sandy Cyperus esculentus L. field
作者:Wang, Cong;Hu, Yuxiang;Wu, Hui;Wang, Zhirui;Cai, Jiangping;Wang, Zhengwen;Li, Hui;Wang, Cong;Hu, Yuxiang;Wu, Hui;Liu, Heyong;Jiang, Yong;Ren, Wei;Yang, Ning
关键词:No-tillage; Bacterial community composition; Bacterial function prediction; Aeolian sandy soil;
Cyperus esculentus L. -
The UDP-glycosyltransferase UGT352A3 contributes to the detoxification of thiamethoxam and imidacloprid in resistant whitefly
作者:Du, Tianhua;Xue, Hu;Zhang, Youjun;Yang, Xin;Du, Tianhua;Zhou, Xiaomao;Gui, Lianyou;Belyakova, Natalia A.
关键词:UDP-glycosyltransferases; Enzyme activity; Bemisia tabaci; Insecticide resistance; Molecular docking
-
Design, synthesis, and biological evaluation of oxime ether derivatives containing 1,5-dimethyl-6-thioxo-1,3,5-triazinane-2,4-dione as protoporphyrinogen IX oxidase inhibitors
作者:Luo, Dingfeng;Wang, Yingying;Ma, Changsheng;Li, Hao;Yan, Sheng;Bai, Zhendong;Bai, Lianyang;Li, Zuren;Luo, Dingfeng;Wang, Yingying;Ma, Changsheng;Bai, Lianyang;Li, Zuren;Wan, Yuanhui;Bai, Lianyang;Li, Zuren
关键词:herbicidal activity; transcriptomics; substructure splicing; PPO inhibitor
-
Production and Subsequent Application of Different Biochar-based Organic Fertilizers to Enhance Vegetable Quality and Soil Carbon Stability
作者:Zhang, Jining;Zhang, Xianxian;Wang, Cong;Sun, Huifeng;Zhou, Sheng;Zhang, Jining;Zhang, Xianxian;Wang, Cong;Sun, Huifeng;Zhou, Sheng;Zhang, Jining;Zhang, Xianxian;Wang, Cong;Sun, Huifeng;Zhou, Sheng;Ge, Li-ao;Chen, Honghui;Huang, Jian;Yang, Yuxiang
关键词:Biochar; Biochar-based organic fertilizer; Carbon stability; Compost; Vegetable
-
Impact of irrigation strategies on methane emission and absorption characteristics at different interfaces in rice field systems
作者:Liu, Lei;Zhou, Sheng;Wang, Cong;Sun, Huifeng;Zhang, Xianxian;Zhang, Jining;Jiang, Zheng;Zhou, Sheng;Wang, Cong;Sun, Huifeng;Zhang, Xianxian;Zhang, Jining;Jiang, Zheng;Zhou, Sheng;Wang, Cong;Sun, Huifeng;Zhang, Xianxian;Zhang, Jining;Jiang, Zheng;Zhou, Sheng
关键词:Rice paddy; Irrigation regimes; Methane; Stable isotope; Interface exchange
-
A Supramolecular Material for Controlling Kiwifruit Bacterial Canker
作者:Deng, Xile;Bai, Lianyang;Zhang, Jichuan;Liu, Tianqi;Zhang, Jiaheng;Bian, Qiang;Zhang, Yizhuo;Zhang, Li;Zhou, Mingqing;Xie, Le
关键词:bactericidal activity; kiwifruit bacterial canker (KBC); molecular simulations; nano pesticide; supramolecular nanocarrier
-
Research on the Economic Loss Model of Invasive Alien Species Based on Multidimensional Data Spatialization-A Case Study of Economic Losses Caused by Hyphantria cunea in Jiangsu Province
作者:Li, Cheng;Zhou, Yongbin;Wang, Cong;Pan, Xubin;Wang, Ying;Qi, Xiaofeng;Wan, Fanghao
关键词:
Hyphantria cunea ; economic loss assessment model; DID model; SDMs; MaxEnt; future climate conditions