Quantitative Sequence- Activity Model Analysis of Oligopeptides Coupling an Improved High- Dimension Feature Selection Method with Support Vector Regression

文献类型: 外文期刊

第一作者: Dai, Zhijun

作者: Dai, Zhijun;Zhang, Hongyan;Yuan, Zheming;Wang, Lifeng;Dai, Zhijun;Zhang, Hongyan;Bai, Lianyang;Yuan, Zheming;Bai, Lianyang

作者机构:

关键词: feature selection;high-dimension feature;oligopeptides;quantitative sequence-activity model;support vector machine

期刊名称:CHEMICAL BIOLOGY & DRUG DESIGN ( 影响因子:2.817; 五年影响因子:2.631 )

ISSN: 1747-0277

年卷期: 2014 年 83 卷 4 期

页码:

收录情况: SCI

摘要: Five hundred and thirty-one physicochemical property parameters of amino acids were directly used as descriptors to characterize the structure of oligopeptides. Based on support vector regression (SVR), a novel rapid selection method called binary matrix resetting filter (BMRF) was proposed to nonlinearly select high-dimensional features and then multiround last-elimination (MRLE) was used for subtle screening. The reserved descriptors were used to construct the regression model with SVR, which was then applied to the quantitative sequence-activity model (QSAM) analysis for two oligopeptide systems. Compared with the widely used 16 kinds of amino acid descriptors, four QSAM modeling methods and four feature selection methods, our work shows a significant improvement in modeling performance, especially in external prediction. Furthermore, the real biochemical significance corresponding to reserved descriptors can be given directly, and the interpretability of the established QSAM model is improved significantly. This novel method has a high potential to become an available tool for regression analysis of high-dimension data, such as QSAM modeling of peptides or even proteins.

分类号:

  • 相关文献

[1]QSAR modeling of E. coli promoters with parameters selected by binary matrix shuffling filter. Wang, Li-Feng,Dai, Zhi-Jun,Yuan, Zhe-Ming,Wang, Kai,Wang, Li-Feng,Dai, Zhi-Jun,Yuan, Zhe-Ming,Bai, Lian-Yang.

[2]Optimization of Enzymatic Production of Oligopeptides from Apricot Almonds Meal with Neutrase and N120P. Wang, Chunyan,Wang, Qiang,Tian, Jinqiang. 2010

[3]An Improved Combination of Spectral and Spatial Features for Vegetation Classification in Hyperspectral Images. Fu, Yuanyuan,Zhao, Chunjiang,Yang, Guijun,Song, Xiaoyu,Feng, Haikuan,Fu, Yuanyuan,Zhao, Chunjiang,Yang, Guijun,Song, Xiaoyu,Feng, Haikuan,Fu, Yuanyuan,Zhao, Chunjiang,Yang, Guijun,Song, Xiaoyu,Feng, Haikuan,Fu, Yuanyuan,Zhao, Chunjiang,Yang, Guijun,Song, Xiaoyu,Feng, Haikuan,Wang, Jihua,Jia, Xiuping. 2017

[4]A COMPARATIVE ANALYSIS OF MUTUAL INFORMATION BASED FEATURE SELECTION FOR HYPERSPECTRAL IMAGE CLASSIFICATION. Fu, Yuanyuan,Fu, Yuanyuan,Jia, Xiuping,Huang, Wenjiang,Wang, Jihua. 2014

[5]A pipeline for improved QSAR analysis of peptides: physiochemical property parameter selection via BMSF, near-neighbor sample selection via semivariogram, and weighted SVR regression and prediction. Wang, Lifeng,Chen, Yuan,Yuan, Zheming,Dai, Zhijun,Wang, Lifeng,Chen, Yuan,Bai, Lianyang,Yuan, Zheming,Wang, Haiyan,Bai, Lianyang.

[6]How do temporal and spectral features matter in crop classification in Heilongjiang Province, China?. Hu Qiong,Wu Wen-bin,Song Qian,Lu Miao,Chen Di,Yu Qiang-yi,Tang Hua-jun,Song Qian. 2017

[7]Design and Implementation of Novel Agricultural Remote Sensing Image Classification Framework through Deep Neural Network and Multi-Feature Analysis. Zhang, Youzhi. 2015

[8]Dynamic monitoring and driving power analysis of LUCC based on remote sensing in Beijing in recent thirty years. Gu, Xiaohe,Guo, Wei,Dong, Yansheng,Wang, Yanchang. 2013

[9]Survey of Support Vector Machine in the Processing of Remote Sensing Image. Li, Su,Wang, Wenchao. 2013

[10]Comparative Study on Remote Sensing Invertion Methods for Estimating Winter Wheat Leaf Area Index. Xie Qiao-yun,Huang Wen-jiang,Peng Dai-liang,Zhang Qing,Xie Qiao-yun,Liang Dong,Huang Lin-sheng,Zhang Dong-yan,Cai Shu-hong,Yang Gui-jun. 2014

[11]Support-Vector-Machine-Based Models for Modeling Daily Reference Evapotranspiration With Limited Climatic Data in Extreme Arid Regions. Wen, Xiaohu,Si, Jianhua,He, Zhibin,Yu, Haijiao,Wu, Jun,Shao, Hongbo,Shao, Hongbo. 2015

[12]A New Strategy in Observer Modeling for Greenhouse Cucumber Seedling Growth. Qiu, Quan,Qiao, Xiaojun,Zheng, Chenfei,Wang, Wenping,Yu, Jingquan,Shi, Kai,Bai, He. 2017

[13]Retrieving Soybean Leaf Area Index from Unmanned Aerial Vehicle Hyperspectral Remote Sensing: Analysis of RF, ANN, and SVM Regression Models. Yuan, Huanhuan,Yang, Guijun,Wang, Yanjie,Liu, Jiangang,Yu, Haiyang,Feng, Haikuan,Xu, Bo,Zhao, Xiaoqing,Yang, Xiaodong,Yuan, Huanhuan,Li, Changchun,Wang, Yanjie,Yuan, Huanhuan,Yang, Guijun,Liu, Jiangang,Feng, Haikuan,Yang, Xiaodong,Yang, Guijun,Yu, Haiyang,Xu, Bo,Zhao, Xiaoqing,Yang, Xiaodong. 2017

[14]Geographic Characterization of Leccinum rugosiceps by Ultraviolet and Infrared Spectral Fusion. Yao, Sen,Liu, Hong-Gao,Li, Jie-Qing,Yao, Sen,Wang, Yuan-Zhong,Li, Tao,Wang, Yuan-Zhong. 2017

[15]Verification and predicting temperature and humidity in a solar greenhouse based on convex bidirectional extreme learning machine algorithm. Zou, Weidong,Yao, Fenxi,Zhang, Baihai,Guan, Zixiao,He, Chaoxing.

[16]Monitoring Plastic-Mulched Farmland by Landsat-8 OLI Imagery Using Spectral and Textural Features. Hasituya,Chen, Zhongxin,Wang, Limin,Wu, Wenbin,Li, He,Jiang, Zhiwei. 2016

[17]DIAGNOSTIC MODEL FOR WHEAT LEAF CONDITIONS USING IMAGE FEATURES AND A SUPPORT VECTOR MACHINE. Du, K.,Sun, Z.,Li, Y.,Zheng, F.,Chu, J.,Su, Y.. 2016

[18]Application of support vector machine for detecting rice diseases using shape and color texture features. Yao, Qing,Guan, Zexin,Zhou, Yingfeng,Tang, Jian,Hu, Yang,Yang, Baojun. 2009

[19]Feature Extraction and Classification of Animal Blood Spectra with Support Vector Machine. Lu Peng-fei,Fan Ya,Zhou Lin-hua,Gao Bin,Qian Jun,Liu Lin-na,Zhao Si-yan,Kong Zhi-feng. 2017

[20]Kharif Dryland Crop Identification Based on Synthetic Aperture Radar in the North China Plain. Dong Zhaoxia,Wang Di,Zhou Qingbo,Chen Zhongxin. 2015

作者其他论文 更多>>