QSAR modeling of E. coli promoters with parameters selected by binary matrix shuffling filter

文献类型: 外文期刊

第一作者: Wang, Li-Feng

作者: Wang, Li-Feng;Dai, Zhi-Jun;Yuan, Zhe-Ming;Wang, Kai;Wang, Li-Feng;Dai, Zhi-Jun;Yuan, Zhe-Ming;Bai, Lian-Yang

作者机构:

关键词: Quantitative sequence-activity model;feature selection;support vector regression;promoter

期刊名称:JOURNAL OF THE INDIAN CHEMICAL SOCIETY ( 影响因子:0.284; )

ISSN:

年卷期:

页码:

收录情况: SCI

摘要: The 1123 topological structure parameters of DNA bases were directly used as descriptors to characterize the sequence of 38 E. coli promoters. For the correspondingly generated high-dimensional feature set, the correlation analysis and binary matrix shuffling filter (BMSF) were successively used to remove the redundancy or useless features, and only 20 features were finally reserved, with definite meanings. Based on reserved features and support vector regression (SVR), a quantitative structure-activity relationship (QSAR) model was established for the analysis of 38 E. coli promoters, and the leave-one-out (LOO) prediction accuracy of this model was of 0.838, superior to that of reference model, i.e. partial least squares (PLS). Referring to the SVR interpretation system, the established QSAR model in this work has extremely significant nonlinear regression, and the relationship between real promoter strength and 11 significant reserved features was directly given out. This work provides an efficient tool for the QSAR analysis of promoters and other similar molecular sequences.

分类号: O6

  • 相关文献

[1]A pipeline for improved QSAR analysis of peptides: physiochemical property parameter selection via BMSF, near-neighbor sample selection via semivariogram, and weighted SVR regression and prediction. Wang, Lifeng,Chen, Yuan,Yuan, Zheming,Dai, Zhijun,Wang, Lifeng,Chen, Yuan,Bai, Lianyang,Yuan, Zheming,Wang, Haiyan,Bai, Lianyang.

[2]Quantitative Sequence- Activity Model Analysis of Oligopeptides Coupling an Improved High- Dimension Feature Selection Method with Support Vector Regression. Dai, Zhijun,Zhang, Hongyan,Yuan, Zheming,Wang, Lifeng,Dai, Zhijun,Zhang, Hongyan,Bai, Lianyang,Yuan, Zheming,Bai, Lianyang. 2014

[3]An Improved Combination of Spectral and Spatial Features for Vegetation Classification in Hyperspectral Images. Fu, Yuanyuan,Zhao, Chunjiang,Yang, Guijun,Song, Xiaoyu,Feng, Haikuan,Fu, Yuanyuan,Zhao, Chunjiang,Yang, Guijun,Song, Xiaoyu,Feng, Haikuan,Fu, Yuanyuan,Zhao, Chunjiang,Yang, Guijun,Song, Xiaoyu,Feng, Haikuan,Fu, Yuanyuan,Zhao, Chunjiang,Yang, Guijun,Song, Xiaoyu,Feng, Haikuan,Wang, Jihua,Jia, Xiuping. 2017

[4]A COMPARATIVE ANALYSIS OF MUTUAL INFORMATION BASED FEATURE SELECTION FOR HYPERSPECTRAL IMAGE CLASSIFICATION. Fu, Yuanyuan,Fu, Yuanyuan,Jia, Xiuping,Huang, Wenjiang,Wang, Jihua. 2014

[5]How do temporal and spectral features matter in crop classification in Heilongjiang Province, China?. Hu Qiong,Wu Wen-bin,Song Qian,Lu Miao,Chen Di,Yu Qiang-yi,Tang Hua-jun,Song Qian. 2017

[6]Design and Implementation of Novel Agricultural Remote Sensing Image Classification Framework through Deep Neural Network and Multi-Feature Analysis. Zhang, Youzhi. 2015

[7]Characterization of the promoter of phosphate transporter TaPHT1.2 differentially expressed in wheat varieties. Miao, Jun,Sun, Jinghan,Liu, Dongcheng,Li, Bin,Zhang, Aimin,Li, Zhensheng,Tong, Yiping,Miao, Jun. 2009

[8]Regulatory mutations in the A2M gene are involved in the mastitis susceptibility in dairy cows. Wang, X. G.,Huang, J. M.,Feng, M. Y.,Ju, Z. H.,Wang, C. F.,Zhong, J. F.,Yang, G. W.,Yuan, J. D.. 2014

[9]Variants and Gene Expression of the TLR2 Gene and Susceptibility to Mastitis in Cattle. Huang, Jinming,Liu, Li,Wang, Hongmei,Zhang, Cuixia,Ju, Zhihua,Wang, Changfa,Zhong, Jifeng.

[10]Function identification of bovine Nramp1 promoter and intron 1. Hao, Linlin,Zhang, Libo,Liu, Songcai,Li, Mingtang,Zhong, Jifeng,Wang, Nan. 2011

[11]Identification and genetic effect of haplotypes in the promoter region of porcine myostatin gene. Liu, D.,Xu, Q.,Zang, L.,Liang, S.,Jiang, Y.,Wu, Y.,Wei, S.. 2011

[12]Specific Expression of Maize SBEIIb Promoter Mediated by Different Promoter Region in Transgenic Tobacco Plants. Sun Cui-xia,Li Meng,Wang Xiao-peng,Zhang Guo-dong,Tian Yan-chen,Wang Ze-li,Han Jing. 2009

[13]Identification and preliminary analysis of a new PCP promoter from Brassica rapa ssp chinensis. Zhang, Qiang,Cao, Jiashu,Zhang, Qiang,Liu, Huizhi. 2008

[14]p53 and NF kappa B regulate microRNA-34c expression in porcine ovarian granulosa cells. Xu Yuan,Xiao Guang,Zhang Zhe,Chen Zan-mou,Zhang Hao,Li Jia-qi,Zhang Ai-ling. 2016

[15]Expression regulation of a xylanase inhibitor gene riceXIP in rice (Oryza sativa L.). Zhan, Yihua,Sun, Xiangyu,Xu, Ying,Huang, Yingying,Jiang, Dean,Weng, Xiaoyan,Sun, Renjie,Hou, Chunxiao,Hou, Chunxiao. 2017

[16]Promoter Characterization and Expression Pattern Analysis of Porcine TCAP Gene. Qiao, Mu,Wu, Huayu,Huang, Jingshu,Peng, Xianwen,Liu, Guisheng,Feng, Zheng,Mei, Shuqi. 2012

[17]The SOD Gene Family in Tomato: Identification, Phylogenetic Relationships, and Expression Patterns. Feng, Kun,Zheng, Qingsong,Feng, Kun,Yu, Jiahong,Cheng, Yuan,Ruan, Meiying,Wang, Rongqing,Ye, Qingjing,Zhou, Guozhi,Li, Zhimiao,Yao, Zhuping,Yang, Yuejian,Wan, Hongjian,Yu, Jiahong. 2016

[18]Isolation of the endosperm-specific LPAAT gene promoter from coconut (Cocos nucifera L.) and its functional analysis in transgenic rice plants. Zheng, Yusheng,Wang, Zhekui,Li, Dongdong,Xu, Li,Zhou, Peng,Ye, Rongjian,Lin, Yongjun,Ye, Rongjian,Lin, Yongjun. 2010

[19]Functional analysis of the larval serum protein gene promoter from silkworm, Bombyx mori.. Tang, SM,Yi, YZ,Shen, XJ,Zhang, ZF,Li, YR,He, JL. 2003

[20]Identification of novel transcripts from the porcine MYL1 gene and initial characterization of its promoters. Li, Jiaqi,Wang, Liangliang,Zhang, Hao,Chen, Songling,Mei, Yingjie,Wang, Chong,Ling, Fei,Du, HongLi,Fang, Wei,Chen, Yaosheng,Liu, Xiaohui. 2010

作者其他论文 更多>>