A combined statistical model for multiple motifs search

文献类型: 外文期刊

第一作者: Guan Shan

作者: Guan Shan;Gao Li-Feng;Liu Xin

作者机构:

关键词: transcription factor binding sites;motif;position weight matrix

期刊名称:CHINESE PHYSICS B ( 影响因子:1.494; 五年影响因子:1.262 )

ISSN: 1674-1056

年卷期: 2008 年 17 卷 12 期

页码:

收录情况: SCI

摘要: Transcription factor binding sites (TFBS) play key roles in genebior 6.8 wavelet expression and regulation. They are short sequence segments with definite structure and can be recognized by the corresponding transcription factors correctly. From the viewpoint of statistics, the candidates of TFBS should be quite different from the segments that are randomly combined together by nucleotide. This paper proposes a combined statistical model for finding over-represented short sequence segments in different kinds of data set. While the over-represented short sequence segment is described by position weight matrix, the nucleotide distribution at most sites of the segment should be far from the background nucleotide distribution. The central idea of this approach is to search for such kind of signals. This algorithm is tested on 3 data sets, including binding sites data set of cyclic AMP receptor protein in E. coli, PlantProm DB which is a non-redundant collection of proximal promoter sequences from different species, collection of the intergenic sequences of the whole genome of E. Coli. Even though the complexity of these three data sets is quite different, the results show that this model is rather general and sensible.

分类号:

  • 相关文献

[1]Functional analysis of a viroid RNA motif mediating cell-to-cell movement in Nicotiana benthamiana. Wang, Meng,Jiang, Dongmei,Li, Shifang,Jiang, Dongmei.

[2]Genomic sequencing and analysis of Chilli ringspot virus, a novel potyvirus. Gong, Dian,Wang, Jian-Hua,Lin, Zhan-Song,Zhang, Shao-Yan,Zhang, Yu-Liang,Yu, Nai-Tong,Liu, Zhi-Xin,Gong, Dian,Lin, Zhan-Song,Zhang, Shao-Yan,Yu, Nai-Tong,Xiong, Zhongguo,Xiong, Zhongguo. 2011

[3]Genome-wide identification and validation of simple sequence repeats (SSRs) from Asparagus officinalis. Li, Shufen,Li, Xu,Yuan, Jinhong,Deng, Chuanliang,Gao, Wujun,Zhang, Guojun,Wang, Lianjun.

[4]Bioinformatics Analysis of NBS-LRR Encoding Resistance Genes in Setaria italica. Zhao, Yan,Weng, Qiaoyun,Song, Jinhui,Ma, Hailian,Yuan, Jincheng,Liu, Yinghui,Dong, Zhiping.

[5]Genome-wide analysis suggests divergent evolution of lipid phosphotases/phosphotransferase genes in plants. Wang, Peng,Chen, Zhenxi,Gai, Jiangtao,Wang, Peng,Chen, Zhenxi,Gai, Jiangtao,Kasimu, Rena,Chen, Yinhua,Zhang, Xiaoxiao.

[6]The proteome and phosphoproteome of maize pollen uncovers fertility candidate proteins. Chao, Qing,Gao, Zhi-fang,Wang, Yue-feng,Mei, Ying-chang,Zhao, Biligen-gaowa,Wang, Bai-chen,Li, Zhe,Huang, Xia-he,Wang, Ying-chun,Li, Liang,Jiang, Yu-bo.

作者其他论文 更多>>