RNA-QC-chain: comprehensive and fast quality control for RNA-Seq data

文献类型: 外文期刊

第一作者: Zhou, Qian

作者: Zhou, Qian;Chen, Songlin;Zhou, Qian;Chen, Songlin;Su, Xiaoquan;Jing, Gongchao;Su, Xiaoquan;Jing, Gongchao;Su, Xiaoquan;Ning, Kang

作者机构:

关键词: Quality control;RNA-Seq;Contamination identification;Alignment statistics;Parallel computing

期刊名称:BMC GENOMICS ( 影响因子:3.969; 五年影响因子:4.478 )

ISSN: 1471-2164

年卷期: 2018 年 19 卷

页码:

收录情况: SCI

摘要: Background: RNA-Seq has become one of the most widely used applications based on next-generation sequencing technology. However, raw RNA-Seq data may have quality issues, which can significantly distort analytical results and lead to erroneous conclusions. Therefore, the raw data must be subjected to vigorous quality control (QC) procedures before downstream analysis. Currently, an accurate and complete QC of RNA-Seq data requires of a suite of different QC tools used consecutively, which is inefficient in terms of usability, running time, file usage, and interpretability of the results. Results: We developed a comprehensive, fast and easy-to-use QC pipeline for RNA-Seq data, RNA-QC-Chain, which involves three steps: (1) sequencing-quality assessment and trimming; (2) internal (ribosomal RNAs) and external (reads from foreign species) contamination filtering; (3) alignment statistics reporting (such as read number, alignment coverage, sequencing depth and pair-end read mapping information). This package was developed based on our previously reported tool for general QC of next-generation sequencing (NGS) data called QC-Chain, with extensions specifically designed for RNA-Seq data. It has several features that are not available yet in other QC tools for RNA-Seq data, such as RNA sequence trimming, automatic rRNA detection and automatic contaminating species identification. The three QC steps can run either sequentially or independently, enabling RNA-QC-Chain as a comprehensive package with high flexibility and usability. Moreover, parallel computing and optimizations are embedded in most of the QC procedures, providing a superior efficiency. The performance of RNA-QC-Chain has been evaluated with different types of datasets, including an in-house sequencing data, a semi-simulated data, and two real datasets downloaded from public database. Comparisons of RNA-QC-Chain with other QC tools have manifested its superiorities in both function versatility and processing speed. Conclusions: We present here a tool, RNA-QC-Chain, which can be used to comprehensively resolve the quality control processes of RNA-Seq data effectively and efficiently.

分类号:

  • 相关文献

[1]Multi-scale geospatial agroecosystem modeling: A case study on the influence of soil data resolution on carbon budget estimates. Zhang, Xuesong,Manowitz, David H.,Izaurralde, Roberto C.,Thomson, Allison M.,West, Tristram O.,Zhang, Xuesong,Manowitz, David H.,Izaurralde, Roberto C.,Thomson, Allison M.,West, Tristram O.,Sahajpal, Ritvik,Izaurralde, Roberto C.,Zhao, Kaiguang,LeDuc, Stephen D.,Xu, Min,Xiong, Wei,Zhang, Aiping,Post, Wilfred M..

[2]Artepillin C, is it a good marker for quality control of Brazilian green propolis?. Zhang, Cui-ping,Shen, Xiao-ge,Chen, Jia-wei,Jiang, Xia-sen,Hu, Fu-liang,Wang, Kai. 2017

[3]Quality of the entomopathogenic nematode Steinernema carpocapsae produced on different media. Yang, HW,Jian, H,Zhang, SG,Zhang, GY. 1997

[4]Rapid Prediction Study of Total Flavonids Content in Panax notoginseng Using Infrared Spectroscopy Combined with Chemometrics. Li Yun,Zhang Ji,Wang Yuan-zhong,Zhang Jin-yu,Li Yun,Zhang Ji,Wang Yuan-zhong,Zhang Jin-yu,Li Yun,Xu Fu-rong,Zhang Jin-yu. 2017

[5]Fourier transform mid-infrared spectroscopy and chemometrics to identify and discriminate Boletus edulis and Boletus tomentipes mushrooms. Qi, Lu-Ming,Zhang, Ji,Wang, Yuan-Zhong,Qi, Lu-Ming,Liu, Hong-Gao,Li, Tao. 2017

[6]Chemical fingerprinting of Su-He-Xiang-Wan and attribution of major characteristic peaks for its quality control by GC-MS. Wang Wei-ping,Liang Yi-zeng,Wang Wei-ping,Lin Juan,Zhang Ming-yue,Zhang Liang-xiao. 2013

[7]Design of Agent-Based Agricultural Product Quality Control System. Zhu, Yeping,Li, Shijuan,Liu, Shengping,Yue, E.. 2011

[8]Qualitative and quantitative analysis of chemical constituents in Ardisiae Japonicae Herba. Yu, Ke-Yun,Li, Shang-Zhen,Li, Pei,Dou, Li-Li,Liu, E-Hu,Gao, Wen,Wu, Wei,Wang, Yuan-Zhong. 2017

[9]Quantitative and Chemical Fingerprint Analysis for the Quality Evaluation of Receptaculum Nelumbinis by RP-HPLC Coupled with Hierarchical Clustering Analysis. Wu, Yan-Bin,Zheng, Li-Jun,Wu, Jian-Guo,Wu, Jin-Zhong,Yi, Jun,Chen, Ti-Qiang. 2013

[10]Bee Products Quality Control and Emergency Management Mechanism Research Based on Multi-Agent. Yue, E.,Zhu, Yeping,Cao, Yongsheng. 2012

[11]Application of FTIR and Active Ingredients Quantitative Analysis on Quality Control of Dai Medicine Alstonia scholaris (L.) R. Br.. Yang Ni-na,Zhao Ying-hong,Yang Ni-na,Yang Chun-yong,Wang Yuan-zhong,Wang Yuan-zhong. 2017

[12]Geographical traceability of wild Boletus edulis based on data fusion of FT-MIR and ICP-AES coupled with data mining methods (SVM). Li, Yun,Zhang, Ji,Wang, Yuanzhong,Li, Yun,Zhang, Ji,Wang, Yuanzhong,Li, Yun,Liu, Honggao,Li, Jieqing,Li, Tao.

[13]Genetic Materials at the Gene Engineering Division, RIKEN BioResource Center. Yokoyama, Kazunari K.,Yokoyama, Kazunari K.,Murata, Takehide,Pan, Jianzhi,Nakade, Koji,Kishikawa, Shotaro,Ugai, Hideyo,Kimura, Makoto,Kujime, Yukari,Hirose, Megumi,Masuzaki, Satoko,Yamasaki, Takahito,Kurihara, Chitose,Okubo, Masato,Nakano, Yuri,Kusa, Yuka,Yoshikawa, Akiko,Inabe, Kumiko,Ueno, Kazuko,Obata, Yuichi,Yokoyama, Kazunari K.,Pan, Jianzhi,Ugai, Hideyo,Kimura, Makoto.

[14]Genome-wide characterization of differentially expressed genes provides insights into regulatory network of heat stress response in radish (Raphanus sativus L.). Wang, Ronghua,Xu, Liang,Wang, Yan,Liu, Liwang,Wang, Ronghua,Mei, Yi,Guo, Jun,Zhu, Xianwen. 2018

[15]Transcriptome Analysis of Sucrose Metabolism during Bulb Swelling and Development in Onion (Allium cepa L.). Zhang, Chunsha,Zhang, Hongwei,Liang, Yi,Zhan, Zongxiang,Liu, Bingjiang,Chen, Zhentai. 2016

[16]Expression profiles of a cytoplasmic male sterile line of Gossypium harknessii and its fertility restorer and maintainer lines revealed by RNA-Seq. Han, Zongfu,Deng, Yongsheng,Kong, Fanjin,Wang, Zongwen,Shen, Guifang,Wang, Jinghui,Duan, Bing,Li, Ruzhong,Qin, Yuxiang. 2017

[17]Transcriptome and Differential Expression Profiling Analysis of the Mechanism of Ca2+ Regulation in Peanut (Arachis hypogaea) Pod Development. Yang, Sha,Zhang, Jialei,Geng, Yun,Guo, Feng,Meng, Jingjing,Li, Xinguo,Li, Lin,Wang, Jianguo,Sui, Na,Wan, Shubo. 2017

[18]Genome-wide comparative transcriptome analysis of CMS-D2 and its maintainer and restorer lines in upland cotton. Jianyong Wu,Wu, Jianyong,Xing, Chaozhu,Meng Zhang,Bingbing Zhang,Xuexian Zhang,Liping Guo,Tingxiang Qi,Hailin Wang,Jinfa Zhang,Chaozhu Xing. 2017

[19]Early Transcriptomic Adaptation to Na2CO3 Stress Altered the Expression of a Quarter of the Total Genes in the Maize Genome and Exhibited Shared and Distinctive Profiles with NaCl and High pH Stresses. Zhang, Li-Min,Liu, Xiang-Guo,Han, Si-Ping,Hao, Dong-Yun,Zhang, Li-Min,Qu, Xin-Ning,Yu, Ying,Dou, Yao,Xu, Yao-Yao,Hao, Dong-Yun,Zhang, Li-Min,Jing, Hai-Chun. 2013

[20]Uncovering Male Fertility Transition Responsive miRNA in a Wheat Photo-Thermosensitive Genic Male Sterile Line by Deep Sequencing and Degradome Analysis. Bai, Jian-Fang,Wang, Yu-Kun,Wang, Peng,Duan, Wen-Jing,Yuan, Shao-Hua,Sun, Hui,Yuan, Guo-Liang,Ma, Jing-Xiu,Wang, Na,Zhang, Feng-Ting,Zhang, Li-Ping,Zhao, Chang-Ping,Bai, Jian-Fang,Wang, Yu-Kun,Wang, Peng,Duan, Wen-Jing,Yuan, Shao-Hua,Sun, Hui,Yuan, Guo-Liang,Ma, Jing-Xiu,Wang, Na,Zhang, Feng-Ting,Zhang, Li-Ping,Zhao, Chang-Ping,Wang, Peng,Duan, Wen-Jing. 2017

作者其他论文 更多>>