DBG2OLC: Efficient Assembly of Large Genomes Using Long Erroneous Reads of the Third Generation Sequencing Technologies
文献类型: 外文期刊
第一作者: Ye, Chengxi
作者: Ye, Chengxi;Hill, Christopher M.;Ye, Chengxi;Ma, Zhanshan (Sam);Wu, Shigang;Ruan, Jue
作者机构:
期刊名称:SCIENTIFIC REPORTS ( 影响因子:4.379; 五年影响因子:5.133 )
ISSN: 2045-2322
年卷期: 2016 年 6 卷
页码:
收录情况: SCI
摘要: The highly anticipated transition from next generation sequencing (NGS) to third generation sequencing (3GS) has been difficult primarily due to high error rates and excessive sequencing cost. The high error rates make the assembly of long erroneous reads of large genomes challenging because existing software solutions are often overwhelmed by error correction tasks. Here we report a hybrid assembly approach that simultaneously utilizes NGS and 3GS data to address both issues. We gain advantages from three general and basic design principles: (i) Compact representation of the long reads leads to efficient alignments. (ii) Base-level errors can be skipped; structural errors need to be detected and corrected. (iii) Structurally correct 3GS reads are assembled and polished. In our implementation, preassembled NGS contigs are used to derive the compact representation of the long reads, motivating an algorithmic conversion from a de Bruijn graph to an overlap graph, the two major assembly paradigms. Moreover, since NGS and 3GS data can compensate for each other, our hybrid assembly approach reduces both of their sequencing requirements. Experiments show that our software is able to assemble mammalian-sized genomes orders of magnitude more quickly than existing methods without consuming a lot of memory, while saving about half of the sequencing cost.
分类号:
- 相关文献
作者其他论文 更多>>
-
Ubiquitination of OsCSN5 by OsPUB45 activates immunity by modulating the OsCUL3a-OsNPR1 module
作者:Zhang, Chongyang;Fang, Liang;He, Feng;You, Xiaoman;Wang, Min;Zhao, Tianxiao;Hou, Yanyan;Wang, Ruyi;Ning, Yuese;Zhang, Chongyang;Ruan, Jue;Fang, Liang;Francis, Frederic;Xiao, Ning;Li, Aihong;Yang, Jian;Wang, Guo-Liang
关键词:
-
Targeting conserved secreted effectors to control rice blast
作者:Zhang, Chongyang;Ruan, Jue;Zhang, Chongyang;Feng, Qin;You, Xiaoman;Ning, Yuese;Feng, Qin;Wang, Guo-Liang
关键词:
-
PFLO: a high-throughput pose estimation model for field maize based on YOLO architecture
作者:Pan, Yuchen;Liu, Bingwen;Wang, Li;Pan, Yuchen;Chang, Jianye;Dong, Zhemeng;Liu, Bingwen;Liu, Hailin;Ruan, Jue;Dong, Zhemeng
关键词:Plant pose estimation; Maize; Computer vision; Deep learning; In-field monitoring
-
Low-input PacBio sequencing generates high-quality individual fly genomes and characterizes mutational processes
作者:Jia, Hangxing;Tan, Shengjun;Cai, Yingao;Guo, Yanyan;Shen, Jieyu;Zhang, Yaqiong;Ma, Huijing;Zhang, Qingzhu;Qiao, Gexia;Zhang, Yong E.;Cai, Yingao;Guo, Yanyan;Shen, Jieyu;Zhang, Qingzhu;Chen, Jinfeng;Qiao, Gexia;Zhang, Yong E.;Chen, Jinfeng;Ruan, Jue
关键词:
-
HiTE: a fast and accurate dynamic boundary adjustment approach for full-length transposable element detection and annotation
作者:Hu, Kang;Ni, Peng;Xu, Minghua;Zou, You;Wang, Jianxin;Hu, Kang;Ni, Peng;Wang, Jianxin;Hu, Kang;Ni, Peng;Xu, Minghua;Zou, You;Wang, Jianxin;Chang, Jianye;Ruan, Jue;Gao, Xin;Gao, Xin;Li, Yaohang;Hu, Bin;Hu, Bin
关键词:
-
A compressive seeding algorithm in conjunction with reordering-based compression
作者:Ji, Fahu;Liu, Xianming;Zhou, Qian;Liu, Xianming;Ruan, Jue;Zhu, Zexuan
关键词:
-
NextDenovo: an efficient error correction and accurate assembly tool for noisy long reads
作者:Hu, Jiang;Wang, Zhuo;Sun, Zongyi;Liang, Fan;Li, Jingjing;Wang, Depeng;Hu, Benxia;Ayoola, Adeola Oluwakemi;Wu, Dong-Dong;Wang, Sheng;Sandoval, Jose R.;Cooper, David N.;Hu, Jiang;Ye, Kai;Ruan, Jue;Xiao, Chuan-Le;Wu, Dong-Dong;Wu, Dong-Dong;Wang, Sheng;Wu, Dong-Dong
关键词:Long reads; Genome assembly; Error-correction; Human genomes; Segmental duplication