LR_Gapcloser: a tiling path-based gap closer that uses long reads to complete genome assembly
文献类型: 外文期刊
第一作者: Xu, Gui-Cai
作者: Xu, Gui-Cai;Zhu, Rui;Zhang, Yan;Li, Shang-Qi;Wang, Hong-Wei;Li, Jiong-Tang;Xu, Gui-Cai;Zhu, Rui;Zhang, Yan;Li, Shang-Qi;Wang, Hong-Wei;Li, Jiong-Tang;Xu, Gui-Cai;Xu, Tian-Jun;Zhu, Rui
作者机构:
关键词: gap-closure; genome assembly; third-generation sequencing; next-generation sequencing; repetitive elements
期刊名称:GIGASCIENCE ( 影响因子:6.524; 五年影响因子:8.702 )
ISSN: 2047-217X
年卷期: 2019 年 8 卷 1 期
页码:
收录情况: SCI
摘要: Background: Completing a genome is an important goal of genome assembly. However, many assemblies, including reference assemblies, are unfinished and have a number of gaps. Long reads obtained from third-generation sequencing (TGS) platforms can help close these gaps and improve assembly contiguity. However, current gap-closure approaches using long reads require extensive runtime and high memory usage. Thus, a fast and memory-efficient approach using long reads is needed to obtain complete genomes. Findings: We developed LR_Gapcloser to rapidly and efficiently close the gaps in genome assembly. This tool utilizes long reads generated from TGS sequencing platforms. Tested on de novo assembled gaps, repeat-derived gaps, and real gaps, LR_Gapcloser closed a higher number of gaps faster and with a lower error rate and a much lower memory usage than two existing, state-of-the art tools. This tool utilized raw reads to fill more gaps than when using error-corrected reads. It is applicable to gaps in the assemblies by different approaches and from large and complex genomes. After performing gap-closure using this tool, the contig N50 size of the human CHM1 genome was improved from 143 kb to 19 Mb, a 132-fold increase. We also closed the gaps in the Triticum urartu genome, a large genome rich in repeats; the contig N50 size was increased by 40%. Further, we evaluated the contiguity and correctness of six hybrid assembly strategies by combining the optimal TGS-based and next-generation sequencing-based assemblers with LR_Gapcloser. A proposed and optimal hybrid strategy generated a new human CHM1 genome assembly with marked contiguity. The contig N50 value was greater than 28 Mb, which is larger than previous non-reference assemblies of the diploid human genome. Conclusions: LR_Gapcloser is a fast and efficient tool that can be used to close gaps and improve the contiguity of genome assemblies. A proposed hybrid assembly including this tool promises reference-grade assemblies. The software is available at http://www.fishbrowser.org/software/LR_Gapcloser/.
分类号:
- 相关文献
作者其他论文 更多>>
-
Influence of the 'painless' TRP channel on temperature-dependent escape and humidity-related pupation in Bactrocera dorsalis larvae
作者:Zhang, Yan;Zhang, Panpan;Luo, Zhicai;Wang, Qi;Zhang, Jie;Yang, Minghuan;Yan, Shanchun;Liu, Wei;Wang, Guirong
关键词:Bactrocera dorsalis; Bdorpainless; CRISPR/Cas9; extreme environments; escape behavior
-
Estimation of Processing Tomato Nutrient Uptake Based on the QUEFTS Model in Xinjiang
作者:Yibati, Halihashi;Gao, Jie;Yibati, Halihashi;Zhang, Yan;Li, Qingjun;Xu, Xinpeng;He, Ping;Yin, Xinhua
关键词:processing tomato; fruit yield; QUEFTS model; nutrient; internal efficiency (IE)
-
Impact of homogenization methods on the interfacial protein composition and stability of peanut oil body emulsion with sodium caseinate and maltodextrin
作者:Lin, Zihui;Zhou, Pengfei;Deng, Yuanyuan;Liu, Guang;Li, Ping;Zeng, Jiarui;Zhang, Yan;Tang, Xiaojun;Zhao, Zhihao;Zhang, Mingwei;Lin, Zihui
关键词:Peanut oil body; Homogenization methods; Emulsion; Interfacial protein; Stability
-
Nitrogen Deficiency Accelerates Rice Leaf Senescence Through ABA Signaling and Sugar Metabolic Shifts
作者:Asad, Muhmmad Asad Ullah;Guan, Xianyue;Zhang, Yan;Zhou, Lujian;Zhou, Weijun;Cheng, Fangmin;Asad, Muhmmad Asad Ullah;Bartas, Martin;Ullah, Najeeb;Cheng, Fangmin
关键词:
-
Effectiveness of Different Organic Solvent Additions to Water Samples for Reducing the Adsorption Effects of Organic Pesticides Using Ultra-High-Performance Liquid Chromatography-Tandem Mass Spectrometry
作者:Liu, Yucan;Xu, Xinyi;Wang, Ying;Zhang, Yan;Lu, Jianbo;Liu, Chengbin;Duan, Jinming;Sun, Hongwei
关键词:adsorption effect; UHPLC-ESI-MS/MS; organic pesticides; direct injection technique; detection signal intensity; organic solvent; addition ratio
-
Dual recombinase polymerase amplification system combined with lateral flow immunoassay for simultaneous detection of Staphylococcus aureus and Vibrio parahaemolyticus
作者:Zhang, Yan;Liu, Xiaofeng;Zhang, Yan;Luo, Jiawei;Liu, Hua;Li, You;Wang, Jinbin;Zeng, Haijuan;Zhang, Yan;Luo, Jiawei;Liu, Hua;Li, You;Wang, Jinbin;Zeng, Haijuan;Zeng, Haijuan;Liu, Juan;Zhu, Lemei
关键词:Recombinase polymerase amplification; Lateral flow immunoassays; Foodborne pathogens; Multiplex detection
-
Modulation of Gut Mycobiome and Serum Metabolome by a MUFA-Rich Diet in Sprague Dawley Rats Fed a High-Fructose, High-Fat Diet
作者:Zhao, Zhihao;Zhong, Lihuang;Zeng, Guangzhen;Liu, Songbin;Deng, Yuanyuan;Zhang, Yan;Tang, Xiaojun;Zhang, Mingwei;Wu, Jiajin
关键词:dietary fatty acids; oleic acid; high-fat; high-fructose diet; gut fungi; bile acid metabolism; CoA biosynthesis