RAfilter: an algorithm for detecting and filtering false-positive alignments in repetitive genomic regions
文献类型: 外文期刊
第一作者: Yang, Jinbao
作者: Yang, Jinbao;Pan, Weihua;Yang, Jinbao;Zhao, Xianjia;Jiang, Heling;Yang, Yingxue;Hou, Yuze;Pan, Weihua;Zhao, Xianjia
作者机构:
期刊名称:HORTICULTURE RESEARCH ( 影响因子:8.7; 五年影响因子:9.0 )
ISSN: 2662-6810
年卷期: 2023 年 10 卷 1 期
页码:
收录情况: SCI
摘要: Telomere to telomere (T2T) assembly relies on the correctness of sequence alignments. However, the existing aligners tend to generate a high proportion of false-positive alignments in repetitive genomic regions which impedes the generation of T2T-level reference genomes for more important species. In this paper, we present an automatic algorithm called RAfilter for removing the false-positives in the outputs of existing aligners. RAfilter takes advantage of rare k-mers representing the copy-specific features to differentiate false-positive alignments from the correct ones. Considering the huge numbers of rare k-mers in large eukaryotic genomes, a series of high-performance computing techniques such as multi-threading and bit operation are used to improve the time and space efficiencies. The experimental results on tandem repeats and interspersed repeats show that RAfilter was able to filter 60%-90% false-positive HiFi alignments with almost no correct ones removed, while the sensitivities and precisions on ONT datasets were about 80% and 50% respectively.
分类号:
- 相关文献
作者其他论文 更多>>
-
Haplotype-resolved assembly of auto-polyploid genomes via combining Hi-C and gametic data
作者:Zhang, Xiaohui;Li, Dongxi;Zhang, Xiaohui;Pan, Weihua
关键词:Haplotype-resolved assembly; Auto-polyploid; PacBio HiFi; Hi-C; Gametic data
-
MCSS: microbial community simulator based on structure
作者:Hui, Xingqi;Liu, Fang;Hui, Xingqi;Yang, Jinbao;Pan, Weihua;Yang, Jinbao;Sun, Jinhuan;Liu, Fang
关键词:metagenome; microbiome communities; long reads; simulator; assembly
-
Comprehensive Evaluation of Genome Gap-Filling Tools Utilizing Long Reads
作者:Zhao, Xianjia;Liu, Fang;Zhao, Xianjia;Pan, Weihua;Liu, Fang
关键词:gap-filling; long reads; genome assembly
-
Comprehensive assessment of 11 de novo HiFi assemblers on complex eukaryotic genomes and metagenomes
作者:Yu, Wenjuan;Luo, Haohui;Yang, Jinbao;Zhang, Shengchen;Jiang, Heling;Zhao, Xianjia;Hui, Xingqi;Sun, Da;Pan, Weihua;Li, Liang;Wei, Xiu-qing;Lonardi, Stefano;Zhao, Xianjia;Hui, Xingqi;Yang, Jinbao;Zhang, Shengchen
关键词:
-
Melon ripeness detection by an improved object detection algorithm for resource constrained environments
作者:Jing, Xuebin;Wang, Yuanhao;Li, Dongxi;Jing, Xuebin;Wang, Yuanhao;Pan, Weihua
关键词:Ripeness detection; Object detection; Melon; Deep learning
-
Leaf rolling detection in maize under complex environments using an improved deep learning method
作者:Wang, Yuanhao;Jing, Xuebin;Han, Xiaohong;Wang, Yuanhao;Jing, Xuebin;Pan, Weihua;Gao, Yonggang;Zhao, Cheng
关键词:Leaf rolling; Object detection; Maize; Deep learning
-
Comparison of Hi-C-Based Scaffolding Tools on Plant Genomes
作者:Hou, Yuze;Wang, Li;Hou, Yuze;Pan, Weihua
关键词:de novo assembly; scaffolding tools; scaffolding completeness; scaffolding accuracy; Hi-C