Bacteriophage classification for assembled contigs using graph convolutional network
文献类型: 外文期刊
第一作者: Shang, Jiayu
作者: Shang, Jiayu;Sun, Yanni;Jiang, Jingzhe
作者机构:
期刊名称:BIOINFORMATICS ( 影响因子:6.937; 五年影响因子:8.47 )
ISSN: 1367-4803
年卷期: 2021 年 37 卷
页码:
收录情况: SCI
摘要: Motivation: Bacteriophages (aka phages), which mainly infect bacteria, play key roles in the biology of microbes. As the most abundant biological entities on the planet, the number of discovered phages is only the tip of the iceberg. Recently, many new phages have been revealed using high-throughput sequencing, particularly metagenomic sequencing. Compared to the fast accumulation of phage-like sequences, there is a serious lag in taxonomic classification of phages. High diversity, abundance and limited known phages pose great challenges for taxonomic analysis. In particular, alignment-based tools have difficulty in classifying fast accumulating contigs assembled from metagenomic data. Results: In this work, we present a novel semi-supervised learning model, named PhaGCN, to conduct taxonomic classification for phage contigs. In this learning model, we construct a knowledge graph by combining the DNA sequence features learned by convolutional neural network and protein sequence similarity gained from gene-sharing network. Then we apply graph convolutional network to utilize both the labeled and unlabeled samples in training to enhance the learning ability. We tested PhaGCN on both simulated and real sequencing data. The results clearly show that our method competes favorably against available phage classification tools.
分类号:
- 相关文献
作者其他论文 更多>>
-
Identification of a novel circovirus associated with turbot (Scophthalmus maximus) acute hemorrhage disease
作者:Wang, Huilin;Huang, Zhihui;Li, Jie;Jiang, Jingzhe;Xu, Liming;Zhou, Yong;Qin, Qiwei;Wei, Jingguang;Wang, Qiyao;Xiao, Zhizhong;Li, Jie
关键词:Turbot; Turbot acute hemorrhage disease; Circovirus; Pathogen
-
Biological characterization and genomic profiling of a novel Providencia phage isolated from farm effluents
作者:Zhu, Mantong;Zhu, Mantong;Liu, Chang;Liu, Guangfeng;Zhang, Hongsai;Wang, Xing;Wang, Ying;Luo, Yongqi;Jiang, Jingzhe
关键词:
-
Biological characteristics and genome analysis of Citrobacter freundii phage K1M
作者:Xie, Keming;Liu, Chang;Liu, Guangfeng;Zhu, Peng;Jiang, Jingzhe;Xie, Keming;Yang, Zheng;Yuan, Lihong;Jiang, Jingzhe;Xie, Keming
关键词:
Citrobacter freundii ; Lytic phage;Fredivirus ; Polysaccharide depolymerase; Classification -
Identification and classification of the genomes of novel microviruses in poultry slaughterhouse
作者:Xie, Keming;Sun, Xinyu;Pan, Jingqi;Qiu, Suiping;Yuan, Xiaoqi;Liang, Mengshi;Jiang, Jingzhe;Yuan, Lihong;Xie, Keming;Zhu, Peng;Liu, Chang;Liu, Guangfeng;Jiang, Jingzhe;Lin, Benfu;Zhu, Peng;Cao, Xudong
关键词:poultry slaughterhouse; microviruses; genome; clustering; host
-
Rice Yield and Nitrogen Use Efficiency: Different Responses to Soil Organic Matter between Early and Late Rice
作者:Wang, Yong;Tang, Gang;Fu, Wentao;Huang, Shan;Sun, Yanni;Chen, Jin;Chen, Jin
关键词:Soil Organic Carbon; Double rice; N Uptake; N Recovery Efficiency; N-15 Tracer
-
RNAVirHost: a machine learning-based method for predicting hosts of RNA viruses through viral genomes
作者:Chen, Guowei;Sun, Yanni;Jiang, Jingzhe
关键词:RNA virus; host prediction; machine learning; metagenomics
-
Liming reduces nitrogen uptake from chemical fertilizer but increases that from straw in a double rice cropping system
作者:Liao, Ping;Liao, Ping;van Groenigen, Kees Jan;Liu, Lei;Sun, Yanni;Huang, Shan;Zeng, Yongjun;Chen, Jin;Chen, Jin
关键词:Yield; N recovery rate; N losses; Soil acidification; 15 N tracing