Prokaryotic virus host prediction with graph contrastive augmentaion

Du, Zhi-Hua and Zhong, Jun-Peng and Liu, Yun and Li, Jian-Qiang and Vega, Nic (2023) Prokaryotic virus host prediction with graph contrastive augmentaion. PLOS Computational Biology, 19 (12). e1011671. ISSN 1553-7358

[thumbnail of journal.pcbi.1011671.pdf] Text
journal.pcbi.1011671.pdf - Published Version

Download (2MB)

Abstract

Prokaryotic viruses, also known as bacteriophages, play crucial roles in regulating microbial communities and have the potential for phage therapy applications. Accurate prediction of phage-host interactions is essential for understanding the dynamics of these viruses and their impacts on bacterial populations. Numerous computational methods have been developed to tackle this challenging task. However, most existing prediction models can be constrained due to the substantial number of unknown interactions in comparison to the constrained diversity of available training data. To solve the problem, we introduce a model for prokaryotic virus host prediction with graph contrastive augmentation (PHPGCA). Specifically, we construct a comprehensive heterogeneous graph by integrating virus-virus protein similarity and virus-host DNA sequence similarity information. As the backbone encoder for learning node representations in the virus-prokaryote graph, we employ LGCN, a state-of-the-art graph embedding technique. Additionally, we apply graph contrastive learning to augment the node representations without the need for additional labels. We further conducted two case studies aimed at predicting the host range of multi-species phages, helping to understand the phage ecology and evolution.

Item Type: Article
Subjects: Science Repository > Biological Science
Depositing User: Managing Editor
Date Deposited: 10 Apr 2024 11:47
Last Modified: 10 Apr 2024 11:47
URI: http://research.manuscritpub.com/id/eprint/4068

Actions (login required)

View Item
View Item