Abstract
The type VI secretion system (T6SS) is an important mediator of microbe–microbe and microbe–host interactions. Gram-negative bacteria use the T6SS to inject T6SS effectors (T6Es), which are usually proteins with toxic activity, into neighboring cells. Antibacterial effectors have cognate immunity proteins that neutralize self-intoxication. Here, we applied novel structural bioinformatic tools to perform systematic discovery and functional annotation of T6Es and their cognate immunity proteins from a dataset of 17,920 T6SS-encoding bacterial genomes. Using structural clustering, we identified 517 putative T6E families, outperforming sequence-based clustering. We developed a logistic regression model to reliably quantify protein–protein interaction of new T6E-immunity pairs, yielding candidate immunity proteins for 231 out of the 517 T6E families. We used sensitive structure-based annotation which yielded functional annotations for 51% of the T6E families, again outperforming sequence-based annotation. Next, we validated four novel T6E-immunity pairs using basic experiments in E. coli. In particular, we showed that the Pfam domain DUF3289 is a homolog of Colicin M and that DUF943 acts as its cognate immunity protein. Furthermore, we discovered a novel T6E that is a structural homolog of SleB, a lytic transglycosylase, and identified a specific glutamate that acts as its putative catalytic residue. Overall, this study applies novel structural bioinformatic tools to T6E-immunity pair discovery, and provides an extensive database of annotated T6E-immunity pairs.
Original language | English |
---|---|
Pages (from-to) | 702-718 |
Number of pages | 17 |
Journal | Molecular Systems Biology |
Volume | 20 |
Issue number | 6 |
DOIs | |
State | Published - 3 Jun 2024 |
Keywords
- Alphafold-multimer
- Effector-immunity Pairs
- Foldseek
- Structural Bioinformatics
- Type VI Secretion System (T6SS)
All Science Journal Classification (ASJC) codes
- Information Systems
- General Immunology and Microbiology
- Applied Mathematics
- General Biochemistry,Genetics and Molecular Biology
- General Agricultural and Biological Sciences
- Computational Theory and Mathematics