Patterns of mutational correlations, learnt from patient-derived sequences of human immunodeficiency virus (HIV) proteins, are informative of biochemically linked networks of interacting sites that may enable viral escape from the host immune system. Accurate identification of these networks is important for rationally designing vaccines which can effectively block immune escape pathways. Previous approaches have partly identified such networks by examining the principal components (PCs) of the mutational correlation matrix of HIV Gag proteins. However, driven by a conservative approach, these methods analyze the few dominant (strongest) PCs, potentially missing information embedded within the sub-dominant (relatively weaker) ones that may be important for vaccine design.
By using sequence data for HIV Gag, complemented by model-based simulations, we revealed that certain networks of interacting sites that appear important for vaccine design purposes are not accurately reflected by the dominant PCs. Rather, these networks are encoded jointly by both dominant and sub-dominant PCs. By incorporating information from the sub-dominant PCs, we identified a network of interacting sites of HIV Gag that associated very strongly with viral control. Based on this network, we propose several new candidates for a potent T-cell-based HIV vaccine [1]. Remarkably, the recommended targets align with targets of a T cell-based vaccine developed and tested independently, which has shown to elicit potent T cell responses in rhesus macaques against these targets [2].
References:
[1] S. F. Ahmed, A. A. Quadeer, D. Morales-Jimenez, and M. R. McKay, “Sub-dominant principal components inform new vaccine targets for HIV Gag,” Bioinformatics, vol. 35, no. 20, pp. 3884–3889, Oct. 2019, doi: 10.1093/bioinformatics/btz524.
[2] D. K. Murakowski et al., “Adenovirus-vectored vaccine containing multidimensionally conserved parts of the HIV proteome is immunogenic in rhesus macaques,” Proc Natl Acad Sci U S A, vol. 118, no. 5, p. e2022496118, Feb. 2021, doi: 10.1073/pnas.2022496118.