Louis H. Miller, Guiyun Yan, Colby T. Ford, Anthony Ford, Richard D. Pearson, Julian C. Rayner, Delenasaw Yewhalaw, Daniel Kepple, Sisay Getachew, Beka Raya Abagero, Jordan Connors, Karthigayan Gunalan, Daniel Janies, Sarah Auburn, Eugenia Lo, Ford, Anthony [0000-0002-1615-2864], Kepple, Daniel [0000-0003-0566-3623], Abagero, Beka Raya [0000-0002-7205-9496], Connors, Jordan [0000-0003-4887-8026], Pearson, Richard [0000-0002-7386-3566], Auburn, Sarah [0000-0002-4638-536X], Ford, Colby [0000-0002-7859-3622], Rayner, Julian C. [0000-0002-9835-1014], Apollo - University of Cambridge Repository, Dinglasan, Rhoel Ramos, and Rayner, Julian C [0000-0002-9835-1014]
Plasmodium vivax malaria is much less common in Africa than the rest of the world because the parasite relies primarily on the Duffy antigen/chemokine receptor (DARC) to invade human erythrocytes, and the majority of Africans are Duffy negative. Recently, there has been a dramatic increase in the reporting of P. vivax cases in Africa, with a high number of them being in Duffy negative individuals, potentially indicating P. vivax has evolved an alternative invasion mechanism that can overcome Duffy negativity. Here, we analyzed single nucleotide polymorphism (SNP) and copy number variation (CNV) in Whole Genome Sequence (WGS) data from 44 P. vivax samples isolated from symptomatic malaria patients in southwestern Ethiopia, where both Duffy positive and Duffy negative individuals are found. A total of 123,711 SNPs were detected, of which 22.7% were nonsynonymous and 77.3% were synonymous mutations. The largest number of SNPs were detected on chromosomes 9 (24,007 SNPs; 19.4% of total) and 10 (16,852 SNPs, 13.6% of total). There were particularly high levels of polymorphism in erythrocyte binding gene candidates including merozoite surface protein 1 (MSP1) and merozoite surface protein 3 (MSP3.5, MSP3.85 and MSP3.9). Two genes, MAEBL and MSP3.8 related to immunogenicity and erythrocyte binding function were detected with significant signals of positive selection. Variation in gene copy number was also concentrated in genes involved in host-parasite interactions, including the expansion of the Duffy binding protein gene (PvDBP) on chromosome 6 and MSP3.11 on chromosome 10. Based on the phylogeny constructed from the whole genome sequences, the expansion of these genes was an independent process among the P. vivax lineages in Ethiopia. We further inferred transmission patterns of P. vivax infections among study sites and showed various levels of gene flow at a small geographical scale. The genomic features of P. vivax provided baseline data for future comparison with those in Duffy-negative individuals and allowed us to develop a panel of informative Single Nucleotide Polymorphic markers diagnostic at a micro-geographical scale., Author summary Plasmodium vivax is the most geographically widespread parasite species that causes malaria in humans. Although it occurs in Africa as a member of a mix of Plasmodium species, P. vivax is dominant in other parts of the world outside of Africa (e.g., Brazil). It was previously thought that most African populations were immune to P. vivax infections due to the absence of Duffy antigen chemokine receptor (DARC) gene expression required for erythrocyte invasion. However, several recent reports have indicated the emergence and potential spread of P. vivax across human populations in Africa. Compared to Southeast Asia and South America where P. vivax is highly endemic, data on polymorphisms in erythrocyte binding gene candidates of P. vivax from Africa is limited. Filling this knowlege gap is critical for identifying functional genes in erythrocyte invasion, biomarkers for tracking the P. vivax isolates from Africa, as well as potential gene targets for vaccine development. This paper examined the level of genetic polymorphisms in a panel of 43 potential erythrocyte binding protein genes based on whole genome sequences and described transmission patterns of P. vivax infections from different study sites in Ethiopia based on the genetic variants. Our analyses showed that chromosomes 9 and 10 of the P. vivax genomes isolated in Ethiopia had the most high-quality genetic polymorphisms. Among all erythrocyte binding protein gene candidates, the merozoite surface proteins 1 and merozoite surface protein 3 showed high levels of polymorphism. MAEBL and MSP3.8 related to immunogenicity and erythrocyte binding function were detected with significant signals of positive selection. The expansion of the Duffy binding protein and merozoite surface protein 3 gene copies was an independent process among the P. vivax lineages in Ethiopia. Various levels of gene flow were observed even at a smaller geographical scale. Our study provided baseline data for future comparison with P. vivax in Duffy negative individuals and help develop a panel of genetic markers that are informative at a micro-geographical scale.