APOBEC3F Diversity and its Influence on Lentiviral Resistance

viral antagonists drive the evolution of host proteins, and vice versa. consequently, restriction factors are remarkably variable, displaying polymorphism within species and divergence between species as a result of positive selection. this paper investigates diversity in the apobec3f (a3f) restriction factors of old world primates in order to determine whether they display evidence of involvement in an evolutionary “arms race.” we speculated that genetic variability in a3f could reflect evolutionary conflict with the vif proteins of primate lentiviruses, which are known to enhance viral replication by binding and degrading host a3 proteins. a3fs of several old world primate species were genotyped, and the sequences revealed both intra-species diversity and inter-species divergence. representative rhesus macaque (macaca mulatta) sequences were cloned and tested for sensitivity to vifs from various simian immunodeficiency viruses (sivs). evolution of a3f in the rhesus lineage is not due to selection by sivs, but may reflect antagonism by another retrovirus. a biological “arms race” APOBEC3F Diversity and its Influence on Lentiviral Resistance


introduction I. Restriction factors: barriers to cross species transmission
Acquired Immune Deficiency Syndrome (AIDS) was first identified as a disease in 1981, as increasing numbers of homosexual men were dying of opportunistic infections after becoming immunocompromised. 1 The causative agent was soon discovered to be a lentivirus, classified as Human Immunodeficiency Virus Type 1 (HIV-1). 2,3HIV-1 and its close relative HIV Type 2 (HIV-2) arose in human populations as a result of cross-species transmission events from natural simian hosts of the Simian Immunodeficiency Viruses (SIVs).The closest simian relative of HIV-1 is the strain naturally infecting chimpanzees (SIVcpz), 4 while the closest simian relative of HIV-2 is the strain naturally infecting sooty mangabeys (SIVsm). 5Over 40 species-specific SIVs have since been identified in nonhuman primate species.A fraction of these infections result in immunodeficiency in the host species, while most are nonpathogenic in their natural hosts. 6e SIV of a given primate species cannot readily infect a different species without undergoing numerous adaptations.Part of this adaptation process involves evading host antiviral proteins known as restriction factors.Restriction factors are host proteins that utilize numerous mechanisms to interfere with viral infections as part of the innate immune response.Some interact directly with viral factors, while others make the cellular environment unsuitable for sustained viral replication. 7These defensive genes are subject to relatively rapid evolution under the selective pressure of viral counter-restriction mechanisms, and variants that confer greater fitness to the host during a viral outbreak will become fixed in a population much faster than would normally be expected as a result of genetic drift alone.The virus, in turn, can adapt to infect the host carrying these more fit alleles, creating a new selective pressure on the host.Thus, over evolutionary time, a comparatively large number of non-synonymous, or protein-altering, mutations will be evident in genes that have participated in these virus-host "arms races." 8nes can be examined for evidence of positive selection by comparing the rate of fixation of non-synonymous mutations (N) to the rate of fixation of synonymous mutations (S).dN/dS analysis demonstrates whether fixation of nonsynonymous changes is occurring faster than expected under neutral selection alone.A dN/dS value greater than one indicates that a particular site is evolving under posi-tive selection, while a dN/dS value less than one indicates that a site is evolving under purifying selection. 9is idea of a virus-host "arms race" is a fairly simplified model.RNA viruses can diversify their genome at a much higher rate than their hosts can.Hosts keep up through diversity of antiviral mechanisms rather than rapid counterrevolution.A viral variant that efficiently evades one host restriction mechanism will likely become more susceptible to another. 10However, over evolutionary time and as a result of many interactions with diverse viruses, host restriction factors bear signatures of positive selection.
Another defining factor of restriction factors is their relationships with viral antagonist proteins.In many cases, viruses have evolved specific proteins to evade the mechanism of a particular restriction factor.This virus-host relationship is highly species-specific because of the rapid adaptation process.As a general rule, any given virus will have accessory proteins that are most effective at neutralizing the restriction mechanisms of its natural host, but may not be effective against the homologous restriction mechanisms of other organisms. 11variety of restriction factors have been implicated in differential host resistance to primate lentiviruses, like HIV and the SIVs.Lentiviruses are part of the retrovirus family, and their life cycle is well understood.In order to infect a new target cell, the virion will first bind to a CD4 receptor and co-receptors on the cell surface.The virion membrane fuses with the cell membrane and the capsid is released into the cytoplasm.The viral RNA within the capsid then undergoes reverse transcription in the cytoplasm to form DNA that is transported into the nucleus and integrated into the host genome.The viral DNA is transcribed and translated using host machinery, and then viral RNA and proteins are localized to the cell membrane to form new virions.The restriction factors of primate lentiviruses have evolved to interfere with nearly every aspect of this life cycle, and thus act as barriers to cross-species transmission.

II. The APOBEC3 subfamily: a cluster of potent lentiviral restriction factors
The Apolipoprotein B mRNA-editing catalytic polypetide-3 (APOBEC3) subfamily of cytidine deaminases represents one such group of restriction factors.Primates encode seven APOBEC3 (A3) genes that form a cluster on chromosome 22 in humans and chromosome 10 in rhesus macaque monkeys: A3A, A3B, A3C, A3D, A3F, A3G, and A3H.A3 proteins are most highly expressed in HIV-1 target cell types such as T-cells, macrophages, and dendritic cells, and they are active in restricting retroviruses as well as some other viruses that do not replicate via reverse transcription. 12,13Human A3D, A3F, A3G, and A3H are known to be involved with HIV restriction, and each of these proteins has a functionally conserved homolog in rhesus macaques. 14e structure of A3 proteins is characterized by a cytidine deaminase domain consisting of six α-helices coordinated around five β-strands.This domain sits between a short N-terminal α-helical domain and a short C-terminal peptide. 15The deaminase domain contains a conserved zincbinding catalytic HXEX23-28PCX2-4C motif that coordinates deaminase activity.A3A, A3C, and A3H contain a single deaminase domain, while A3B, A3D, A3F, and A3G contain two deaminase domains.In the four A3s with two deaminase domains, only the C-terminal domain (CTD) is catalytically active. 16,17A3G and A3F have the greatest sequence similarity of the seven A3s, and they share many structural features that have been elucidated from the crystal structures of their CTDs.The basic structure is maintained between the proteins, but differences paralleling the divergence in primary structure are evident. 18G is the most widely studied of the A3 subfamily, as it is believed to play the largest role in HIV-1 restriction. 19However, A3D, A3F, and A3H have also been implicated in viral restriction.A3s act by inducing cytidine-to-uracil mutations in the viral negative sense cDNA produced during reverse transcription, consistent with the proteins' known cytidine deaminase function. 20Studies have shown A3G to be the more potent editor in humans, producing about 10 times more mutations in HIV-1 reverse transcripts than A3F in cell culture. 21Yet, A3F is stably co-expressed with A3G in human cells and is also packaged into HIV-1 virions to inhibit infectivity. 22A3G preferentially deaminates cytidines on the 3' end of 5'-CCC-3' or 5'-CC-3' sequences, while A3F and other A3 proteins preferentially deaminate the 3' cytidine in 5'-TC-3' sequences. 23

III. Interaction between APOBEC3 and Vif
Soon after the discovery of HIV-1, the viral protein termed viral infectivity factor (Vif) was found to be necessary for successful infection. 24However, it was not necessary in every cell type; HIV-1Δvif could infect certain permissive cell lines, while infection was unsuccessful in others. 25It was only later determined that A3 proteins were responsible for imparting resistance to Vif-deficient virus in these nonpermissive cell types. 26Vif has two domains, one crucial for binding A3 proteins and one that contains a highly conserved sequence required for mediating degradation. 27Vif acts by targeting cytoplasmic A3 proteins for proteosomal degradation (Figure 1). 28The interaction between A3s and Vif is a crucial determinant of the species specificity of a virus, and it is likely that a Vif from a given viral strain has evolved precisely to counteract the A3 proteins of its host species. 29,30The critical sites involved in A3-Vif interactions have largely been elucidated via mutagenesis studies, and as little as a single amino acid can be responsible for conferring the ability of A3 proteins to resist degradation by Vif.In A3G, the Vif binding site has been localized to the N-terminal deaminase domain (NTD).It was found that the Vif of an SIV strain isolated from African green monkeys (SIVagm) was inactive against human A3G (hA3G) but active against African green monkey A3G (ag-mA3G).This discrepancy was mapped to amino acid 128, an aspartic acid in hA3G and a lysine in agmA3G. 31This interface was expanded upon to include the proline at position 129 and the aspartic acid at position 130. Together, these three residues create a crucial negatively charged binding surface for HIV-1 Vif in hA3G. 32Contrary to A3G, the HIV-1 Vif binding site in A3F is localized at the CTD.The region comprising amino acids 289EFLARH294 in hA3F and agmA3F has been shown to be a critical region for HIV-1 Vif interaction. 33Another study suggested that amino acid 324 is a critical residue that confers hA3F susceptibility to HIV-1 Vif and rhesus macaque A3F (rhA3F) resistance to HIV-1 Vif. 34More recently, the negatively charged surface of α-helices 3 and 4 in hA3F has been strongly implicated in HIV-1 Vif binding. 35In HIV-1 Vif, "Thus, over evolutionary time, a comparatively large number of non-synonymous, or protein-altering, mutations will be evident in genes that have participated in these virus-host 'arms races.'"highly conserved N-terminal motifs are involved with A3G and A3F binding. 36,37.Objective While much attention has been given to the primary structures of A3s in terms of their effect on cross-species transmission of primate lentiviruses, examination of the diversity in these genes has been largely unconsidered.The most widely utilized primate model for investigating lentiviral infection and disease is the SIV infecting rhesus macaques (SIVmac), with rhesus macaques of Indian origin being the best characterized species. 38Rhesus macaques are not natural hosts of SIV in the wild; rather, SIVmac emerged from cross-species transmission events in US primate centers and likely originated from infected sooty mangabeys. 39,40The adaptations allowing the virus to stably infect rhesus macaques likely occurred as a result of serial passages of the virus through multiple animals. 41nlike SIVsm in its natural host, SIVmac is pathogenic in rhesus macaques, resulting in an AIDS-like disease.Given the well-established history of its emergence, SIVmac is an excellent model for studying the adaptive processes that viruses must undergo in order to successfully infect a new host species.
Recent work has demonstrated that the rhesus macaque A3G gene (rhA3G) is polymorphic at an N-terminal site, where three sequence variations are apparent.The variants displayed differential susceptibilities to SIVsm Vifmediated degradation in cell culture, but all three could be degraded by SIVmac Vif. 42Thus, it is likely that SIVmac Vif had to adapt to counteract the SIVsm Vif-resistant rhesus macaque A3G allele in order for pathogenic SIVmac to emerge.It is also known that A3G has evolved under strong positive selection, and that Vif is not wholly responsible for this phenomenon. 43e present study is intended to further characterize the degree of variation in other A3 family members, more specifically A3F.Old World primate A3Fs were examined for the genetic signatures of a rapidly adapting restriction factor.In a manner analogous to A3G, polymorphic sites in A3F may indicate potential determinants of the ability of SIVmac to infect its novel host.Polymorphic sites were investigated for their potential influence on the ability of rhA3F to resist degradation by Vifs derived from various SIV strains.Examination of allelic variance in A3F provides valuable insight into the barriers to cross-species transmission of lentiviruses, and offers further support to the idea of an evolutionary "arms race" between restriction factors and their viral antagonists.

Reverse Transcriptase PCR (RT-PCR) and Cloning
RT-PCR reactions were performed on RNA samples (extracted previously at the New England Primate Research Center) using the Invitrogen (Carlsbad, CA) SuperScript ® III One-Step RT-PCR System or the QIAGEN (Hilden, Germany) OneStep RT-PCR Kit according their associated protocols.Gateway cloning forward and reverse primers were used (Table 1).Products were identified via gel electrophoresis and purified using QIAGEN QIAquick Gel Extraction Kit according to the protocols specified in the kit.Purified DNA samples were cloned using the Invitrogen Gateway ® Cloning system and its associated protocols.Purified PCR samples were cloned into Invitrogen pENTR ® /D-TOPO plasmids.The clones were transformed into Invitrogen One Shot ® Stbl3 TM chemically competent E.coli and transformants were selected for via kanamycin selective plating.Plasmid DNA was isolated using the QIAGEN QIAprep Spin Miniprep Kit and its associated protocols, and then sent to Eton Bioscience (Charlestown, MA) for sequencing to confirm the identity of A3F sequence.Selected clones were then subcloned into Invitrogen pcDNA3.2TM /V5-DEST plasmids (Figure 4).The clones were transformed into Invitrogen One Shot ® Stbl3 TM competent E.coli and transformants were selected for via ampicillin selective plating.Plasmid DNA was isolated as before, and clones were sequenced again to confirm A3F insertion.

dN/dS Analysis
A3F sequences from rhesus macaques and other primates obtained from our own clones and from the NCBI database (see Appendix for sequences) were trimmed and aligned using the Multiple Align function in Geneious 6.1.8software (created by Biomatters, available at http:// www.geneious.com).The trimmed sequences were edited such that extraneous sequence before or after the A3F open reading frame was deleted.The sequences were up-loaded to Datamonkey, a public server for comparative analysis of sequence alignments. 44,45,46All three available models for dN/dS analysis (SLAC, FEL, REL) were run for the relatively small data set of 15 sequences.As suggested by prior research, the SLAC and FEL models were run with P values of 0.25, while the REL model was run with the Datamonkey-suggested Bayes factor of 50. 47notyping PCR was performed on rhesus macaque genomic DNA samples obtained from multiple US primate research centers using Invitrogen Platinum ® PCR SuperMix and its associated protocols.The primers used were genotyping forward and reverse (Table 1).The PCR products were visualized via gel electrophoresis, purified using the QIAGEN QIAquick Gel Extraction Kit according to its associated protocols, and sent to Eton Bioscience for sequencing.The sequence chromatograms were aligned using Geneious 6.1.8software and examined for the presence of the SKEH/FQQY variants.

Mutagenesis
Around-the-horn PCR was used to mutate amino acids 60, 61, 63, and 64 from the FQQY variant to the SKEH variant (or vice versa) in select rhA3F clones.PCR was performed using Thermo Scientific (Waltham, MA) Phusion Flash High Fidelity PCR Master Mix and its associated protocols, and the patch mutant forward and reverse primers were used.PCR products were digested with 10U of DpnI (New England Biolabs; Ipswich, MA) and purified using the QIAGEN QIAquick PCR Purification Kit and its associated protocols.Ligation was performed using the Promega (Madison, WI) LigaFast TM Rapid Ligation System and its associated protocols.The clones were transformed into Invitrogen One Shot ® Stbl3 TM competent E.coli, and transformants were selected for via ampicillin selective plating.Plasmid DNA was isolated using the QIAGEN QIAprep Spin Miniprep Kit and its associated protocols, and then sent to Eton Bioscience for sequencing to confirm successful mutagenesis of A3F.

Immunoblotting
Plasmids to be used in cell culture were prepared using the Invitrogen PureLink ® HiPure Plasmid Maxiprep Kit and its associated protocols.HEK293T cells were seeded in 6-well plates at a density of 800,000-

I. Investigating diversity and positive selection in Old World primate A3F
In order to determine whether primate A3F displays the hallmarks of a restriction factor that has been antagonized over time, rhesus macaque (Macaca mulatta) A3F sequences were first investigated for evidence of polymorphism.
To examine polymorphism within species, RT-PCR was performed on RNA samples obtained from 10 different rhesus macaques.The resulting DNA was cloned and the sequences were examined for both silent and non-synonymous polymorphisms.The data showed that A3F is a highly polymorphic gene in rhesus macaques, and that many of the single nucleotide polymorphisms result in amino acid changes (Figure 2).
To examine inter-species diversity, RT-PCR was also performed on multiple RNA samples from sooty mangabeys (Cercocebus atys), crab-eating macaques (Macaca fascicularis), and pigtail macaques (Macaca nemestrina).A3F coding sequences from these related primates showed divergence from each other and from rhesus macaques, demonstrating that there are fixed differences between A3F genes in Old World primates (Figure 3).
After establishing that A3F is a polymorphic gene in rhesus macaques and that fixed differences exist between A3Fs of related Old World primates, a dN/dS analysis was performed in order to determine whether any codons have evolved under positive selection.The species that were included in this analysis were rhesus macaques, crab-eating macaques, pigtail macaques, sooty mangabeys, African green monkeys (Chlorocebus sabaeus), humans (Homo sapiens), chimpanzees (Pan troglodytes), bonobos (Pan paniscus), and drills (Mandrillus leucophaeus).These sequences were uploaded to Datamonkey, a public server for comparative analysis of sequence alignments (See dN/dS section in Materials and Methods).11 codons were found to be evolving under positive selection.The sites were mapped relative to the deaminase domains and catalytic motifs in A3F (Figure 4), and seven sites were found to overlap with those previously identified to be polymorphic in rhesus macaques.

II. A closer look at a polymorphic and positively selected site in rhesus macaques
Four codons at positions 60, 61, 63, and 64 in rhesus macaques showed evidence of both intra-species variability and positive selection.In the rhA3F clones, these sites were linked and appeared as either 60FQPQY64 (henceforth referred to as FQQY) or 60SKPEH64 (henceforth referred to as SKEH).20 of 35 clones showed the FQQY sequence, while 15 clones showed the SKEH sequence.
The patch change from FQQY to SKEH is fairly significant at the amino acid level: bulky hydrophobic phenylalanine is exchanged for a small polar serine, polar uncharged glutamine is exchanged for a positively charged lysine and then a negatively charged glutamic acid, and hydrophobic tyrosine is exchanged for a positively charged histidine.Thus, the SKEH variant has three charged amino acids, whereas the FQQY variant is a neutral surface.
A broader genomic survey was done in order to better understand the level of diversity in the patch.Targeted PCR was used to amplify an approximately 680 base pair (bp) stretch of A3F from genomic DNA samples representing 44 different rhesus macaques.15 animals appeared to be heterozygous FQQY/SKEH, 17 animals appeared to be homozygous FQQY/FQQY, and 12 animals appeared to be homozygous SKEH/SKEH.This frequency distribution correlated to the results from the cloned sequences.Thus, it is reasonable to conclude that this patch has maintained a high level of diversity within rhesus macaques, and that this diversity is not unique to a particular population of animals or the result of a small data set.

III. The functional significance of diversity
After demonstrating that there is a high level of diversity in primate A3F and that some sites in the gene have evolved under positive selection, experiments were done to determine whether the different A3F alleles displayed differential susceptibilities to Vifs of various SIV strains.First, A3F clones from rhesus macaques, crab-eating macaques, sooty mangabeys, and pigtail macaques were tested for expression in HEK293T cells.Of the 17 clones tested, 11 were found to express, as visualized via Western blot (Figure 5).They expressed at variable levels, and only a subset of clones that represented diverse sequences and were wellexpressed were chosen for further examination.
In addition to these clones representing naturally occurring rhA3Fs, three point mutants were made by exchanging the FQQY and SKEH patches by site-directed mutagenesis.The resulting clones contained identical sequence to their parent rhA3F clone except for the four nucleotides that are responsible for the amino acid changes at this patch.These point mutants were also tested for expression in HEK293T cells, and two of the three clones tested were found to be highly expressed as detected via Western blot (Figure 5).
In order to determine whether the rhA3F variation confers differential resistance to Vif-mediated degradation, HEK293T cells were simultaneously transfected with A3F expression constructs and Vif expression constructs containing the Vif coding sequences of different SIV strains.The expression of A3F in the presence of these Vif challenges was then detected via Western blot and compared between the different clones as an indication of susceptibility to Vif-mediated degradation.Two rhA3G expression constructs were also included as controls because their phenotypes against these Vifs have been published previously. 48A representative blot is shown in Figure 6, and it reveals three interesting observations.First, multiple rhA3Fs displayed resistance to the Vif derived from rhesus macaque-adapted SIVmac239.Second, nearly all of the rhA3Fs were degraded by a SIVsm Vif and by Vif from the stump-tailed macaque SIV strain (SIVstm).These are somewhat surprising results, considering that the A3F-Vif interaction tends to be species-specific.Viral antagonist proteins are generally well-adapted to the restriction factors of their hosts, so one would expect Vif from rhesus macaque-adapted SIVmac to degrade the rhA3Fs and Vifs from related SIVs to not be as effective.However, this was not the case; SIVmac Vif did not degrade rhA3Fs, while Vifs from related SIVsm and SIVstm did.Finally, the FQQY/SKEH patch genotype did not affect sensitivity to any of the Vifs tested.

discussion
Investigations into the diversity of primate A3F revealed that the gene displays the hallmarks of a restriction factor that has been subject to positive selective pressure by viral antagonists over evolutionary time.A3F is a highly polymorphic gene within rhesus macaques, and many singlenucleotide polymorphisms lead to significant amino acid changes.Charged residues are often implicated in proteinprotein interactions, and therefore in the context of a restriction factor.Polymorphisms that change the charge of the amino acid may be more likely to be involved in an evolutionary "arms race" than more functionally conservative polymorphisms.
Examination of the inter-species variation via dN/dS analysis indicated that multiple sites within the gene have been subject to positive selection, and some of these sites overlap with the polymorphic sites found in rhesus macaques.
It is important to note that the input for dN/dS analysis had 15 sequences representing 10 different primate species.This does not represent the full range of primate A3Fs, and the use of multiple A3F variants of certain species may have artificially influenced the results.However, eliminating variable alleles of a given species also would also have had the potential of producing artificially conservative results.Comparing results of multiple analyses that differ in their level of conservativeness helped to address this issue, and 11 sites in primate A3F were shown to be under the influence of positive selection.This is potentially only a subset of sites that are subject to positive selection, as the different analysis models identified as many as 25 positively selected sites.
An interesting patch of four linked amino acids was identified at codons 60, 61, 63, and 64, where two sequences were found at similar frequencies in rhA3F clones.dN/dS analysis suggested that these codons have been subject to positive selective pressure.The FQQY allele appeared at a slightly higher frequency than the SKEH allele in a broader genomic analysis of the region in rhesus macaques; however, the less common SKEH haplotype did appear across multiple animals and clones, unlike many other polymorphic sites where the less common allele appeared at a significantly lower frequency and sometimes only in one animal.The changes at the amino acid level in this patch are rather significant, as three charged amino acids are introduced in the SKEH haplotype while the patch is neutral in the FQQY haplotype.This patch was also notable because of its location in the structure of A3F.It borders the N-terminal catalytic motif, and is adjacent to a known determinant of strain-specific resistance to SIV-Vif mediated degradation in A3G.Polymorphism at residue 59 in rhA3G confers variable susceptibility to degradation by SIVmac Vif. 49Together, these observations suggest that this patch may play a key role in an interaction with a viral protein, particularly with SIV Vif.However, functional tests did not suggest that variation at this site plays a role in determining susceptibility to Vif, as rhA3F clones that contained variable patch alleles did not display differential resistance to Vifs from various SIV strains.
Despite this patch's proximity to residue 59, it is not necessarily surprising that it does not play a role in rhesus A3F interaction with Vif.It is known that HIV-1 Vif interacts with the CTD in A3F, unlike in A3G where it interacts with the NTD.While this interaction has not been precisely "The interaction between A3s and Vif is a crucial determinant of the species specificity of a virus, and it is likely that a Vif from a given viral strain has evolved precisely to counteract the A3 proteins of its host species."mapped with SIV Vifs, it is reasonable to believe that they will behave analogously to HIV-1 Vif and interact at the CTD in A3F.Therefore, this NTD patch would not be expected to play a direct role in interactions with Vif.
Functional tests did reveal multiple surprising phenotypes relating to the SIV Vifs.The rhA3F clones tested displayed resistance to degradation by SIVmac Vif.SIVmac239 is a cloned viral isolate that is well adapted to its rhesus macaque host.Thus, one would expect the Vif derived from this macaque-adapted virus to degrade the rhA3F clones, yet this was not observed.rhA3F function was further tested against a more diverse panel of macaqueadapted Vifs (data not shown), and similarly to the SIVmac239 isolate, the rhA3F alleles were resistant to degradation.Thus, this phenotype does not appear to be a specific phenomenon of the SIVmac239 isolate, but rather a more general property of rhA3F.It is important to note that the rhA3G controls confirm that this SIVmac239 Vif clone is active and behaves as expected with these rhA3G alleles.It is possible to speculate that SIVmac239 evolved to effectively counteract rhA3G, the more potent deaminase, but in doing so it traded off the ability to neutralize rhA3F.This proposal is supported by the fact that the more ancestral SIVsmE041 Vif completely degraded the rhA3F clones tested, with the exception of rhA3F-WI, which showed an intermediate phenotype.
A3F is known to restrict viruses other than lentiviruses, so it is reasonable to believe that a different retrovirus may be responsible for driving this high level of diversity in rhesus macaque A3F.A potential candidate is simian foamy virus (SFV), a member of the Spumavirus genus of retroviruses.FVs are distinct from other retroviruses in many aspects, including their gene expression, protein processing, and replication.They are found in non-human primates, cats, cows, and horses, as well as humans who have been infected by non-human primates.They cause a persistent infection, but there are no known pathologies in natural or human hosts. 50,51It has been suggested that SFVs have coevolved with their Old World primate hosts for over 30 million years, marking them as the oldest known RNA viruses in vertebrates. 52A3 proteins have been implicated in restriction of FVs, and the FV accessory protein Bet has been found to counteract A3 restriction mechanisms analogously to HIV-1 Vif. 53,54The relatively old age of FVs as compared to SIVs, their ubiquitous presence in Old World primates, and the existence of an accessory protein that may have anti-A3 properties together suggest that SFVs may have acted as a selective pressure on A3F.This investigation offered valuable insight into the genetic diversity of A3F in Old World primates.The presence of unusually high intra-and inter-species variability indicates that A3F has been continuously exposed to positive selective pressure.While SIV may not necessarily be the virus responsible for driving the polymorphism in rhesus macaque A3F, understanding the restriction factor's genetic variability opens a line of inquiry into other potential selective pressures.It is also important to note that only one linked polymorphic patch was investigated; there are several other sites that may play a role in interactions with Vif.Further mutagenesis studies will reveal whether any of these sites play a role in SIV restriction, or whether it truly is another virus altogether that has provided the selective pressure on rhA3F.In addition, the sooty mangabey, crabeating macaque, and pigtail macaque A3F clones have not been fully characterized in terms of their susceptibility to various Vifs.This information will provide a more complete picture of the role that Vif has played in driving A3F evolution, and further elucidate mechanisms of cross-species transmission of primate lentiviruses.figure 3: inter-species diversity of a3f in old world primates.the phylogenetic tree shows the degree of divergence between rhesus macaque a3f clones (rh), sooty mangabey a3f clones (sm), crab-eating macaque a3f clones (mf), and pigtail macaque a3f clones (mn). the species tend to group together into distinct lineages, and diversity within species is evident within these lineages.

figure 1 :
figure 1: a3 proteins interfere with lentiviral replication in the absence of vif. the top half of the figure exemplifies a cell infected with a virus that does not encode vif.host a3 proteins (triangles) incorporate into a budding virion, and then interfere with the process of reverse transcription in the target cell to prevent further viral replication.the bottom half of the figure exemplifies a cell infected with a virus that encodes vif.vif (¾ circles) prevents a3 proteins from incorporating into the budding virion by targeting them for proteosomal degradation, allowing replication to continue unhindered in the target cell.

figure 2 :
figure 2: distribution of polymorphic sites in rhesus macaque a3f. the dark gray boxes represent the cytidine deaminase domains in a3f, and the blue shading indicates the catalytic motif.the lines show the approximate location of sites found to contain non-synonymous polymorphisms in two or more rhesus macaque clones.the codon numbers and coded amino acids are indicated above or below the image.only polymorphisms that were seen in at least two clones are included.bolded lines and codon numbers indicate that the polymorphism was found in multiple animals.

figure 4 :
figure 4: distribution of positively selected sites in primate a3f. the grey boxes represent the cytidine deaminase domains in a3f, and the shaded areas indicates the catalytic motifs.the triangles show the approximate locations of positively selected codons identified via dn/ds analysis.these sites are distributed fairly evenly throughout the a3f gene.