|
||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
ARTICLE |
CORRESPONDENCE Stuart C. Ray: sray{at}jhmi.edu
|
|
|---|
A virus capable of genetic variation and of causing chronic infection will evolve to optimize its fitness in each host, a process which is the net sum of immune recognition (positive selection) and functional constraint on replication (negative selection). Because an estimated 1012 virions are produced each day through an error-prone, nonproofreading NS5B RNA polymerase, hepatitis C virus (HCV) is especially capable of viral evolution (1, 2). However, we previously showed that evolution is not driven by replication alone. In the acute phase of infection before adaptive immune responses (but after weeks of replication supporting a viral RNA level of more than 105 IU/ml), the same major viral variant was detected in each of a serial passage of eight chimpanzees (3). In contrast, the sequence of envelope genes, particularly HVR1, changes in virtually all humans who have been persistently infected (including the source of the inoculum passaged through this chimpanzee lineage [4]), a notable exception being persons with attenuated humoral immune responses (agammaglobulinemia), who have been shown to have reduced variability in HVR1 (5). Longitudinal studies of chimpanzees experimentally infected with HCV have revealed that amino acid replacements in immunodominant CD8+ T cell epitopes presented on MHC class I in an allele-restricted manner contribute to viral persistence (6). Thus, we hypothesized that the net evolution of HCV would demonstrate both functional constraint (reversion of sequences toward consensus) as well as positive pressure (and thus reveal immunodominant epitopes).
Although it required that persons be infected with the same inoculum, it was possible to test this hypothesis because between May 1977 and November 1978 over 500 women were inadvertently infected with HCV from a single acutely infected source, as a result of treatment with contaminated anti-D immune globulin (7). In a single amplicon, a 5.2-kb cDNA spanning 5'UTR through the NS3NS4A junction was cloned from serum collected from 22 women 1822 yr after infection, as well as in two specimens of frozen plasma from the inoculum donor.
| Results and Discussion |
|---|
|
|
|---|
|
|
Rather than convergent evolution, these results might have been due to shared selection and then divergence (at other sites) from a rare variant that we did not detect in the inoculum. We did detect one clone (clone #5) among 20 in the inoculum material that carried the RxTxxFxS motif at positions 394401, but it was highly divergent from all other sequences described here, as evident from its position in the phylogenetic tree (Fig. 1 B), and therefore less likely than the other 19 sequences to represent the founder strain for these women. Although it is possible that a less divergent RxTxxFxS clone was present in the inoculum at very low frequency, shared selection of such a rare variant would support the same conclusion.
Because HVR1 is a potential target of both humoral and cellular immunity and the precise recognition motifs remain difficult to identify due to the extreme variability, further examination of positive selection was focused on nonenvelope genes, and in particular, on known MHC class Irestricted epitopes. Consistent with immune selection hypothesis, the number of changes in sites of known epitopes associated with specific class I alleles was significantly greater than the number of changes in other sites (RR 1.6, P < 0.05), and greater than what was found in that same site for persons who did not possess the allele (Table I, with detail for all epitopes in Fig. S2). For HLA B*35, changes in epitopes were observed only in women having that allele, and for B*37, sequence changes were 8.5 times as likely to occur in an epitope associated with an allele in women having the allele as compared with those that lacked it (P < 0.001). An example of such an epitope is shown in a 38amino acid region that spans an HLA A2 motif (Fig. 3, A and B). Mutations from R to K were noted at position 1397 outside the A2 epitope, and mutations from G to S were noted within the epitope in 8 and 6 of 22 women, respectively. However, whereas R to K mutation was noted in a similar percentage of A2-positive and A2-negative women (41.7% vs. 30.0%, P > 0.10), all G to S mutations were observed in A2-positive women (P = 0.015), consistent with immune escape as has been observed in the simian immunodeficiency virus macaque model (10) and chimpanzees infected with HCV (6).
|
|
Collectively, these findings indicate that HCV sequence change is a nonrandom process that reflects negative selection (change is disadvantageous) as well as positive selection. Moreover, we find evidence that positive selection represents both the direct effect of pressure applied by immune responses in the current host (in this case, HLA class Irestricted CD8+ cytotoxic T lymphocytes) as well as reversion of sequence toward a consensus, as we saw with envelope sequences.
To independently evaluate this paradigm, we compared the amino acid sequences of these women with an HCV 1b consensus sequence. For the epitopes that showed evidence of HLA class Irestricted positive selection (a significantly increased risk of mutations from the inoculum occurred when the restricting allele was present), there was also an increased number of changes away from the 1b prototype consensus in women with one of these alleles, but not those without (Table II). In addition, for epitopes that showed the converse effect, i.e., evidence of positive selection when the allele was absent (a significantly lower risk of mutations from the inoculum when the restricting allele was present), there was also an increased number of changes toward consensus in those who lacked the allele versus those who had the allele, suggesting reversion (Table II). These findings are supported by an accompanying report, Cox et al. (11), which shows that amino acid substitutions in CD8+ T cell epitopes are associated with a loss of T cell recognition during acute infection, whereas nonepitope changes revert toward consensus at a rate much higher than expected by chance.
|
Indirectly, these results suggest that immune escape is costly to the virus. Fitness is conventionally measured as a competition among genetic variants, and when the viral population size is large the most successful variants present at any one time in a host are by definition the most fit under the overall selection pressures. We assume that over very long periods of time (relative to the viral life cycle) the residues in the viral genome have the opportunity to vary substantially, and independently to a first approximation (this is a fundamental assumption of phylogenetic analysis). Therefore, although there may be some covariation, analytically the positions in the sequence can be considered independent variables. Finding a strong positive correlation between the presence of HLA alleles and substitutions in allele-associated epitopes suggests that those changes increase viral fitness in the presence of the associated immune response. Likewise, a strong association between the absence of alleles and substitutions in allele-associated epitopes, particularly when found to represent reversion to consensus, suggests that positive selection in a previous host carried a fitness cost in terms of viral replicative capacity.
Prior studies have demonstrated reversion of CTL escapevariant sequences in macaques experimentally infected with SIV (13), reversion of one epitope each of HIV-1 and HCV in humans (14, 15), and evidence of HIV-1 adaptation to common HLA alleles (16). This is the first report of viral adaptation to multiple HLA alleles across multiple genes in HCV, and provides additional support for the suggestion, based on minimizing differences between vaccine and circulating strains, that vaccine effectiveness may be enhanced by using a consensus (17) or ancestral (18) sequence.
The ability of viruses to restrict adaptive immune responses and evade those that are formed contributes to persistence and is a major barrier to vaccine development. These data suggest that escape variants have greater fitness in the presence of an individual host's immune response, and that immune evasion contributes to the sequence divergence observed in a each persistently infected host. Nonetheless, this divergence may actually reduce the fitness of the virus in the population (that is, in other hosts). From an evolutionary perspective, these forces maintain the virus as a distinct pathogen. However, the data also suggest that immune responses to consensus sequences (rather than a product based on the sequence in a given host) may establish the highest barrier to viral escape and consequently the most effective protection against chronic infection.
| Materials and methods |
|---|
|
|
|---|
Hemigenomic cDNA cloning.
The region encoding Core, E1, E2, p7, NS2, and NS3 was amplified and cloned as previously described (20), and 40 clones per specimen were stored. For each specimen, envelope sequences from 10 random randomly selected clones were determined using primer H77-1868a21 (20) on a PRISM version 3100 sequencer (ABI). These sequence data are available from GenBank/EMBL/DDBJ under accession nos. DQ061331 through DQ061378.
Estimation of consensus sequence.
An alignment of full-length HCV subtype 1b sequences was obtained from the Los Alamos National Laboratories HCV database (http://hcv.lanl.gov). The alignment was edited by hand to remove gaps introduced for alignment to other genotypes, and to remove duplicate sequences from the same human source and those obtained from nonhuman sources. The resulting alignment included 83 sequences. A majority-rule consensus sequence was formed, with residues occurring in less than 42 sequences flagged as nonconsensus. Changes in the anti-D recipients were then classified as "toward" (change results in a residue matching the consensus) or "away" (change results in a residue not matching the consensus, or residue is nonconsensus).
Estimation of the likelihood of convergence.
The expected frequency of covariation assuming independence was calculated as the product of the marginal frequencies, and compared with the observed value using the Chi-squared distribution with three degrees of freedom.
If we assume that all amino acid replacements are equally likely over a time period that is very long with respect to the rate of mutation, then sharing of amino acids at four variable sites in just two study subjects would be expected to occur at a frequency of 1/204 or 0.00000625. Of course, all amino acid replacements are not equally likely, even in the highly variable HVR1 (8); therefore, the likelihood of sharing of four variable sites by two subjects would be higher, e.g., 1/34 or 0.012 if each site is equally likely to have one of three residues. Because the likelihood of shared residues at variable sites in multiple study subjects is the product of such probabilities, the observed findings in this study are clearly incompatible with random substitution and most consistent with convergent evolution.
Phylogenetic analysis
Sequences were aligned using ClustalX (21), codon boundaries were restored by hand in BioEdit (22), and phylogenetic analysis was performed using PAUP* version 4b10 (Sinauer Associates) using a HKY85+G model and parameters (Ti/Tv 2.78,
= 0.37) selected with the aid of ModelTest (23). Initial results from one specimen were consistent with subtype 1a, and that specimen was not examined further. Reference sequences included 3 from subtype 1a, 83 from subtype 1b (including AF313916), 2 from subtype 1c, 6 from subtype 2a, 8 from subtype 2b, 1 each from subtypes 2c and 2k, 4 from subtype 3a, and 1 each from subtypes 3b, 3k, 4a, 5a, 6a, 6b, 6d, 6g, 6h, and 6k (obtained from http://hcv.lanl.gov).
VarPlot was used to calculate nonsynonymous and synonymous distances using the method of Nei and Gojobori, in a sliding window 20 codons wide, moving in 1 codon steps, as previously described (24).
Sequence logos.
A sequence logo is a graphical representation of a group of aligned sequences, at each position of which the frequency of each residue is represented by the height of the single-letter representation of that residue (25). The sequence logos in Fig. 2 were generated using a novel software program, VisSPA (Visual Sequence Pattern Analysis, available on request from the author S.C. Ray). The algorithm is identical to that described for type 2 logos by Gorodkin et al. (26), except that the a priori distributions for the logo are calculated empirically from input sequences, and missing values in the a priori distribution are assigned the lowest frequency of residues at that site (if more than one state is represented) or 1/20 if the a priori distribution has only one residue at that site.
Online supplemental material
Fig. S1 shows the variability of HVR1 in each study subject. Fig. S2 shows the sequences at the sites of known MHC class Irestricted alleles in each study subject. Online supplemental material is available at http://www.jem.org/cgi/content/full/jem.20050122/DC1.
| Acknowledgments |
|---|
This research was funded by National Institutes of Health grants U19 AI40035 and R01 DA016078, and Irish Health Research Board grant HC08/97.
The authors have no conflicting financial interests.
Submitted: 13 January 2005
Accepted: 5 May 2005
| References |
|---|
|
|
|---|
1 Neumann, A.U., N.P. Lam, H. Dahari, D.R. Gretch, T.E. Wiley, T.J. Layden, and A.S. Perelson. 1998. Hepatitis C viral dynamics in vivo and the antiviral efficacy of interferon-alpha therapy. Science. 282:103107.
2 Martell, M., J.I. Esteban, J. Quer, J. Genesca, A. Weiner, R. Esteban, Guardia, and J. Gomez. 1992. Hepatitis C virus (HCV) circulates as a population of different but closely related genomes: quasispecies nature of HCV genome distribution. J.Virol. 66:32253229.
3 Ray, S.C., Q. Mao, R.E. Lanford, S. Bassett, O. Laeyendecker, Y.M. Wang, and D.L. Thomas. 2000. Hypervariable region 1 sequence stability during hepatitis C virus replication in chimpanzees. J. Virol. 74:30583066.
4 Ogata, N., H.J. Alter, R.H. Miller, and R.H. Purcell. 1991. Nucleotide sequence and mutation rate of the H strain of hepatitis C virus. Proc. Natl. Acad. Sci. USA. 88:33923396.
5 Gaud, U., B. Langer, T. Petropoulou, H.C. Thomas, and P. Karayiannis. 2003. Changes in hypervariable region 1 of the envelope 2 glycoprotein of hepatitis C virus in children and adults with humoral immune defects. J. Med. Virol. 69:350356.[CrossRef][Medline]
6 Erickson, A.L., Y. Kimura, S. Igarashi, J. Eichelberger, M. Houghton, J. Sidney, D. McKinney, A. Sette, A.L. Hughes, and C.M. Walker. 2001. The outcome of hepatitis C virus infection is predicted by escape mutations in epitopes targeted by cytotoxic T lymphocytes. Immunity. 15:883895.[CrossRef][Medline]
7 Kenny-Walsh, E. 1999. Clinical outcomes after hepatitis C infection from contaminated anti-D immune globulin. Irish Hepatology Research Group. N. Engl. J. Med. 340:12281233.
8 McAllister, J., C. Casino, F. Davidson, J. Power, E. Lawlor, P.L. Yap, P. Simmonds, and D.B. Smith. 1998. Long-term evolution of the hypervariable region of hepatitis C virus in a common-source-infected cohort. J. Virol. 72:48934905.
9 Penin, F., C. Combet, G. Germanidis, P.O. Frainais, G. Deleage, and J.M. Pawlotsky. 2001. Conservation of the conformation and positive charges of hepatitis C virus E2 envelope glycoprotein hypervariable region 1 points to a role in cell attachment. J. Virol. 75:57035710.
10 Allen, T.M., D.H. O'Connor, P. Jing, J.L. Dzuris, B.R. Mothe, T.U. Vogel, E. Dunphy, M.E. Liebl, C. Emerson, N. Wilson, et al. 2000. Tat-specific cytotoxic T lymphocytes select for SIV escape variants during resolution of primary viraemia. Nature. 407:386390.[CrossRef][Medline]
11 Cox, A.L., T. Mosbruger, Q. Mao, Z. Liu, X.-H. Wang, H.-C. Yang, J. Sidney, A. Sette, D. Pardoll, D.L. Thomas, and S.C. Ray. Cellular immune selection with hepatitis C virus persistence in humans. J. Exp. Med. 201:17411752.
12 Gandhi, R.T., A. Wurcel, E.S. Rosenberg, M.N. Johnston, N. Hellmann, M. Bates, M.S. Hirsch, and B.D. Walker. 2003. Progressive reversion of human immunodeficiency virus type 1 resistance mutations in vivo after transmission of a multiply drug-resistant virus. Clin. Infect. Dis. 37:16931698.[CrossRef][Medline]
13 Friedrich, T.C., E.J. Dodds, L.J. Yant, L. Vojnov, R. Rudersdorf, C. Cullen, D.T. Evans, R.C. Desrosiers, B.R. Mothe, J. Sidney, et al. 2004. Reversion of CTL escape-variant immunodeficiency viruses in vivo. Nat. Med. 10:275281.[CrossRef][Medline]
14 Leslie, A.J., K.J. Pfafferott, P. Chetty, R. Draenert, M.M. Addo, M. Feeney, Y. Tang, E.C. Holmes, T. Allen, J.G. Prado, et al. 2004. HIV evolution: CTL escape mutation and reversion after transmission. Nat. Med. 10:282289.[CrossRef][Medline]
15 Timm, J., G.M. Lauer, D.G. Kavanagh, I. Sheridan, A.Y. Kim, M. Lucas, T. Pillay, K. Ouchi, L.L. Reyor, J.S. Zur Wiesch, et al. 2004. CD8 epitope escape and reversion in acute HCV infection. J. Exp. Med. 200:15931604.
16 Moore, C.B., M. John, I.R. James, F.T. Christiansen, C.S. Witt, and S.A. Mallal. 2002. Evidence of HIV-1 adaptation to HLA-restricted immune responses at a population level. Science. 296:14391443.
17 Gaschen, B., J. Taylor, K. Yusim, B. Foley, F. Gao, D. Lang, V. Novitsky, B. Haynes, B.H. Hahn, T. Bhattacharya, and B. Korber. 2002. Diversity considerations in HIV-1 vaccine selection. Science. 296:23542360.
18 Nickle, D.C., M.A. Jensen, G.S. Gottlieb, D. Shriner, G.H. Learn, A.G. Rodrigo, and J.I. Mullins. 2003. Consensus and ancestral state HIV vaccines. Science. 299:15151518.
19 Fanning, L.J., E. Kenny-Walsh, and F. Shanahan. 2004. Persistence of hepatitis C virus in a white population: associations with human leukocyte antigen class 1. Hum. Immunol. 65:745751.[CrossRef][Medline]
20 Liu, Z., D.M. Netski, Q. Mao, O. Laeyendecker, J.R. Ticehurst, X.H. Wang, D.L. Thomas, and S.C. Ray. 2004. Accurate representation of the hepatitis C virus quasispecies in 5.2-kilobase amplicons. J. Clin. Microbiol. 42:42234229.
21 Jeanmougin, F., J.D. Thompson, M. Gouy, D.G. Higgins, and T.J. Gibson. 1998. Multiple sequence alignment with Clustal X. Trends Biochem. Sci. 23:403405.[CrossRef][Medline]
22 Hall, T.A. 2001. BioEdit: Biological sequence alignment editor for Windows 95/98/NT version 5.0.7. software. http://www.mbio.ncsu.edu/RNaseP/info/programs/BIOEDIT/bioedit.html (accessed March 5, 2003).
23 Posada, D., and K.A. Crandall. 1998. MODELTEST: testing the model of DNA substitution. Bioinformatics. 14:817818.
24 Ray, S.C., Y.M. Wang, O. Laeyendecker, J. Ticehurst, S.A. Villano, and D.L. Thomas. 1998. Acute hepatitis C virus structural gene sequences as predictors of persistent viremia: hypervariable region 1 as decoy. J. Virol. 73:29382946.
25 Schneider, T.D., and R.M. Stephens. 1990. Sequence logos: a new way to display consensus sequences. Nucleic Acids Res. 18:60976100.
26 Gorodkin, J., L.J. Heyer, S. Brunak, and G.D. Stormo. 1997. Displaying the information contents of structural RNA alignments: the structure logos. Comput. Appl. Biosci. 13:583586.
27 Chang, K.M., B. Rehermann, J.G. McHutchison, C. Pasquinelli, S. Southwood, A. Sette, and F.V. Chisari. 1997. Immunological significance of cytotoxic T lymphocyte epitope variants in patients chronically infected by the hepatitis C virus. J. Clin. Invest. 100:23762385.[Medline]
28 Ward, S., G. Lauer, R. Isba, B. Walker, and P. Klenerman. 2002. Cellular immune responses against hepatitis C virus: the evidence base 2002. Clin. Exp. Immunol. 128:195203.[CrossRef][Medline]
This article has been cited by other articles:
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| TABLE OF CONTENTS |
|