<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
	<ui>1743-422X-3-103</ui>
	<ji>1743-422X</ji>
	<fm>
		<dochead>Research</dochead>
		<bibl>
			<title>
				<p>Comparative analysis of hepatitis C virus phylogenies from coding and non-coding regions: the 5' untranslated region (UTR) fails to classify subtypes</p>
			</title>
			<aug>
				<au id="A1" ca="yes">
					<snm>Hraber</snm>
					<mi>T</mi>
					<fnm>Peter</fnm>
					<insr iid="I1"/>
					<email>phraber@lanl.gov</email>
				</au>
				<au id="A2">
					<snm>Fischer</snm>
					<fnm>William</fnm>
					<insr iid="I1"/>
					<email>wfischer@lanl.gov</email>
				</au>
				<au id="A3">
					<snm>Bruno</snm>
					<mi>J</mi>
					<fnm>William</fnm>
					<insr iid="I1"/>
					<email>billb@lanl.gov</email>
				</au>
				<au id="A4">
					<snm>Leitner</snm>
					<fnm>Thomas</fnm>
					<insr iid="I1"/>
					<email>tkl@lanl.gov</email>
				</au>
				<au id="A5">
					<snm>Kuiken</snm>
					<fnm>Carla</fnm>
					<insr iid="I1"/>
					<email>kuiken@lanl.gov</email>
				</au>
			</aug>
			<insg>
				<ins id="I1">
					<p>Theoretical Biology and Biophysics, T-10 MS K710, Los Alamos National Laboratory, Los Alamos NM 87545 USA</p>
				</ins>
			</insg>
			<source>Virology Journal</source>
			<issn>1743-422X</issn>
			<pubdate>2006</pubdate>
			<volume>3</volume>
			<issue>1</issue>
			<fpage>103</fpage>
			<url>http://www.virologyj.com/content/3/1/103</url>
			<xrefbib>
				<pubidlist>
					<pubid idtype="pmpid">17169155</pubid>
					<pubid idtype="doi">10.1186/1743-422X-3-103</pubid>
				</pubidlist>
			</xrefbib>
		</bibl>
		<history>
			<rec>
				<date>
					<day>06</day>
					<month>11</month>
					<year>2006</year>
				</date>
			</rec>
			<acc>
				<date>
					<day>14</day>
					<month>12</month>
					<year>2006</year>
				</date>
			</acc>
			<pub>
				<date>
					<day>14</day>
					<month>12</month>
					<year>2006</year>
				</date>
			</pub>
		</history>
		<cpyrt>
			<year>2006</year>
			<collab>Hraber et al; licensee BioMed Central Ltd.</collab>
			<note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
		</cpyrt>
		<abs>
			<sec>
				<st>
					<p>Abstract</p>
				</st>
				<sec>
					<st>
						<p>Background</p>
					</st>
					<p>The duration of treatment for HCV infection is partly indicated by the genotype of the virus. For studies of disease transmission, vaccine design, and surveillance for novel variants, subtype-level classification is also needed. This study used the Shimodaira-Hasegawa test and related statistical techniques to compare phylogenetic trees obtained from coding and non-coding regions of a whole-genome alignment for the reliability of subtyping in different regions.</p>
				</sec>
				<sec>
					<st>
						<p>Results</p>
					</st>
					<p>Different regions of the HCV genome yield inconsistent phylogenies, which can lead to erroneous conclusions about classification of a given infection. In particular, the highly conserved 5' untranslated region (UTR) yields phylogenetic trees with topologies that differ from the HCV polyprotein and complete genome phylogenies. Phylogenetic trees from the NS5B gene reliably cluster related subtypes, and yield topologies consistent with those of the whole genome and polyprotein.</p>
				</sec>
				<sec>
					<st>
						<p>Conclusion</p>
					</st>
					<p>These results extend those from previous studies and indicate that, unlike the NS5B gene, the 5' UTR contains insufficient variation to resolve HCV classifications to the level of viral subtype, and fails to distinguish genotypes reliably. Use of the 5' UTR for clinical tests to characterize HCV infection should be replaced by a subtype-informative test.</p>
				</sec>
			</sec>
		</abs>
	</fm>
	<meta>
		<classifications>
			<classification type="bmc" subtype="user_supplied_xml" id="endnote"/>
		</classifications>
	</meta>
	<bdy>
		<sec>
			<st>
				<p>Background</p>
			</st>
			<p>In treating infection with hepatitis C virus, knowledge of a patient's viral genotype informs the choice of appropriate therapy <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>. Although the HCV subtype afflicting a patient is not currently used to make clinical treatment decisions, knowing the viral subtype is important for studies of its origin, transmission, and evolution <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>. For example, new emerging variants can be characterized better when they can be assigned an unequivocal subtype classification <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. Molecular epidemiology analyses rely on information about sequence variation at the subtype level <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. Vaccine-design strategies are informed by the diversity of HCV variants and the antigenic determinants (epitopes) therein <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>. The risk of hepatocellular carcinoma, a frequent complication for HCV infection, might be assessed better in light of HCV subtype <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. Thus, effective methods for both genotype and subtype classification are important tools to manage HCV infections.</p>
			<p>Techniques to infer phylogenies combine an optimality criterion with an algorithm to search for the best tree. Optimality criteria quantify how well the tree describes the data, and are either distance-based or character-based <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>. An algorithm can quickly construct a single tree that minimizes all the pairwise distances among taxa. However, this approach is less able to use information from different taxa to model variation in evolutionary rates across sites than the optimality criterion of maximum likelihood (<abbrgrp><abbr bid="B9">9</abbr></abbrgrp>, p. 175). Search algorithms are deployed by character-based methods to find trees that best explain the data, given an evolutionary model with known assumptions. The search algorithms of character-based methods take more time to evaluate alternative candidate trees than rapid distance-based methods. Perhaps for this reason, many more distance-based than character-based phylogenies of HCV genotypes have been published. However, maximum-likelihood phylogenetic inference is known to outperform distance-based methods when such complications as substitution rate heterogeneity or covariation between sites are present <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>. Formal comparisons between topologies are thus more appropriate for maximum-likelihood phylogenies than for the approximations that result from distance-based methods.</p>
			<p>This study evaluates phylogenies derived from coding (NS5B) and non-coding (5' UTR) regions of whole-genome HCV sequences for consistent classification of viral subtypes into distinct genetic groups, or clades, with the aim of evaluating their suitability for genotype and subtype classification. Concordance with the whole-genome phylogeny is desired. Nucleotide characters in NS5B are over five times more abundant than in the 5' UTR, though only a small portion of this region is amplified for subtyping. To compensate for this, we also considered a smaller, oft-studied portion of NS5B that we call the "Okamoto region" (from nt 8282 to 8610 in the H77 reference genome) for its ability to represent the phylogeny of NS5B and the entire HCV genome. We tested the hypothesis that phylogenetic trees obtained from different genomic regions of HCV differ significantly. We also compared tree topologies for their ability to group genotypes and subtypes consistently into clades.</p>
		</sec>
		<sec>
			<st>
				<p>Results</p>
			</st>
			<sec>
				<st>
					<p>Phylogenetic inferences</p>
				</st>
				<p>Among the 38 whole-genome HCV sequences representing 18 confirmed subtypes as summarized in Table <tblr tid="T1">1</tblr>, the most general substitution model, the general time reversible model (GTR, also known as REV) with a discrete gamma approximation for rate heterogeneity, was consistently supported as superior among the twelve nucleotide substitution models evaluated (not shown). Models adjusted for rate heterogeneity consistently fit the data better than models that assume a fixed evolutionary rate across sites (not shown). Substitution models with fewer parameters or an assumption of equal base compositions performed significantly worse than GTR, regardless of whether or not the sequences analyzed contained protein-coding regions. Adding a parameter for the estimated proportion of invariant sites significantly improved the substitution model, yielding parameters as shown in Table <tblr tid="T2">2</tblr>. The same model was selected when the AIC was adjusted to compensate for a low ratio of sample data to parameters (not shown). Thus, GTR with a gamma distribution of evolutionary rates per site and accommodation of invariant sites (GTR+&#915;+I) is the best substitution model for HCV variation among those considered, and was used for maximum-likelihood phylogeny inference.</p>
				<tbl id="T1">
					<title>
						<p>Table 1</p>
					</title>
					<caption>
						<p>Confirmed subtypes and accession numbers of HCV genomes studied.</p>
					</caption>
					<tblbdy cols="2">
						<r>
							<c ca="center">
								<p>
									<b>Subtype</b>
								</p>
							</c>
							<c ca="left">
								<p>
									<b>Database Accession Numbers</b>
								</p>
							</c>
						</r>
						<r>
							<c cspan="2">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>1a</p>
							</c>
							<c ca="left">
								<p>[EMBL:<ext-link ext-link-type="embl" ext-link-id="AF009606">AF009606</ext-link>, EMBL:<ext-link ext-link-type="embl" ext-link-id="AF511950">AF511950</ext-link>, EMBL:<ext-link ext-link-type="embl" ext-link-id="D10749">D10749</ext-link>, EMBL:<ext-link ext-link-type="embl" ext-link-id="M62321">M62321</ext-link>]</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>1b</p>
							</c>
							<c ca="left">
								<p>[EMBL:<ext-link ext-link-type="embl" ext-link-id="AF483269">AF483269</ext-link>, EMBL:<ext-link ext-link-type="embl" ext-link-id="AJ000009">AJ000009</ext-link>, EMBL:<ext-link ext-link-type="embl" ext-link-id="D11168">D11168</ext-link>, EMBL:<ext-link ext-link-type="embl" ext-link-id="L02836">L02836</ext-link>]</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>1c</p>
							</c>
							<c ca="left">
								<p>[EMBL:<ext-link ext-link-type="embl" ext-link-id="AY051292">AY051292</ext-link>, EMBL:<ext-link ext-link-type="embl" ext-link-id="AY651061">AY651061</ext-link>, EMBL:<ext-link ext-link-type="embl" ext-link-id="D14853">D14853</ext-link>, EMBL:<ext-link ext-link-type="embl" ext-link-id="E08443">E08443</ext-link>]</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>2a</p>
							</c>
							<c ca="left">
								<p>[EMBL:<ext-link ext-link-type="embl" ext-link-id="AB047639">AB047639</ext-link>, EMBL:<ext-link ext-link-type="embl" ext-link-id="AF169003">AF169003</ext-link>, EMBL:<ext-link ext-link-type="embl" ext-link-id="AF169005">AF169005</ext-link>, EMBL:<ext-link ext-link-type="embl" ext-link-id="D00944">D00944</ext-link>]</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>2b</p>
							</c>
							<c ca="left">
								<p>[EMBL:<ext-link ext-link-type="embl" ext-link-id="AB030907">AB030907</ext-link>, EMBL:<ext-link ext-link-type="embl" ext-link-id="AF238486">AF238486</ext-link>, EMBL:<ext-link ext-link-type="embl" ext-link-id="AY232746">AY232746</ext-link>, EMBL:<ext-link ext-link-type="embl" ext-link-id="D10988">D10988</ext-link>]</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>2c</p>
							</c>
							<c ca="left">
								<p>[EMBL:<ext-link ext-link-type="embl" ext-link-id="D50409">D50409</ext-link>]</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>2k</p>
							</c>
							<c ca="left">
								<p>[EMBL:<ext-link ext-link-type="embl" ext-link-id="AB031663">AB031663</ext-link>]</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>3a</p>
							</c>
							<c ca="left">
								<p>[EMBL:<ext-link ext-link-type="embl" ext-link-id="AF046866">AF046866</ext-link>, EMBL:<ext-link ext-link-type="embl" ext-link-id="D17763">D17763</ext-link>, EMBL:<ext-link ext-link-type="embl" ext-link-id="D28917">D28917</ext-link>, EMBL:<ext-link ext-link-type="embl" ext-link-id="X76918">X76918</ext-link>]</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>3b</p>
							</c>
							<c ca="left">
								<p>[EMBL:<ext-link ext-link-type="embl" ext-link-id="D49374">D49374</ext-link>, EMBL:<ext-link ext-link-type="embl" ext-link-id="E10840">E10840</ext-link>]</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>3k</p>
							</c>
							<c ca="left">
								<p>[EMBL:<ext-link ext-link-type="embl" ext-link-id="D63821">D63821</ext-link>]</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>4a</p>
							</c>
							<c ca="left">
								<p>[EMBL:<ext-link ext-link-type="embl" ext-link-id="Y11604">Y11604</ext-link>]</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>5a</p>
							</c>
							<c ca="left">
								<p>[EMBL:<ext-link ext-link-type="embl" ext-link-id="Y13184">Y13184</ext-link>]</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>6a</p>
							</c>
							<c ca="left">
								<p>[EMBL:<ext-link ext-link-type="embl" ext-link-id="AY859526">AY859526</ext-link>, EMBL:<ext-link ext-link-type="embl" ext-link-id="Y12083">Y12083</ext-link>]</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>6b</p>
							</c>
							<c ca="left">
								<p>[EMBL:<ext-link ext-link-type="embl" ext-link-id="D84262">D84262</ext-link>]</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>6d</p>
							</c>
							<c ca="left">
								<p>[EMBL:<ext-link ext-link-type="embl" ext-link-id="D84263">D84263</ext-link>]</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>6g</p>
							</c>
							<c ca="left">
								<p>[EMBL:<ext-link ext-link-type="embl" ext-link-id="D63822">D63822</ext-link>]</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>6h</p>
							</c>
							<c ca="left">
								<p>[EMBL:<ext-link ext-link-type="embl" ext-link-id="D84265">D84265</ext-link>]</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>6k</p>
							</c>
							<c ca="left">
								<p>[EMBL:<ext-link ext-link-type="embl" ext-link-id="D84264">D84264</ext-link>]</p>
							</c>
						</r>
					</tblbdy>
				</tbl>
				<tbl id="T2">
					<title>
						<p>Table 2</p>
					</title>
					<caption>
						<p>Substitution model (GTR+&#915;+I) parameters and alignment properties.</p>
					</caption>
					<tblbdy cols="5">
						<r>
							<c ca="center">
								<p>
									<b>Model Parameter</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>Genome</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>Polyprotein</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>5' UTR</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>Okamoto</b>
								</p>
							</c>
						</r>
						<r>
							<c cspan="5">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>A proportion</p>
							</c>
							<c ca="center">
								<p>0.2034</p>
							</c>
							<c ca="center">
								<p>0.2046</p>
							</c>
							<c ca="center">
								<p>0.1920</p>
							</c>
							<c ca="center">
								<p>0.288</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>C proportion</p>
							</c>
							<c ca="center">
								<p>0.3261</p>
							</c>
							<c ca="center">
								<p>0.3267</p>
							</c>
							<c ca="center">
								<p>0.2913</p>
							</c>
							<c ca="center">
								<p>0.3302</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>G proportion</p>
							</c>
							<c ca="center">
								<p>0.2675</p>
							</c>
							<c ca="center">
								<p>0.2698</p>
							</c>
							<c ca="center">
								<p>0.3081</p>
							</c>
							<c ca="center">
								<p>0.2667</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>U proportion</p>
							</c>
							<c ca="center">
								<p>0.2030</p>
							</c>
							<c ca="center">
								<p>0.1989</p>
							</c>
							<c ca="center">
								<p>0.2086</p>
							</c>
							<c ca="center">
								<p>0.1743</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>A-C rate</p>
							</c>
							<c ca="center">
								<p>1.6280</p>
							</c>
							<c ca="center">
								<p>1.5920</p>
							</c>
							<c ca="center">
								<p>16.9081</p>
							</c>
							<c ca="center">
								<p>1.2156</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>A-G rate</p>
							</c>
							<c ca="center">
								<p>5.9755</p>
							</c>
							<c ca="center">
								<p>5.8823</p>
							</c>
							<c ca="center">
								<p>56.7130</p>
							</c>
							<c ca="center">
								<p>3.5749</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>A-U rate</p>
							</c>
							<c ca="center">
								<p>2.7662</p>
							</c>
							<c ca="center">
								<p>2.7764</p>
							</c>
							<c ca="center">
								<p>54.5047</p>
							</c>
							<c ca="center">
								<p>1.3329</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>C-G rate</p>
							</c>
							<c ca="center">
								<p>1.1295</p>
							</c>
							<c ca="center">
								<p>1.1087</p>
							</c>
							<c ca="center">
								<p>4.7757</p>
							</c>
							<c ca="center">
								<p>0.5330</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>C-U rate</p>
							</c>
							<c ca="center">
								<p>7.5166</p>
							</c>
							<c ca="center">
								<p>7.5910</p>
							</c>
							<c ca="center">
								<p>128.7054</p>
							</c>
							<c ca="center">
								<p>5.4729</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>G-U rate</p>
							</c>
							<c ca="center">
								<p>1.0000</p>
							</c>
							<c ca="center">
								<p>1.0000</p>
							</c>
							<c ca="center">
								<p>1.0000</p>
							</c>
							<c ca="center">
								<p>1.0000</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>Proportion of invariant sites (I)</p>
							</c>
							<c ca="center">
								<p>0.2693</p>
							</c>
							<c ca="center">
								<p>0.2549</p>
							</c>
							<c ca="center">
								<p>0.6637</p>
							</c>
							<c ca="center">
								<p>0.2881</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>&#915;-distribution shape parameter</p>
							</c>
							<c ca="center">
								<p>0.8357</p>
							</c>
							<c ca="center">
								<p>0.8601</p>
							</c>
							<c ca="center">
								<p>0.9055</p>
							</c>
							<c ca="center">
								<p>1.3298</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>Nucleotides in alignment</p>
							</c>
							<c ca="center">
								<p>9791</p>
							</c>
							<c ca="center">
								<p>9177</p>
							</c>
							<c ca="center">
								<p>300</p>
							</c>
							<c ca="center">
								<p>329</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>Conserved sites in alignment</p>
							</c>
							<c ca="center">
								<p>3473</p>
							</c>
							<c ca="center">
								<p>3028</p>
							</c>
							<c ca="center">
								<p>251</p>
							</c>
							<c ca="center">
								<p>223</p>
							</c>
						</r>
					</tblbdy>
				</tbl>
				<p>The 5' UTR is represented by the smallest number of aligned nucleotide sites (300 nt; the 5' most 42 nt were excluded from analysis because of extensive gaps throughout the available sequence data), followed by the Okamoto region of NS5B (329 nt), then the polyprotein (9177 nt), and the whole genome (9791 nt, Table <tblr tid="T2">2</tblr>). The proportion of invariant nucleotide sites for the 5' UTR is 2/3, much lower than for the protein-coding regions, for which less than 1/3 of sites do not vary (Table <tblr tid="T2">2</tblr>). The 5' UTR is known to be less variable than protein-coding regions of HCV <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B6">6</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>.</p>
				<p>Tree topologies from the entire HCV genome and the polyprotein are identical (Figs. <figr fid="F1">1a, b</figr> and <figr fid="F2">2a, b</figr>). The tree from the Okamoto region of NS5B resembles trees from the whole genome and the polyprotein, except for rearrangements in the ordering of deeply rooted branches (Figs. <figr fid="F1">1d</figr> and <figr fid="F2">2d</figr>). Trees from sequences that include protein-coding regions clearly group subtypes from the same genotype into clades, while the tree from the non-coding terminus conflates subtypes of genotypes 1 and 6 with subtypes 4a and 5a, and subtypes of genotypes 1 and 6 cannot be distinguished (Figs. <figr fid="F1">1c</figr> and <figr fid="F2">2c</figr>). Thus, the phylogenetic trees of the 5' UTR are less able to group subtypes from the same genotype together into clades than trees from protein-coding sequences (Figs. <figr fid="F1">1</figr> and <figr fid="F2">2</figr>), regardless of the method used for phylogenetic inference. Parsimony analysis yields comparable results, with similar trees for the whole genome, polyprotein, and the Okamoto region of NS5B, while the tree from the 5' UTR contains a basal polytomy that does not resolve genotypes 1,4, 5, or 6 (not shown).</p>
				<fig id="F1">
					<title>
						<p>Figure 1</p>
					</title>
					<caption>
						<p>Neighbor-joining phylogenies</p>
					</caption>
					<text>
						<p><b>Neighbor-joining phylogenies</b>. Unrooted neighbor-joining phylogenetic trees from (a) complete HCV genome, (b) polyprotein, (c) 5' UTR, and (d) the Okamoto region of NS5B. Due to our focus on the consistency of subtype classification and the relative branching topology among subtypes, each tree is scaled independently.</p>
					</text>
					<graphic file="1743-422X-3-103-1"/>
				</fig>
				<fig id="F2">
					<title>
						<p>Figure 2</p>
					</title>
					<caption>
						<p>Maximum-likelihood phylogenies</p>
					</caption>
					<text>
						<p><b>Maximum-likelihood phylogenies</b>. Unrooted maximum likelihood phylogenetic trees from (a) complete HCV genome, (b) polyprotein, (c) 5' UTR, and (d) the Okamoto region of NS5B. Taxon labels indicate HCV genotype and subtype from Table 1. Due to our focus on the consistency of subtype classification and the relative branching topology among subtypes, each tree is scaled independently.</p>
					</text>
					<graphic file="1743-422X-3-103-2"/>
				</fig>
			</sec>
			<sec>
				<st>
					<p>Hypothesis tests</p>
				</st>
				<p>Log-likelihood scores and SH-test results for alternative trees are summarized in Table <tblr tid="T3">3</tblr>. All tests yield the same outcomes, regardless of whether or not RELL optimization was used. Comparisons of alternative trees with the 5' UTR data fail to reject the null hypothesis of no difference in likelihoods (P > &#945;; see Methods). Comparisons among alternative trees with data from the Okamoto region of NS5B indicate that the 5' UTR tree has a significantly different likelihood (P &lt; 0.0001) than trees obtained from NS5B, polyprotein, or whole-genome data, which are statistically indistinguishable (P > &#945;). Comparing parsimony trees from 300-nt windows in NS5B with trees from the 5' UTR via the incongruence length difference test <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>, which uses the difference in tree lengths as a test statistic, rather than the likelihood difference, yielded the same pattern of significant differences (not shown).</p>
				<tbl id="T3">
					<title>
						<p>Table 3</p>
					</title>
					<caption>
						<p>Shimodaira-Hasegawa test results from 10,000 bootstrap replicates.</p>
					</caption>
					<tblbdy cols="5">
						<r>
							<c ca="center">
								<p>
									<b>Tree</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>-ln <it>L</it></b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>-ln &#916;</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>P<sub>RELL</sub></b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>P<sub>FULL</sub></b>
								</p>
							</c>
						</r>
						<r>
							<c cspan="5">
								<hr/>
							</c>
						</r>
						<r>
							<c cspan="5" ca="center">
								<p>
									<b>5' UTR sites</b>
								</p>
							</c>
						</r>
						<r>
							<c cspan="5">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>5' UTR</p>
							</c>
							<c ca="center">
								<p>895</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
							<c ca="center">
								<p>--</p>
							</c>
							<c ca="center">
								<p>--</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>Whole genome</p>
							</c>
							<c ca="center">
								<p>955</p>
							</c>
							<c ca="center">
								<p>61</p>
							</c>
							<c ca="center">
								<p>0.0225</p>
							</c>
							<c ca="center">
								<p>0.0153</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>Polyprotein</p>
							</c>
							<c ca="center">
								<p>956</p>
							</c>
							<c ca="center">
								<p>62</p>
							</c>
							<c ca="center">
								<p>0.0221</p>
							</c>
							<c ca="center">
								<p>0.0144</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>Okamoto region</p>
							</c>
							<c ca="center">
								<p>949</p>
							</c>
							<c ca="center">
								<p>54</p>
							</c>
							<c ca="center">
								<p>0.0323</p>
							</c>
							<c ca="center">
								<p>0.0215</p>
							</c>
						</r>
						<r>
							<c cspan="5">
								<hr/>
							</c>
						</r>
						<r>
							<c cspan="5" ca="center">
								<p>
									<b>Okamoto region sites</b>
								</p>
							</c>
						</r>
						<r>
							<c cspan="5">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>Okamoto region</p>
							</c>
							<c ca="center">
								<p>5,226</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
							<c ca="center">
								<p>--</p>
							</c>
							<c ca="center">
								<p>--</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>Whole genome</p>
							</c>
							<c ca="center">
								<p>5,256</p>
							</c>
							<c ca="center">
								<p>30</p>
							</c>
							<c ca="center">
								<p>0.2824</p>
							</c>
							<c ca="center">
								<p>0.2872</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>Polyprotein</p>
							</c>
							<c ca="center">
								<p>5,255</p>
							</c>
							<c ca="center">
								<p>29</p>
							</c>
							<c ca="center">
								<p>0.2981</p>
							</c>
							<c ca="center">
								<p>0.3050</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>5' UTR</p>
							</c>
							<c ca="center">
								<p>5,898</p>
							</c>
							<c ca="center">
								<p>672</p>
							</c>
							<c ca="center">
								<p>&lt; 0.0001</p>
							</c>
							<c ca="center">
								<p>&lt; 0.0001</p>
							</c>
						</r>
					</tblbdy>
				</tbl>
			</sec>
			<sec>
				<st>
					<p>Consistency and homoplasy indices</p>
				</st>
				<p>Increasing window sizes represent the CI as an increasingly smooth function, as more nucleotides better approximate the whole-genome phylogeny than fewer nucleotides. However, increasing window size yields poorer resolution in the 5' UTR (Fig. <figr fid="F3">3a</figr>) because fewer windows are able to represent this region. Contrary to expectations, the rescaled homoplasy index is not constant. Despite large fluctuations within the 5' UTR, the rescaled homoplasy index is generally greater in the 5' UTR than in other regions of the HCV genome and particularly NS5B (Fig. <figr fid="F3">3b</figr>). After correcting for the substitution rate in this manner, the consistency of sites with the whole-genome phylogeny is lower in the 5' UTR than in NS5B.</p>
				<fig id="F3">
					<title>
						<p>Figure 3</p>
					</title>
					<caption>
						<p>Consistency and homoplasy indices</p>
					</caption>
					<text>
						<p><b>Consistency and homoplasy indices</b>. Moving-window averages of (a) character consistency with the whole-genome phylogeny for windows of 100 (red), 300 (blue), or 500 (black) nucleotides and (b) proportion of informative sites (red) and rescaled homoplasy index (black) for windows of 100 nucleotides as a function of the window midpoint in the whole-genome alignment. Regions corresponding to the 5' UTR (left) and NS5B (right) are indicated with grey bands, with a white band in the middle of NS5B to indicate the 329 nt Okamoto region.</p>
					</text>
					<graphic file="1743-422X-3-103-3"/>
				</fig>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Discussion</p>
			</st>
			<p>An earlier investigation of phylogenetic relations among 27 complete HCV genomes used maximum likelihood and careful determination of the appropriate nucleotide substitution model, and reported a star-like phylogeny among the six known HCV genotypes <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. The best substitution model was also found to be the most general. In the earlier study, the 5' UTR was found to have lower phylogenetic signal, lower evolutionary rate, and greater phylogenetic noise than alternative regions of the HCV genome, including NS5B <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. Our observations concur with those previously reported. Methodological refinements in our approach include the use of information-based model selection criteria to determine the best nucleotide substitution model, more complete HCV genomes, the revised nomenclature for subtypes <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>, and formal comparisons between alternative topologies for the purpose of subtype determination.</p>
			<p>The tree from the Okamoto region of NS5B is a significantly better fit to the HCV whole-genome and polyprotein data than the 5' UTR tree, regardless of the optimality criterion used for phylogenetic inference. Trees obtained from the 5' UTR perform worse at classifying HCV subtypes into clades of the same genotype than do trees from the whole genome, polyprotein, or the Okamoto region of NS5B. Discordant topologies of maximum-likelihood phylogenetic trees obtained from the 5' UTR and NS5B have been described for a subset of HCV genotypes <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr></abbrgrp>. The inconsistent ordering of deeply rooted branches among trees from protein-coding regions indicates a basal polytomy whose resolution is contingent on the data available, which accords with the star-like phylogeny of all six known HCV genotypes previously reported elsewhere <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B5">5</abbr><abbr bid="B12">12</abbr><abbr bid="B16">16</abbr></abbrgrp>.</p>
			<p>The same evolutionary model (GTR with a discrete-gamma distribution of rate variation) used here has been utilized previously for likelihood phylogenies of the hepatitis B virus <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> and, with accommodation of invariant sites, for both HIV <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> and HCV <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. Instantaneous substitution rates (normalized to the G-U rate) are greater among sites in the non-coding 5' UTR than in the regions that encode proteins, despite the fact that overall sequence conservation is greater in the UTR (Table <tblr tid="T2">2</tblr>). In particular, the instantaneous substitution rate between cytidine and uridine is much greater for the 5' UTR than for protein-coding regions. The accelerated C-U (or C-T for DNA sequences) substitution rate has previously been reported and discussed for protein-coding regions <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>, though the rate is even greater for the non-coding terminus than for regions having codon usage constraints. Spontaneous deamination of cytosine to uracil may inflate the C-U substitution rate.</p>
			<p>Conservation of single-stranded RNA secondary structure in both coding and non-coding regions of HCV has already been reported <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>. The high C-U rate bias may additionally be explained by the formation of non-canonical base pairs between guanosine and uridine in single-stranded RNA molecules, which is consistent with selection to conserve secondary structure, because a mutation from cytosine to uridine is less disruptive to secondary structure formation than other point mutations <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. The may also be explained by the fact that all rates are rescaled such that the G-U rate is unity. A low G-U substitution rate thus inflates other rates. A mutation between G and U is disruptive to RNA secondary structure, because it eliminates the possibility of bases pairing without a compensatory mutation elsewhere. Overall, the elevated C-U substitution rate seen for the 5' UTR probably results from several interacting factors.</p>
			<p>Though the same evolutionary model applies to the non-coding 5' UTR and the Okamoto region of NS5B, the two regions are subjected to different constraints. While coding sequences have codon-usage constraints and selective pressure for amino-acid mutations to escape detection by the host immune system, the UTR must preserve long-range interactions with complementary nucleotides at the other terminus of the viral genome if cyclization of the genome is essential to viral replication <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B20">20</abbr></abbrgrp>. Because of these differences in selective regimes, it should not be surprising that phylogenies of the two differ.</p>
			<p>HCV diagnostic technologies include serologic (antibody based) and genetic (sequence based) techniques to detect infected samples <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B6">6</abbr><abbr bid="B25">25</abbr></abbrgrp>. Population screens are the most commonly deployed genetic HCV tests, which benefit from low false-positive rates because they utilize the conserved 5' UTR as targets for PCR amplification. However, it is clear both from the results of this study and from previous investigations that the 5' UTR does not contain sufficient information to resolve subtypes <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr></abbrgrp>. Phylogenetic signal in protein-coding regions, such as NS5B, provides a useful alternative <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B32">32</abbr></abbrgrp>, but few commercial assays exploit this information at present. The "gold standard" for subtype determination is direct sequencing, which has a lower cost for reagents but requires more time than commercial assay kits <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B25">25</abbr></abbrgrp>.</p>
			<p>There exist further complications to subtype classification, including coinfection <abbrgrp><abbr bid="B30">30</abbr><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr></abbrgrp>, recombination <abbrgrp><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr></abbrgrp>, within-host evolution <abbrgrp><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr></abbrgrp>, and compartmentalization of genotypes into different cell types <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. Diagnostic assays that are informed by the 5' UTR will be less able to accommodate these difficulties than methods that are able to resolve subtypes.</p>
		</sec>
		<sec>
			<st>
				<p>Conclusion</p>
			</st>
			<p>Ultimately, HCV infection outcome results from an interaction between the virus and its host. The current standard of care is limited in efficacy, and treatment outcome is contingent on viral genotype <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B6">6</abbr><abbr bid="B25">25</abbr><abbr bid="B34">34</abbr></abbrgrp>. To improve HCV therapies, perform effective public-health surveillance for new variants and modes of transmission, and further vaccine development efforts, detailed information about the interacting genotypes is needed. Diagnostic methods that assign viral subtype classifications are thus greatly desired. Such methods perform better when they are not informed by sequence variation from the non-coding 5' UTR, and should instead favor protein-coding regions, such as the Okamoto region of NS5B.</p>
		</sec>
		<sec>
			<st>
				<p>Methods</p>
			</st>
			<sec>
				<st>
					<p>Phylogenetic inference</p>
				</st>
				<p>We used multiple methods for phylogenetic inference, including neighbor joining (NJ), maximum parsimony (MP), and maximum likelihood (ML) <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>. This was done to evaluate whether the inferential technique has an influence on the ability of the resulting phylogenies to resolve subtypes into clades. We used PAUP*, version 4.0b10 <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> for phylogenetic inference. Neighbor-joining trees were constructed with the F84 distance metric <abbrgrp><abbr bid="B41">41</abbr></abbrgrp> and the BioNJ algorithm <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>. For parsimony analyses, uninformative invariant characters were excluded and gaps were treated as a fifth character state.</p>
				<p>To select an appropriate nucleotide substitution model, we used FindModel, an independ-ent, online implementation of ModelTest <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>. This approach uses an information-based goodness-of-fit criterion, in the sense that the best model minimizes the quantity of bits required to encode both the model and the model-encoded data for electronic transmission <abbrgrp><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr></abbrgrp>. Such an approach includes a penalty term for the number of parameters, and thus facilitates comparing models with varied numbers of parameters <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. The fit of each model to the data was evaluated both with and without a four-category discrete approximation to a gamma distribution of substitution rates per site. Because FindModel does not test models with invariant sites, we also used ModelTest (version 3.6) to evaluate nucleotide substitution models with invariant sites <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>. Akaike's information criterion (AIC) was used to quantify the suitability of alternative models having varied numbers of parameters to fit the data <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>.</p>
			</sec>
			<sec>
				<st>
					<p>Hypothesis tests</p>
				</st>
				<p>To evaluate the significance of differences in ML phylogenies obtained from different regions of the HCV genome, we used the Shimodaira-Hasegawa (SH) test <abbrgrp><abbr bid="B48">48</abbr></abbrgrp> as implemented in PAUP*, version 4.0b10 <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. The null hypothesis of the SH test is that none of the trees evaluated has a likelihood that differs significantly from any other. Rejecting the null hypothesis indicates a significant difference in likelihood scores, and thus in tree topologies <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>.</p>
				<p>For a pair of trees defined a priori, the SH test computes the difference in their likelihoods (&#916;). This difference is compared with the null distribution of likelihood scores, obtained by building trees from character data generated by iterative bootstrap resampling with replacement of the nucleotide sites. A computationally efficient optimization (RELL) may be applied, which simply adds together per-site likelihoods over the resampled sites. Otherwise, the tree parameters are optimized on the resampled data (FULL). The resampled likelihood differences are denoted <m:math name="1743-422X-3-103-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:msup><m:mi>&#916;</m:mi><m:mo>&#8242;</m:mo></m:msup><m:mi>i</m:mi></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqbamaaBaaaleaacqWGPbqAaeqaaaaa@2FA5@</m:annotation></m:semantics></m:math>, where <it>i </it>indexes the replicate, and they are subsequently transformed by subtracting the mean resampled difference &lt;&#916;'>, a procedure called centering. The original difference in likelihoods is compared with the null distribution in a one-tailed, non-parametric manner, whereby the rank of &#916; is evaluated against the centered, sorted &#916;' distribution. If the rank of &#916; is found to lie outside the interval of the null distribution between 0 and the (1-&#945;) &#215; 100 percentile, the difference in likelihoods is significant with (1-&#945;) &#215; 100% confidence, and the null hypothesis is rejected in favor of the alternative. (The acceptable type I, or false positive, error rate per test is denoted &#945;.)</p>
				<p>Here the tree topologies are ML phylogenies that represent different regions of the HCV genome. The reference alignment of 38 HCV whole-genome sequences representing 18 confirmed subtypes (Table <tblr tid="T1">1</tblr>) was obtained from the LANL HCV database <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. We conducted SH tests with data from the 5' UTR, the Okamoto region of NS5B, and whole genome. Topologies were paired such that the ML tree <m:math name="1743-422X-3-103-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>T</m:mi><m:mi>x</m:mi><m:mo>&#8727;</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGubavdaqhaaWcbaGaemiEaGhabaGaey4fIOcaaaaa@3072@</m:annotation></m:semantics></m:math> inferred from the data of region <it>x </it>(either the 5' UTR or Okamoto region) was compared with the ML tree <m:math name="1743-422X-3-103-i3" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>T</m:mi><m:mi>y</m:mi><m:mo>&#8727;</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGubavdaqhaaWcbaGaemyEaKhabaGaey4fIOcaaaaa@3074@</m:annotation></m:semantics></m:math> from data of region <it>y </it>representing each other region (either 5' UTR, Okamoto region, polypeptide, or whole genome, provided <it>y </it>&#8800; <it>x</it>), yielding the likelihood difference &#916; &#8801; <it>L</it><sub><it>x</it></sub>(<m:math name="1743-422X-3-103-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>T</m:mi><m:mi>x</m:mi><m:mo>&#8727;</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGubavdaqhaaWcbaGaemiEaGhabaGaey4fIOcaaaaa@3072@</m:annotation></m:semantics></m:math>) - <it>L</it><sub><it>x</it></sub>(<m:math name="1743-422X-3-103-i3" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>T</m:mi><m:mi>y</m:mi><m:mo>&#8727;</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGubavdaqhaaWcbaGaemyEaKhabaGaey4fIOcaaaaa@3074@</m:annotation></m:semantics></m:math>), where <it>L</it><sub><it>x</it></sub>(<m:math name="1743-422X-3-103-i3" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>T</m:mi><m:mi>y</m:mi><m:mo>&#8727;</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGubavdaqhaaWcbaGaemyEaKhabaGaey4fIOcaaaaa@3074@</m:annotation></m:semantics></m:math>) is the likelihood of the ML tree from region <it>y </it>evaluated with data from region <it>x</it>. We randomly resampled 10,000 replicate data sets for each pair of trees and compared the original difference in likelihoods with the null distribution that resulted. The type I error rate was reduced to accommodate six hypothesis tests (&#945; = 0.05/6 = 0.00833). This reduction preserves the experiment-wide false-positive rate by making each comparison more stringent.</p>
			</sec>
			<sec>
				<st>
					<p>Consistency and homoplasy indices</p>
				</st>
				<p>To understand better phylogenetic inconsistencies over the HCV genome, we computed the character consistency index (CI) for each site in PAUP with the whole-genome phylogeny, and summarized CI with a moving-window (running) average over 100, 300, and 500 nt. The 100 nt window size was used subsequently because it allows for clear visualization of the 342 nucleotides that constitute the 5' UTR. Because the consistency and homoplasy indices (HI) are complementary (CI+HI = 1), character consistency is high when homoplasy is low, and vice versa. Thus, we expect lower homoplasy to result from fewer informative sites. Further, homoplasy decreases rapidly with decreasing substitution rates. To control for variation in the number of informative sites across the genome, we rescaled the homoplasy index against the square of the proportion of informative sites in the window region. This was done because, in the limit of short branch lengths, the number of informative sites should be proportional to the substitution rate <it>r</it>, while the number of homoplasies should be proportional to <it>r</it><sup>2</sup>. The result was subsequently normalized against the maximum, to facilitate comparison with the proportion of informative sites. As a result, if all parts of the HCV genome are equally informative, one can expect the rescaled homoplasy index to be roughly constant over the viral genome.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Competing interests</p>
			</st>
			<p>The author(s) declare that they have no competing interests.</p>
		</sec>
		<sec>
			<st>
				<p>Authors' contributions</p>
			</st>
			<p>All authors contributed equally to the conceptualization, experimental design, data analyses, and narrative presented herein.</p>
		</sec>
	</bdy>
	<bm>
		<ack>
			<sec>
				<st>
					<p>Acknowledgements</p>
				</st>
				<p>This work was supported by an NIH-DOE interagency agreement (Y1-A1-1500-04) and a LANL internal directed research grant for vaccine design. We thank T-10 and both the HCV and HIV database teams at LANL for sharing their resources and expertise, and particularly Bette Korber for helpful discussions. LA-UR 06-3473.</p>
			</sec>
		</ack>
		<refgrp>
			<bibl id="B1">
				<title>
					<p>Peginterferon alfa-2a plus ribavirin for chronic hepatitis C virus infection</p>
				</title>
				<aug>
					<au>
						<snm>Fried</snm>
						<fnm>MW</fnm>
					</au>
					<au>
						<snm>Shiffman</snm>
						<fnm>ML</fnm>
					</au>
					<au>
						<snm>Reddy</snm>
						<fnm>KR</fnm>
					</au>
					<au>
						<snm>Smith</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Marinos</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Goncales</snm>
						<fnm>FL</fnm>
					</au>
					<au>
						<snm>Haussinger</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Diago</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Carosi</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Dhumeaux</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Craxi</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Lin</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Hoffman</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Yu</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>N Engl J Med</source>
				<pubdate>2002</pubdate>
				<volume>347</volume>
				<issue>13</issue>
				<fpage>975</fpage>
				<lpage>982</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1056/NEJMoa020047</pubid>
						<pubid idtype="pmpid" link="fulltext">12324553</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B2">
				<title>
					<p>Peginterferon-alpha 2a and ribavirin combination therapy in chronic hepatitis C - A randomized study of treatment duration and ribavirin dose </p>
				</title>
				<aug>
					<au>
						<snm>Hadziyannis</snm>
						<fnm>SJ</fnm>
					</au>
					<au>
						<snm>Sette</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Morgan</snm>
						<fnm>TR</fnm>
					</au>
					<au>
						<snm>Balan</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Diago</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Marcellin</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Ramadori</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Bodenheimer</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Bernstein</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Rizzetto</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Zeuzem</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Pockros</snm>
						<fnm>PJ</fnm>
					</au>
					<au>
						<snm>Lin</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Ackrill</snm>
						<fnm>AM</fnm>
					</au>
				</aug>
				<source>Ann Intern Med</source>
				<pubdate>2004</pubdate>
				<volume>140</volume>
				<issue>5</issue>
				<fpage>346</fpage>
				<lpage>355</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">14996676</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B3">
				<title>
					<p>Genetic diversity and evolution of hepatitis C virus - 15 years on</p>
				</title>
				<aug>
					<au>
						<snm>Simmonds</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<source>J Gen Virol</source>
				<pubdate>2004</pubdate>
				<volume>85</volume>
				<fpage>3173</fpage>
				<lpage>3188</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1099/vir.0.80401-0</pubid>
						<pubid idtype="pmpid" link="fulltext">15483230</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B4">
				<title>
					<p>Molecular methods of hepatitis C genotyping.</p>
				</title>
				<aug>
					<au>
						<snm>Weck</snm>
						<fnm>K</fnm>
					</au>
				</aug>
				<source>Expert Rev Mol Diagn</source>
				<pubdate>2005</pubdate>
				<volume>5</volume>
				<issue>4</issue>
				<fpage>507</fpage>
				<lpage>520</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1586/14737159.5.4.507</pubid>
						<pubid idtype="pmpid" link="fulltext">16013969</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B5">
				<title>
					<p>Consensus proposals for a unified system of nomenclature of hepatitis C virus genotypes</p>
				</title>
				<aug>
					<au>
						<snm>Simmonds</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Bukh</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Combet</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Del&#233;age</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Enomoto</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Feinstone</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Halfon</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Inchausp&#233;</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Kuiken</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Maertens</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Mizokami</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Murphy</snm>
						<fnm>DG</fnm>
					</au>
					<au>
						<snm>Okamoto</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Pawlotsky</snm>
						<fnm>JM</fnm>
					</au>
					<au>
						<snm>Penin</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Sablon</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Shin-I</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Stuyver</snm>
						<fnm>LJ</fnm>
					</au>
					<au>
						<snm>Thiel</snm>
						<fnm>HJ</fnm>
					</au>
					<au>
						<snm>Viazov</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Weiner</snm>
						<fnm>AJ</fnm>
					</au>
					<au>
						<snm>Widell</snm>
						<fnm>A</fnm>
					</au>
				</aug>
				<source>Hepatology</source>
				<pubdate>2005</pubdate>
				<volume>42</volume>
				<issue>4</issue>
				<fpage>962</fpage>
				<lpage>973</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1002/hep.20819</pubid>
						<pubid idtype="pmpid" link="fulltext">16149085</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B6">
				<title>
					<p>Hepatitis C viruses</p>
				</title>
				<aug>
					<au>
						<snm>Major</snm>
						<fnm>ME</fnm>
					</au>
					<au>
						<snm>Rehermann</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Feinstone</snm>
						<fnm>SM</fnm>
					</au>
				</aug>
				<source>Fields' Virology</source>
				<publisher>Philadephia , Lippincott, Williams &amp; Wilkins</publisher>
				<editor>Knipe DM, Howley PM</editor>
				<edition>4th</edition>
				<pubdate>2001</pubdate>
				<fpage>1127</fpage>
				<lpage>1161</lpage>
			</bibl>
			<bibl id="B7">
				<title>
					<p>Los Alamos hepatitis C immunology database</p>
				</title>
				<aug>
					<au>
						<snm>Yusim</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Richardson</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Tao</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Dalwani</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Agrawal</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Szinger</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Funkhouser</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Korber</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Kuiken</snm>
						<fnm>C</fnm>
					</au>
				</aug>
				<source>Appl Bioinformatics</source>
				<pubdate>2005</pubdate>
				<volume>4</volume>
				<issue>4</issue>
				<fpage>217</fpage>
				<lpage>225</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.2165/00822942-200504040-00002</pubid>
						<pubid idtype="pmpid">16309340</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B8">
				<title>
					<p>Outcome of liver disease in a large cohort of histologically proven chronic hepatitis C: influence of HCV genotype</p>
				</title>
				<aug>
					<au>
						<snm>Roffi</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Redaelli</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Colloredo</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Minola</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Donada</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Picciotto</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Riboli</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Del Poggio</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Rinaldi</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Paris</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Fornaciari</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Giusti</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Marin</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Morales</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Sangiovanni</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Belloni</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Pozzi</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Poli</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Mascoli</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Corradi</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Pioltelli</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Scalori</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Mancia</snm>
						<fnm>G</fnm>
					</au>
				</aug>
				<source>Eur J Gastroenterol Hepatol</source>
				<pubdate>2001</pubdate>
				<volume>13</volume>
				<issue>5</issue>
				<fpage>501</fpage>
				<lpage>506</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1097/00042737-200105000-00007</pubid>
						<pubid idtype="pmpid" link="fulltext">11396528</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B9">
				<title>
					<p>Inferring Phylogenies</p>
				</title>
				<aug>
					<au>
						<snm>Felsenstein</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<publisher>Sunderland, MA , Sinauer Associates</publisher>
				<pubdate>2004</pubdate>
			</bibl>
			<bibl id="B10">
				<title>
					<p>Phylogenetic inference</p>
				</title>
				<aug>
					<au>
						<snm>Swofford</snm>
						<fnm>DL</fnm>
					</au>
					<au>
						<snm>Olsen</snm>
						<fnm>GJ</fnm>
					</au>
					<au>
						<snm>Waddell</snm>
						<fnm>PJ</fnm>
					</au>
					<au>
						<snm>Hillis</snm>
						<fnm>DM</fnm>
					</au>
				</aug>
				<source>Molecular Systematics</source>
				<publisher>Sunderland, MA , Sinauer Associates</publisher>
				<editor>Hillis DM, Moritz C, Mable BK</editor>
				<edition>2nd</edition>
				<pubdate>1996</pubdate>
				<fpage>407</fpage>
				<lpage>514</lpage>
			</bibl>
			<bibl id="B11">
				<title>
					<p>Variability of hepatitis C virus</p>
				</title>
				<aug>
					<au>
						<snm>Simmonds</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<source>Hepatology</source>
				<pubdate>1995</pubdate>
				<volume>21</volume>
				<issue>2</issue>
				<fpage>570</fpage>
				<lpage>583</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/0270-9139(95)90121-3</pubid>
						<pubid idtype="pmpid">7531173</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B12">
				<title>
					<p>Hepatitis C virus evolutionary patterns studied through analysis of full-genome sequences</p>
				</title>
				<aug>
					<au>
						<snm>Salemi</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Vandamme</snm>
						<fnm>AM</fnm>
					</au>
				</aug>
				<source>J Mol Evol</source>
				<pubdate>2002</pubdate>
				<volume>54</volume>
				<issue>1</issue>
				<fpage>62</fpage>
				<lpage>70</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1007/s00239-001-0018-9</pubid>
						<pubid idtype="pmpid" link="fulltext">11734899</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B13">
				<title>
					<p>Testing significance of incongruence</p>
				</title>
				<aug>
					<au>
						<snm>Farris</snm>
						<fnm>JS</fnm>
					</au>
					<au>
						<snm>K&#228;llersj&#246;</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Kluge</snm>
						<fnm>AG</fnm>
					</au>
					<au>
						<snm>Bult</snm>
						<fnm>C</fnm>
					</au>
				</aug>
				<source>Cladistics</source>
				<pubdate>1994</pubdate>
				<volume>10</volume>
				<fpage>315</fpage>
				<lpage>319</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1111/j.1096-0031.1994.tb00181.x</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B14">
				<title>
					<p>Analysis of a new hepatitis C virus type and its phylogenetic relationship to existing variants</p>
				</title>
				<aug>
					<au>
						<snm>Chan</snm>
						<fnm>SW</fnm>
					</au>
					<au>
						<snm>McOmish</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Holmes</snm>
						<fnm>EC</fnm>
					</au>
					<au>
						<snm>Dow</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Peutherer</snm>
						<fnm>JF</fnm>
					</au>
					<au>
						<snm>Follett</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Yap</snm>
						<fnm>PL</fnm>
					</au>
					<au>
						<snm>Simmonds</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<source>J Gen Virol</source>
				<pubdate>1992</pubdate>
				<volume>73</volume>
				<fpage>1131</fpage>
				<lpage>1141</lpage>
				<xrefbib>
					<pubid idtype="pmpid">1316939</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B15">
				<title>
					<p>Sequence variability in the 5' non-coding region of hepatitis C virus: identification of a new virus type and restrictions on sequence diversity</p>
				</title>
				<aug>
					<au>
						<snm>Simmonds</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>McOmish</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Yap</snm>
						<fnm>PL</fnm>
					</au>
					<au>
						<snm>Chan</snm>
						<fnm>SW</fnm>
					</au>
					<au>
						<snm>Lin</snm>
						<fnm>CK</fnm>
					</au>
					<au>
						<snm>Dusheiko</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Saeed</snm>
						<fnm>AA</fnm>
					</au>
					<au>
						<snm>Holmes</snm>
						<fnm>EC</fnm>
					</au>
				</aug>
				<source>J Gen Virol</source>
				<pubdate>1993</pubdate>
				<volume>74</volume>
				<fpage>661</fpage>
				<lpage>668</lpage>
				<xrefbib>
					<pubid idtype="pmpid">8385694</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B16">
				<title>
					<p>The origin of hepatitis C virus genotypes</p>
				</title>
				<aug>
					<au>
						<snm>Smith</snm>
						<fnm>DB</fnm>
					</au>
					<au>
						<snm>Pathirana</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Davidson</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Lawlor</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Power</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Yap</snm>
						<fnm>PL</fnm>
					</au>
					<au>
						<snm>Simmonds</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<source>J Gen Virol</source>
				<pubdate>1997</pubdate>
				<volume>78</volume>
				<fpage>321</fpage>
				<lpage>328</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">9018053</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B17">
				<title>
					<p>Molecular evolution of the hepatitis B virus genome</p>
				</title>
				<aug>
					<au>
						<snm>Yang</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Lauder</snm>
						<fnm>IJ</fnm>
					</au>
					<au>
						<snm>Lin</snm>
						<fnm>HJ</fnm>
					</au>
				</aug>
				<source>J Mol Evol</source>
				<pubdate>1995</pubdate>
				<volume>41</volume>
				<issue>5</issue>
				<fpage>587</fpage>
				<lpage>596</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1007/BF00175817</pubid>
						<pubid idtype="pmpid">7490773</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B18">
				<title>
					<p>Selecting models of nucleotide substitution: an application to human immunodeficiency virus 1 (HIV-1)</p>
				</title>
				<aug>
					<au>
						<snm>Posada</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Crandall</snm>
						<fnm>KA</fnm>
					</au>
				</aug>
				<source>Mol Biol Evol</source>
				<pubdate>2001</pubdate>
				<volume>18</volume>
				<issue>6</issue>
				<fpage>897</fpage>
				<lpage>906</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">11371577</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B19">
				<title>
					<p>Characteristics of nucleotide substitution in the hepatitis C virus genome: Constraints on sequence change in coding regions at both ends of the genome</p>
				</title>
				<aug>
					<au>
						<snm>Smith</snm>
						<fnm>DB</fnm>
					</au>
					<au>
						<snm>Simmonds</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<source>J Mol Evol</source>
				<pubdate>1997</pubdate>
				<volume>45</volume>
				<issue>3</issue>
				<fpage>238</fpage>
				<lpage>246</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1007/PL00006226</pubid>
						<pubid idtype="pmpid" link="fulltext">9302317</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B20">
				<title>
					<p>Conserved RNA secondary structures in Flaviviridae genomes</p>
				</title>
				<aug>
					<au>
						<snm>Thurner</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Witwer</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Hofacker</snm>
						<fnm>IL</fnm>
					</au>
					<au>
						<snm>Stadler</snm>
						<fnm>PF</fnm>
					</au>
				</aug>
				<source>J Gen Virol</source>
				<pubdate>2004</pubdate>
				<volume>85</volume>
				<fpage>1113</fpage>
				<lpage>1124</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1099/vir.0.19462-0</pubid>
						<pubid idtype="pmpid" link="fulltext">15105528</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B21">
				<title>
					<p>Mutation Master: Profiles of substitutions in hepatitis C virus RNA of the core, alternate reading frame, and NS2 coding regions</p>
				</title>
				<aug>
					<au>
						<snm>Walewski</snm>
						<fnm>JL</fnm>
					</au>
					<au>
						<snm>Gutierrez</snm>
						<fnm>JA</fnm>
					</au>
					<au>
						<snm>Branch-Elliman</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Stump</snm>
						<fnm>DD</fnm>
					</au>
					<au>
						<snm>Keller</snm>
						<fnm>TR</fnm>
					</au>
					<au>
						<snm>Rodriguez</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Benson</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Branch</snm>
						<fnm>AD</fnm>
					</au>
				</aug>
				<source>RNA</source>
				<pubdate>2002</pubdate>
				<volume>8</volume>
				<issue>5</issue>
				<fpage>557</fpage>
				<lpage>571</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1370277</pubid>
						<pubid idtype="pmpid" link="fulltext">12022223</pubid>
						<pubid idtype="doi">10.1017/S1355838202029023</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B22">
				<title>
					<p>Thermodynamic and phylogenetic prediction of RNA secondary structures in the coding region of hepatitis C virus</p>
				</title>
				<aug>
					<au>
						<snm>Tuplin</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Wood</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Evans</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Patel</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Simmonds</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<source>RNA</source>
				<pubdate>2002</pubdate>
				<volume>8</volume>
				<issue>6</issue>
				<fpage>824</fpage>
				<lpage>841</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1370300</pubid>
						<pubid idtype="pmpid" link="fulltext">12088154</pubid>
						<pubid idtype="doi">10.1017/S1355838202554066</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B23">
				<title>
					<p>Detection of genome-scale ordered RNA structure (GORS) in genomes of positive-stranded RNA viruses: implications for virus evolution and host persistence</p>
				</title>
				<aug>
					<au>
						<snm>Simmonds</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Tuplin</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Evans</snm>
						<fnm>DJ</fnm>
					</au>
				</aug>
				<source>RNA</source>
				<pubdate>2004</pubdate>
				<volume>10</volume>
				<issue>9</issue>
				<fpage>1337</fpage>
				<lpage>1351</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1370621</pubid>
						<pubid idtype="pmpid" link="fulltext">15273323</pubid>
						<pubid idtype="doi">10.1261/rna.7640104</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B24">
				<title>
					<p>Global similarities in nucleotide base composition among disparate functional classes of single-stranded RNA imply adaptive evolutionary convergence</p>
				</title>
				<aug>
					<au>
						<snm>Schultes</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Hraber</snm>
						<fnm>PT</fnm>
					</au>
					<au>
						<snm>LaBean</snm>
						<fnm>TH</fnm>
					</au>
				</aug>
				<source>RNA</source>
				<pubdate>1997</pubdate>
				<volume>3</volume>
				<issue>7</issue>
				<fpage>792</fpage>
				<lpage>806</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1369525</pubid>
						<pubid idtype="pmpid" link="fulltext">9214661</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B25">
				<title>
					<p>Laboratory assays for diagnosis and management of hepatitis C virus infection</p>
				</title>
				<aug>
					<au>
						<snm>Richter</snm>
						<fnm>SS</fnm>
					</au>
				</aug>
				<source>J Clin Microbiol</source>
				<pubdate>2002</pubdate>
				<volume>40</volume>
				<issue>12</issue>
				<fpage>4407</fpage>
				<lpage>4412</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">154655</pubid>
						<pubid idtype="pmpid" link="fulltext">12454127</pubid>
						<pubid idtype="doi">10.1128/JCM.40.12.4407-4412.2002</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B26">
				<title>
					<p>Hepatitis C virus genotyping: interrogation of the 5' untranslated region cannot accurately distinguish genotypes 1a and 1b</p>
				</title>
				<aug>
					<au>
						<snm>Chen</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Weck</snm>
						<fnm>KE</fnm>
					</au>
				</aug>
				<source>J Clin Microbiol</source>
				<pubdate>2002</pubdate>
				<volume>40</volume>
				<issue>9</issue>
				<fpage>3127</fpage>
				<lpage>3134</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">130800</pubid>
						<pubid idtype="pmpid" link="fulltext">12202542</pubid>
						<pubid idtype="doi">10.1128/JCM.40.9.3127-3134.2002</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B27">
				<title>
					<p>Comparison of hepatitis C virus NS5b and 5' noncoding gene sequencing methods in a multicenter study</p>
				</title>
				<aug>
					<au>
						<snm>Laperche</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Lunel</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Izopet</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Alain</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>D&#233;ny</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Duverlie</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Gaudy</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Pawlotsky</snm>
						<fnm>JM</fnm>
					</au>
					<au>
						<snm>Plantier</snm>
						<fnm>JC</fnm>
					</au>
					<au>
						<snm>Pozzetto</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Thibault</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Tosetti</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Lefr&#232;re</snm>
						<fnm>JJ</fnm>
					</au>
				</aug>
				<source>J Clin Microbiol</source>
				<pubdate>2005</pubdate>
				<volume>43</volume>
				<issue>2</issue>
				<fpage>733</fpage>
				<lpage>739</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">548094</pubid>
						<pubid idtype="pmpid" link="fulltext">15695672</pubid>
						<pubid idtype="doi">10.1128/JCM.43.2.733-739.2005</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B28">
				<title>
					<p>Unique NS5b hepatitis C virus gene sequence consensus database is essential for standardization of genotype determinations in multicenter epidemiological studies</p>
				</title>
				<aug>
					<au>
						<snm>Laperche</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Saune</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>D&#233;ny</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Duverlie</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Alain</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Chaix</snm>
						<fnm>ML</fnm>
					</au>
					<au>
						<snm>Gaudy</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Lunel</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Pawlotsky</snm>
						<fnm>JM</fnm>
					</au>
					<au>
						<snm>Payan</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Pozzetto</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Tamalet</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Thibault</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Vallet</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Bouchardeau</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Izopet</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Lefr&#232;re</snm>
						<fnm>JJ</fnm>
					</au>
				</aug>
				<source>J Clin Microbiol</source>
				<pubdate>2006</pubdate>
				<volume>44</volume>
				<issue>2</issue>
				<fpage>614</fpage>
				<lpage>616</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1392686</pubid>
						<pubid idtype="pmpid" link="fulltext">16455925</pubid>
						<pubid idtype="doi">10.1128/JCM.44.2.614-616.2006</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B29">
				<title>
					<p>Hepatitis C virus genotyping based on 5' noncoding sequence analysis (Trugene)</p>
				</title>
				<aug>
					<au>
						<snm>Halfon</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Trimoulet</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Bourliere</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Khiri</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>L&#233;dinghen</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Couzigou</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Feryn</snm>
						<fnm>JM</fnm>
					</au>
					<au>
						<snm>Alcaraz</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Renou</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Fleury</snm>
						<fnm>HJA</fnm>
					</au>
					<au>
						<snm>Ouzan</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>J Clin Microbiol</source>
				<pubdate>2001</pubdate>
				<volume>39</volume>
				<issue>5</issue>
				<fpage>1771</fpage>
				<lpage>1773</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">88023</pubid>
						<pubid idtype="pmpid" link="fulltext">11325988</pubid>
						<pubid idtype="doi">10.1128/JCM.39.5.1771-1773.2001</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B30">
				<title>
					<p>Determining hepatitis C genotype by analyzing the sequence of the NS5b region</p>
				</title>
				<aug>
					<au>
						<snm>Sandres-Saun&#233;</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Deny</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Pasquier</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Thibaut</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Duverlie</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Izopet</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>J Virol Methods</source>
				<pubdate>2003</pubdate>
				<volume>109</volume>
				<issue>2</issue>
				<fpage>187</fpage>
				<lpage>193</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0166-0934(03)00070-3</pubid>
						<pubid idtype="pmpid" link="fulltext">12711062</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B31">
				<title>
					<p>Comparison of hepatitis C virus genotyping by 5' noncoding region- and core-based reverse transcriptase PCR assay with sequencing and use of the assay for determining subtype distribution in India</p>
				</title>
				<aug>
					<au>
						<snm>Lole</snm>
						<fnm>KS</fnm>
					</au>
					<au>
						<snm>Jha</snm>
						<fnm>JA</fnm>
					</au>
					<au>
						<snm>Shrotri</snm>
						<fnm>SP</fnm>
					</au>
					<au>
						<snm>Tandon</snm>
						<fnm>BN</fnm>
					</au>
					<au>
						<snm>Prasad</snm>
						<fnm>VG</fnm>
					</au>
					<au>
						<snm>Arankalle</snm>
						<fnm>VA</fnm>
					</au>
				</aug>
				<source>J Clin Microbiol</source>
				<pubdate>2003</pubdate>
				<volume>41</volume>
				<issue>11</issue>
				<fpage>5240</fpage>
				<lpage>5244</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">262521</pubid>
						<pubid idtype="pmpid" link="fulltext">14605173</pubid>
						<pubid idtype="doi">10.1128/JCM.41.11.5240-5244.2003</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B32">
				<title>
					<p>Evaluation of complete genome sequences and sequences of individual gene products for the classification of hepatitis C viruses</p>
				</title>
				<aug>
					<au>
						<snm>Shukla</snm>
						<fnm>DD</fnm>
					</au>
					<au>
						<snm>Hoyne</snm>
						<fnm>PA</fnm>
					</au>
					<au>
						<snm>Ward</snm>
						<fnm>CW</fnm>
					</au>
				</aug>
				<source>Arch Virol</source>
				<pubdate>1995</pubdate>
				<volume>140</volume>
				<issue>10</issue>
				<fpage>1747</fpage>
				<lpage>1761</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1007/BF01384339</pubid>
						<pubid idtype="pmpid">7503676</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B33">
				<title>
					<p>Comparative study of three methods for genotyping hepatitis C virus strains in samples from Spanish patients</p>
				</title>
				<aug>
					<au>
						<snm>Forns</snm>
						<fnm>X</fnm>
					</au>
					<au>
						<snm>Maluenda</snm>
						<fnm>MD</fnm>
					</au>
					<au>
						<snm>Lopez-Labrador</snm>
						<fnm>FX</fnm>
					</au>
					<au>
						<snm>Ampurdanes</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Olmedo</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Costa</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Simmonds</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Sanchez-Tapias</snm>
						<fnm>JM</fnm>
					</au>
					<au>
						<snm>Anta</snm>
						<fnm>MTJD</fnm>
					</au>
					<au>
						<snm>Rodes</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>J Clin Microbiol</source>
				<pubdate>1996</pubdate>
				<volume>34</volume>
				<issue>10</issue>
				<fpage>2516</fpage>
				<lpage>2521</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">229308</pubid>
						<pubid idtype="pmpid" link="fulltext">8880512</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B34">
				<title>
					<p>Hepatitis C virus infection</p>
				</title>
				<aug>
					<au>
						<snm>Lauer</snm>
						<fnm>GM</fnm>
					</au>
					<au>
						<snm>Walker</snm>
						<fnm>BD</fnm>
					</au>
				</aug>
				<source>N Engl J Med</source>
				<pubdate>2001</pubdate>
				<volume>345</volume>
				<issue>1</issue>
				<fpage>41</fpage>
				<lpage>52</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1056/NEJM200107053450107</pubid>
						<pubid idtype="pmpid" link="fulltext">11439948</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B35">
				<title>
					<p>Evidence of intratypic recombination in natural populations of hepatitis C virus</p>
				</title>
				<aug>
					<au>
						<snm>Colina</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Casane</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Vasquez</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Garc&#237;a-Aguirre</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Chunga</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Romero</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Khan</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Cristina</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>J Gen Virol</source>
				<pubdate>2004</pubdate>
				<volume>85</volume>
				<fpage>31</fpage>
				<lpage>37</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1099/vir.0.19472-0</pubid>
						<pubid idtype="pmpid" link="fulltext">14718617</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B36">
				<title>
					<p>Serendipitous identification of natural intergenotypic recombinants of hepatitis C in Ireland</p>
				</title>
				<aug>
					<au>
						<snm>Moreau</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Hegarty</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Levis</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Sheehy</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Crosbie</snm>
						<fnm>O</fnm>
					</au>
					<au>
						<snm>Kenny-Walks</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Fanning</snm>
						<fnm>LJ</fnm>
					</au>
				</aug>
				<source>Virology J</source>
				<pubdate>2006</pubdate>
				<volume>3</volume>
				<fpage>95</fpage>
				<xrefbib>
					<pubid idtype="doi">10.1186/1743-422X-3-95</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B37">
				<title>
					<p>Sampling and repeatability in the evaluation of hepatitis C virus genetic variability</p>
				</title>
				<aug>
					<au>
						<snm>Torres-Puente</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Bracho</snm>
						<fnm>MA</fnm>
					</au>
					<au>
						<snm>Jimenez</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Garcia-Robles</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Moya</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Gonzalez-Candelas</snm>
						<fnm>F</fnm>
					</au>
				</aug>
				<source>J Gen Virol</source>
				<pubdate>2003</pubdate>
				<volume>84</volume>
				<fpage>2343</fpage>
				<lpage>2350</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1099/vir.0.19273-0</pubid>
						<pubid idtype="pmpid" link="fulltext">12917454</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B38">
				<title>
					<p>Intra-host evolutionary dynamics of hepatitis C virus E2 in treated patients</p>
				</title>
				<aug>
					<au>
						<snm>Alfonso</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Mbayed</snm>
						<fnm>VA</fnm>
					</au>
					<au>
						<snm>Sookoian</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Campos</snm>
						<fnm>RH</fnm>
					</au>
				</aug>
				<source>J Gen Virol</source>
				<pubdate>2005</pubdate>
				<volume>86</volume>
				<fpage>2781</fpage>
				<lpage>2786</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1099/vir.0.81084-0</pubid>
						<pubid idtype="pmpid" link="fulltext">16186232</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B39">
				<title>
					<p>Compartmentalization of hepatitis C virus genotypes between plasma and peripheral blood mononuclear cells</p>
				</title>
				<aug>
					<au>
						<snm>Roque-Afonso</snm>
						<fnm>AM</fnm>
					</au>
					<au>
						<snm>Ducoulombier</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Di Liberto</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Kara</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Gigou</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Dussaix</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Samuel</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Feray</snm>
						<fnm>C</fnm>
					</au>
				</aug>
				<source>J Virol</source>
				<pubdate>2005</pubdate>
				<volume>79</volume>
				<issue>10</issue>
				<fpage>6349</fpage>
				<lpage>6357</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1091708</pubid>
						<pubid idtype="pmpid" link="fulltext">15858018</pubid>
						<pubid idtype="doi">10.1128/JVI.79.10.6349-6357.2005</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B40">
				<title>
					<p>PAUP*.  Phylogenetic analysis using parsimony (* and other methods)</p>
				</title>
				<aug>
					<au>
						<snm>Swofford</snm>
						<fnm>DL</fnm>
					</au>
				</aug>
				<publisher>Sunderland, MA , Sinauer Associates</publisher>
				<edition>4th</edition>
				<pubdate>2002</pubdate>
			</bibl>
			<bibl id="B41">
				<title>
					<p>Distance methods for inferring phylogenies: a justification</p>
				</title>
				<aug>
					<au>
						<snm>Felsenstein</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Evolution</source>
				<pubdate>1984</pubdate>
				<volume>38</volume>
				<fpage>16</fpage>
				<lpage>24</lpage>
				<xrefbib>
					<pubid idtype="doi">10.2307/2408542</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B42">
				<title>
					<p>BIONJ: an improved version of the NJ algorithm based on a simple model of sequence data</p>
				</title>
				<aug>
					<au>
						<snm>Gascuel</snm>
						<fnm>O</fnm>
					</au>
				</aug>
				<source>Mol Biol Evol</source>
				<pubdate>1997</pubdate>
				<volume>14</volume>
				<issue>7</issue>
				<fpage>685</fpage>
				<lpage>695</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">9254330</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B43">
				<title>
					<p>ModelTest: testing the model of DNA substitution</p>
				</title>
				<aug>
					<au>
						<snm>Posada</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Crandall</snm>
						<fnm>KA</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>1998</pubdate>
				<volume>14</volume>
				<issue>9</issue>
				<fpage>817</fpage>
				<lpage>818</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/14.9.817</pubid>
						<pubid idtype="pmpid" link="fulltext">9918953</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B44">
				<title>
					<p>Model selection and multimodel inference: a practical information-theoretic approach</p>
				</title>
				<aug>
					<au>
						<snm>Burnham</snm>
						<fnm>KP</fnm>
					</au>
					<au>
						<snm>Anderson</snm>
						<fnm>DR</fnm>
					</au>
				</aug>
				<publisher>New York , Springer-Verlag</publisher>
				<edition>2nd</edition>
				<pubdate>2002</pubdate>
			</bibl>
			<bibl id="B45">
				<title>
					<p>Selecting the best-fit model of nucleotide substitution</p>
				</title>
				<aug>
					<au>
						<snm>Posada</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Crandall</snm>
						<fnm>KA</fnm>
					</au>
				</aug>
				<source>Syst Biol</source>
				<pubdate>2001</pubdate>
				<volume>50</volume>
				<issue>4</issue>
				<fpage>580</fpage>
				<lpage>601</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1080/106351501750435121</pubid>
						<pubid idtype="pmpid">12116655</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B46">
				<title>
					<p>Model selection and the principle of minimum description length</p>
				</title>
				<aug>
					<au>
						<snm>Hansen</snm>
						<fnm>MH</fnm>
					</au>
					<au>
						<snm>Yu</snm>
						<fnm>B</fnm>
					</au>
				</aug>
				<source>J Am Stat Assoc</source>
				<pubdate>2001</pubdate>
				<volume>96</volume>
				<issue>454</issue>
				<fpage>746</fpage>
				<lpage>774</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1198/016214501753168398</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B47">
				<title>
					<p>A new look at the statistical model identification</p>
				</title>
				<aug>
					<au>
						<snm>Akaike</snm>
						<fnm>H</fnm>
					</au>
				</aug>
				<source>IEEE Trans Automatic Control</source>
				<pubdate>1974</pubdate>
				<volume>19</volume>
				<issue>6</issue>
				<fpage>716</fpage>
				<lpage>723</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1109/TAC.1974.1100705</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B48">
				<title>
					<p>Multiple comparisons of log-likelihoods with applications to phylogenetic inference</p>
				</title>
				<aug>
					<au>
						<snm>Shimodaira</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Hasegawa</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>Mol Biol Evol</source>
				<pubdate>1999</pubdate>
				<volume>16</volume>
				<issue>8</issue>
				<fpage>1114</fpage>
				<lpage>1116</lpage>
			</bibl>
			<bibl id="B49">
				<title>
					<p>Likelihood-based tests of topologies in phylogenetics</p>
				</title>
				<aug>
					<au>
						<snm>Goldman</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Anderson</snm>
						<fnm>JP</fnm>
					</au>
					<au>
						<snm>Rodrigo</snm>
						<fnm>AG</fnm>
					</au>
				</aug>
				<source>Syst Biol</source>
				<pubdate>2000</pubdate>
				<volume>49</volume>
				<issue>4</issue>
				<fpage>652</fpage>
				<lpage>670</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1080/106351500750049752</pubid>
						<pubid idtype="pmpid">12116432</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B50">
				<title>
					<p>The Los Alamos hepatitis C sequence database</p>
				</title>
				<aug>
					<au>
						<snm>Kuiken</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Yusim</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Boykin</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Richardson</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2005</pubdate>
				<volume>21</volume>
				<issue>3</issue>
				<fpage>379</fpage>
				<lpage>384</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/bth485</pubid>
						<pubid idtype="pmpid" link="fulltext">15377502</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
		</refgrp>
	</bm>
</art>
