Edited by Yoko Satta. Naruya Saitou: Corresponding author. Email: nsaitou@genes.nig.ac.jp. Takashi Kitano: Present address: Department of Experimental and Forensic Pathology, Yamagata University School of Medicine, Yamagata, 990-9585, Japan

Index
INTRODUCTION
MATERIALS AND METHODS
RESULTS
DISCUSSION
ELECTRONIC-DATABASE INFORMATION
References

INTRODUCTION

Comparison of human and other hominoids is definitely necessary to understand human evolution (e.g., see Saitou 2005). Chimpanzee chromosome 22, orthologous to human chromosome 21, have been fully sequenced recently for human-chimpanzee comparative genomics (The International Chimpanzee Chromosome 22 Consortium, 2004), however, we need outgroup species to infer human specific changes. There are such gene comparison studies of human and other hominoids (Chen and Li 2001; O’hUigin et al. 2002; Wildman et al. 2003; Kitano et al. 2004). Those studies mainly used protein-coding sequence data. Some studies (Chen and Li 2001; Yi et al. 2002) compared human and other hominoid non-coding sequence data. However, data from 5’ upstream non-coding regions are much less abundant than data from coding regions. 5’ upstream non-coding regions of genes have important role for controlling gene expression. Accumulation of analyses of these regions between human and other hominoids is important to understand human evolution.

The nucleotide substitutional difference between the human and chimpanzee genomes was estimated to be 1.23% based on a total of 19,813,086 bp BAC end sequ-ence comparison (Fujiyama et al. 2002), while the whole chromosome comparison between human chromosome 21 and chimpanzee chromosome 22 showed somewhat higher value, 1.44% (The International Chimpanzee Chromosome 22 Consortium, 2004). Many of those differences are probably caused by mutations occurred in so-called junk DNA (Ohno 1972), and have no effect on phenotypic difference between human and chimpanzee. Some proportion of nucleotide changes, however, must be responsible for human-specific characters, such as large brain size, bipedalism, etc. King and Wilson (1975) proposed that genetic changes at the gene expression control region are more important than changes in the protein-coding region.

We thus estimated the rate of nucleotide substitution of 5’ upstream gene region in hominoids in this study. We then compared it with corresponding synonymous substitution rates. These rates were also compared with those for mouse (Mus musculus) and rat (Rattus norvegicus). Finally, implication of human specific changes in the 5’ upstream region is discussed.


MATERIALS AND METHODS

We determined nine 5’ upstream gene regions (see Table 1) for human (Homo sapiens), chimpanzee (Pan troglodytes), gorilla (Gorilla gorilla), and orangutan (Pongo pygmaeus) in this study. Six genes (DDC, DMD, MAOA, NEF3, NGFB, and POU3F2) are brain and/or neuron related genes. Two alternative 5’ regions were sequenced for the DMD gene. The 5’ upstream regions for two non brain-neuron specific genes (INMT and PLP2) were also sequenced. INMT is known to be expressed in several human tissues such as adult thyroid, adult adrenal gland, adult lung, and fetal lung (Thompson et al. 1999), and PLP2 is expressed in colonic epithelial cell (Olivia et al. 1993).


View Details
Table 1.
List of genes whose 5’ upstream regions were compared


DNA was extracted from peripheral blood sample of Japanese individuals with informed consent for human samples. DNA samples for chimpanzee, gorilla and orangutan were also used, and those samples were the same as used by Kitano et al. (2004). We sequenced two human samples and two chimpanzee samples for each DMD-1, NGFB, and POU3F2 gene, and these two sequences were identical with each other (data not shown). Hot-start PCR was applied to each selected segment (PE GeneAmp PCR system 2400 and 9700). All PCR primers were designed based on human sequences in the database. A list of primers used in this study is available from TK. PCR products were confirmed by 1.5% agarose gel electrophoresis, and purified using Micro Spin Columns (Amersham Biosciences). The purified PCR products were sequenced by using BigDye Terminator Cycle Sequencing Kit and ABI PRISM 377/310 DNA sequencer (PE Biosystems). Both strands were read using PCR primers and inner primers when long sequences were read. Overlapping two peaks on a site was regarded as heterozygous and the site was designated by following Nomenclature Committee of the International Union of Biochemistry (NC-IUB).

CLUSTAL W ver.1.8 (Thompson et al. 1994) was used for multiple alignments. Aligned regions without gaps were used for following analyses. Numbers of nucleotide substitutions per site (d) were estimated by using one-parameter method (Jukes and Cantor 1969). Numbers of synonymous substitutions per synonymous site were estimated by using Nei and Gojobori’s (1986) method.

Fig. 1 shows 5’ upstream regions of eight genes whose sequences were determined in this study. Because the first exons of two genes (DDC and NGFB) were non-coding exons, start codons were not shown in the figure. Both alternative non-coding exons of DMD were also sequenced. DMD-1 and DMD-2 correspond to the brain type (Boyce et al. 1991) and the Purkinje cell type (Holder et al. 1996), respectively. For simplicity, we used terms ‘DMD-1’ and ‘DMD-2’ in this paper.


View Details
Fig. 1.
Genome structures of the nine sequenced regions. Coding and non-coding exons are shown by close and open boxes, respectively. Thick bars show sequenced regions. 5’ upstream region of MAOA overlaps with 3’ region of LOC352786 indicated by a grey box.



RESULTS

We determined nine 5’ upstream regions for four hominoid species (human, chimpanzee, gorilla, and orangutan), and their DDBJ/EMBL/GenBank International Nucleotide Sequence Database accession numbers are listed in Table 2. A total of 4157 nucleotide sites could be compared, and 108 variant sites were detected. Variant nucleotide sites are shown in Table 3. Two sites (120 of NEF3 and 218 of NGFB) had three character states. Therefore, a total of 110 changes are necessary to explain these variant sites.


View Details
Table 2.
List of DDBJ/EMBL/GenBank accession numbers* and NCBI gene numbers whose nucle-otide sequences were compared





View Details
Table 3.
Variant nucleotide sites of nine 5’ upstream regions of human, chimpanzee, gorilla, and orangutan


Because numbers of nucleotide substitutions on each branch were low, we added nucleotide substitutions occurred on all branches of Fig. 2. The total number of nucleotide substitutions among human, chimpanzee, gorilla, and orangutan (d) were estimated from the total number of nucleotide changes (m) among human, chimpanzee, gorilla, and orangutan. The total d for hominoids was 0.027 ± 0.003. Divergence times of human, chimpanzee, gorilla, and orangutan are shown in Fig. 2 following Chen and Li (2001). By adding divergence times of all branches, the total time was 34.8–46.6 MY (million years), and this value was used to estimate rates of nucleotide substitution. The total rate ranged from 0.52 × 10–9 to 0.85 × 10–9. There were no specific rate differences between brain-neuron expressed genes (DDC, DMD-1, DMD-2, MAOA, NEF3, NGFB, and POU3F2) and the remaining genes (INMT and PLP2).


View Details
Fig. 2.
A species tree of human, chimpanzee, gorilla, and orangutan. Numbers of each branch indicate nucleotide changes. Divergence times for three speciation events are shown on the time scale bar. Ranges of divergence times (MY, million years) are shown in parentheses.


We used BLAST homology search system to retrieve homologous sequences from rat and mouse from the genome database. However, we could retrieve mouse and rat counterparts only for DMD-1, DMD-2, and POU3F2. Because the regions around translation and/or transcription start sites of those six remaining sequences had homology between human and mouse-rat, we retrieved same length of sequences with hominoids from regions just upstream of these homologous regions, and used them as possible mouse and rat counterpart sequences (see Table 2 for list of sequence IDs used for comparison). The total d between mouse and rat was 0.100 ± 0.005 (Table 4). Since the divergence time bet-ween mouse and rat is a matter of argument (Wilson et al. 1977; Jacobs. and Pilbeam 1980), we used two kinds of divergence times between mouse ,and rat to estimate rates of nucleotide substitutions; 40.7 MYA (million years ago) (Kumar and Hedges 1998) and 30 MYA (Kitano et al. 1999). The total rate ranged from 1.17 × 10–9 to 1.76 × 10–9. The total rate of nucleotide substitution in hominoids was lower than that in rodents.


View Details
Table 4.
Nucleotide differences and substitution rate for 5’ upstream regions of nine protein coding genes for Hominoids and Rodents


Fig. 3 shows relationship of d between hominoids and rodents for each region. Hominoid d and rodent d are positively correlated (correlation coefficient is 0.63). This is mainly because three regions (DMD-1, DMD-2, and POU3F2; shown in gray circle in Fig. 3) where primates-rodents homology was found low d both in hominoids and mouse/rat, while the remaining six regions, in which no clear homology was found between hominoids and mouse/rat, showed higher d both in hominoids and mouse/rat. This suggests that relative intensity of functional constraints had been more or less constant through the mammalian evolution. One 5’ upstream region for hominoids, INMT, showed the evolutionary distance larger than that expected under pure neutral evolution (see Fig. 3). This suggests that the 5’ upstream region of this gene reduced its importance in the primate lineage.


View Details
Fig. 3.
Comparison of evolutionary distances for ten 5’ upst-ream regions between rodents and hominoids, based on data shown in Table 4. Three regions designated with gray circles are those with asterisks in Table 4. Two broken lines represent evolutionary distances for rodents (horizontal) and for hominoids (vertical) under the pure neutral evolution. The genomic divergence between mouse and rat (Table 6) was used as the pure neutral evolutionary distance for rodents. The value of genomic divergence between human and chimpanzee (0.012) multiplied by 3.77 was used as the pure neutral evolutionary distance for hominoids, because the total time of all branches (34.8–46.6 MY, mean 40.7 MY) was 3.77 times higher than the time between human and chimpanzee (10.8 MY = 5.4 × 2). The diagonal line represents equal selective constraints between rodents and hominoids. This line is through the origin and the intersection of the two broken lines.



DISCUSSION

The average rate of nucleotide substitution for human and chimpanzee non-coding DNA has been reported to be 0.99 × 10–9 (Yi et al. 2002) or 0.86 × 10–9 (Wildman et al. 2003). The total rate of the 5’ upstream region of hominoids (from 0.52 × 10–9 to 0.85 × 10–9) estimated by this study was lower than those of previous results. We estimated the rate of nucleotide substitution by using human, chimpanzee, gorilla, and orangutan data sets. When we estimate rates of nucleotide substitution by using only human and chimpanzee, the overall rate for human and chimpanzee ranged from 0.41 × 10–9 to 0.82 × 10–9 (m = 26, d = 0.006 ± 0.001). Divergence time (4.6–6.2 MYA, see Fig. 2) between human and chimpanzee was used estimate the rate of nucleotide substitutions. The overall rate of human and chimpanzee was still lower than those of previous reports (Yi et al. 2002; Wildman et al. 2003).

Rates of nucleotide substitution of various regions were summarized as nonsynonymous (0.85 × 10–9), synonymous (4.61 × 10–9), 5’ UTR (1.96 × 10–9), 3’ UTR (2.10 × 10–9), introns (3.14 × 10–9) and pseudogenes (3.52 × 10–9) in various mammalian genes (Li and Graur 1991). It is shown that the rate of nucleotide substitution in 5’ upstream regions is lower than those in other regions except for nonsynonymous sites. Studies of divergence between human and chimpanzee (Chen et al. 2001; Shi et al. 2003) also showed similar tendency. 5’ upstream regions do not code gene products, but these regions probably have functional roles to gene expression. Our result showed lower rate of nucleotide substitution in 5’ upstream regions. This suggests that 5’ upstream regions have evolved under strong functional constraints.

It has been reported that rates of coding region in hominoids are lower than those of rodents (Wu and Li 1985; Gu and Li 1992; Kitano et al. 1999). In this study, we showed that 5’ upstream regions also have lower rates of substitution in hominoids than in rodents. To assess whether the lower rate of substitution in hominoids than in rodents is due to generation time effects or stronger constraint in hominoids, we estimated rate of synonymous substitutions for the same genes. For hominoid comparisons, we used distances between human and chimpanzee, because data for other hominoid species were not available. Table 5 shows the comparison of synonymous substitutions for hominoids and rodents (mouse vs. rat). We then summarized distance differences between hominoids and rodents as shown in Table 6. In all three comparisons (5’ upstream region distance, synonymous distance, and genomic divergence), the evolutionary distance between mouse and rat is 8–17 times higher than that between human and chimpanzee. The ratio of the species divergence time between the two species pair, t(M-R)/t(H-C), should be at most 7 times, if there is no difference in evolutionary rate in primates and rodents. This indicates that somehow the evolutionary rate has been higher in rodents than in primates.


View Details
Table 5.
Nucleotide differences and substitution rate for synonymous sites of five protein coding genes for Hominoids (human vs. chimpanzee) and seven genes for Rodents (mouse vs. rat)





View Details
Table 6.
Comparison of nucleotide differences of human-chimpanzee and mouse-rat


One good characteristic of simultaneous sequence comparison of closely related species is clear deduction of species-specific changes. We thus extracted eleven changes out of total 110 changes as human lineage specific changes (see Table 3). We searched putative transcription factor binding sites by using TESS (Transcription Element Search System) under anticipation that even one nucleotide substitution at the human lineage might caused a significant phenotypic change through change of gene expression pattern. Only five possible transcription elements differences between human and other hominoids were observed, as listed in Table 7. Those site differences may affect gene expression differences between human and other hominoids, however, we need further experiments to confirm them.


View Details
Table 7.
Differences between human and other hominoid on possible transcription factor binding sites


We are grateful for R. E. Ferrell for providing us chimpanzee DNA samples and for S. Ueda for providing us gorilla and orangutan DNA samples. We thank H. Kobayakawa for technical assistance. This study was supported by a grant in aid for scientific studies from Ministry of Education, Science, Sport, and Culture, Japan to NS, Joint Research Project (Soken/K99-1) of The Graduate University for Advanced Studies, Japan to NS. TK was supported by COE research fellowship of the National Institute of Genetics and JSPS postdoctoral fellowship.


ELECTRONIC-DATABASE INFORMATION

NCBI Home Page, http://www.ncbi.nlm.nih.gov/

Project Silver Home Page, http://sayer.lab.nig.ac.jp/~silver/index.html

TESS, http://www.cbil.upenn.edu/tess/


References
Boyce, F. M. Beggs, A. H., Feener, C., and Kunkel, L. M. (1991) Dystrophin is transcribed in brain from a distant upstream promoter. Proc. Natl. Acad. Sci. USA 88, 1276–1280.
Chen, F.-C. and Li. W.-H. (2001) Genomic divergence between humans and other hominoids and the effective population size of the common ancestor of humans and chimpanzees. Am. J. Hum. Genet. 68, 444–456.
Chen, F.-C., Vallender, E. J., Wang, H., Tzeng, C. S., and Li, W.-H. (2001) Genomic divergence between human and chimpanzee estimated from large-scale alignments of genomic sequences. J. Hered. 92, 481–489.
Fujiyama, A., Watanabe, H., Toyoda, A., Taylor, T. D., Itoh, T., Tsai, S. F., Park, H. S., Yaspo, M. L., Lehrach, H., Chen, Z., Fu, G., Saitou, N., Osoegawa, K., de Jong, P. J., Suto, Y., Hattori, M., and Sakaki, Y. (2002) Construction and analysis of a human-chimpanzee comparative clone map. Science 295, 131–134.
Gu, X., and Li, W.-H. (1992) Higher rates of amino acid substitution in rodents than in humans. Mol. Phylogenet. Evol. 1, 211–214.
Holder, E., Maeda, M., and Bies, R. D. (1996) Expression and regulation of the dystrophin Purkinje promoter in human skeletal muscle, heart, and brain. Hum. Genet. 97, 232–239.
Jacobs, L. L. and Pilbeam, D. (1980) Of mice and men: fossil-based divergence dates and molecular “clocks”. J. Hum. Evol. 9, 551–555.
Jukes, T. H. and Cantor, C. R. (1969) Evolution of protein molecules. In Mammalian protein metabolism (H. N. Munro, ed.), pp. 21–132. Academic, New York.
King, M. C., and Wilson, A. C. (1975) Evolution at two levels in humans and chimpanzees. Science 188, 107–116.
Kitano, T., Liu, Y.-H., Ueda, S., and Saitou, N. (2004) Human specific amino acid changes found in 103 protein coding genes. Mol. Biol. Evol. 21, 936–944.
Kitano, T., Oota, S., and Saitou, N. (1999) Molecular evolutionary analyses of the Rh blood group genes and Rh50 genes in mammals. Zoological Studies 38, 379–386.
Kumar, S., and Hedges, S. B. (1998) A molecular timescale for vertebrate evolution. Nature 392, 917–920.
Li, W.-H. and Graur, D. (1991) Fundamentals of Molecular Evolution. Sinauer Associates, Inc., Sunderland, Mass.
Nei, M., and T. Gojobori. (1986) Simple methods for estimating the numbers of synonymous and nonsynonymous nucleotide substitutions. Mol. Biol. Evol. 3, 418–426.
Ohno, S. (1972) So much “junk” DNA in our genome. Brookhaven Symp. Biol. 23, 366–370.
O'hUigin, C., Satta, Y., Takahata, N., and Klein, J. (2002) Contribution of homoplasy and of ancestral polymorphism to the evolution of genes in anthropoid primates. Mol. Biol. Evol. 19, 1501–1513.
Oliva, M. M., Wu, T. C., and Yang, V. W. (1993) Isolation and characterization of a differentiation-dependent gene in the human colonic cell line HT29-18. Arch. Biochem. Biophys. 302, 183–192.
Saitou, N. (2005) Evolution of hominoids and the search for a genetic basis for creating humanness. Cyto. Genome Res. 108, 16–21.
Shi, J., Xi, H., Wang, Y., Zhang, C., Jiang, Z., Zhang, K., Shen, Y., Jin, L., Zhang, K., Yuan, W., Wang, Y., Lin, J., Hua, Q., Wang, F., Xu, S., Ren, S., Xu, S., Zhao, G., Chen, Z., Jin, L., Huang, W. (2003) Divergence of the genes on human chromosome 21 between human and other hominoids and variation of substitution rates among transcription units. Proc. Natl. Acad. Sci. USA 100, 8331–8336.
Thompson, J. D., Gibson, T. J., and Higgins, D. G. (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, positions-specific gap penalties and weight matrix choice. Nuc. Acids Res. 22, 4673–4680.
Thompson, M. A., Moon, E., Kim, U.-J., Xu, J., Siciliano, M. J., and Weinshilboum, R. M. (1999) Human indolethylamine N-methyltransferase: cDNA cloning and expression, gene cloning, and chromosomal localization. Genomics 61, 285–297.
The International Chimpanzee Chromosome 22 Consortium [Watanabe, H., Fujiyama, A., Hattori, M., Taylor, T. D., Toyoda, A., Kuroki, Y., Noguchi, H. BenKahla, A., Lehrach, H., Sudbrak, R., Kube, M., Taenzer, S., Galgoczy, M.Platzer, P., Scharfe, M., Nordsiek, G., Blocker, H., Hellmann, I., Khaitovich, P., Paabo, S., Reinhardt, R., Zheng, H.-J., Zhang, X.-L., Zhu, G.-F., Wang, B.-F., Fu, G., Ren, S.-X., Zhao, G.-P., Chen, Z., Lee, Y.-S., Cheong, J.-E., Choi, S.-H., Wu, K.-M., Liu, T.-T., Hsiao, K.-J., Tsai, S.-F., Kim, C.-G., Oota, S., Kitano, T., Kohara, Y., Saitou, N., Park, H.-S., Wang, S.-Y., Yaspo, M.-L., and Sakaki Y.] (2004) DNA sequence and comparative analysis of chimpanzee chromosome 22. Nature 429, 382–388.
Wildman, D. E., Uddin, M., Liu, G., Grossman, L. I., and Goodman, M. (2003) Implications of natural selection in shaping 99.4% nonsynonymous DNA identity between humans and chimpanzees: Enlarging genus Homo. Proc. Natl. Acad. Sci. USA 100, 7181–7188.
Wilson, A. C., Carlson, S. S., and White, T. J. (1977) Biochemical evolution. Annu. Rev. Biochem. 6, 573–639.
Wu, C.-I. and Li, W.-H. (1985) Evidence for higher rates of nucleotide substitution in rodents than in man. Proc. Natl. Acad. Sci. USA 82, 1741–1745.
Yi, S., Ellsworth, D. L., and Li, W.-H. (2002) Slow molecular clocks in Old World monkeys, apes, and humans. Mol. Biol. Evol. 19, 2191–2198.