2019 Volume 94 Issue 4 Pages 159-165
Gene-centromere (G-C) mapping provides insight into vertebrate genome composition, structure and evolution. Although medaka fish are important experimental animals, no genome-wide G-C map of medaka has been constructed. In this study, we used 112 interspecific triploid hybrids and 152 DNA markers to make G-C maps of all 24 linkage groups (LGs). Under the assumption of 50% interference, 24 centromeres were localized onto all corresponding medaka LGs. Comparison with 21 centromere positions deduced from putative centromeric repeats revealed that 19 were localized inside the centromeric regions of the G-C maps, whereas two were not. Based on the centromere positions indicated in the G-C maps and those of centromeric repeats on each LG, we classified chromosomes as either biarmed or monoarmed; n = 24 = 10 metacentrics/submetacentrics + 14 subtelocentrics/acrocentrics, which is consistent with the results of previous karyological reports. This study helps to elucidate genome evolution mechanisms, and integrates physical and genetic maps with karyological information of medaka.
The identification of centromere positions is essential for the integration of cytogenetic and genetic linkage maps and is also an initial step toward understanding the composition and structure of the centromeric region as well as the whole genome. Mainly due to the lack of well-defined genetic linkage maps using co-dominant markers, centromeres have been located only in very limited fish species so far. Most previous gene-centromere (G-C) map studies on fish species were based on allozyme markers and conducted decades ago. With the advantages and popularity of co-dominant DNA markers, G-C maps have recently been reported for many fish species. Zebrafish, Danio rerio, was the first fish for which all 25 centromeres were localized on genetic linkage maps (Johnson et al., 1996), followed by rainbow trout, Oncorhynchus mykiss (Sakamoto et al., 2000), loach, Misgurnus anguillicaudatus (Morishima et al., 2001), Japanese eel, Anguilla japonica (Nomura et al., 2006), turbot, Scophthalmus maximus (Martínez et al., 2008), large yellow croaker, Pseudosciaena crocea (Li et al., 2008), half-smooth tongue sole, Cynoglossus semilaevis (Ji et al., 2009), walking catfish, Clarias macrocephalus (Poompuang and Sukkorntong, 2011) and bighead carp, Hypophthalmichthys nobilis (Zhu et al., 2013).
Medaka fish (Oryzias latipes, O. sakaizumii and O. sinensis) represent one of the most important experimental animals worldwide, and have been used in fields such as genetics and developmental biology. The first medaka G-C map was made for linkage group (LG) 1, which is the sex LG (Sato et al., 2001). The distance between the marker and centromere is very important for estimating centromeric regions: the shorter the distance, the more accurate the centromeric regions. In this study, we aimed to localize centromeres onto all 24 LGs. The information obtained from this G-C map will be useful for understanding the genome structure and chromosome evolution of this species group.
All fish strains used in this study were supplied by the Faculty of Science, Niigata University, a subcenter of the National BioResource Project (medaka) supported by the Ministry of Education, Culture, Sports, Science and Technology of Japan, and were reared in 150 × 250 × 105-mm plastic vessels with 2 l of water at 25 ℃. Oryzias luzonensis was originally obtained from Luzon (Formacion and Uwa, 1985), and two inbred strains of medaka, HNI-II and Hd-rR (derived from O. sakaizumii and O. latipes, respectively), were described by Hyodo-Taguchi and Sakaizumi (1993).
CrossesHd-rR strain females were mated with HNI-II males, which resulted in F1 progeny (DNF1). DNF1 females and O. luzonensis males were isolated the day before they were mated. Fertilized eggs were immediately collected after spawning; after 2 min and 45 sec, they were exposed to 42 ℃ water for 2 min to block the second meiotic division (Naruse et al., 1985). Allodiploid embryos of O. latipes/sakaizumii and O. luzonensis almost all develop abnormally or die before hatching (Formacion and Uwa, 1985). The hatching rate of the hybrid embryos is improved by high-temperature treatment, and this improvement has been verified to be the result of triploidization (Sato et al., 2001). Thus, all or most of the hatched fry are expected to be allotriploids.
GenotypingDNA was extracted from the hatched fry by proteinase K digestion, phenol–chloroform extraction and isopropanol precipitation (Shinoyama et al., 1999). DNA samples were dissolved in TE buffer. To find DNA markers to detect polymorphisms between Hd-rR, HNI-II and O. luzonensis, we selected expressed sequence tag (EST) markers from the medaka EST database (http://mbase.nig.ac.jp/mbase/medaka_top.html), and made new markers as necessary (Supplementary Table S1). PCR amplification of genomic DNA was performed as follows: 35 cycles at 95 ℃ for 30 sec, 55 ℃ for 30 sec, and 72 ℃ for 2 min. All PCR products were electrophoresed on polyacrylamide gels as described by Kimura et al. (2004). In this study, we developed an SNP genotyping system between the HNI-II and Hd-rR inbred strains. SNP genotyping was performed using the MassARRAY iPLEX Gold Assay (Sequenom) (Takada et al., 2013). The information used for genotyping is described in Supplementary Table S2.
Centromere localizationThe second meiotic division segregation frequency (y), which is a function of the recombination rate between the marker and centromere, was estimated by counting the proportion of progeny heterozygous for Hd-rR and HNI-II alleles in allotriploid hybrids, which allowed us to estimate G-C distance (x) (Johnson et al., 1996). G-C distance may be estimated under different models of chiasma interference (Danzmann and Gharbi, 2001). Assuming 50% chiasma interference, G-C distance was calculated as follows: x = [ln (1 + y) − ln (1 − y)] × 100 / 4.
Chromosome type classificationChromosomes were classified according to the criterion of the centromeric index (CI; long arm to short arm ratio) introduced by Levan et al. (1964). Chromosome type and corresponding CI values were as follows: metacentric, 1.0–1.7; submetacentric, 1.7–3.0; subtelocentric, 3.0–7.0; and acrocentric, more than 7.0.
In this study, we used 112 triploid hybrids and 152 DNA markers (Supplementary Table S1). Hatched fry were all allotriploid hybrids, as determined by genotyping using DNA markers. The map length of each chromosome was 44.3–58.7 cM (average, 50.2 cM). The linkage map spanned a genetic length of 1,204 cM (Table 1), which is 4% shorter than that proposed in a previous study (1,257 cM in female, Kimura et al., 2005); however, we used terminal markers for the physical map of each chromosome in this study. This difference may also be partially due to long interval distances between loci (average interval distance, 9.3 vs. 5.4 cM in Kimura et al., 2005). Centromeres were successfully positioned onto all 24 LGs of the medaka genetic linkage map (Fig. 1).
LG | Map length (cM) | Recombination rate (cM/Mbp) | |
---|---|---|---|
Centromeric region* | Other region | ||
1 | 56.1 | 0.95 | 1.87 |
2 | 51.0 | 0.61 | 2.24 |
3 | 52.6 | 1.14 | 2.18 |
4 | 58.7 | 0.58 | 2.18 |
5 | 56.1 | 0.47 | 1.96 |
6 | 45.9 | 0.60 | 1.56 |
7 | 50.7 | 1.21 | 1.47 |
8 | 47.4 | 0.19 | 2.53 |
9 | 44.5 | 0.70 | 1.65 |
10 | 49.4 | 0.06 | 2.07 |
11 | 44.3 | 2.72 | 1.50 |
12 | 46.2 | 0.32 | 2.21 |
13 | 49.5 | 0.75 | 1.59 |
14 | 48.3 | 0.75 | 1.85 |
15 | 47.1 | 0.48 | 1.70 |
16 | 47.6 | 1.12 | 1.71 |
17 | 50.5 | 0.23 | 1.71 |
18 | 55.2 | 2.01 | 1.77 |
19 | 52.1 | 1.41 | 2.74 |
20 | 44.7 | 1.72 | 2.15 |
21 | 53.4 | 1.14 | 2.06 |
22 | 49.0 | 0.77 | 1.99 |
23 | 52.8 | 2.09 | 2.24 |
24 | 50.6 | 1.31 | 2.51 |
Total | 1,203.7 |
Gene-centromere maps of 24 medaka LGs. For each chromosome, the physical map (Hd-rR) is on the left, and the G-C map is on the right (n = 112). Centromere locations determined by half-tetrad analysis are shown as open circles. Centromeric regions are indicated as gray boxes in the physical maps. Centromeric repeats identified by Ichikawa et al. (2017) for Hd-rR and HNI-II are represented by black and white triangles, respectively.
In general, centromeric regions possess lower levels of recombination while telomeric regions of the chromosome experience increased crossover events (Choo, 1998; International Human Genome Sequencing Consortium, 2001). We found that almost every centromeric region of the G-C maps, except for those of LGs 11 and 18, had lower recombination frequencies (Table 1). LG11 of HNI-II has a large chromosomal inversion (7–23 Mbp) in the central region and that of Hd-rR does not (Kimura et al., 2005; Ichikawa et al., 2017), which may explain the high recombination frequency in the centromeric region of LG11. For LG18, the relatively higher level of recombination in the centromeric region may be partially due to insufficient narrowing of this region.
Comparison between centromeric regions in the G-C maps and putative centromeric repeat positionsPutative centromeric repeats have been reported in three inbred medaka strains: Hd-rR, HNI-II and HSOK (Ichikawa et al., 2017). The repeats were identified in one or multiple strains for all LGs except LGs 15 and 24. Based on these results, we predicted the centromere locations on the chromosomes (Table 2). For six LGs (2, 6, 11, 12, 19 and 21), centromeric repeats were identified in Hd-rR and HNI-II; for 17 LGs (1, 3, 4, 5, 7, 8, 9, 10, 13, 14, 15, 16, 17, 18, 20, 22 and 23), they were found in either Hd-rR or HNI-II. The repeats on LG6 were located at 0 Mbp in Hd-rR, but at the opposite end in HNI-II. Furthermore, two centromere positions were predicted for LG21 in Hd-rR.
LG | Centromeric region on G-C map (Mbp) | Chromosome type* judged from | Expected chromosome type | |||
---|---|---|---|---|---|---|
G-C map | Position of centromeric repeats in Hd-rR | Position of centromeric repeats in HNI-II | FISH and/or silver staining | |||
1 | 23.33–36.66 | M/SM/ST/A | SM | – | SM | Bi (SM) |
2 | 10.12–13.06 | M | M | M | – | Bi (M) |
3 | 0.44–14.45 | M/SM/ST/A | ST | – | – | Mono (ST) |
4 | 0.04–7.87 | ST/A | A | – | – | Mono (A) |
5 | 0.12–5.87 | ST/A | – | A | – | Mono (A) |
6 | 1.01–4.07 | A | A | A | – | Mono (A) |
7 | 33.83–34.57 | A | – | A | – | Mono (A) |
8 | 15.37–22.52 | M/SM/ST | SM | – | – | Bi (SM) |
9 | 18.24–24.72 | M/SM | SM | – | SM | Bi (SM) |
10 | 0.14–7.53 | ST/A | – | ST | – | Mono (ST/A) |
11 | 26.39–28.21 | A | A | A | A | Mono (A) |
12 | 18.25–24.10 | M/SM/ST | SM | SM | – | Bi (SM) |
13 | 29.66–32.77 | A | A | – | A | Mono (A) |
14 | 0.26–6.21 | ST/A | A | – | – | Mono (A) |
15 | 0.44–3.85 | ST/A | – | – | – | Mono (ST/A) |
16 | 0–5.90 | ST/A | A | – | – | Mono (A) |
17 | 0.56–2.50 | A | A | – | – | Mono (A) |
18 | 12.60–21.94 | M/SM | SM | – | SM | Bi (SM) |
19 | 11.33–19.13 | M/SM/ST | SM | SM | – | Bi (SM) |
20 | 0–12.14 | M/SM/ST/A | ST | – | SM (NOR) | Bi (SM) |
21 | 8.60–16.55 | M/SM | D (A or M) | A | – | Bi (M) |
22 | 23.69–28.97 | ST/A | A | – | – | Mono (A) |
23 | 9.26–15.46 | M/SM | M | – | – | Bi (M) |
24 | 19.02–23.24 | ST/A | – | – | – | Mono (ST/A) |
A comparison between the positions of putative centromeric repeats on 23 LGs and the centromeric regions in the G-C maps revealed that repeat positions on 19 LGs (1–5, 7–12, 14, 16–20, 22 and 23) were inside their respective centromeric regions in the G-C maps, whereas those of LGs 6 and 13 were not. The putative centromeric repeats on LG6 were at 0 Mbp in Hd-rR, but at the opposite end in HNI-II, and those on LG13 may not be functional. For LG 21, two positions (2.5 and 14.5 Mbp) of centromeric repeats were found in Hd-rR, whereas only one (2.5 Mbp) was found in HNI-II; additionally, the repeats at 14.5 Mbp were located inside the centromeric region in the G-C map, which indicates that these repeats have centromeric function. The repeats at 2.5 Mbp may be silenced in both Hd-rR and HNI-II. The centromere of HNI-II is assumed to be located at the same position as that of Hd-rR. Although centromeric repeats were not identified on LG24, the centromere was mapped at one end (19–23 Mbp) of the chromosome.
Chromosome classificationOryzias latipes and O. sakaizumii have 24 pairs of chromosomes (2n = 48) that were classified into two types: 10 pairs of biarmed (metacentric and submetacentric) and 14 pairs of monoarmed (subtelocentric and acrocentric) chromosomes (Uwa and Ojima, 1981). Previous fluorescence in situ hybridization (FISH) studies showed that LGs 1, 9 and 18 are biarmed (Matsuda et al., 1998; Brunner et al., 2001; Kondo et al., 2002), whereas LGs 11 and 13 are monoarmed (Myosho et al., 2012). Silver staining analysis by Uwa and Ojima (1981) revealed nucleolus organizer regions (NORs) on the tip of the short arms of a satellited chromosome pair. This result and our FISH analysis demonstrated a NOR on LG 20 (data not shown), which indicates that LG 20 is biarmed and bears a NOR.
Based on the definition of chromosome types (Levan et al., 1964) and our G-C maps, we classified LGs 2, 9, 18, 21 and 23 as biarmed chromosomes, and LGs 4–7, 10, 11, 13–17, 22 and 24 as monoarmed chromosomes. The chromosome types of the five remaining LGs were not determined from the G-C maps alone, because their centromeric regions could not be narrowed enough. Based on the position of the putative centromeric repeats, we classified Hd-rR chromosomes into two types: nine biarmed LGs (1, 2, 8, 9, 12, 18, 19, 21 and 23) and 10 monoarmed LGs (3, 4, 6, 11, 13, 14, 16, 17, 20 and 22). For HNI-II, LGs 2, 12 and 19 were biarmed, whereas LGs 5–7, 10, 11 and 21 were monoarmed (Table 2).
We classified all 24 LGs into either biarmed (1, 2, 8, 9, 12, 18–21 and 23) or monoarmed chromosomes (3–7, 10, 11, 13–17, 22 and 24). Although the chromosome type of LG 20 was determined to be monoarmed (subtelocentric) based on the centromeric repeats of Hd-rR, this chromosome was expected to be biarmed, because this NOR-bearing chromosome was classified as submetacentric (Uwa and Ojima, 1981).
The results of this study integrate the centromere map and genetic linkage map in O. latipes/O. sakaizumii, which provides valuable information that will help to consolidate the genetic and physical maps in the near future. These findings will also be useful for studies on genome structure, chromosome evolution and positional cloning of genes in this species complex.
We are grateful to Prof. T. Shiroishi and Dr. T. Takada of the National Institute of Genetics for their generous help with SNP genotyping. We are also grateful to Prof. Y. Matsuda and Dr. Y. Uno of Nagoya University, and Dr. Y. Takehana of Nagahama Institute of Bio-Science and Technology, for their invaluable advice regarding FISH analysis. We thank Dr. Mallory Eckstut from Edanz Group for editing a draft of this manuscript.