Genes & Genetic Systems
Online ISSN : 1880-5779
Print ISSN : 1341-7568
ISSN-L : 1341-7568
Methods, technology, and resources
Development of polymorphic microsatellite markers for Fagus pashanica (Fagaceae) using next-generation sequencing
Bi-Ru ZhuXiao-Ya ZhangXiao-Xi YangWan-Jin Liao
Author information
JOURNAL OPEN ACCESS FULL-TEXT HTML
Supplementary material

2023 Volume 98 Issue 5 Pages 277-281

Details
ABSTRACT

Fagus pashanica is an endangered and endemic tree species in China. To understand its genetic diversity and structure for effective conservation, we used next-generation sequencing data to develop a set of microsatellite markers. Twenty-three of the 68 designed loci were successfully amplified. Fifteen polymorphic loci with clear peaks were selected for further analyses in three F. pashanica populations sampled from Nanjiang, Wangcang and Pingwu counties in Sichuan Province, China. The number of alleles per locus ranged from two to 11. The levels of observed and expected heterozygosity ranged from 0.033–0.852 and 0.033–0.787, respectively. All 23 loci were also successfully amplified in F. longipetiolata and F. lucida, and 19 were successfully amplified in F. engleriana. These microsatellite markers will be useful for population genetic studies of F. pashanica and other Fagus species.

MAIN

Beeches (Fagus L., Fagaceae) are deciduous broadleaved trees that dominate certain temperate forests in the Northern Hemisphere (Denk, 2003). They are broadly distributed in three isolated regions: East Asia, Europe and West Asia, and North America (Fang and Lechowicz, 2006). There are four Fagus species in China (Huang et al., 1999): F. engleriana, F. longipetiolata, F. lucida and F. hayatae. All of these species mainly occur in the moist subtropical mountains in South China, between 700 m and 2,500 m altitude (Cao et al., 1995; Li and Li, 2008).

Fagus pashanica C. C. Yang is an endangered and endemic species in China that has been identified as a subspecies of F. hayatae (Huang et al., 1999; Ji et al., 2002). However, recent molecular phylogenetic analyses in Fagus suggested that F. pashanica should be a distinct species from F. hayatae in Taiwan Island (Li et al., 2003; Yang, 2020). Fagus pashanica has a more restricted and fragmented habitat than other Fagus species in China, and is distributed in isolated high mountains between 1,300 m and 1,900 m altitude in Sichuan, Hubei and Zhejiang provinces (Li and Li, 2008). Human activities and climate change have accelerated its habitat fragmentation and population decline. To effectively manage and conserve this species, we need to understand its genetic diversity and structure so that genetic conservation strategies (e.g., genetic rescue) can then be applied (Bell et al., 2019; Hoffmann et al., 2021).

Previous studies on F. pashanica are mainly about taxonomy, population characteristics and geographical distribution. Two used allozyme markers to study genetic diversity of several F. pashanica populations (Li et al., 1999; Li and Li, 2005). Because of the disadvantage of allozyme markers and the limitation of sample size, these reports have revealed little about the genetic diversity and structure of F. pashanica. Further studies with adequate sampling and sensitive molecular markers are required.

Microsatellites, also known as simple sequence repeats or short tandem repeats, are polymorphic, codominant markers that have been widely used in population genetic studies. With technical developments such as next-generation sequencing and multiplex PCR, the costs of identifying and genotyping large numbers of microsatellite loci in non-model species have been reduced greatly (Guichoux et al., 2011). In this study we identified microsatellite markers for F. pashanica based on Illumina sequencing genome data. We then used these markers to assess the genetic diversity of three F. pashanica populations and evaluated their cross-species application to F. longipetiolata, F. lucida and F. engleriana.

We used the genome sequencing data of 74 F. pashanica individuals from Yang (2020). These data were generated by an Illumina HiSeq 2500 with 150-bp paired-end sequencing. A total of 27,764,466 clean reads were used for de novo assembly using SOAPdenovo V2.04 (Luo et al., 2012). The scaffolds generated by de novo assembly were then used to detect microsatellite loci using QDD V3.1.2 (Meglécz et al., 2014) with default parameters. Primers were designed using QDD, taking into consideration the optimal PCR product size (100–200 bp), primer annealing temperature (57–63 ℃) and primer length (18–27 nucleotides).

To evaluate the microsatellite markers, samples of F. pashanica were collected from three natural populations from Nanjiang (NJ, n = 27), Wangcang (WC, n = 30) and Pingwu (PW, n = 28) counties in Sichuan Province. To evaluate the cross-species applicability of these markers, samples of three other Fagus species were collected: 30 individuals from one natural population of each species. The locality information of the sampled populations is detailed in Supplementary Table S1. Individuals were randomly selected and were at least 10 m apart from each other in one site. Fresh and healthy leaves of each individual were collected in a paper bag and dried with silica gel. Genomic DNA of all sampled individuals was extracted from the dried leaf tissues using the Plant Genomic DNA Kit (Tiangen Biotech, Beijing, China) and stored at −30 ℃.

Microsatellite locus validation had three steps. First, we used seven individuals from population WC of F. pashanica to test the success of amplification for the designed primer pairs. The 15-μl PCR reaction mixture contained 9.2 μl of deionized water, 1.5 μl of 10× buffer (Mg2+-free), 0.9 μl of MgCl2 (25 mM), 1.2 μl of dNTP mixture (2.5 mM each), 0.5 μl of each primer (10 μM), 0.2 μl of Taq polymerase (5 U/μl, Takara Bio, Dalian, China) and 1 μl of DNA. Touchdown PCRs were performed under the following conditions: an initial denaturation of 4 min at 94 ℃; 10 cycles of 94 ℃ for 45 s, 63 ℃ for 45 s with a decrease of 1.3 ℃ per cycle, 72 ℃ for 40 s; 20 further cycles of 94 ℃ for 45 s, 50 ℃ for 45 s, 72 ℃ for 40 s; and a final extension at 72 ℃ for 8 min. The success of amplification was determined by electrophoresis using 1.5% agarose gels. Second, the forward primers of the successfully amplified primer pairs in all seven WC individuals were labeled with a fluorescent dye. DNA from all individuals collected from the three F. pashanica populations and three other Fagus species populations was used for PCR reactions using the above conditions. PCR products were scanned with an ABI 3730xl DNA Analyzer (Applied Biosystems) at Tsingke Biotechnology (Beijing, China), and genotyped by GENEMAPPER V4.0 (Applied Biosystems). For each Fagus species, microsatellite loci that could be successfully amplified in more than 90% of individuals and had clear peaks, suitable fragment lengths and polymorphism were further analyzed. Third, population genetic analyses were conducted. The number of alleles (NA), Shannon’s information index (I), observed heterozygosity (HO) and expected heterozygosity (HE) were estimated using GenAlEx version 6.5 (Peakall and Smouse, 2012). Pairwise FST values were calculated using GenAlEx. Hardy–Weinberg equilibrium (HWE) and linkage disequilibrium between microsatellite loci were tested using GENEPOP4.7.2 (Rousset, 2008). Frequencies of null alleles were estimated by Micro-Checker V2.2.3 (Van Oosterhout et al., 2004).

Twenty-three of the 68 primer pairs randomly selected from the designed primer pairs using QDD successfully amplified the target loci in the first step (Table 1). For F. pashanica, 15 primer pairs were finally selected for population genetic screening. All these loci had > 95% PCR amplification success rate (Table 2). The NA per locus ranged from two to 11. The I values were from 0.085 to 1.688. The levels of HO and HE ranged from 0.033 to 0.852 and 0.033 to 0.787. Significant deviations from HWE (P < 0.05) were detected in five loci: F45 in population NJ; F66 and F68 in population WC; and F11, F34 and F45 in population PW. Potential inbreeding and genetic drift may have led to the deviations from HWE in small populations of F. pashanica. No significant linkage disequilibrium or null alleles were observed in any of the 15 loci. Among the three F. pashanica populations, NJ had the lowest mean value of NA (3.9), but the highest mean values of I (0.889), HO (0.459) and HE (0.485) (Table 2). The pairwise FST values suggested moderate differentiation between populations NJ and WC (0.069), and between populations NJ and PW (0.084), but low differentiation between populations WC and PW (0.023). The transferability of the 23 primer pairs was tested across 90 individuals of F. lucida, F. longipetiolata and F. engleriana (Table 3). All the primer pairs successfully amplified the target loci, except loci F19, F43, F59 and F66 in F. engleriana. Amplifications of these four loci may have failed because F. engleriana belongs to another subgenus. These loci performed differently in population genetic analyses for different Fagus species (Table 3).

Table 1. Characteristics of 23 microsatellite loci of Fagus pashanica

LocusPrimer sequences (5′–3′)Repeat motifGenBank accession no.
F5F: GCAACTTGGATTATGTTCTTCCAG(AAT)8OR020943
R: TGTCCGTTAATAAAGAACTCGGG
F8F: CCAGCAACTCTCGTTAACTAGC(AAT)8OR020944
R: AACGAGATGCGCGTCTCTAC
F10F: TGCCAAACAAGCGACAAACA(AAC)8OR020945
R: AGCATGTGCAGAAGTTGTGG
F11F: ACTCTCACCAGCTTTCAGGC(AGG)7OR020946
R: ACACCTACACGTTATATCACACGA
F14F: TGCCGCACTTAACCTCACAT(ATC)7OR020947
R: CTCACAGCATAAAGATCTTATACCG
F16F: GGGTGTTAGCTTCCTAATTTCCC(AC)16OR020948
R: CAACAGGATTGAGTGAGATGCC
F18F: GCATCACAGGTCTGTCCACA(AG)13OR020949
R: GAGCAACATAGCATGTGGTTGA
F19F: CTTGTATATTGCTTGGCTACTAGC(AG)12OR020950
R: TGATGGAAAGTTGAGCCACC
F20F: CTCAAACCCGACCTAACCCG(AG)12OR020951
R: TCGTAACCGATACCAACCACA
F24F: CAAGAAATGCAGAATAGGAGAGAGA(AG)11OR020952
R: ATGGCTCACTGCTTCGTGAA
F27F: ACGTCGATTTCTTTAACGGCT(AG)10OR020953
R: CAATTTCCACACTCCGATGACC
F31F: TCCAAACACCGAATAGCCCTC(AG)9OR020954
R: TGAGAGGCGTTGTGAGGTTC
F34F: AGGTGACTTGTGACGTGAATGA(AG)9OR020955
R: AGAAAGAGCCATCCCTTTCCT
F37F: ACTGAACACTTTCTAGAGCTTCGA(AG)9OR020956
R: TGCAGTCAAACAAGACAACCA
F43F: AACCCAGAGTTGCTAGTTGC(AG)8OR020957
R: ACTTCCAAAGTTCGAAGTGCT
F45F: TGAACATGTAGCTAGCCCGC(AC)8OR020958
R: ATGCCTACTCATTCTGTGCGA
F47F: GCCACGGGATGCTGAAAGAT(AC)8OR020959
R: TGACGAGGTGGTGTTTGGG
F58F: AGGCACGCCCACATACAATT(AG)7OR020960
R: CAAGTCTACTTCATGATTCCCTCT
F59F: ACTCGCCCATTGGTTAACCT(AT)7OR020961
R: TAAGAGCTGCTGCCACAACA
F62F: TCCCTTCGAGCTTGTTATGCA(AG)7OR020962
R: ACTGAGTCTGAGTAGAATCCCAT
F64F: GCCTCCTATTGGCCATCTTGA(AG)7OR020963
R: CCTCTGCTACCTCCTCCCAT
F66F: CACATGTGGAGATAAAGAGTCCA(AG)7OR020964
R: ACCTGCTCTCGATTCTTTCCA
F68F: ACCAGATGCCCTTCCATTGA(AG)5OR020965
R: TGTTGCGGTGGTGGTTTAGT

Bold loci were successfully validated in three populations of F. pashanica.

Table 2. Genetic diversity statistics of 15 polymorphic microsatellite loci in three populations of Fagus pashanica

LocusPopulationTotal (n = 85)MeanPCR amplification rate (%)
NJ (n = 27)WC (n = 30)PW (n = 28)
NAIHOHENAIHOHENAIHOHENASize range (bp)IHOHE
F841.3020.6300.71130.9920.5670.59341.1030.5000.6225164–1761.1320.5650.642100
F1040.5650.3330.28840.5860.3330.31050.6450.2860.2886110–1250.5990.3170.295100
F1151.0300.4070.50140.8440.3670.4515a0.6980.2860.3375135–1470.8580.3530.430100
F1661.1930.6670.606101.5250.7000.65681.0930.5000.48911160–1841.2700.6220.584100
F1851.2660.5930.67951.3570.6670.68161.3820.7140.6926165–1771.3350.6580.684100
F1920.5930.2400.40320.2870.1000.15320.6520.4290.4592151–1570.5110.2560.33898
F2061.5050.8520.74871.6880.8330.78751.3300.7500.6967129–1411.5080.8120.744100
F2450.6690.2960.29840.7480.4330.38750.8700.3930.4486171–1810.7620.3740.378100
F3430.6250.4070.34130.3940.2000.1852a0.0900.0360.0354189–1950.3700.2140.187100
F3731.0110.6670.60431.0200.6670.61330.9310.5000.5763121–1250.9870.6110.598100
F454a1.1420.5380.63951.0540.5670.5475a1.0180.4290.5056164–1761.0720.5110.56499
F4730.7680.4810.51440.7400.4670.41440.6130.3210.3285163–1750.7070.4230.418100
F6230.7300.3330.42640.7480.3000.38720.4410.2500.2704155–1630.6390.2940.361100
F6620.3490.1480.1983a0.5630.3000.30930.3460.1790.1654126–1420.4190.2090.224100
F6830.5920.2960.3162a0.0850.0330.03320.1540.0710.0693109–1290.2770.1340.139100
Mean3.90.8890.4590.4854.50.8420.4360.4344.10.7580.3670.399

NA = number of alleles per locus; I = Shannon’s information index; HO = observed heterozygosity; HE = expected heterozygosity; n = number of individuals.

a  Significant deviation from Hardy–Weinberg equilibrium (P < 0.05).

Table 3. Transferability of the 23 microsatellite markers developed from Fagus pashanica in F. longipetiolata, F. lucida and F. engleriana

F. longipetiolata (n = 30)F. lucida (n = 30)F. engleriana (n = 30)
LocusNASize range (bp)IHOHEPCR amplification rate (%)NASize range (bp)IHOHEPCR amplification rate (%)NASize range (bp)IHOHEPCR amplification rate (%)
F53118–1240.8860.6000.5551004121–1331.2890.7670.7041004c118–1301.0790.5670.613100
F85164–1821.1920.5330.6441003abc164–1761.0330.3450.62710011640.0000.0000.000100
F104111–1260.7810.6330.4771004c111–1231.1390.8000.64010011080.0000.0000.000100
F114135–1471.2200.8670.6831006a135–1501.5570.6550.7561006135–1531.4430.8210.71894
F143157–1630.7550.5000.5051003154–1630.4850.2670.2401003a154–1630.3430.1000.156100
F165160–1681.1700.5670.5781006160–1721.3070.6670.6481005160–1681.1700.5670.578100
F1810167–1951.8480.7240.76910012157–2012.1230.9670.8541004c157–1791.1100.6670.610100
F194147–1531.0160.7000.5961003abc149–1531.0040.3330.6021000NA0
F209c125–1511.8430.7590.8101007123–1431.6880.8670.7981007117–1431.5110.8000.697100
F244c172–1780.9670.6670.5371004172–1920.4720.2330.2141007172–2001.4560.6900.707100
F272165–1670.6660.4330.4731003165–1690.7030.3670.3841003c164–1680.8430.5330.483100
F317126–1441.5650.8330.74310010128–1542.0160.9670.8481003126–1380.3700.2000.183100
F344185–1910.8180.4330.5121007187–2071.4610.9000.7131005189–2030.8690.6670.508100
F373120–1260.2820.0670.1261005c120–1281.3910.9000.7311006151–1711.4610.7330.720100
F433116–1400.7030.5330.4551002a136–1380.0850.0330.0331000NA0
F455162–1741.1330.5330.6211004c164–1720.9860.6330.5351003166–1740.5050.3330.283100
F474c160–1700.8470.4000.4631004c166–1921.0850.7000.5931003160–1640.2300.1000.096100
F582ab137–1390.5000.0000.3201007a107–1411.5650.6670.73510011139–1771.5610.8000.659100
F597171–1891.4580.7330.6881008c173–2351.6520.8330.7811000NA0
F622abc155–1570.4510.1330.2781003ac155–1591.0380.5330.6291002c159–1610.5430.4000.358100
F643166–1760.7030.6000.4551004172–1780.9220.5000.50110011660.0000.0000.000100
F665c134–1441.2790.6070.686943b140–1500.7160.1670.4381000NA0
F682107–1270.0850.0330.03310011270.0000.0000.0009411070.0000.0000.00090

NA = number of alleles per locus; I = Shannon’s information index; HO = observed heterozygosity; HE = expected heterozygosity; n = number of individuals; NA = unsuccessful PCR amplification.

Bold loci were successfully validated in three populations of F. pashanica.

a  Significant deviation from Hardy–Weinberg equilibrium (P < 0.05).

b  Null allele present.

c  Significant linkage disequilibrium (P < 0.01).

In this study, we developed 23 microsatellite markers for F. pashanica using next-generation sequencing data and successfully validated 15 of them in three populations of F. pashanica. These markers will be useful for future genetic diversity and structure studies of F. pashanica, which are essential to effectively manage and conserve this species.

ACKNOWLEDGMENTS

This work was supported by the National Key R&D Program of China (2017YFA0605100) and the State Key Laboratory of Earth Surface Processes and Resource Ecology (2022-TS-04).

REFERENCES
 
© 2023 The Author(s).

This is an open access article distributed under the terms of the Creative Commons BY 4.0 International (Attribution) License (https://creativecommons.org/licenses/by/4.0/legalcode), which permits the unrestricted distribution, reproduction and use of the article provided the original source and authors are credited.
https://creativecommons.org/licenses/by/4.0/legalcode
feedback
Top