Genes & Genetic Systems
Online ISSN : 1880-5779
Print ISSN : 1341-7568
ISSN-L : 1341-7568
Full papers
Development of 19 universal microsatellite loci for three closely related Ficus species (Moraceae) by high-throughput sequencing
Shuqiong LiJianfeng Huang Clive T. DarwellYanqiong Peng
Author information
JOURNAL OPEN ACCESS FULL-TEXT HTML

2020 Volume 95 Issue 1 Pages 21-27

Details
ABSTRACT

The genus Ficus is striking for its species diversity, ecological significance, and its often species-specific relationship with coevolved pollinating fig wasps, which has long fascinated biologists. The three closely related and generally co-distributed dioecious species Ficus hispida, F. heterostyla and F. squamosa provide an ideal system for the study of speciation, hybridization (caused by pollinator sharing) and comparative phylogeography to infer historical biogeography. We aimed to develop microsatellite markers for these allied species to facilitate the outlined study investigations. A DNA library was constructed from one F. heterostyla sample, and 19 microsatellite loci were developed based on high-throughput sequencing. These markers showed relatively high polymorphism in all three fig species. The mean number of alleles per locus was 3.594–5.286, and the mean observed and expected heterozygosity were 0.469–0.546 and 0.467–0.528, respectively. Principal coordinate, STRUCTURE and AMOVA analyses revealed different degrees of genetic differentiation within species, and, despite some observed genetic admixture, indicated the presence of clear boundaries between different species. In summary, we successfully developed universal microsatellite markers for three closely related Ficus species. These markers will be of great value for investigating patterns of biodiversity among the species in this model system for coevolutionary studies.

INTRODUCTION

With approximately 800 species, Ficus (Moraceae) is arguably the most diverse woody plant genus and occurs globally in the tropics and subtropics. Ficus are viewed as keystone species within tropical forest ecosystems due to their provisioning of fig resources for frugivores (birds, mammals, etc.) (Herre et al., 2008). All Ficus species are involved in an obligate pollination mutualism with mostly species-specific pollinating fig wasps (Hymenoptera, Chalcidoidea, Agaonidae), and the incumbent coevolutionary dynamics between Ficus and agaonids are considered to be the main driver of fig diversity (Cruaud et al., 2012). However, numerous cases of host shift and pollinator sharing have been reported (Haine et al., 2006; Cornille et al., 2012; Liu et al., 2015), potentially leading to host fig hybridization, which has been posited as an additional mechanism generating fig diversity (Machado et al., 2005).

Ficus hispida forms the outgroup lineage to the sister species F. heterostyla and F. squamosa (Cruaud et al., 2012). Despite close phylogenetic relationships, they display differences in phenotype or habitat preference. Ficus hispida is a pioneer shrub or small tree, widely distributed over tropical Asia and Australasia. It is mainly found along roads or secondary habitats as a pioneer species. Berg and Chantarasuwan (2007) proposed F. heterostyla as distinct from F. hispida because of differences in fruiting position and hair color on leaves and stems. Ficus heterostyla grows under the forest canopy or along roadsides (Berg and Chantarasuwan, 2007; Liu et al., 2015). However, F. squamosa is a short dioecious shrub (1–3 m tall) with a creeping stem, typically found growing along riverbanks or near fast-flowing streams in tropical forests (Pothasin et al., 2016) and occurring from Nepal to Thailand. These three species co-occur in Yunnan (China) and the northern part of Mainland Southeast Asia. The complementary fruiting phenologies between F. heterostyla and F. squamosa facilitate the sharing of one pollinating fig wasp by these two figs, which may result in hybridization. Furthermore, experimental hybrid seeds have been generated (Liu et al., 2015). Therefore, F. hispida, F. heterostyla and F. squamosa comprise an ideal system to study species diversification, the maintenance of species boundaries, hybridization and comparative phylogeography.

Due to the usefulness of estimating parameters of population genetic diversity, identifying individuals or populations, and investigating phylogeographic patterns (Wei et al., 2014; Zhao et al., 2015), developing microsatellite loci for these three species is an appropriate approach for the studies mentioned above. Although Dev et al. (2011) published eight microsatellite loci for F. hispida, they often fail to display polymorphisms among these species and the number of markers is insufficient for high-resolution studies. Thus, there is a need for the development of additional microsatellite markers. In the current work, we aimed to develop a set of universal polymorphic microsatellite primers for these three Ficus species using high-throughput sequencing. These new primers will enable us to evaluate population dynamics and connectivity of these three species and will facilitate further comparative phylogeographic studies.

MATERIALS AND METHODS

Isolation of microsatellite loci

Genomic DNA was extracted from leaves of all three species collected from six natural populations in Southwestern China and Southeast Asia (Fig. 1A), using the Tiangen Plant Genomic DNA Kit (Tiangen Biotech, Beijing, China). The DNA was quantified using a NAS-99 spectrophotometer (ATCGene, Piscataway, NJ, USA) and we followed the manufacturer’s protocols for the KAPA HyperPrep kit (KAPA Biosystems, Wilmington, MA, USA) to construct a DNA library using one sample of F. heterostyla. The sequencing processes were performed by Novogene (Beijing, China) on a HiSeq 2500 (Illumina, San Diego, CA, USA), using reagents from Illumina and with paired read lengths of 2 × 150 bp. All raw data were quality-filtered by trimming adapter sequences and by removing reads with quality scores < 20. Finally, high-quality cleaned paired reads were aligned by FLASH (Magoc and Salzberg, 2011) and we obtained 10,952,088 tags that were deposited in the NCBI Sequence Read Archive (BioProject ID PRJNA542260; accession no. SRP197518). The MICROSATELLITE (MISA) identification tool (Thiel et al., 2003) was adopted to identify reads with microsatellite loci, and Primer 3.0 (Rozen and Skaletsky, 1999) was used to design microsatellite primers using default settings.

Fig. 1.

Results of clustering (K = 3) calculated by STRUCTURE indicating three Ficus species. (A) Distribution of six populations and pie charts showing the proportions of membership coefficients in clusters assignment from CLUMPP. Cluster colors correspond to those used in (C). (B) Diagram of Delta K and variance per K value, plotted by the HARVESTER online program, determining the optimal K value. (C) Bar plot displaying six populations of three Ficus species (Ficus hispida: LWX and MDA; F. heterostyla: A and TGC; F. squamosa: QSBG and MYXJ) and three clusters, represented by vertical black lines and different colors, respectively.

To develop universal polymorphic microsatellite primers for the three species, a two-step PCR amplification approach was employed. The first PCR amplification was tested on six samples randomly selected from the six populations to initially screen for primers. An M13 tag (5′-TGTAAAACGACGGCCAGT-3′) was added to the 5′ end of the primer of each locus. This amplification step used the following protocol and conditions: 15 μl of PCR amplification reaction volume containing 1.5 μl of 10×Buffer, 1.2 μl of dNTP (2.5 mM), 1.2 μl of DNA, 0.8 μl of TP-M13 (5 M), 1 μl of specific primer, 0.1 μl of Takara Taq HS polymerase and 9.2 μl of deionized water; initial denaturation at 95 ℃ for 5 min, 30 cycles at 94 ℃ for 30 s, at 56 ℃ for 30 s and at 72 ℃ for 30 s, 10 cycles at 94 ℃ for 30 s, at 53 ℃ for 30 s and at 72 ℃ for 30 s, and final extension at 60 ℃ for 30 min. Microsatellite loci that were successfully amplified and displayed polymorphisms in all three species were selected for the next PCR amplification stage. For this, we used multiplex PCR and focused on the applicability of the filtered primers in exploring genetic diversity within populations. In this step, a total of 95 samples from the six populations of the three species were analyzed using 5′ fluorophore primers labeled with 6-FAM, HEX, ROX or TMR instead of M13-tailed primers. PCR amplification was based on the following protocol and conditions: 10 μl of PCR amplification reaction volume containing 2.0 μl of 5 × Buffer, 0.3 μl of dNTP (10 mM), 1 μl of DNA, 0.4 μl of primer mixture (5 μM), 0.1 μl of Taq polymerase, and 6.2 μl of deionized water; initial denaturation at 95 ℃ for 8 min, 30 cycles at 94 ℃ for 30 s, at 58 ℃ for 40 s and at 72 ℃ for 40 s, and final extension at 60 ℃ for 30 min. All PCR products were analyzed by capillary electrophoresis on an ABI 3730XL DNA analyzer (Applied Biosystems, Foster City, CA, USA) with the GeneScan 500 ROX Size Standard. Microsatellite fragment sizes were determined using GeneMapper version 3.2.

Data analysis

First, we checked null alleles using MICRO-CHECKER 2.2 (van Oosterhout et al., 2004), employing 95% confidence intervals. The population genetic parameters of diversity, comprising the number of alleles per locus (A), the observed heterozygosity (HO), the expected heterozygosity (HE) and the inbreeding coefficient (FIS), were calculated by GenAlEx version 6.5 (Peakall and Smouse, 2012). We tested for linkage disequilibrium and Hardy–Weinberg equilibrium in the three species using GENEPOP 4.7.0 (Rousset, 2008) and used Bonferroni correction (at P < 0.05) with default parameters in MYRIADS 1.1 (Carvajal-Rodriguez and de Uña-Alvarez, 2011). We then tested for population differentiation within individual species using analysis of molecular variance (AMOVA) in ARLEQUIN 3.5 (Excoffier and Lischer, 2010) with 1,000 permutations to test for significance. STRUCTURE 2.3.4 (Pritchard et al., 2000) was used to assign individuals into clusters of different numbers of demes (K; at values between 1–6) with membership coefficients estimated under an admixture model with correlated allele frequencies. We set 10 independent runs for each value of K and 100,000 MCMC repetitions with a burn-in of 100,000 iterations. We used the HARVESTER online program (Earl and vonHoldt, 2012) to plot the values of Delta K and variance per K value, and to determine the optimal K value (Evanno et al., 2005). For the optimal value of K, CLUMPP 1.1.2 (Jakobsson and Rosenberg, 2007) was used to summarize the membership coefficients into clusters, after which the program DISTRUCT 1.1 (Rosenberg, 2004) was launched to draw the results as colored box-plots. Results were also plotted on a distribution map. Finally, a principal coordinate analysis (PCoA) was carried out in GenAlEx version 6.5 based on a pairwise, individual-by-individual codominant genotypic distance matrix.

RESULTS AND DISCUSSION

MISA found 9,265 contigs containing at least one microsatellite, along with a total of 9,616 microsatellite loci. Among these loci, 2-, 3-, 4- and 5-nucleotide motifs were found at 90.34%, 8.53%, 0.83% and 0.3%, respectively. A total of 14,505 primer pairs were recovered, and 158 primer pairs featuring different repetitions were then selected randomly for subsequent PCR tests. Across the 158 selected primer pairs, 34 microsatellite loci were successfully amplified from the first six samples of the three species, with 19 loci displaying relatively high polymorphism along with appropriate fragment size in the pre-test experiment (Table 1). For these 19 applicable loci, we tested polymorphism with the three species.

Table 1. Characteristics of 19 microsatellite loci developed for Ficus hispida, F. heterostyla and F. squamosa
LocusPrimer sequences (5′–3′)Repeat
motif
Allele size
range (bp)
Ta (℃)Fluorescent
label
GenBank
accession no.
3-N173F: AATGTCCTTCGGCTCTCAAGA(GA)30100–13760.1HEXMK937077
R: GGGTTGCCAACTTTCTCATTATG
4-N101F: AAAATGTCGGGATGTATTCAAGA(AC)2772–15559.8ROXMK937078
R: TTTCGCCCATATCTCATTTCAC
5-N108F: CCGAAGACATTTTAAACCATCAA(TG)23128–17760.7HEXMK937079
R: AGCTAGGTTTCCCTTTCATTGTC
6-N104F: AACGTGGCTGTTGGGTAACAT(TC)26126–16459.7HEXMK937080
R: GGCAACAGTTGTCCCAAAAGTA
7-N245F: CACAAGGCTGCTTTGTTTTCTTC(TC)30125–17060.4ROXMK937081
R: GGAATACAAGGATAGCATGTTCCC
10-N381F: GTCGGAGCGATTTTCCAGTG(GA)35120–16159.8FAMMK937082
R: CCGACACAACTCACGACCATA
11-N346F: GGGAAAATAGGAGCGTAGGGA(AG)39104–15561.5HEXMK937083
R: GGGCCTTTTGGTTCCTAGAAT
12-N102F: TCTCGAATCGAATCCAAGTGC(TGT)25136–30859.0FAMMK937084
R: TTATAGCCCTATTGCTCCAATGC
16-N106F: GGAAAACCAAACAGTAGAGGCA(CT)33142–23060.7FAMMK937085
R: GTGTTAAACACATGGGCCAGAA
19-N530F: CAGTGGCAACAAGATTTCACCT(GA)32163–17858.8TMRMK937086
R: TTAAAGTAAATCTGCCCCTCTCTT
20-N291F: AATCGTACCTACCGGCTGCTA(GA)38148–21061.2TMRMK937087
R: CAGCACCATCAGCACCAAAG
21-N197F: GGCAAAGATCACGAGGAGACG(TC)31165–22559.7FAMMK937088
R: CTCACTATCGCCCAACGCAT
23-N724F: AGGGTGAGGCTTTTTCGCAA(TCT)23167–19159.9TMRMK937089
R: GGTTCATTCTTCTTCTCCGACG
26-N180F: TGTTTCATGTATTCACCTTGGGA(TTC)15133–16859.9ROXMK937090
R: AATACAGTTGCAGAGACGGAAGA
27-N150F: CATTTTCGGTGCTTGGATGT(GA)20141–16160.0HEXMK937091
R: GTAACGAATCATACCCAACACCTA
28-N247F: GCTGCAAAAGCAATAACAAAGC(CA)24150–18259.6ROXMK937092
R: TATATCAAGACCTTCGGATGCTC
29-N105F: GAACTAGAAATCCCACCAAGCA(TC)19161–23060.4TMRMK937093
R: CTCTTCTTTTCATTGTGGTCCCT
30-N457F: AGGAAGCGAAAGCTCGATTC(CAA)17179–22661.5FAMMK937094
R: TCCTCTTCTTCCTCCTCTTCCA
32-N125F: GCAGGCTGAACAGGACACTGA(TG)22166–20359.3TMRMK937095
R: TAGCCGCCAAATCCTCATCT

Ta, annealing temperature.

Several loci displayed signals of null alleles (6-N104, 19-N530, 11-N346, 16-N106 and 27-N150 in F. hispida; 12-N102, 32-N125, 10-N381, 16-N106, 27-N150 and 29-N105 in F. heterostyla; 21-N197, 6-N104 and 29-N105 in F. squamosa). Over populations, per locus, the A, HO, HE and FIS ranged from 1 to 10, 0 to 0.871, 0 to 0.830, and –0.233 to 0.912, respectively (Table 2). The relatively high FIS values that were shown for some loci likely result from the presence of null alleles (Table 2). After correcting, we found a total of seven locus pairs (12-N102 vs. 21-N197, 3-N173 vs. 7-N245 and 30-N457 in F. hispida; 6-N104 vs. 21-N197, 7-N245 vs. 26-N180, 5-N108 vs. 11-N346 and 21-N197 in F. heterostyla) that significantly deviated from linkage equilibrium, but no loci that significantly deviated from Hardy–Weinberg equilibrium (P < 0.05), across all species. AMOVA tests revealed a much higher population differentiation in F. squamosa (FST = 0.43, P < 0.01) than in F. hispida (FST = 0.071, P < 0.01) and F. heterostyla (FST = 0.079, P < 0.01. STRUCTURE results indicated an optimal K value of three among three species (Fig. 1B, 1C). PCoA showed patterns consistent with those of STRUCTURE (Figs. 1C and 2), which indicated the presence of clear boundaries between the three species. STRUCTURE results also displayed some signatures of admixture between paired species (Fig. 1A, 1C), indicating interspecific gene flow.

Table 2. Genetic parameters of 19 microsatellite loci for three Ficus species
LocusFicus hispida (n = 39)
LWX (20) a, MDA (19) a
Ficus heterostyla (n = 29)
A (14) a, TGC (15) a
Ficus squamosa (n = 27)
QSBG (13) a, MYXJ (14) a
AHOHEFISAHOHEFISAHOHEFIS
3-N1734.000.6680.596−0.1205.000.6550.602−0.1026.500.8460.689−0.218
4-N1014.000.2540.228−0.1025.500.5520.5800.0425.000.7010.634−0.102
6-N1043.000.2840.4740.3208.500.7550.7950.0564.500.4840.6920.290
7-N2458.000.7960.764−0.0435.000.3790.382−0.0113.500.6320.6340.004
12-N10210.000.7960.8210.0313.500.1070.1930.2563.000.6440.575−0.116
19-N5302.000.1000.2610.6163.500.6170.6680.0604.000.5190.5420.041
30-N4575.500.6120.538−0.1243.500.2400.3010.1464.000.6040.546−0.030
32-N1253.500.4140.4370.0882.000.0360.1260.7172.500.2690.3180.153
10-N3818.000.7970.771−0.0318.500.5000.7440.3004.500.6620.6850.043
11-N3464.000.4140.6710.3702.500.3380.336−0.0325.000.4400.5220.140
16-N10610.000.6180.8300.2517.000.5810.7250.2113.000.4780.485−0.024
20-N2916.000.6960.648−0.0648.000.6930.7980.1342.500.4750.4670.012
21-N1977.500.8710.768−0.1347.500.7240.7490.0348.500.6290.7470.124
26-N1804.000.4630.4980.0452.500.1740.2510.1882.000.2500.203−0.233
27-N1502.500.0360.4250.9124.500.2690.4820.2263.000.2340.2590.055
28-N2472.000.1000.1330.2452.500.4500.455−0.0493.500.5220.440−0.187
29-N1053.500.3870.4260.07110.500.7880.8000.0077.000.4040.7350.435
5-N1087.000.7920.758−0.0425.000.5210.6150.1613.500.3300.281−0.155
23-N7241.000.0000.000NA1.000.0000.000−0.0762.000.2310.189−0.219
Mean5.2860.5460.528−0.0244.6150.4690.5020.0523.5940.4900.467−0.047
a  Locations of the populations are as follows: LWX, Laos (18°4’15.20” N, 102°40’27.37” E); MDA, Myanmar (15°8’19.24” N, 97°49’39.36” E); A, Thailand (14°28’54.18” N, 101°23’0.6” E); TGC, Thailand (12°40’27.48” N, 102°5’44.52” E); QSBG, Thailand (18°53’39.72” N, 98°51’27.72” E); MYXJ, China (21°36’52.69” N, 101°33’50.36” E).

A, number of alleles; HE, expected heterozygosity; HO, observed heterozygosity; FIS, inbreeding coefficient; n, number of individuals collected; NA, not applicable; bold FIS indicates null alleles at this locus; mean values were calculated without loci showing signals of null alleles.

Fig. 2.

PCoA plot of individuals based on a pairwise, individual-by-individual codominant genotypic distance matrix. Shapes represent different species while colors represent different populations. Axis Coord. 1 and axis Coord. 2 explained 19.08% and 17.62% of the total variance, respectively.

Pollinator sharing and interspecific hybridization caused by pollinator sharing have been documented repeatedly for Ficus species (e.g., Machado et al., 2005; Liu et al., 2015). Complementary flowering phenologies between F. heterostyla and F. squamosa have facilitated host shift of the pollinator, which may cause interspecific hybridization. Hybrid seeds have been experimentally generated between these two figs, indicating the potential for introgression (Liu et al., 2015). Additionally, there is phylogenetic evidence of a high degree of pollinator sharing based on ultra-conserved elements (unpublished results) from pollinators of these three figs. Thus, the presence of weak gene flow among them indicated in the present study is not unexpected. However, our discovery that two F. heterostyla individuals of population “A” showed identical genetic constitution with F. hispida should be explored deeply in future (Fig. 1C), including by enhanced sampling and morphological identification. Furthermore, more detailed studies are required to establish the exact genetic patterns of species interaction, mechanisms of introgression between different pairs of species, and individual species’ evolutionary histories.

Within species, F. squamosa shows the highest genetic differentiation, followed by F. heterostyla and F. hispida. Pollen dispersal of dioecious figs is often restricted, which is likely a result of high population densities, fruiting patterns and growth forms, since high densities of receptive host trees will nullify the requirement for long-distance dispersal of tiny pollinating wasps (Harrison, 2003; Wang et al., 2009; Chen et al., 2011). Among our studied dioecious figs, F. squamosa shows higher local abundances and is found near streams or rivers, which could account for the high population differentiation if water courses act as barriers to dispersal of pollinators. Despite the shorter inter-population sampling distance (F. heterostyla: 215 km; F. hispida: 661 km), F. heterostyla shows slightly higher differentiation than F. hispida. The figs of F. heterostyla are located in rooting stolons near or under the soil (Liu et al., 2015), which may affect pollinator dispersal and thus have resulted in increased genetic differentiation. This pattern was also documented for F. tikoua (Chen et al., 2011).

We successfully developed 19 polymorphic microsatellite loci for the target species. These new microsatellite markers will be highly valuable for future studies on species diversification, the maintenance of species boundaries, hybridization and comparative phylogeography.

ACKNOWLEDGMENTS

The authors are grateful to X. X. Song for technical assistance, and to D. R. Yang, B. G. Miao and P. Yang for sample collection. This research was funded by the National Natural Science Foundation of China (31800313, 31672373), the Light of West China Program of the Chinese Academy of Sciences, and the Program of the Ministry of Science and Technology of China (2018FY100406).

REFERENCES
 
© 2020 by The Genetics Society of Japan
feedback
Top