2014 年 83 巻 2 号 p. 95-107
Solanaceae, Rosaceae, and Plantaginaceae exhibit the S-RNase-based gametophytic self-incompatibility (GSI) system. This type of GSI is controlled by a single polymorphic locus (S locus) containing the pistil S determinant gene, S-ribonuclease (S-RNase), and the pollen S determinant, the S locus F-box gene (SFB/SLF). In addition to these determinant genes, non-S factors, called modifier genes, are required for the GSI reaction. Here, we conducted large-scale transcriptome analysis of unpollinated, self-pollinated, and cross-pollinated pistils of Japanese apricot (Prunus mume Sieb. et Zucc. cv. Nanko) to capture all of the molecular events induced by the GSI reaction in Prunus, using next-generation sequencing technologies. We obtained 40,061 unigenes from 77,521,310 reads from pollinated and unpollinated pistils and pollen grains. Among these unigenes, 29,985 and 27,898 unigene sequences showed at least one hit against the NCBI nr and TAIR10 protein databases, respectively, in BLASTX searches using an E-value cutoff of 1e-6. Digital expression analysis showed that 8,907 and 10,190 unigenes were expressed at significantly different levels between unpollinated (UP) and cross-pollinated (CP) pistils and between UP and self-pollinated (SP) pistils, respectively. The expression of 4,348 unigenes in both CP and SP pollination was commonly and significantly different from that in UP, while the expression of 4,559 and 5,842 unigenes in CP and SP, respectively, was specifically and significantly different from UP. The expression of 2,227 unigenes was up-regulated both in CP and SP compared with UP. Genes supposedly involved in S-RNase-based GSI were included among the unigenes up-regulated by pollination, while no unigenes homologous to the solanaceous pistil modifiers HT-B or 120K were included among the unigenes up-regulated by pollination or in the whole unpollinated/pollinated pistil transcriptome. We discuss the distinct molecular mechanism of S-RNase-based GSI in Prunus.
Self-incompatibility (SI) is a widespread mechanism in flowering plants that prevents inbreeding and promotes out-crossing. Although there are several different SI mechanisms in flowering plants, the Solanaceae, Rosaceae, and Plantaginaceae exhibit the S-RNase-based gametophytic self-incompatibility (GSI) system. This type of GSI is controlled by a single polymorphic locus (S locus) encoding pistil S and pollen S determinants. The former is the S-ribonuclease gene (S-RNase) and the latter is a pollen-expressed F-box gene called the S haplotype-specific F-box gene (SFB) in Prunus of the Rosaceae and the S locus F-box gene (SLF) in the Solanaceae and Plantaginaceae (Entani et al., 2003; Lai et al., 2002; Qiao et al., 2004; Sijacic et al., 2004; Ushijima et al., 2003, 2004; Yamane at al., 2003; Wang et al., 2004). Although the Prunus pollen S was originally referred to by two different terms, SFB (Ushijima et al., 2003) and SLF (Entani et al., 2003), we use SFB in this article because it distinguishes the Prunus system from other systems, and recent studies have all used SFB (Newbigin et al., 2008; Sassa et al., 2010; Tao and Iezzoni, 2010). The S locus F-box brothers (SFBBs) have been identified as candidates for the pollen S determinant in the subtribe Pyrinae of the Rosaceae (De Franceschi et al., 2011a, 2011b; Kakui et al., 2011; Minamikawa et al., 2010; Okada et al., 2008, 2011; Sassa et al., 2007). Variants of the S locus defined by combinations of S-RNase and pollen S F-box gene(s) alleles are called S haplotypes.
In the S-RNase-based GSI system, pollen with an S haplotype that matches either S haplotype of the diploid pistil is recognized as self and its tube growth is arrested in the pistil because of the cytotoxicity of S-RNase. Although how this cytotoxicity is exerted remains unclear, it has been suggested that the degradation of RNA in the pollen cytoplasm by self S-RNase results in the arrest of self-pollen tube growth, as indicated by the degradation of pollen rRNA in incompatible pollen tubes (McClure et al., 1990) and the essential role of ribonuclease activity for the incompatible reaction (Huang et al., 1994). It is necessary for normal pollen tube growth in compatible pollination to evade S-RNase cytotoxicity.
Two working models, the S-RNase degradation model and the S-RNase compartmentalization model, have been proposed for SI/SC reactions (McClure et al., 2011). The pollen determinant SFB/SLF is a member of the F-box proteins and contains an F-box domain at the N-terminus. F-box proteins typically form the SCF (SKP1/Cullin/F-box) E3 ubiquitin ligase complex, which regulates protein degradation in the ubiquitin/proteasome proteolytic pathway. In the degradation model, it is hypothesized that the SCFSLF E3 ligase specifically polyubiquitinates all non-self S-RNases. The polyubiquitinated non-self S-RNases are degraded by the 26S proteasome in the pollen tube, allowing normal pollen tube growth in compatible pollination. Recently, it was reported that at least three types of divergent SLF proteins, including the former SLF-like proteins, function as pollen determinants, each recognizing a subset of non-self S-RNases for polyubiquitination (Kubo et al., 2010). Since this finding, the S-RNase degradation model has been superseded by the collaborative non-self-recognition model, although the S-RNase degradation model and the collaborative non-self-recognition model are the same in that the pollen S determinant is involved in the degradation of cytotoxic non-self S-RNase.
The other model is the compartmentalization model, although this model does not necessarily contradict the S-RNase degradation and collaborative non-self-recognition models (Goldraij et al., 2006; McClure et al., 2011). In this model, S-RNases are taken up non-specifically into both self and non-self pollen tubes and then sequestered in the vacuole, and therefore do not exert cytotoxicity. In incompatible pollen tubes, the vacuole breaks down and S-RNases are released into the cytoplasm. HT-B, one of the modifiers of the SI reaction in Solanaceae, is presumed to be involved in this vacuole breakdown. Although genetic studies have indicated the presence of a pistil modifier in the Rosaceae (Fernández i Martí et al., 2009; Moriya et al., 2009), no genes orthologous to solanaceous modifiers, such as HT-B and 120K, have been identified in the Rosaceae.
Each model can partly explain some aspects of the GSI phenomenon but neither model fully explains the molecular mechanism of GSI, possibly because unidentified pollen and/or pistil factors play important roles in the SI/SC reaction. Furthermore, Tao and Iezzoni (2010) suggested the possible existence of a distinct molecular recognition mechanism in the GSI of Prunus based on the different SI/SC behaviors of pollen-part mutants and heterodiallelic pollen. We are still far from a full understanding of the molecular mechanism of S-RNase-based GSI. For further understanding of the molecular basis of GSI, it is essential to identify pollen- or pistil-part modifier genes.
Large-scale transcriptome analysis using next-generation sequencing (NGS) technologies has proven very useful to obtain candidates for such unidentified genes. NGS technologies are capable of generating high-throughput reads at a relatively low cost and have been used in various types of research, including genome sequencing, marker discovery, and especially transcriptome analysis (Ekblom and Galindo, 2011; Pareek et al., 2011; Strickler et al., 2012). Transcriptome analysis using NGS technologies can capture nearly all of the expressed sequences, including rare transcripts in a particular tissue at a specific developmental stage, due to the great depth of sequencing. Therefore, it is very useful, especially for transcriptome analysis in non-model organisms like Japanese apricot. In this study, we used this newly developed technology to dissect the complete molecular network underlying the GSI reaction to help identify pollen- or pistil-part modifier genes.
Although various NGS platforms, such as the Roche/454 Genome Sequencer FLX (GS FLX), the Illumina Genome Analyzer, and the ABI SOLiD, are available, the Roche/454 GS FLX is preferred for transcriptome analysis in non-model organisms (Table 2 in Ekblom and Galindo, 2011; Table 1 in Strickler et al., 2012) because of its longer read length and high accuracy. However, the number of reads generated per run by the Roche/454 GS FLX is much lower than the Illumina Genome Analyzer. The higher number of reads generated by the Illumina Genome Analyzer is preferable for more accurate gene expression measurement. In this study, we used both platforms, the Roche/454 GS FLX and the Illumina Genome Analyzer, to fully explore the advantages of each platform and conducted a large-scale transcriptome analysis of unpollinated, self-pollinated and cross-pollinated pistils of Japanese apricot to capture all of the molecular events induced by the GSI reaction in Japanese apricot, one of the fruit species in Prunus.

Samples and the number of reads obtained from NGS platforms.

Results of assembly by Trinity.
Twelve-year-old Japanese apricot ‘Nanko’ (S1 S7) and ‘Kairyo-Uchida-Ume’ (S3 S4) trees grown at the experimental farm of Kyoto University were used in this study. During the 2010 flowering season, pistils and anthers were sampled from unopened flowers of ‘Nanko’ at the balloon stage of development. Anthers were also sampled from ‘Kairyo-Uchida-Ume’. The anthers were dehisced and the pollen grains were collected. Forty ‘Nanko’ pistils were placed on 1% (w/v) agar in each Petri dish (90 × 20 mm) and incubated at 20°C overnight. After overnight incubation, three different pollination treatments were conducted for the pistils. Two subsets of the pistils were pollinated by ‘Nanko’ (self-pollination, SP) or ‘Kairyo-Uchida-Ume’ (cross-pollination, CP). One third of the pistils were kept unpollinated (UP). The SP and CP pistils were sampled at 3, 6, and 9 hours after pollination. The UP pistils were also sampled at the same times as the pollinated pistils. One hundred pistils, equivalent to the number of pistils incubated in two and half Petri dishes, were sampled at each time point after pollination and pooled for the SP, CP, and UP treatments. Thus, three hundred pistils were used for RNA extraction in each treatment. Two pollen samples, pollen grains and germinated pollen grains, of ‘Nanko’ and ‘Kairyo-Uchida-Ume’ were also used for RNA extraction. For the germinated pollen grain sample, pollen grains (100 mg) were germinated in 10 mL liquid pollen germination medium [50 g·L−1 sucrose, 125 g·L−1 PEG6000, 300 mg·L−1 casein, 10 mg·L−1 rifampicin, 100 mM CaCl2, 100 mM KCl, 100 mM MgSO4, 100 mM H3BO3 in 100 mM MES (pH 5.8)] at 18°C for three hours on a seesaw shaker at low speed (20 shakes·min−1). After three hours of incubation, the germinated pollen grains were sampled for RNA extraction. All collected pistils, pollen grains and germinated pollen grains were immediately frozen in liquid N2 and stored at −80°C until use. Total RNA was extracted by the cold phenol method as described previously (Tao et al., 1999).
Sequencing, sequence preprocessing, and assemblyThe 454-pyrosequencing technique for 3′-ESTs using the GS FLX (Roche 454; Life Sciences, Branford, CT, USA) was performed with cDNA synthesized from mRNA prepared from SP pistils, CP pistils, UP pistils, pollen grains, and germinated pollen grains. Sample preparation and sequencing were conducted as per the custom service provided by TaKaRa BIO Inc. Dragon Genomics Center (Mie, Japan), as described previously (Habu et al., 2012). After sequencing, the obtained reads were processed with Seqclean software (http://compbio.dfci.harvard.edu/tgi/software/, September 18, 2013) to trim low complexity sequences. Then, the reads were further processed using RepeatMasker (Smit et al., 1996–2010) (http://www.repeatmasker.org) with RepBase (Jurka et al., 2005) to mask repeat sequences to avoid misassembly. Finally, the masked reads were processed with a Perl script as follows: i) low quality regions were masked, ii) masked regions of both ends were trimmed, iii) reads that were shorter than 10 bases were removed, and iv) reads that contained more than 30% masked regions were removed.
Illumina Genome Analyzer IIx (GAIIx) (Illumina, San Diego, CA, USA) single-end sequencing was also performed for three samples, SP, CP, and UP pistils, as per the custom service provided by TaKaRa BIO Inc. Dragon Genomics Center. In short, sequencing libraries were constructed using the mRNA-Seq Sample Prep (Illumina) and Small RNA Sample Prep (Illumina) Kits following the Directional mRNA-Seq Library Prep Pre-Release Protocol (Illumina). Using the constructed libraries as templates, clonal clusters from molecule fragments were constructed on GAIIx flow cells by the Cluster Station (Illumina) and Single-Read Cluster Generation Kit v4 (Illumina). Sequencing was performed using the TruSeq SBS Kit v5 (Illumina). After sequencing, the obtained reads were processed with a Perl script as follows: i) low quality regions were masked, and ii) reads that contained more than 10% masked regions were removed.
The preprocessed reads obtained from 454- pyrosequencing and GAIIx sequencing were assembled by Trinity (Grabherr et al., 2011) with the default settings. In addition, the reads from unpollinated pistils obtained in this study (UP_454 and UP_GAIIx) and the reads from pollen (‘Nanko’ and ‘Kairyo-Uchida-Ume’) obtained from DRA (Acc. DRP000624) were individually preprocessed and assembled as described above to obtain unpollinated pistil and pollen transcriptomes.
Functional annotationTo annotate the unigenes, they were searched against the nonredundant protein database (nr) of the National Center for Biotechnology Information (NCBI) and the Arabidopsis protein database (TAIR10) of the Arabidopsis Information Resource (TAIR) by the BLASTX program (Altschul et al., 1990) using an E-value cutoff of 1e-6. Only best-hit results were extracted and hits against the TAIR database were used for functional gene ontology (GO) annotations (The Gene Ontology Consortium, 2000) using the TAIR Gene Ontology tool (http://www.arabidopsis.org/tools/bulk/go/index.jsp, September 13, 2013). Assigned GO terms were summarized based on “Plant GO Slim” provided by the TAIR website.
Digital Expression AnalysisDigital expression analysis was conducted using a number of reads mapped to each unigene. Three GAIIx read sets were individually mapped to the unigenes using Bowtie (Langmead et al., 2009). The obtained raw count data were used as inputs to edgeR (Robinson et al., 2010) for statistical analysis. Unigenes with less than a 1% false discovery rate (FDR) were taken as differentially expressed between samples. GO enrichment analyses of these differentially expressed unigenes were conducted by agriGO (Du et al., 2010).
In total, 1,314,579 raw reads from seven libraries were generated by 454-pyrosequencing and 86,127,694 raw reads from three libraries were generated by GAIIx (Table 1). All the obtained sequences from 454-pyrosequencing and GAIIx reads are available from the DDBJ Sequence Read Archive (http://trace.ddbj.nig.ac.jp/dra/index.html, September 13, 2013) under the following accession numbers (study: DRP01188, samples: DRS012257-DRS012268, experiments: DRX012514-DRX012518, and runs: DRR013973-DRR013977). After the preprocessing steps, we obtained 1,165,107 reads from 454-pyrosequencing and 76,356,203 reads from GAIIx (Table 1). We ran preliminary assemblies with the Trinity program using three different data sets (seven 454 read sets, three GAIIx read sets, and all 10 read sets) and decided to use the unigenes generated from all 10 read sets for further analyses, because they included more and longer unigenes than those generated from the other two data sets (Table 2). This unigene data set contained 40,061 contigs (referred to as unigenes hereafter) ranging from 201 to 7,477 bases with an average length of 572 bases (Table 3). A BLASTN (1e-100) search against the transcriptome data set of dormant buds in Japanese apricot (Habu et al., 2012) revealed that 14,950 of the 40,061 unigenes were almost identical to sequences in the dormant bud transcriptome.

Summary of the assembly and annotation of unigenes using all ten samples.
When the unigene sequences were queried against the NCBI non-redundant and Arabidopsis (TAIR10) protein databases using BLASTX (1e-6), 29,985 (74.8% of total unigenes) and 27,898 (69.6% of total unigenes) unigene sequences showed at least one hit against the NCBI nr and TAIR10 protein databases, respectively. The best hits from TAIR10 were used to assign gene ontology (GO) annotations. A total of 4,417 GO terms were assigned to 27,399 unique sequences, representing 68.4% of all unigene sequences and 98.2% of the BLAST-annotated sequences. GO Slim assignments for the unigene sequences and the complete Arabidopsis gene set within the three categories are summarized in Figure 1. The GO distribution indicated that genes with a wide range of functions were expressed in unpollinated and/or pollinated styles and the GO distribution trends in the P. mume pistil and pollen transcriptomes appeared to be similar to those of the complete Arabidopsis gene set.

Gene Ontology (GO) assignments for the unigenes. Proportions of the annotated unigenes and the complete Arabidopsis gene set that matched various gene ontology categories.
For digital expression analysis, the three GAIIx read sets were mapped to the unigenes by Bowtie (Table 4). In total, 22,932,076 CP_GAIIx reads, 20,154,057 SP_ GAIIx reads, and 21,536,333 UP_GAIIx reads were mapped to the unigenes, with the average number of mapped reads per unigene being 503–572. For statistical analyses, we used the R package edgeR. Differentially expressed unigenes were selected based on the FDR (cut-off FDR < 0.01). Comparisons of the number of mapped reads from the UP pistil with those of the CP or SP pistils showed that the expressions of 8,907 unigenes were significantly different between UP and CP pistils and those of 10,190 unigenes were significantly different between UP and SP pistils (Fig. 2). Among these, the changes in expression of 4,348 unigenes were significant in both pollination types, while the expressions of 4,559 and 5,842 unigenes were changed specifically in CP and SP, respectively. Among the CP-specific differentially expressed unigenes, the expressions of 2,373 unigenes were increased and those of 2,186 unigenes were decreased by cross-pollination. Among the SP-specific differentially expressed unigenes, the expressions of 2,691 unigenes were increased and those of 3,151 unigenes were decreased by self-pollination. All four subsets of genes showed no enriched GO categories in the biological process ontology. The expressions of 2,227 and 1,942 unigenes were increased and decreased commonly by both pollination types, respectively, while the expressions of 95 unigenes were increased by CP but decreased by SP and vice versa for 84 unigenes (Fig. 2). In the former two subsets that were commonly up- or down-regulated by both pollination types, the GO categories for responses to various stimuli were enriched (Table 5), while no enriched GO category in the biological process ontology was detected in the latter two subsets that were differentially regulated by self- and cross-pollinations.

Results of mapping to the unigenes.

Venn diagram showing the unigenes with significantly different numbers of ESTs among unpollinated and cross/self-pollinated samples. Differentially expressed unigenes were selected based on the FDR (cut-off FDR < 0.01).

The results of GO enrichment analysis of unigenes differentially expressed between both pollination types.
The total mRNAs from pollinated pistils consisting of both pistil and pollen transcripts should contain transcripts from genes involved in the self-(in)compatible reaction such as self-incompatibility specificity determinant genes and modifier genes. As the expression of some of these genes could be changed by pollination, we first focused on the 2,227 unigenes whose expressions were increased by both pollination types. These unigenes were searched against the unpollinated pistil and pollen transcriptome sequences by MEGABLAST (cutoff e-value < 1e-100) to determine from which organs, pistils and/or pollen grains, the transcripts originated. Assemblies of the mixed reads of UP_454 + UP_GAIIx (UP) and the mixed reads of ‘Nanko’ pollen + ‘Kairyo-Uchida-Ume’ pollen (NKP) (DRA Acc. DRP000624) yielded 32,736 and 38,797 contigs (unigenes), respectively (Table 6). Using these transcriptomes, we determined the origin of the unigenes up-regulated by both pollination types. MEGABLAST searching revealed that 1,564 and 1,695 unigenes up-regulated by both pollination types had almost identical sequences in the UP and NKP transcriptomes, respectively, indicating that the former were expressed in pistils and the latter in pollen grains. As shown in Figure 3, the expressions of 1,263 unigenes were predicted to be up-regulated commonly in pollen and pistils or up-regulated at least either in pollen or pistils because they were found in both the UP- and NKP-specific unigenes, while the expressions of 432 and 301 unigenes were up-regulated specifically in pollen and pistils, respectively. The remaining 231 unigenes up-regulated by both pollination types had no homologous sequences in either of the transcriptomes.

Results of assemblies of un-pollinated pistil and pollen data sets.

Venn diagram showing the unigenes with significantly different numbers of ESTs among organs. Differentially expressed unigenes were selected based on the FDR (cut-off FDR < 0.01).
Among the 301 pistil-specific unigenes up-regulated by both pollinations, 184 unigenes (61.1%) were annotated by TAIR10 and 199 (66.1%) were annotated by nr, including three S-RNase unigenes (Table 7). The expressions of the S-RNases were high in unpollinated pistils and were increased by both pollination types. Among 432 pollen-specific unigenes up-regulated by both pollination types, 243 unigenes (56.2%) were annotated by TAIR10 and 263 (60.9%) were annotated by nr. In contrast to the pistil determinant, S-RNase, the pollen determinant, SFB, was not included among these pollen-specific unigenes, although it was included in the combined unpollinated/pollinated pistil transcriptome. This is because the expression of SFB was not detected in all of the pistils (Table 8). Similarly, S locus F-box protein with low allelic sequence polymorphism 2, which is also called SLFL2 (Matsumoto et al., 2008), was not included among the differentially expressed unigenes but was found in the whole unpollinated/pollinated pistil transcriptome because its expression was barely detectable. Conversely, S locus F-box protein with low allelic sequence polymorphism 3 (SLFL3) (Matsumoto et al., 2008), which encodes one of the pollen-expressed F-box proteins located in the S locus but has very low allelic sequence polymorphism, was included among the pollen-specific unigenes up-regulated by both pollination types and its expression was especially increased by cross-pollination (Table 7). In addition, the pollen-specific unigenes up-regulated by both pollination types included cullin 1, a ubiquitin-like protein and ubiquitin carboxyl-terminal hydrolase16-like protein (Table 9). Among the 1,263 unigenes up-regulated in either or both organs by both pollination types, 1,079 unigenes (85.4%) were annotated by TAIR10 and 1,129 (89.4%) were annotated by nr. These unigenes included S locus F-box protein with low allelic sequence polymorphism 1 (SLFL1) (Matsumoto et al., 2008) (Table 7), the subunits of SCF complex, ASK2 (SKP1) and two types of cullins, three types of E2 ubiquitin conjugating enzymes, E3 ubiquitin-protein ligase1 (UPL1), and four types of ubiquitin or ubiquitin-like proteins (Table 9).

Differentially expressed unigenes located at the S locus.

Other F-box unigenes located at the S locus.

Ubiquitination-related unigenes up-regulated by pollination.
Since the pollen-part modifier gene of Prunus (M locus) has been recently mapped to the distal part of Chr. 3 flanked by two SSR markers within an interval of 1.8 cM, corresponding to ∼364 Kb in the peach genome (Zuriaga et al., 2012), we searched for differentially expressed unigenes located at this locus by MEGABLAST searching against the peach genome (The International Peach Genome Initiative, 2013). We found 19 unigenes up-regulated by either or both pollination types between PSG3_71 and PSG3_96, which is defined as the M locus (Table 10). Among these, only comp15915_c0_seq1 was pollen-specific and up-regulated by self-pollination. On the other hand, homologous sequences to the solanaceous pistil modifier genes, such as HT-B and 120K, were not included among either the pistil-specific differentially expressed unigenes or in the whole unpollinated/pollinated pistil transcriptome.

SP- or Cp-specific up-regulated unigenes located around the M locus.
As described above, we found that the ubiquitin carboxyl-terminal hydrolase16-like protein, a member of the deubiquitinating enzymes (DUB), was significantly up-regulated by both pollination types, suggesting that deubiquitination may be related to the SI reaction. Therefore, DUBs were searched against the SP- and CP-specific up-regulated unigenes; we found six and 12 deubiquitinating enzyme-like unigenes, respectively (Table 11). Among them, only UBP11-like unigene expressed specifically in pollen was found among the SP-specific up-regulated DUB-like unigenes.

Deubiquitination enzymes (DUBs) among the SP- or CP-specific up-regulated unigenes.
The application of NGS technologies to transcriptome analysis allows the characterization of nearly all of the transcripts in a particular tissue during a specific biological event. To date, the Roche/454 GS FLX system has been mainly used for transcriptome analyses in non-model plants because it generates longer reads. However, the number of reads generated by the Roche/454 GS FLX is much lower than is generated by short read platforms such as the Illumina Genome Analyzer. In this study, we used both platforms to fully exploit the advantages of the two different systems. We performed preliminary testing of the de novo assemblies generated using either or both platforms (Table 2). Among the three assemblies, the hybrid assembly of the Roche/454 and Illumina reads gave the longest unigene length. This result is consistent with the results of Cahais et al. (2012), which showed that higher quality assemblies were obtained when the Roche/454 and Illumina data were combined, in comparison with the use of either Roche/454 or Illumina data alone. In this study, however, the assembly results of the hybrid data set, i.e. the number and average length of the unigenes, were not much different from those of the Illumina-only reads (Table 2), indicating that the Roche/454 reads contributed much less to the assembly results than the Illumina reads. This is probably because a sufficient number of reliable long unigenes could be reconstructed from the Illumina reads alone. Previously, de novo assembly of Illumina reads without reference sequences was more difficult than for Roche/454 reads, because of the short read length (Pop and Salzbarg, 2008; Strickler et al., 2012). Recently, however, with the improvement of read length and advances in assembly algorithms and assemblers (Grabherr et al., 2011; Martin and Wang, 2011), higher quality assemblies can be obtained from Illumina reads alone compared with Roche/454 reads (Luo et al., 2012; Xiao et al., 2013). Therefore, Illumina platforms have recently been chosen routinely for transcriptome analysis in non-model organisms and model organisms (Fu et al., 2013; Liu et al., 2013; Oono et al., 2013). This study also confirms that Illumina reads alone can be used for transcriptome analysis of non-model organisms.
In this study, we conducted a large-scale transcriptome analysis of pistils from Japanese apricot, a non-model organism, to gain insight into the molecular network of the SI/SC reaction. We obtained 40,061 unigenes from 77,521,310 reads of pollinated and unpollinated pistils and pollen grains. Among these unigenes, 29,985 had homologous genes in GenBank, while the other 10,076 appeared to be novel. We previously obtained 113,629 unigenes, including contigs and singletons, from the transcriptome of Japanese apricot dormant buds (Habu et al., 2012). Among the 40,061 unigenes obtained in this study, only 14,591 unigenes (37.2%) had at least one closest hit against the unigenes from dormant buds. The low percentage of common unigenes generated could be attributed to differences in the organs used and differences in the developmental stages of the organs. In GO analyses, none of the characteristic GO categories were found in the overall GO distribution but several GO terms involved in responses to stimuli were over-represented in the GO enrichment analysis of unigenes up-regulated by both pollination types, indicating that the stimulus of pollination triggers the up-regulation of many genes.
Pistil modifiersAs the pistil modifiers for S haplotype-specific pollen rejection in the Solanaceae, the expression of HT-B and 120K in the pistil is indispensable (McClure et al., 1999; Hancock et al., 2005). In the compartmentalization model developed based on microscopic observations of pollen tube growth in Nicotiana, the pistil modifiers HT-B and 120K are thought to play a key role in vacuole breakdown in the pollen tube after SI pollination, which is supposed to release non-self S-RNase cytotoxicity (Goldraij et al., 2006; McClure et al., 2011). If the Solanaceae and Prunus share the same molecules in SI/SC recognition mechanisms, homologous sequences to HT-B and 120K should be present in the pistil transcriptome but no such sequences were found in the combined unpollinated/pollinated pistil transcriptome of Japanese apricot. It is possible that other unidentified distinct molecules are involved in the compartmentalization of S-RNases in Prunus or alternatively, the current compartmentalization model developed based on microscopic observations in the Solanaceae is not readily applicable to the Prunus GSI.
Pollen modifiersMutation in the pollen-part non-S factor, called the pollen modifier, appeared to confer SC in sweet cherry (Prunus avium) (Wünsch and Hormaza, 2004) and apricot (Prunus armeniaca) (Vilanova et al., 2006). As mutation of this pollen-part modifier gene results in self-compatibility, this modifier gene is considered to play a key role in the SI reaction and could be up- regulated upon SI pollination. Among the 19 unigenes upregulated by either or both pollination types at the M locus, only comp15915_c0_seq1 was pollen-specific and up-regulated by self-pollination (Table 10). Although this unigene was annotated as the auxin transporter BIG by TAIR10, it is thought that BIG could function as an UBR4, one of the ubiquitin-protein E3 ligase component N-recognins (Graciet and Wellmer, 2010). In fact, the putative amino acid sequence of the homologue of comp15915_c0_seq1 in peach, ppm000002m, contains the UBR4 E3 ubiquitin-protein ligase domain. Therefore, the gene from which comp15915_c0_seq1 is derived probably functions as a UBR4. Although the pollen S determinant is an F-box protein in the S-RNase-based GSI system and is thought to be involved in ubiquitination, comp15915_c0_seq1 is unlikely to interact with the pollen S F-box proteins, because the ubiquitin-protein E3 ligase component N-recognins do not interact with F-box proteins and recognize the N-terminus of substrates directly. It will be interesting to test whether comp15915_c0_seq1 is involved in the GSI reaction in Prunus.
General inhibitor in Prunus SIA series of molecular and genetic analyses of the S-RNase-based SI mechanism in Prunus indicated the presence of a distinct recognition mechanism (Tao and Iezzoni, 2010). The main difference between the GSI mechanism in Prunus and those in Solanaceae and Plantaginaceae is considered to be the role of the pollen S determinant. It has been proposed that multiple pollen-specific F-box genes at the S locus collaboratively function to detoxify all but self S-RNase in the Solanaceae (Kubo et al., 2010). Mutations that disrupt pollen S (SLF) function in the Solanaceae and Plantaginaceae have yet to be found and therefore are thought to confer either SI or lethality. However, mutations that disrupt pollen S (SFB) function found in Prunus result in SC (Tao and Iezzoni, 2010; Ushijima et al., 2004; Yamane and Tao, 2009), indicating that the Prunus pollen S determinant may act to prohibit unknown mechanisms that inactivate the cytotoxic effects of the S-RNase. In other words, it is thought that multiple pollen S determinants (SLFs) would collaboratively detoxify the non-self S-RNases in the Solanaceae, while a single pollen S determinant (SFB) would release the cytotoxicity of the self S-RNase in Prunus. The role of the pollen S determinant of Pyrinae (SFBBs) is still unclear but it is suggested that the GSI mechanism of Pyrinae would be similar to that of Solanaceae (De Franceschi et al., 2012). On the other hand, three other types of pollen-expressed F-box genes, SLFL1, SLFL2, and SLFL3 (synonymous with the S locus F-box protein with low allelic sequence polymorphism 1, 2, and 3, respectively), were also found on the S locus of Prunus (Entani et al., 2003; Matsumoto et al., 2008; Ushijima et al., 2003, 2004). These Prunus SLFLs are thought to be putative orthologs of the Pyrinae SFBBs based on phylogenetic analysis (Matsumoto et al., 2008). Although the distinct S-RNase detoxification mechanism in Prunus is still unknown, one possible explanation could be that Prunus SLFLs have lost their self-recognition function and have begun to function to detoxify all S-RNases including self S-RNase as the general inhibitor (GI) of the S-RNase. In this study, we found SLFL1, SLFL2, and SLFL3 in the pistil transcriptome. Among these, SLFL3 showed transcriptional pattern changes appropriate for GI. Namely, the expression of SLFL3 was pollen-specific and highly up-regulated by pollination. It is interesting that the comp2390_c0_seq1 annotated as ASK2 by TAIR10 is almost identical to PavSSK1, which is a gene for the SKP1-like1 protein interacting with SLFLs and SFB to form the SCF complex with Cullin1-likes (CUL1s) (Matsumoto et al., 2012), and was also up-regulated upon pollination, especially after compatible pollination (Table 9). We also found ubiquitin carboxyl-terminal hydrolase16-like protein, which is a member of the deubiquitinating enzymes (DUBs), among the pollen-specific unigenes up-regulated by pollination. Furthermore, an ubiquitin-specific protease, UBP11, was also found as an SP-specific up-regulated unigene expressed in pollen (Table 11). Although we have yet to clarify the SI/SC recognition mechanism of the S-RNase-based GSI in Prunus, it is likely that the ubiquitin/proteasome proteolytic pathway is involved in the system. Deubiquitination processes may also be involved in the S haplotype-dependent protection and degradation of S-RNase.
In conclusion, our large scale transcriptome analyses support the presence of a distinct molecular mechanism for S-RNase-based GSI in Prunus. Further detailed studies of the genes identified in this study should lead to elucidation of the Prunus GSI mechanism.
Important notes: During the preparation of this paper, an article describing the whole genome sequencing of P. mume was published (Zhang et al., 2012). In the future, whole genome sequence data incorporated into developing bioinformatics programs will strongly promote omics studies in Japanese apricot and provide us with new insights for understanding agronomically important traits of this species.