Pigmentation of soybean seed coats via a mutation that abolishes production of multiple-phased siRNAs of chalcone synthase genes

Mashiro Yuhazu; Shun Mikuriya; Ayumi Mori; Maria Stefanie Dwiyanti; Mineo Senda; Akira Kanazawa

doi:10.1266/ggs.23-00260

ABSTRACT

Lack of pigmentation in seed coats of soybean is caused by natural RNA silencing of chalcone synthase (CHS) genes. This phenomenon is an evolutionary consequence of structural changes in DNA that resulted in the production of double-stranded RNAs (dsRNAs) that trigger RNA degradation. Here we determined that a mutant with pigmented seed coats derived from a cultivar that lacked the pigmentation had a deletion between DNA regions ICHS1 and a cytochrome P450 gene; the deletion included GmIRCHS, a candidate gene that triggers CHS RNA silencing via production of CHS dsRNAs. We also characterized CHS short interfering RNAs (siRNAs) produced in the wild-type seed coats that had CHS RNA silencing. Phased 21-nt CHS siRNAs were detected in all 21 phases and were widely distributed in exon 2 of CHS7, which indicates commonality in the pattern of RNA degradation in natural CHS RNA silencing between distantly related species. These results with the similarities in the rearrangements found in spontaneous mutants suggest that the structural organization that generates dsRNAs that trigger phased siRNA production is vulnerable to further structural changes, which eventually abolish the induction of RNA silencing.

INTRODUCTION

RNA silencing refers comprehensively to gene silencing phenomena that are induced by nucleotide sequence-specific interactions involving RNA (Voinnet, 2002; Matzke et al., 2004). RNA silencing was discovered first in transgenic petunia plants, in which both a transgene and its homologous endogenous gene are downregulated (Napoli et al., 1990; van der Krol et al., 1990). Later studies, including the discovery of RNA interference (Fire et al., 1998), demonstrated that double-stranded RNA (dsRNA) is a trigger for reactions responsible for RNA silencing. The reactions involve processing of dsRNA into short interfering RNAs (siRNAs) that are 21–24 nt long by RNaseIII-type dsRNA endonuclease, called Dicer or Dicer-like, and cleavage of target RNA by the RNA-induced silencing complex that contains a member of the Argonaute proteins. In addition, RNA-dependent RNA polymerase forms dsRNA using single-stranded RNA to trigger or amplify the reactions (Baulcombe, 2004). Pathways of RNA silencing also include induction of epigenetic changes in nuclei via cytosine methylation and histone modification, and downregulation of gene expression mediated by microRNAs (Bologna and Voinnet, 2014; Matzke et al., 2015). Although first discovered in a transgenic plant, RNA silencing has been detected in endogenous genes in non-transgenic plants. Some altered phenotypes of plants, including those manifested as visibly altered phenotypes, are ascribed to such natural RNA silencing (Kanazawa, 2008). The earliest known phenomena of natural RNA silencing were those manifested in the presence or absence of pigmentation in soybean seed coats (Senda et al., 2004; Tuteja et al., 2004), in various parts of maize plants (Della Vedova et al., 2005) and in specific portions of petunia petals (Koseki et al., 2005).

Seed coat color of soybean is controlled by three loci, I, R and T. While the R and T loci determine the types of anthocyanin pigments and proanthocyanidins in seed coats, the I locus determines the spatial distribution and presence or absence of the pigments (Senda et al., 2012). Four alleles have been found for the I locus, namely I, iⁱ, i^k and i. The presence and absence of pigments in the entire seed coat are conferred by the i and I alleles, respectively. The iⁱ and i^k alleles are responsible for lack of pigmentation in specific portions of seed coats, namely portions other than the hilum and a saddle-shaped region, respectively. The inhibition of pigmentation by the I and iⁱ alleles is caused by RNA silencing of chalcone synthase (CHS) genes (Senda et al., 2004; Tuteja et al., 2004), which encode a key enzyme in the biosynthesis of anthocyanins and proanthocyanidins. A wild ancestor of cultivated soybean produces brown or black seed coats, and the phenotypes caused by natural CHS RNA silencing were generated during or after domestication of soybean. The CHS genes in soybean constitute a multigene family. Early studies identified nine family members (CHS1–CHS9) (Akada and Dube, 1995; Tuteja and Vodkin, 2008). A phylogenetic analysis of the nucleotide sequences of these genes indicated that they were classified into two subfamilies, one comprising CHS7 and CHS8 and the other CHS1–CHS6 and CHS9 (Kurauchi et al., 2009). A recent in silico analysis detected five additional genes, CHS10–CHS14. A phylogenetic analysis indicated that CHS10–CHS12 were grouped with CHS1–CHS6 and CHS9, while CHS13 and CHS14 were grouped into distinct clades (Anguraj Vadivel et al., 2018). Among the multiple CHS genes, CHS7/CHS8 transcripts constitute most CHS transcripts in pigmented seed coats (Kasai et al., 2004; Tuteja et al., 2004).

On the basis of structural analysis of the I locus, models that explain the induction of CHS RNA silencing in seed coats of soybean, and in particular the production of CHS dsRNAs, have been proposed. The iⁱ allele of the I locus contains a 10.4-kb inverted repeat (IR) comprising a CHS4-CHS3-CHS1 gene cluster (Clough et al., 2004). In a proposed model, the promoter of a subtilisin gene adjacent to the CHS gene cluster produces antisense transcripts of CHS1, which then form dsRNA with CHS1 sense transcripts in the iⁱ allele (Xie et al., 2019). Similarly, in another proposed model, chimeric transcripts comprising the subtilisin gene, antisense-CHS1 and sense-CHS3 fragments form dsRNA through base pairing between the CHS1 and CHS3 regions of the transcripts in the iⁱ allele (Jia et al., 2020).

On the other hand, the I allele of the I locus contains a gene called GmIRCHS comprising a 1,087-bp IR of a pseudoCHS gene sequence, ΔCHS3, and the 5′ portion of GmJ1 encoding a type III DnaJ-like protein (Kasai et al., 2007). An RNase protection assay demonstrated that the IR of ΔCHS3 is transcribed and intramolecular dsRNA is formed (Kurauchi et al., 2011). In a model for the induction of RNA silencing by the I allele, dsRNA that is formed by GmIRCHS transcripts provides primary siRNAs via processing with a Dicer-like protein(s), which subsequently induce secondary siRNA production from multiple CHS transcripts, whereby CHS mRNAs are extensively degraded (Senda et al., 2012). Subsequent analysis of spontaneous mutants that had pigmented seed coats derived from cultivars that produce nonpigmented seed coats revealed that these plants underwent structural changes that involve complete or partial loss of GmIRCHS, which indicates the plausibility of the model (Senda et al., 2013).

In the present study, we newly analyzed a mutant that has pigmented seed coats and was derived from a population of mutagenized cv. Suzuyutaka, a cultivar that has nonpigmented seed coats characteristic of the I allele. We characterized the reverted, pigmented phenotype and found that CHS mRNA is degraded in the original cultivar but not in the mutant. Our data indicate that a structural change at the I locus eliminates IR structure and causes the reversion to pigmented seed coats, and that the change occurs at a position close to, but different from, those previously identified in the spontaneous mutants discussed above. We also profiled small RNAs, revealing a commonality in RNA degradation of natural CHS RNA silencing between different plant species.

RESULTS

Changes in the expression of CHS genes in a mutant with pigmented seed coats

The mRNA levels of CHS genes were analyzed in the seed coat tissues of the wild type and the seed coat-pigmented mutant (Fig. 1A). Among the CHS family members, CHS7 and CHS8 are known to be highly expressed in pigmented seed coats of soybean and are silenced in nonpigmented seed coats (Kasai et al., 2004; Tuteja et al., 2004); we thus focused our analysis on these genes. Quantitative reverse transcription-PCR (qRT-PCR) analyses using primers that specifically amplify the CHS7 gene and those that amplify both the CHS7 and CHS8 genes were done (Fig. 1B, 1C). In both experiments, the mRNA level was significantly higher in the mutant than in the wild type, which indicates that restoration of pigmentation in seed coats is associated with restoration of the CHS mRNA level from the state caused by RNA silencing. Next, we analyzed structural changes involving the CHS genes that prevented induction of RNA silencing of the CHS genes in seed coats.

Fig. 1. Seed coat phenotypes and CHS expression in wild-type and mutant cv. Suzuyutaka. (A) Seed coats. (B, C) mRNA levels in seed coats for CHS7 (B) and CHS7 and CHS8 (C). mRNA levels of the CHS genes are relative to that of the F-box gene in the wild type, which was set as 1. Data are means and standard errors obtained from five biological replicates. * P < 0.05, ** P < 0.01, Mann–Whitney U test. W, wild type; M, mutant.

Structural changes in the genomic DNA regions that contain the CHS genes in the mutant

Structural changes in DNA regions that contain CHS genes were examined by DNA gel blot analysis (Fig. 2). Using probes that can detect CHS1–CHS9 of the CHS gene family (Senda et al., 2002, 2013), we found a 1.2-kb BclI-digested fragment in the mutant instead of the 6.1-kb fragment that characterizes the wild type, suggesting structural changes around a CHS gene located in this fragment.

Fig. 2. Structural changes involving CHS genes assessed by DNA gel blot analysis. Total DNA from the wild type and mutant was digested with BclI, and after electrophoresis the gel blot was probed with labeled DNA fragments that hybridize with CHS1–CHS9. Estimated sizes of hybridized fragments are indicated. Note that the 1.2-kb fragment was specific to the mutant DNA. W, wild type; M, mutant.

Because of the obvious phenotypic change, we assumed that the structural changes detected by the CHS probe involve the GmIRCHS-ICHS1 cluster, which includes a candidate causal gene of the CHS RNA silencing by allele I. We focused on specific structural changes that were previously identified during the generation of spontaneous mutant plants that produce pigmented seed coats (Senda et al., 2013) and tested whether the mutant here had similar changes. A structural change previously identified was a deletion of the DNA region that contains the GmIRCHS–ICHS1 cluster. One end of the deleted region was within the GmIRCHS–ICHS1 cluster and the other was either within or flanking a cytochrome P450 gene (Senda et al., 2013). Although the boundaries of the deletion vary, the deletion always results in the loss of the IR in GmIRCHS.

We tested for the deletion in this region using a primer set that was designed to anneal to ICHS1 and the flanking region of the cytochrome P450 gene (Fig. 3A). We obtained a PCR product from the mutant but not from the wild type, which suggests that the mutant did indeed have the deletion in this region (Fig. 3B). Nucleotide sequence analysis of the PCR-amplified product showed that the DNA sequence comprises the 5′ portion of ICHS1 and the flanking sequence of the cytochrome P450 gene (Fig. 3D). The positions in ICHS1 and in the flanking region of cytochrome P450 gene differed from those previously identified, but were near them (Supplementary Fig. S1).

Fig. 3. Structural changes involving the GmIRCHS-ICHS1 cluster in the mutant. (A) Origin of the DNA fragment specifically present in the mutant. Regions derived from the GmIRCHS-ICHS1 cluster and the cytochrome P450 gene are indicated by black and gray lines, respectively. The dotted gray line indicates an unknown DNA region between the cytochrome P450 gene and GmIRCHS-ICHS1 cluster. The positions of rearrangement are indicated by red arrowheads. J, the 5′ portion of GmJ1 located adjacent to ΔCHS3 (Kasai et al., 2007). DNA regions amplified by PCR and corresponding products are indicated by horizontal blue lines with an arrowhead at each end. These PCR products correspond to those shown in B and C. For the 2.1-kb PCR product, DNA fragments generated by BclI digestion are shown. (B) Gel electrophoresis of the product of PCR that amplified the rearranged DNA from the mutant. The size of the PCR product analyzed by DNA sequencing is shown to the right. W, wild type; M, mutant. (C) Gel electrophoresis of the BclI-digested 2.1-kb PCR product amplified from the rearranged DNA of the mutant. DNA fragment sizes estimated from DNA size markers are shown to the right; + and – indicate that the DNA was treated with BclI and untreated, respectively. (D) Comparison of the nucleotide sequences of the DNA fragments involved in the rearrangement. Identical bases are indicated by asterisks. Nucleotide sequences of the regions derived from the ICHS1 and cytochrome P450 genes are indicated by black and gray backgrounds, respectively. The position of rearrangement is indicated by a red arrowhead, which corresponds to the position indicated by the red arrowhead in the ICHS1 gene in A.

Amplification of a DNA region encompassing ICHS1 and its surrounding regions from the mutant and subsequent digestion by BclI generated DNA fragments including a 1.2-kb fragment (Fig. 3C). This result explained the generation of the hybridization signal of the same size in the DNA gel blot analysis of the mutant DNA (Fig. 2).

Using PCR with primers designed to anneal to the interior or regions adjacent to ΔCHS3 of GmIRCHS, we also confirmed that this structural change accompanied the lack of GmIRCHS in the mutant (Fig. 4). Amplification using all primer sets, except for the combination of primers B and 6 used for a negative control, generated a product from the wild type but not from the mutant, which indicated that GmIRCHS was missing in the mutant.

Fig. 4. PCR amplification of the GmIRCHS region. Image of PCR products in gel after electrophoresis. “A–D”, “3” and “6” are primers for PCR and are identical to those reported by Kasai et al. (2007). Positions of these primers in and around the ΔCHS3 of GmIRCHS are schematically indicated above the gel image. Combinations of primers for PCR are indicated above lanes. Note that amplification using primers B and 6 resulted in no amplification from either wild type or mutant, which provides a negative control. W, wild type; M, mutant.

Characterization of CHS RNA degradation in seed coat tissues in terms of siRNA production

Because production of siRNA is a hallmark of RNA silencing, we analyzed siRNAs in seed coats of the wild type and mutant by deep sequencing. Of 41,297,244 reads, 63,427 reads matched the CHS7 gene region in the wild type (Supplementary Table S1). The size distribution of siRNAs mapped to the CHS7 gene revealed the predominance of siRNA of 21 nt for both sense and antisense strands among 21–24-nt size classes of siRNAs (Fig. 5). siRNAs were mostly mapped to exon 2 (Fig. 6). The level of siRNAs corresponding to CHS7 was very low in the mutant, consistent with the restoration of pigmentation from the silenced phenotype (Figs. 5, 6). We also mapped siRNAs on ΔCHS3 of GmIRCHS. While abundant 21-nt siRNAs were mapped on the ΔCHS3 sequence, the level of 22-nt siRNA mapped on the ΔCHS3 sequence was also high in the wild type (Fig. 7; discussed later in detail). As expected from the IR structure, siRNAs were mapped widely on the ΔCHS3 sequence in the wild type. In parallel with the lack of GmIRCHS, few siRNAs were mapped on the ΔCHS3 sequence in the mutant (Figs. 7, 8); these are likely derived from CHS3 and/or other CHS genes, given the presence of nucleotide sequence similarity between CHS3 and these genes (Kurauchi et al., 2009).

Fig. 5. Frequency of siRNAs between 21 and 24 nt mapped on the CHS7 gene. Numbers of siRNAs mapped on the sense (s) and antisense (as) strands are indicated. An enlargement of the region from 0–1,000 reads is shown below. W, wild type; M, mutant.

Fig. 6. Position and abundance of siRNAs mapped on the CHS7 gene region. Data for 21-nt, 22-nt, 23-nt and 24-nt siRNAs in the wild type and mutant are shown. The x-axis indicates positions in the CHS7 gene. Bars above (with plus value) and below (with minus value) the x-axis indicate siRNAs mapped on the sense and antisense strands, respectively. A detail of the siRNA data is inserted when the level was very low. Nucleotide positions are numbered relative to the first nucleotide of the ATG codon. e1, exon 1; e2, exon 2; int, intron.

Fig. 7. Frequency of siRNAs between 21 and 24 nt mapped on GmIRCHS. Numbers of siRNAs mapped are indicated. An enlargement of the region from 0–1,000 reads is shown below. Only sense strand data are shown because of the inverted repeat organization of ΔCHS3. W, wild type; M, mutant.

Fig. 8. Position and abundance of siRNAs mapped on GmIRCHS. Data for 21-nt, 22-nt, 23-nt and 24-nt siRNAs in the wild type and mutant are shown. The x-axis indicates positions in the ΔCHS3 inverted repeat. A detail of the siRNA data is inserted when the level was very low. Only sense strand data are shown because of the inverted repeat organization of ΔCHS3. Positions of 22-nt siRNAs mapped to the terminal positions of the regions producing phased siRNAs of the CHS7 gene are indicated by red arrowheads. Nucleotide positions are numbered relative to the first nucleotide of the ΔCHS3 inverted repeat.

In Arabidopsis, cleavage of transcripts by a small RNA can result in in-phase generation of 21-nt secondary siRNAs by DICER-LIKE (DCL) 4 after production of dsRNA by RNA-DEPENDENT RNA POLYMERASE (RDR) 6 (Vazquez et al., 2004; Allen et al., 2005). Such in-phase generation of 21-nt siRNAs was also detected in natural RNA silencing and co-suppression of the CHS-A genes in petunia (Kasai et al., 2013). To detect phased siRNAs, we mapped siRNAs of the CHS7 gene in 21 different phases. Figure 9 shows the distribution of 21-nt phased siRNAs that are contiguous for three or more units in each phase in the CHS7 gene. These phased siRNAs were detected in all 21 phases in wild-type plants for both sense and antisense strands (Fig. 9).

Fig. 9. Phased siRNAs from the CHS7 gene. Phased siRNAs of the sense strand (A) and antisense strand (B) are shown. Presence/absence of 21-nt siRNAs was analyzed in 21 phases independently. The results of each phase are marked from 1 to 21: the first nucleotide of phase 1 corresponds to the first nucleotide of the ATG codon of the CHS7 gene. Lines indicate regions producing phased siRNAs in three or more contiguous units. Triangles indicate the 5′-terminal positions of the regions producing five or more contiguous phased siRNAs where 22-nt siRNAs of the opposite strand are mapped with their central positions corresponding to the terminal positions of the regions (for details, see Fig. 11). Black and red triangles indicate siRNAs mapped on CHS7 and both CHS7 and ΔCHS3, respectively.

The phased siRNA-producing region encompassed a large portion of exon 2 of the CHS7 coding region. The maximum number of contiguous units was 20, covering a 420-nt region, for the sense strand (phase No. 18) and 12, covering a 252-nt region, for the antisense strand (phase No. 12) (Fig. 9). Overall, these data indicate that phased siRNAs were produced in multiple phases at multiple sites over exon 2 in the wild-type plants. We calculated the phasing score of Howell et al. (2007), which reflects both siRNA abundance and number of positions occupied by siRNA reads in a given phase. The values were as high as 40 (Fig. 10A–10D), comparable to the scores found in natural RNA silencing and co-suppression in petunia (30–40; Kasai et al., 2013).

Fig. 10. Characterization of siRNA phasing. (A–D) Phasing scores of selected phases. Phasing scores of two phases of the sense strand, phase 13 (A) and phase 19 (B), and of the antisense strand, phase 11 (C) and phase 12 (D), are shown. These phases had the highest and second highest phasing scores in each strand. Cycle 1 corresponds to the phased siRNA mapped at phase 1. Phasing scores are calculated according to Howell et al. (2007). (E, F) Phasing registers. (E) Sense strand; (F) antisense strand. The percentage of 21-nt siRNAs mapped to each of 21 registers is radially plotted as distance from the center. The percentages of siRNAs from each register are shown.

A decrease in the levels of 22-nt siRNAs after knockout of DCL2 orthologs caused the seed coats to change from yellow to brown in a soybean line carrying the iⁱ allele (Jia et al., 2020). Considering that 22-nt microRNAs trigger the production of phased, secondary siRNAs (Chen et al., 2010), Jia et al. (2020) suggested that the 22-nt CHS siRNAs induce the production of 21-nt CHS siRNAs. Prompted by this finding, we investigated whether 22-nt siRNAs were mapped to the terminal positions of the regions producing phased siRNAs.

When phased siRNA production is induced via RNA cleavage by 22-nt small RNAs (microRNAs or trans-acting siRNAs), the RNA cleavage normally occurs at the opposite position between the 10th and 11th nucleotides from the 5′ end of the small RNAs (Chen et al., 2010). Meanwhile, RNA cleavage by siRNAs often appeared to occur at multiple nucleotide positions (e.g., Yoshikawa et al., 2005). Taking into account these possibilities, we examined the presence or absence of 22-nt siRNAs whose central or adjacent positions correspond to potential RNA cleavage sites that were located at the terminal positions of phased siRNA arrays. Such 22-nt siRNAs were indeed detected (shown by triangles in Fig. 9) and, moreover, among the siRNAs that were mapped at the ΔCHS3 sequence of GmIRCHS (shown by red triangles in Fig. 9), consistent with the notion that ΔCHS3 RNA provides siRNAs to trigger massive production of 21-nt secondary siRNAs of the CHS7 gene. These siRNAs, together with their potential target sequences, are shown in Figure 11; an example of a phased siRNA array is shown in Supplementary Figure S2. The data shown in the 1st, 2nd and 4th examples in Figure 11 suggest that phased siRNAs were produced from neighboring phases, the 5′ end of which was mapped at positions that differed by one nucleotide, while the same 22-nt siRNA was mapped at the terminal positions of phased siRNA arrays. These observations may indicate that phase setting via RNA cleavage occurs at multiple nucleotide positions by the same siRNA. Similar production of phased siRNAs from neighboring phases was suggested in natural RNA silencing and co-suppression in petunia (Kasai et al., 2013). These observations may also be relevant to a 1-nt shift of siRNA production relative to the phase set by a trans-acting siRNA detected in Arabidopsis (Chen et al., 2007). Taken together, these data suggest commonality in the pattern of siRNA production in natural CHS RNA silencing between distantly related species and that the altered DNA structure abolished phased siRNA production.

Fig. 11. Examples of the 5′-termini of phased siRNA arrays. Twenty-two-nucleotide siRNAs (shown in orange letters) are mapped in the opposite strand of the 5′-termini of the phased CHS7 siRNA arrays. These 22-nt siRNAs are also mapped on ΔCHS3, which implies that they trigger the production of phased siRNAs in trans. Phase numbers correspond to those shown in Figure 9. CHS7 RNA sequences are shown in black letters, which correspond to the reference genome sequence. Observed numbers of contiguous siRNA units are indicated above the RNA sequences. Read numbers of the siRNAs mapped on ΔCHS3 are shown in parentheses. Nucleotide positions of CHS7 and those of the 22-nt siRNAs are numbered from the first nucleotide of the ATG codon and that of the ΔCHS3 inverted repeat, respectively.

DISCUSSION

Through a comparative analysis of genomic DNA regions that contain the CHS genes between the seed coat-pigmented mutant and wild type, we found a deletion in the mutant and that one end of the deletion is located in the ICHS1 gene and the other end in an upstream portion of the cytochrome P450 gene. Although the mutant was found in a plant population obtained by mutagenesis, whether the mutation is a direct consequence of the mutagenesis is not known. Considering that the sites of structural change were close to those found in spontaneous mutants, the structural change is likely related to an intrinsic feature associated with the chromosomal regions. We found that these regions have two notable properties. First, most of the end positions of the deletions were present in regions that can form secondary structures (Supplementary Figs. S3, S4). Second, these DNA regions are AT-rich: the 300-bp DNA regions shown in Supplementary Figures S3 and S4 have an AT content of 76% in the upstream portion of the cytochrome P450 gene and 66% in the ICHS1 gene. Although these features may be neither necessary nor sufficient for inducing the structural changes, it is likely that these regions are less stable and may be vulnerable to rearrangements.

We have not been able to determine the internal region of the deletion using a PCR-based approach, being hindered by the presence of duplicated segments in the genome. A recent pan-genome analysis of soybean indeed indicated the presence of repeated DNA segments as a consequence of multiple structural rearrangements at the I locus that could have occurred during evolution (Liu et al., 2020). Our ongoing analysis of advanced resequencing data of a cultivar that confers the I allele will reveal the entire picture of the structural changes that occurred in the process of reversion to the pigmented seed coat phenotype.

We found that siRNAs were mostly produced from exon 2 of the CHS7 gene. This feature was also detected previously in CHS RNA silencing in soybean (Kurauchi et al., 2009; Tuteja et al., 2009). We also found that 21-nt siRNAs were predominant and produced in all 21 phases. Both these features are essentially common to those detected in natural RNA silencing and co-suppression in petunia (Kasai et al., 2013) and differ from those detected in trans-acting siRNA production triggered by microRNAs, in which production of phased siRNAs is confined to one or a small number of phases (Axtell et al., 2006). The predominance of 21-nt siRNA in these RNA silencing systems also differs from the feature of siRNAs produced from transposable elements, in which 24-nt siRNA is predominant (Kasschau et al., 2007). A unique observation in the case of the CHS RNA silencing in soybean is that more siRNAs were produced in a particular phase (phase 11) for antisense RNA, which was evident in the phasing registers (Fig. 10E, 10F). We found that the three most abundant siRNAs of the antisense strand were produced in this phase. These data indicate that phased siRNAs of the antisense strand were somehow produced unequally between different phases.

We also found that 22-nt siRNAs were mapped to the terminus of the region that produced phased siRNAs. Moreover, those 22-nt siRNAs contained siRNAs that were also mapped to ΔCHS3 (Figs. 9, 11). Jia et al. (2020) suggested that the 22-nt primary siRNAs that trigger massive production of phased 21-nt siRNAs are produced from a long IR that contains the CHS1 and CHS3 genes in the iⁱ allele. Similarly, the locations of siRNAs in our data are consistent with the notion that 22-nt siRNAs produced from ΔCHS3 can trigger phased 21-nt siRNA production in trans in the I allele. The nucleotide sequence of ΔCHS3 contains a large portion of exon 2 and the 3′ untranslated region of the CHS3 gene (Kasai et al., 2007; Fig. 12A). Transcripts of GmIRCHS can thus form dsRNAs that serve as a substrate of DCL protein(s) to produce siRNAs (Fig. 12B). The exon 2 sequence of ΔCHS3 and the corresponding region of CHS7 share extensive sequence identity (82%) (Kurauchi et al., 2009; Fig. 12A), so that the exon 2 region of CHS7 transcripts could be a target of ΔCHS3-derived siRNAs. Such structural correspondence between ΔCHS3 and CHS7 may explain the preferential production of 21-nt siRNAs from exon 2 of CHS7 as suggested previously (Senda et al., 2012; Fig. 12B). Furthermore, the ratio of 22-nt siRNAs to 21-nt siRNAs was much higher in ΔCHS3 (Fig. 7) than in CHS7 (Fig. 5). These data are reminiscent of the substrate preference for DCL proteins associated with IR size, which was recently suggested in soybean (Jia et al., 2020). The sizes of siRNAs generated from long IRs (e.g., 1.3 kb) and short IRs (e.g., 0.3 kb) tend to be 22 nt and 21 nt, respectively, which suggests that long and short IR transcripts are favored substrates of DCL2 and DCL4, respectively (Jia et al., 2020). The size of the identical sequence of ΔCHS3 IR is 1,087 bp (Kasai et al., 2007), which is consistent with the production of 22-nt siRNAs at a high level via DCL2-mediated cleavage. Overall, GmIRCHS thus has features to trigger phased siRNA production from exon 2 of CHS7 and consequent degradation of CHS7 transcripts in trans.

Fig. 12. Mechanisms of phased siRNA production from CHS7 transcripts by ΔCHS3-derived siRNAs. (A) Structural correspondence between ΔCHS3 and CHS7. The IR of ΔCHS3 is 1,087 bp long and comprises the 955-bp coding sequence of exon 2 and the 3′ untranslated region of CHS3 (Kasai et al., 2007). The 955-bp sequence shares 82% nucleotide identity with the corresponding region of CHS7 exon 2 (Kurauchi et al., 2009). (B) Schematic diagram of the production of 21-nt phased siRNAs from CHS7 exon 2. The IR structure of ΔCHS3 allows its transcripts to form a dsRNA structure, from which siRNAs are produced via DCL-mediated cleavage. CHS7 exon 2 is targeted by 22-nt siRNAs derived from ΔCHS3, where Argonaute (AGO)-mediated cleavage occurs. The RNA fragments serve as RDR substrates and the resulting dsRNAs are processed via DCL-mediated cleavage to produce 21-nt phased siRNAs. Although not shown explicitly, phased siRNA production triggered by multiple siRNAs and/or by siRNA(s) without cleavage (de Felippes et al., 2017) can also be postulated.

One triggering mechanism of natural RNA silencing involves the production of dsRNA either by read-through transcription of duplicated and rearranged genes (Melquist and Bender, 2003; Kasai et al., 2007) or by convergent transcription of an overlapping gene pair (Borsani et al., 2005). RNA silencing mediated by transcription of rearranged genes was also detected in a mutant generated by mutagenesis (Kusaba et al., 2003). Soybean plants that had structural rearrangements that allowed the production of CHS dsRNA and its concomitant RNA silencing in seed coats were generated during or after domestication, and these plants have been maintained by humans. Independent and repeated generation of revertant mutants from cultivated soybeans that had CHS RNA silencing indicates that the altered DNA structures that induce CHS RNA silencing could be prone to further structural changes, leading to the absence of the IR structure that produces dsRNA upon transcription. This vulnerability of CHS RNA silencing is consistent with the notion that there is no obvious advantage in seed coats that lack flavonoids produced downstream of the process catalyzed by CHS in their biosynthetic pathway; rather, this trait may be neutral or confer lower fitness to the plant.

MATERIALS AND METHODS

Plant materials

The M1-2-3 mutant line, derived from soybean cv. Suzuyutaka, and wild-type plants were used. This mutant line was originally produced in Shonai Regional Center for Biotechnology, Yamagata, Japan, and was obtained from Professor Hikoyuki Yamaguchi. Plants were grown in the experimental field of Hokkaido University, Sapporo, Japan. Seeds were sown in paper pots, and after seedlings had grown for one week in a greenhouse they were transplanted to a field and grown as described previously (Mikuriya et al., 2017).

DNA gel blot analysis

DNA was isolated from leaf tissue essentially as described by Yamada et al. (2002), digested with BclI, and then fractionated by agarose gel electrophoresis. The DNA was transferred to nylon membranes and allowed to hybridize with labeled probes. Labeling of probes and hybridization were done using AlkPhos Direct (GE Healthcare). A 530-bp DNA fragment was amplified by PCR as described previously (Senda et al., 2002, 2013) and used as a probe to detect CHS1–CHS9. Hybridization signals were detected by chemiluminescence on X-ray film.

Analysis of gene expression by qRT-PCR

RNA was isolated from seed coat tissues of developing seeds by the method of Nakashima et al. (2018) but without repeating the DNaseI treatment of the nucleic acids. cDNA synthesis and subsequent qRT-PCR were done as described previously (Kasai et al., 2012). The F-box gene was used as an internal control as described previously (Shiroshita et al., 2021). Primers for qRT-PCR are listed in Supplementary Table S2.

Small RNA deep sequencing and mapping

Low-molecular-weight RNA was isolated from seed coat tissues for deep sequencing of siRNAs essentially as described previously (Kasai et al., 2012, 2013). Ligation of adapters to the RNA, reverse transcription followed by PCR amplification, and analysis of the nucleotide sequence of amplified cDNA were done using a Small RNA Sample Prep Kit and Illumina Genome Analyzer (Illumina). From the sequence reads obtained, adapter sequences and low-quality reads were removed using Trimmomatic (v. 0.39) (Bolger et al., 2014) with default settings, requiring a minimum length of 14 nt. Quality control was performed with FastQC (v. 0.11.9) (Wingett and Andrews, 2018) before and after trimming. Reads were then mapped to the nucleotide sequence of CHS7 (Glyma.01G228700) obtained from the cv. Williams 82 reference genome sequence (Wm82.a2.v1; Schmutz et al., 2010) or the ΔCHS3 sequence of GmIRCHS (AB264311; Kasai et al., 2007) using Bowtie 2 (v. 2.4.5) (Langmead and Salzberg, 2012).

ACKNOWLEDGMENTS

We dedicate this article to the late Dr. Hikoyuki Yamaguchi, former Director of Shonai Regional Center for Biotechnology and Professor Emeritus of the University of Tokyo. We are grateful to him for providing soybean lines and sharing relevant information and regret that he was not able to collaborate on this research project with us for longer. We also thank Dr. Tetsuya Yamada and Dr. Jun Abe of Hokkaido University for their help with field experiments and for insightful discussions.

REFERENCES

Akada, S., and Dube, S. K. (1995) Organization of soybean chalcone synthase gene clusters and characterization of a new member of the family. Plant Mol. Biol. 29, 189–199.
Allen, E., Xie, Z., Gustafson, A. M., and Carrington, J. C. (2005) microRNA-directed phasing during trans-acting siRNA biogenesis in plants. Cell 121, 207–221.
Anguraj Vadivel, A. K., Krysiak, K., Tian, G., and Dhaubhadel, S. (2018) Genome-wide identification and localization of chalcone synthase family in soybean (Glycine max [L] Merr). BMC Plant Biol. 18, 325.
Axtell, M. J., Jan, C., Rajagopalan, R., and Bartel, D. P. (2006) A two-hit trigger for siRNA biogenesis in plants. Cell 127, 565–577.
Baulcombe, D. (2004) RNA silencing in plants. Nature 431, 356–363.
Bolger, A. M., Lohse, M., and Usadel, B. (2014) Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120.
Bologna, N. G., and Voinnet, O. (2014) The diversity, biogenesis, and activities of endogenous silencing small RNAs in Arabidopsis. Annu. Rev. Plant Biol. 65, 473–503.
Borsani, O., Zhu, J., Verslues, P. E., Sunkar, R., and Zhu, J.-K. (2005) Endogenous siRNAs derived from a pair of natural cis-antisense transcripts regulate salt tolerance in Arabidopsis. Cell 123, 1279–1291.
Chen, H.-M., Chen, L.-T., Patel, K., Li, Y.-H., Baulcombe, D. C., and Wu, S.-H. (2010) 22-nucleotide RNAs trigger secondary siRNA biogenesis in plants. Proc. Natl. Acad. Sci. USA 107, 15269–15274.
Chen, H.-M., Li, Y.-H., and Wu, S.-H. (2007) Bioinformatic prediction and experimental validation of a microRNA-directed tandem trans-acting siRNA cascade in Arabidopsis. Proc. Natl. Acad. Sci. USA 104, 3318–3323.
Clough, S. J., Tuteja, J. H., Li, M., Marek, L. F., Shoemaker, R. C., and Vodkin, L. O. (2004) Features of a 103-kb gene-rich region in soybean include an inverted perfect repeat cluster of CHS genes comprising the I locus. Genome 47, 819–831.
de Felippes, F. F., Marchais, A., Sarazin, A., Oberlin, S., and Voinnet, O. (2017) A single miR390 targeting event is sufficient for triggering TAS3-tasiRNA biogenesis in Arabidopsis. Nucleic Acids Res. 45, 5539–5554.
Della Vedova, C. B., Lorbiecke, R., Kirsch, H., Schulte, M. B., Scheets, K., Borchert, L. M., Scheffler, B. E., Wienand, U., Cone, K. C., and Birchler, J. A. (2005) The dominant inhibitory chalcone synthase allele C2-Idf (inhibitor diffuse) from Zea mays (L.) acts via an endogenous RNA silencing mechanism. Genetics 170, 1989–2002.
Fire, A., Xu, S., Montgomery, M. K., Kostas, S. A., Driver, S. E., and Mello, C. C. (1998) Potent and specific genetic interference by double-stranded RNA in Caenorhabditis elegans. Nature 391, 806–811.
Howell, M. D., Fahlgren, N., Chapman, E. J., Cumbie, J. S., Sullivan, C. M., Givan, S. A., Kasschau, K. D., and Carrington, J. C. (2007) Genome-wide analysis of the RNA-DEPENDENT RNA POLYMERASE6/DICER-LIKE4 pathway in Arabidopsis reveals dependency on miRNA- and tasiRNA-directed targeting. Plant Cell 19, 926–942.
Jia, J., Ji, R., Li, Z., Yu, Y., Nakano, M., Long, Y., Feng, L., Qin, C., Lu, D., Zhan, J., et al. (2020) Soybean DICER-LIKE2 regulates seed coat color via production of primary 22-nucleotide small interfering RNAs from long inverted repeats. Plant Cell 32, 3662–3673.
Kanazawa, A. (2008) RNA silencing manifested as visibly altered phenotypes in plants. Plant Biotechnol. 25, 423–435.
Kasai, A., Kasai, K., Yumoto, S., and Senda, M. (2007) Structural features of GmIRCHS, candidate of the I gene inhibiting seed coat pigmen0tation in soybean: implications for inducing endogenous RNA silencing of chalcone synthase genes. Plant Mol. Biol. 64, 467–479.
Kasai, A., Watarai, M., Yumoto, S., Akada, S., Ishikawa, R., Harada, T., Niizeki, M., and Senda, M. (2004) Influence of PTGS on chalcone synthase gene family in yellow soybean seed coat. Breed. Sci. 54, 355–360.
Kasai, M., Koseki, M., Goto, K., Masuta, C., Ishii, S., Hellens, R. P., Taneda, A., and Kanazawa, A. (2012) Coincident sequence-specific RNA degradation of linked transgenes in the plant genome. Plant Mol. Biol. 78, 259–273.
Kasai, M., Matsumura, H., Yoshida, K., Terauchi, R., Taneda, A., and Kanazawa, A. (2013) Deep sequencing uncovers commonality in small RNA profiles between transgene-induced and naturally occurring RNA silencing of chalcone synthase-A gene in petunia. BMC Genomics 14, 63.
Kasschau, K. D., Fahlgren, N., Chapman, E. J., Sullivan, C. M., Cumbie, J. S., Givan, S. A., and Carrington, J. C. (2007) Genome-wide profiling and analysis of Arabidopsis siRNAs. PLoS Biol. 5, e57.
Koseki, M., Goto, K., Masuta, C., and Kanazawa, A. (2005) The star-type color pattern in Petunia hybrida 'Red Star' flowers is induced by sequence-specific degradation of chalcone synthase RNA. Plant Cell Physiol. 46, 1879–1883.
Kurauchi, T., Kasai, A., Tougou, M., and Senda, M. (2011) Endogenous RNA interference of chalcone synthase genes in soybean: formation of double-stranded RNA of GmIRCHS transcripts and structure of the 5' and 3' ends of short interfering RNAs. J. Plant Physiol. 168, 1264–1270.
Kurauchi, T., Matsumoto, T., Taneda, A., Sano, T., and Senda, M. (2009) Endogenous short interfering RNAs of chalcone synthase genes associated with inhibition of seed coat pigmentation in soybean. Breed. Sci. 59, 419–426.
Kusaba, M., Miyahara, K., Iida, S., Fukuoka, H., Takano, T., Sassa, H., Nishimura, M., and Nishio, T. (2003) Low glutelin content1: a dominant mutation that suppresses the glutelin multigene family via RNA silencing in rice. Plant Cell 15, 1455–1467.
Langmead, B., and Salzberg, S. L. (2012) Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359.
Liu, Y., Du, H., Li, P., Shen, Y., Peng, H., Liu, S., Zhou, G.-A., Zhang, H., Liu, Z., Shi, M., et al. (2020) Pan-genome of wild and cultivated soybeans. Cell 182, 162–176.e13.
Matzke, M., Aufsatz, W., Kanno, T., Daxinger, L., Papp, I., Mette, M. F., and Matzke, A. J. M. (2004) Genetic analysis of RNA-mediated transcriptional gene silencing. Biochim. Biophys. Acta. 1677, 129–141.
Matzke, M. A., Kanno, T., and Matzke, A. J. M. (2015) RNA-directed DNA methylation: the evolution of a complex epigenetic pathway in flowering plants. Annu. Rev. Plant Biol. 66, 243–267.
Melquist, S., and Bender, J. (2003) Transcription from an upstream promoter controls methylation signaling from an inverted repeat of endogenous genes in Arabidopsis. Genes Dev. 17, 2036–2047.
Mikuriya, S., Kasai, M., Nakashima, K., Natasia, Hase, Y., Yamada, T., Abe, J., and Kanazawa, A. (2017) Frequent generation of mutants with coincidental changes in multiple traits via ion-beam irradiation in soybean. Genes Genet. Syst. 92, 153–161.
Nakashima, K., Tsuchiya, M., Fukushima, S., Abe, J., and Kanazawa, A. (2018) Transcription of soybean retrotransposon SORE-1 is temporally upregulated in developing ovules. Planta 248, 1331–1337.
Napoli, C., Lemieux, C., and Jorgensen, R. (1990) Introduction of a chimeric chalcone synthase gene into petunia results in reversible co-suppression of homologous genes in trans. Plant Cell 2, 279–289.
Schmutz, J., Cannon, S. B., Schlueter, J., Ma, J., Mitros, T., Nelson, W., Hyten, D. L., Song, Q., Thelen, J. J., Cheng, J., et al. (2010) Genome sequence of the palaeopolyploid soybean. Nature 463, 178–183.
Senda, M., Kasai, A., Yumoto, S., Akada, S., Ishikawa, R., Harada, T., and Niizeki, M. (2002) Sequence divergence at chalcone synthase gene in pigmented seed coat mutants of the Inhibitor locus. Genes Genet. Syst. 77, 341–350.
Senda, M., Kurauchi, T., Kasai, A., and Ohnishi, S. (2012) Suppressive mechanism of seed coat pigmentation in yellow soybean. Breed. Sci. 61, 523–530.
Senda, M., Masuta, C., Ohnishi, S., Goto, K., Kasai, A., Sano, T., Hong, J.-S., and MacFarlane, S. (2004) Patterning of virus-infected Glycine max seed coat is associated with suppression of endogenous silencing of chalcone synthase genes. Plant Cell 16, 807–818.
Senda, M., Nishimura, S., Kasai, A., Yumoto, S., Takada, Y., Tanaka, Y., Ohnishi, S., and Kuroda, T. (2013) Comparative analysis of the inverted repeat of a chalcone synthase pseudogene between yellow soybean and seed coat pigmented mutants. Breed. Sci. 63, 384–392.
Shiroshita, Y., Yuhazu, M., Hase, Y., Yamada, T., Abe, J., and Kanazawa, A. (2021) Characterization of chlorophyll-deficient soybean [Glycine max (L.) Merr.] mutants obtained by ion-beam irradiation reveals concomitant reduction in isoflavone levels. Genetic Resour. Crop Evol. 68, 1213–1223.
Tuteja, J. H., Clough, S. J., Chan, W.-C., and Vodkin, L. O. (2004) Tissue-specific gene silencing mediated by a naturally occurring chalcone synthase gene cluster in Glycine max. Plant Cell 16, 819–835.
Tuteja, J. H., and Vodkin, L. O (2008) Structural features of the endogenous CHS silencing and target loci in the soybean genome. Crop Sci. 48, S49–S68.
Tuteja, J. H., Zabala, G., Varala, K., Hudson, M., and Vodkin, L. O. (2009) Endogenous, tissue-specific short interfering RNAs silence the chalcone synthase gene family in Glycine max seed coats. Plant Cell 21, 3063–3077.
van der Krol, A. R., Mur, L. A., Beld, M., Mol, J. N., and Stuitje, A. R. (1990) Flavonoid genes in petunia: addition of a limited number of gene copies may lead to a suppression of gene expression. Plant Cell 2, 291–299.
Vazquez, F., Vaucheret, H., Rajagopalan, R., Lepers, C., Gasciolli, V., Mallory, A. C., Hilbert, J.-L., Bartel, D. P., and Crété, P. (2004) Endogenous trans-acting siRNAs regulate the accumulation of Arabidopsis mRNAs. Mol. Cell 16, 69–79.
Voinnet, O. (2002) RNA silencing: small RNAs as ubiquitous regulators of gene expression. Curr. Opin. Plant Biol. 5, 444–451.
Wingett, S. W., and Andrews, S. (2018) FastQ Screen: a tool for multi-genome mapping and quality control. F1000Res. 7, 1338.
Xie, M., Chung, C. Y.-L., Li, M.-W., Wong, F.-L., Wang, X., Liu, A., Wang, Z., Leung, A. K.-Y., Wong, T.-H., Tong, S.-W., et al. (2019) A reference-grade wild soybean genome. Nat. Commun. 10, 1216.
Yamada, T., Ohashi, Y., Ohshima, M., Inui, H., Shiota, N., Ohkawa, H., and Ohkawa, Y. (2002) Inducible cross-tolerance to herbicides in transgenic potato plants with the rat CYP1A1 gene. Theor. Appl. Genet. 104, 308–314.
Yoshikawa, M., Peragine, A., Park, M. Y., and Poethig, R. S. (2005) A pathway for the biogenesis of trans-acting siRNAs in Arabidopsis. Genes Dev. 19, 2164–2175.

責任著者(Corresponding author)

訂正情報

J-STAGEへの登録はこちら（無料）