Genes & Genetic Systems
Online ISSN : 1880-5779
Print ISSN : 1341-7568
ISSN-L : 1341-7568
Mini reviews
Interactions among eukaryotes, retrotransposons and riboviruses: endogenous riboviral elements in eukaryotic genomes
Masayuki Horie
著者情報
ジャーナル オープンアクセス HTML

2019 年 94 巻 6 号 p. 253-267

詳細
ABSTRACT

Riboviruses are viruses that have RNA genomes and replicate only via RNA intermediates. Although they do not require a DNA phase for replication and do not encode reverse transcriptase, the presence of DNA forms of riboviral sequences in ribovirus-infected cells has been reported since the 1970s. Additionally, heritable ribovirus-derived sequences, called riboviral endogenous viral elements (EVEs), have been found in the genomes of many eukaryotes. These are now thought to be formed by the reverse transcription machineries of retrotransposons within eukaryotic genomes sometimes referred to as selfish elements. Surprisingly, some reverse-transcribed riboviral DNAs (including EVEs) provide physiological functions for their hosts, suggesting the occurrence of novel interactions among eukaryotic genomes, retrotransposons and riboviruses, and opening the door to new avenues of investigation. Here I review current knowledge on these triangular interactions, and discuss future directions in this field.

INTRODUCTION

Viruses are obligate parasites that infect many types of non-viral organisms and even several giant viruses (La Scola et al., 2008; Claverie and Abergel, 2009). Because virus infections can burden their hosts in several ways, non-viral organisms have gained antiviral immune systems to counteract viral infections (Daugherty and Malik, 2012; tenOever, 2016). Viruses have overcome this countermeasure by acquiring anti-immune systems (Daugherty and Malik, 2012; tenOever, 2016). These battles are thought to have been repeated numerous times, resulting in a so-called evolutionary arms race between viruses and hosts (Gammon and Mello, 2015; Hoffmann et al., 2015; tenOever, 2016). Thus, viruses are believed to have contributed to the evolution of immune systems in eukaryotes and vice versa.

Viruses are divided into four groups based on the genomic material in the virus particles (virions) and their replication modes: DNA viruses, RNA viruses (riboviruses), reverse-transcribing DNA viruses and reverse-transcribing RNA viruses. DNA viruses and reverse-transcribing DNA viruses, and RNA viruses and reverse-transcribing RNA viruses, have DNA and RNA genomes in their respective virus particles. Although all viruses produce mRNA, the genomes of DNA viruses and RNA viruses replicate only via DNA and RNA intermediates, respectively. In contrast, reverse-transcribing DNA viruses and reverse-transcribing RNA viruses encode reverse transcriptase (RTase), which enables reverse transcription (RT). Thus, reverse-transcribing DNA and RNA viruses possess DNA and RNA as genomic material in their virions, but replicate via RNA and DNA intermediates, respectively. In this review, I focus on RNA viruses that do not encode an RTase. Because some informal definitions of “RNA viruses” include reverse-transcribing RNA viruses, I have used the term “riboviruses,” which explicitly excludes reverse-transcribing RNA viruses.

As described above, ribovirus particles have RNA genomes and replicate only via RNA intermediates, such as viral genomic RNA, anti-genomic RNA and mRNA. Based on the polarity of the RNA genomes in the virus particles, riboviruses are subdivided into positive-strand, negative-strand, or double-stranded RNA viruses (Flint et al., 2015). Riboviruses encode an RNA-dependent RNA polymerase that makes RNA copies from RNA templates, which are necessary for their replication and transcription (the typical replication cycle of riboviruses is shown in Fig. 1).

Fig. 1.

Replication cycle of a ribovirus with negative polarity. A schematic representation of the replication of a ribovirus with a negative-strand RNA genome.

Although riboviruses do not require DNA intermediates for their replication, DNA forms of riboviruses have been reported to be present in ribovirus-infected cells, beginning in 1975 (Zhdanov, 1975). Since then, several sporadic reports suggested the presence of DNA forms of riboviral sequences (Klenerman et al., 1997; Crochu et al., 2004; Maori et al., 2007), but it was unclear how the viral DNA was generated. In 2009, Geuking et al. revealed that riboviral RNA is reverse transcribed into DNA by RTase encoded by a retrotransposon (also referred to as a selfish element) present in eukaryotic genomes (Geuking et al., 2009). Additionally, we and others discovered that heritable ribovirus-derived elements exist in the genomes of eukaryotes, at least some of which were apparently reverse transcribed and integrated by retrotransposons (Belyi et al., 2010a; Horie et al., 2010; Katzourakis and Gifford, 2010). These ribovirus-derived sequences are believed to have originated from reverse-transcribed and integrated riboviral sequences in the hosts’ germline cells. Such heritable virus-derived sequences are now known as endogenous viral elements (EVEs) (Katzourakis and Gifford, 2010). After these findings were reported, many other studies showed the presence of riboviral EVEs derived from diverse viruses in eukaryotic genomes (Horie and Tomonaga, 2011; Aiewsakun and Katzourakis, 2015; Parrish and Tomonaga, 2016).

Surprisingly, ribovirus-derived DNA is not just a byproduct of chance interactions between riboviruses and retrotransposons: at least in insects, these interactions play important roles in antiviral immunity (Goic et al., 2013, 2016; Tassetto et al., 2017; Poirier et al., 2018). Additionally, some ribovirus-derived EVEs also play roles in their hosts (Fujino et al., 2014; Myers et al., 2016). These findings provide support for a novel concept regarding biological interactions among riboviruses, retrotransposons and eukaryotic genomes. In this review, I focus on such ribovirus–retrotransposon interactions in animals, and summarize and discuss the modes of their interactions and biological functions in host animals.

INTERACTIONS BETWEEN RIBOVIRUSES AND RETROTRANSPOSONS

As mentioned above, riboviral RNA can be reverse transcribed into DNA in some ribovirus-infected cells (Zhdanov, 1975; Klenerman et al., 1997; Geuking et al., 2009; Horie et al., 2010; Goic et al., 2013, 2016; Tassetto et al., 2017; Poirier et al., 2018). Because riboviruses only encode an RNA-dependent RNA polymerase, an RTase must be provided in trans for the RT reactions of riboviruses. Several elements, such as retroelements, and telomerase are known to have RTase activity in eukaryotic cells. Additionally, coinfection with retroviruses, which are reverse-transcribing RNA viruses encoding RTase and capable of integration, can also serve as sources of RTase. Among them, retroelements and possible co-infection with a retrovirus are reported to be involved in the RT reactions and, in some cases, the integration of riboviral sequences.

Retrotransposons are transposable elements that can expand in a copy-and-paste manner within genomes: retrotransposons are transcribed as RNA, which is reverse transcribed into cDNA, and then integrated into the genome (Fig. 2A). Based on the presence or absence of long terminal repeats (LTRs), retrotransposons are classified as LTR and non-LTR retrotransposons, both of which are further divided into autonomous and non-autonomous classes (Havecker et al., 2004; Eickbush and Jamburuthugoda, 2008). While autonomous retrotransposons encode RTase, which enables retrotransposition in cis, non-autonomous retrotransposons do not encode RTase, and thus must exploit the machineries of autonomous retrotransposons for their retrotransposition (Beauregard et al., 2008).

Fig. 2.

Modes of retrotransposition of non-LTR and LTR retrotransposons. (A) Schematic representations of retrotranspositions are shown. PIC, pre-integration complex; RNP, ribonucleoprotein; RT, reverse transcription; VLP, virus-like particle; TPRT, target-primed reverse transcription. (B) A schematic representation of a LINE-1 retrotransposition is shown. RT, reverse transcription; TSD, target site duplication.

LTR retrotransposons have, as the name suggests, LTRs at both ends, which usually contain several elements essential for retrotransposition, such as a promoter and primer-binding sites (Havecker et al., 2004). LTR retrotransposons harbor at least two genes, namely a gag and pol gene. The gag gene encodes structural proteins that form virus-like particles together with retrotransposon RNAs. The pol gene encodes a protease, integrase and RTase, which catalyze retrotransposition (Havecker et al., 2004). Some LTR retrotransposons, including endogenous retroviruses, also have an env gene that enables the formation of infectious particles and subsequent transmission to other cells (Malik et al., 2000; Havecker et al., 2004). As mentioned above, LTR retrotransposons form virus-like particles in cells where retrotransposon RNAs are reverse-transcribed into DNA using tRNA as primers (Havecker et al., 2004). Next, a retrotransposon DNA–integrase complex (also called the pre-integration complex) is formed, which is translocated into the nucleus and is integrated into a chromosome via integrase activity (Fig. 2A) (Havecker et al., 2004). Therefore, as suggested by their RT, LTR retrotransposons are also known as extrachromosomally primed retrotransposons (Beauregard et al., 2008). LTR retrotransposons comprise a large part of animal genomes; for example, they account for 8% of the human genome (IHGSC, 2001). Some LTR retrotransposons are non-autonomous. These non-autonomous LTR retrotransposons lack essential genes for retrotransposition, and thus they exploit the machineries of autonomous retrotransposons (Havecker et al., 2004).

Non-LTR retrotransposons usually do not have LTRs (Beauregard et al., 2008). As described above, non-LTR retrotransposons are also further divided into autonomous and non-autonomous classes. Autonomous non-LTR retrotransposons encode two proteins: ORF1p has RNA chaperone activity, and ORF2p possesses RTase and endonuclease activities (Feng et al., 1996; Martin, 2006; Beauregard et al., 2008). ORF1p and ORF2p interact with retrotransposon RNA to form a ribonucleoprotein (RNP) complex (Beauregard et al., 2008). The RNP complex is translocated into the nucleus, after which the encoded endonuclease nicks the chromosomal DNA and initiates RT using the nicked DNA as a primer (Beauregard et al., 2008). This mode of RT is known as target-primed reverse transcription (TPRT), and thus non-LTR retrotransposons are also referred to as target-primed retrotransposons (Beauregard et al., 2008). In addition to ORF1p and ORF2p, long interspersed nuclear element 1 (LINE-1) in primates, an autonomous non-LTR retrotransposon that is still active in humans, encodes ORF0p in the antisense orientation, which affects the mobility of LINE-1 (Denli et al., 2015). In contrast, non-autonomous non-LTR retrotransposons are retrotransposed via the machineries of autonomous non-LTR retrotransposons. Non-LTR retrotransposons represent a major component of eukaryotic genomes; for example, approximately 30% of the human genome is comprised of non-LTR retrotransposons (IHGSC, 2001).

Retrotransposons are still active in the genomes of many eukaryotes, including humans (Mills et al., 2007; Hancks and Kazazian, 2016). Indeed, several genetic disorders are caused by de novo insertions of retrotransposons (Hancks and Kazazian, 2012, 2016). Thus, retrotransposons are also referred to as selfish genetic elements. However, they are also considered as driving forces of genome evolution, because retrotransposition can cause genome rearrangement, provide gene regulators, and generate chimeric genes (Cordaux and Batzer, 2009). Some retrotransposon genes may be co-opted by the host cell. For example, the Peg10, Peg11 and Syncytin genes, which are derived from LTR retrotransposons (endogenous retroviruses), are essential for placental formation in mammals (Mi et al., 2000; Dupressoir et al., 2005; Ono et al., 2006; Sekita et al., 2008). Additionally, non-LTR retrotransposons have been implicated in the formation of processed pseudogenes and retrocopies of cellular mRNAs (Esnault et al., 2000). Namely, cellular mRNAs are occasionally incorporated into non-LTR retrotransposon machineries, and then reverse transcribed and integrated into a different position within the genome. Thus, retrotransposons have contributed to the evolution of genomes in various ways.

Although RT and integration of riboviral sequences were first reported in the 1970s (Zhdanov, 1975), the underlying mechanisms have remained mysterious because riboviruses themselves do not encode RTase. However, a series of studies revealed that retrotransposons are involved in RT of riboviral RNAs; furthermore, in some cases, the retrotransposons play a role in capturing foreign viral genetic elements (Geuking et al., 2009; Belyi et al., 2010a; Horie et al., 2010, 2013; Horie and Tomonaga, 2011; Goic et al., 2013; Poirier et al., 2018). In this section, I focus on the RT/integration of riboviral sequences by retrotransposable elements.

RT of riboviral sequences by non-LTR retrotransposons in somatic cells

Some riboviral RNAs have been suggested to be reverse transcribed and integrated into host genomes by the RTase activities of the non-LTR retrotransposon LINE-1 in somatic cells (Horie et al., 2010, 2013; Shimizu et al., 2014).

LINE-1 retrotransposons are autonomous non-LTR retrotransposons, and some of them are active in many species of animals, including humans (Sassaman et al., 1997; Brouha et al., 2003). As described in the above section, RT and LINE-1 insertion are initiated by cleavage of single-stranded DNA by LINE-1 ORF2p (Fig. 2B). The 3′ end of the cleaved DNA is used as a primer for RT. The opposite DNA strand is also cleaved, but the cleavage site is usually located downstream of the first DNA strand, which eventually generates a short direct repeat, referred to as a target site duplication (TSD), on both sides of the inserted LINE-1 (the mode of LINE-1 retrotransposition is reviewed by Beck et al., 2011). Importantly, LINE-1 is involved in the formation of processed pseudogenes, and the resultant integrants have features similar to those observed in retrotransposed LINE-1 (Esnault et al., 2000).

We previously reported that the mRNA of Borna disease virus (BoDV), a ribovirus, was reverse transcribed into DNA, at least a portion of which was integrated into the host chromosome by the machinery of LINE-1 (Horie et al., 2010). BoDV is a negative-strand RNA virus, belonging to the family Bornaviridae, which establishes non-cytolytic persistent infections in the host cell nucleus (Tomonaga et al., 2002). Previously, we detected virus-derived DNA (vDNA) in BoDV-infected cultured cells from diverse mammalian species (such as humans, rabbits, dogs and mink), as well as in mouse brains in vivo (Horie et al., 2010). We also demonstrated, using Alu-PCR and inverse PCR, that BoDV vDNA was integrated into host chromosomes (Horie et al., 2010, 2013). We identified the integrated BoDV DNAs and their flanking sequences, revealing that many of the integrants have structures similar to cellular processed pseudogenes: the integrants had poly(A) tails, short tandem repeats derived from host sequences (e.g., TSDs observed upstream and downstream of the integrants), and occasional 5′ truncations of mRNA (Gilbert et al., 2002). These observations strongly suggested that BoDV vDNA was derived from mRNA following RT by the LINE-1 machinery. In addition to such typical sequences, we also observed integrants that lacked a poly(A) tail, and displayed deletions of flanking sequences and reorganization of flanking sequences (Horie et al., 2010, 2013). These differences may reflect non-canonical integration or endonuclease-independent retrotransposition, as observed for LINE-1 retrotransposition (Gilbert et al., 2002; Morrish et al., 2002). We further showed that the integration efficiency of BoDV mRNA was very low in two human cell lines: less than 0.1% (below the detection limit) of BoDV-infected cells have integrants originating from BoDV mRNA (Horie et al., 2010, 2013). However, it is unclear whether BoDV mRNA is preferentially recognized by the LINE-1 machinery.

Vesicular stomatitis virus (VSV), a ribovirus belonging to the family Rhabdoviridae (Amarasinghe et al., 2018), is also reverse transcribed by the LINE-1 machinery (Shimizu et al., 2014). VSV replicates in the cytoplasm and causes cytopathic effects in infected cells (Shimizu et al., 2014). Shimizu et al. (2014) detected VSV-derived DNA in VSV-infected human cells. The VSV vDNA was suggested to be present as single-stranded DNA molecules of antisense orientation in the cytoplasm, as determined by fractionation and PCR. The authors attempted to determine the sequence of VSV vDNA by inverse PCR and sequencing analyses, using essentially the same method as for our BoDV study. However, they could only detect virus-derived sequences, and no host-derived sequences: all the identified sequences corresponded to a full-length viral mRNA sequence including a poly(A) tail (Shimizu et al., 2014). Based on these observations, they proposed that in the cytoplasm, VSV mRNA alone is reverse transcribed and that the vDNA is present as extrachromosomal DNA (Shimizu et al., 2014). However, the source of primers for RT is unknown in this model. Shimizu et al. (2014) suggested that special DNA primers are not needed for the RT, or that oligo U or T are used as the primers. An alternative explanation is that the detected vDNAs were derived from first-strand cDNA fragments whose second-strand DNA was not yet synthesized. Several cellular factors are involved in LINE-1-mediated TPRT and integration (Pizarro and Cristofari, 2016). Thus, although first-strand VSV cDNA is synthesized in VSV-infected cells, the viral infection induces apoptosis, which may disrupt the cellular machineries for synthesizing second-strand DNA; the cDNA may then be digested by a cellular DNase and leak into the cytoplasm. Indeed, VSV infection causes fragmentation of the host chromosomal DNA (Koyama, 1995). Furthermore, the relative amounts of VSV vDNA and leaked nuclear genomic DNA in the cytoplasm were very similar (Shimizu et al., 2014). Further studies are needed to understand the mode of RT of VSV. For example, several host factors were involved in apoptosis induced by VSV infection (Pearce and Lyles, 2009). Importantly, knockdown of some of these factors inhibited VSV-induced apoptosis. Thus, such systems may be useful to pursue this issue.

Shimizu et al. (2014) also detected vDNA derived from respiratory syncytial virus and echovirus type 35, which are riboviruses with negative- and positive-strand RNA genomes, respectively. These observations suggest that many riboviruses are potentially reverse transcribed by the LINE-1 machinery. However, the biological significance of the RT of riboviruses is unclear. Additionally, it is unclear whether the RT of these riboviruses occurs in vivo. In somatic cells, retrotransposons are usually inactive except in several types of somas, such as neuronal progenitor cells (Muotri et al., 2005; Yang and Kazazian, 2006; Coufal et al., 2009; Garcia-Perez et al., 2010). Therefore, riboviruses that do not infect such cells may not be reverse transcribed by the LINE-1 machinery. In this regard, in vivo animal experiments, as we performed for BoDV in mice (Horie et al., 2010), are also important for obtaining a deeper understanding of the authentic interactions between riboviruses and retrotransposons.

Germline-integrated riboviral sequences in eukaryotic genomes, presumably reverse transcribed and integrated by non-LTR retrotransposons

In addition to the RT of extant viruses, we and others revealed that sequences homologous to riboviruses, such as bornaviruses and filoviruses, are present in eukaryotic genomes (Belyi et al., 2010b; Horie et al., 2010). These sequences are believed to be derived from reverse-transcribed and germline-integrated sequences of riboviruses, known as riboviral EVEs. The RT and integration steps are likely mediated by the LINE-1 machinery because some of the bornaviral and filoviral EVEs have poly(A) tails and TSDs, as observed for the integrants of a modern bornavirus, BoDV (Belyi et al., 2010b; Horie et al., 2010). Thus, retrotransposons may have contributed to capturing foreign genetic material from ancient riboviruses.

Structures of integrants should provide insights into the modes of RT and the characteristics of ancient viruses (Horie and Tomonaga, 2019). Among the above-mentioned riboviral EVEs, many are thought to have been integrated by TPRT of viral mRNAs alone. However, intriguingly, an endogenous bornavirus-like element (EBL) in humans (named hsEBLN-2) seems to have been reverse transcribed and integrated as a recombinant between viral mRNA and a non-autonomous retrotransposon, the Alu element (Belyi et al., 2010b; Horie et al., 2010). An Alu element was identified immediately downstream of the poly(A) tail of hsEBLN-2. Importantly, the hsEBLN-2/Alu element is flanked by a 9-bp TSD, and the Alu element itself is not flanked by a TSD (Fig. 3). This may have been caused by a template switch from the Alu RNA to mRNA of an ancient bornavirus during TPRT of the Alu. This observation suggests that there are two modes of LINE-1-mediated integration of riboviral sequences, as follows. First, viral mRNA is directly reverse transcribed and integrated into a chromosome by the LINE-1 machinery. In this case, viral mRNA may be accidentally incorporated into RNP in the cytoplasm, and then translocated into the nucleus followed by TPRT of viral mRNA. Second, a template switch from a retrotransposon RNA to a viral mRNA occurs during a TPRT reaction of the retrotransposon, leading to integration of a chimeric retrotransposon–viral mRNA sequence (Fig. 3). This scenario is interesting in that it may yield novel insights into the replication sites of ancient viruses. LINE-1-mediated TPRT initiates in the cell nucleus; thus, in theory, only viruses being transcribed in the nucleus have the potential to be integrated by the template-switching mechanism. Indeed, a substantial portion of reported chimeric retrocopies, which were suggested to be formed by template switches between retrotransposon RNAs and other species of RNAs, are composed of the following elements: 5′ parts derived from RNA localizing in the nucleus or in both nucleus and cytoplasm, and 3′ parts originating from cytoplasmic mRNAs of cellular genes or of retrotransposons (Buzdin et al., 2002; Buzdin et al., 2003; Buzdin, 2004). Generally, few lineages of riboviruses (such as bornaviruses) are known to replicate and undergo transcription in the nucleus (Horie and Tomonaga, 2019). Currently, we have only a limited understanding regarding the replication sites of ancient riboviruses: only one lineage of bornavirus was suggested to have replicated in the cell nucleus (Horie and Tomonaga, 2019). However, hsEBLN-2 seems to have been endogenized by the template-switching mechanism described above, suggesting that an ancient bornavirus replicated in the host cell nucleus. By comparing orthologs of hsEBLN-2 in platyrrhines, their integration time was estimated to be more than 40 million years ago (MYA) (Horie et al., 2010). Therefore, this lineage may also have replicated and been transcribed in the nucleus more than 40 MYA. Although many ribovirus-derived EVEs have been reported, only a few studies focused on biological characteristics of ancient riboviruses (Horie and Tomonaga, 2019). Thus, further detailed sequence analyses of such ancient riboviruses should contribute to a better understanding of the replication mechanisms of ancient viruses. An increase in the available genome data would also be helpful to pursue this open question. Furthermore, a combination of reconstruction of ancient viral sequences, as reported for dependoparvovirus-like elements (Smith et al., 2016), and biochemical and molecular biological experiments would be useful to understand the biological features of ancient riboviruses.

Fig. 3.

Possible reverse transcription and integration mechanism of an endogenous bornavirus-like element. (A) Schematic representation of the organization of a target site duplication (TSD). hsEBLN-2, Homo sapiens endogenous bornavirus-like element 2; Poly(A): polyadenine stretch. (B) Schematic representation of possible reverse transcription and integration mechanisms of the chimeric hsEBLN-2-Alu-Sx sequence. TPRT, target-primed reverse transcription.

RT of riboviral sequences by LTR retrotransposons in somatic cells

Many riboviral sequences have also been shown to be reverse transcribed and (in some cases) integrated into the host genome by LTR retrotransposons. Lymphocytic choriomeningitis virus (LCMV) is an ambisense RNA virus belonging to the genus Mammarenavirus of the family Arenaviridae. In 1997, Klenerman et al. reported the presence of DNA forms of LCMV in LCMV-infected cells (Klenerman et al., 1997). Later, Geuking et al. (2009) reported that LCMV RNA was reverse transcribed and integrated into host chromosomes by a murine LTR retrotransposon, namely an intracisternal A-particle (IAP) retrotransposon.

Over 1,000 IAP retrotransposons are present in the mouse genome, some of which are actively retrotransposed (Kuff and Lueders, 1988). The IAP encodes three genes (gag, prt and pol genes), all of which are required for retrotransposition (Dewannieux et al., 2004). The gag gene encodes structural proteins, which form virus-like particles within cells. The prt gene encodes a protease that is involved in proteolytic cleavage of the IAP gene products. The pol gene encodes RTase and integrase. The mode of IAP retrotransposition is summarized in the section “2. Interactions between riboviruses and retrotransposons” and in Fig. 2A.

LCMV DNA was detected as recombinants of LCMV and IAP, and at least a portion of the LCMV/IAP chimeric DNA was present as a chromosomally integrated sequence. This recombination is, at least in IAP-expressing HeLa cells (a human cell line), likely to be mediated by a copy-choice mechanism (Geuking et al., 2009). Interestingly, integration of the LCMV/IAP DNA seemed to be achieved through an unusual process, because the integrants lacked TSDs in flanking regions, which are usually observed when integration into the genome occurs through the canonical integration mechanisms of LTR retrotransposons, and deletion of integration sites was observed (Geuking et al., 2009). Based on these observations, the authors proposed that integration of LCMV/IAP recombinants had occurred at sites of DNA damage (Geuking et al., 2009). Thus, some portion of LCMV/IAP DNA that is present as extrachromosomal vDNA may be coincidentally integrated into chromosomes at sites of DNA damage. Interestingly, LCMV vDNA formation could also be mediated by another RTase: human immunodeficiency virus RTase also supported the formation of LCMV vDNA, although less efficiently than IAP (Geuking et al., 2009). In contrast, no LCMV DNA was detected in the guinea pig cell line CCL158, which has very high RTase activity (Geuking et al., 2009). Thus, some riboviruses and RTase complexes may be compatible (Horie and Tomonaga, 2011). Sequence specificities of RTase and riboviral sequences and/or spaces for the replication of riboviruses and the formation of LTR retrotransposon virus-like particles are probably involved in such compatibility.

Many other riboviruses are also reverse transcribed into vDNA in insect cells. For example, several reports showed that riboviruses with positive- or double-stranded RNA, such as Flock House virus (FHV), Drosophila C virus, Drosophila X virus, and Sindbis virus, are reverse transcribed into DNA in flies and mosquitoes (Goic et al., 2013, 2016; Tassetto et al., 2017; Poirier et al., 2018). Goic et al. (2013) determined the nucleotide sequences of FHV vDNAs in FHV-infected Drosophila cells, showing that vDNAs were present as chimeric sequences with diverse LTR retrotransposons. This finding suggests that the RT reactions in flies were also catalyzed by the RTase activity of LTR retrotransposons (Goic et al., 2013). Importantly, Tassetto et al. (2017) showed that vDNA is produced in hemocytes in Drosophila. Furthermore, Poirier et al. (2018) revealed that defective viral genomes, which are derived from discontinuous viral genomic sequences, are preferentially reverse transcribed into vDNA by LTR retrotransposons. Interestingly, some of the reverse-transcribed vDNA was present as circular forms (circular vDNA: cvDNA). Other than RTases encoded by retrotransposons, two cellular factors (Ago2 and Dicer-2) were shown to be involved in vDNA formation in fruit flies (Tassetto et al., 2017; Poirier et al., 2018). The roles of both factors in RNA interference (RNAi) are well documented: Ago2 is the core effector of the RNA-induced silencing complex, and the RNase activity of Dicer-2 cleaves dsRNA into small interfering RNA (siRNA) (Wilson and Doudna, 2013). Tassetto et al. (2017) showed that Ago2 is involved in the formation of vDNA. As described above, vDNA was shown to be generated in hemocytes of Drosophila. This was demonstrated by generating a mutant lacking Ago2 in hemocytes, leading to no vDNA production during a virus infection. Poirier et al. (2018) showed that Dicer-2 was important for the formation of vDNA. They also identified the helicase domain of Dicer-2 as being important for the generation of vDNA. The helicase domain of Dicer-2 interacted with viral RNA, and mutations in Dicer-2 modified the amount of vDNA in infected cells. However, for both factors, the detailed mechanisms remain unclear.

For mosquito cells, detailed sequencing analysis has not yet been performed. Interestingly, however, at least a portion of vDNA also existed as extrachromosomal cvDNA in mosquito cells, and seemed to be transcribed from defective viral genomes (Poirier et al., 2018). In all of the chimeric virus–retrotransposon cvDNA sequences identified, the virus-derived sequences consisted of non-continuous viral sequences. Additionally, deep sequencing analysis showed that approximately half of the cvDNA-derived reads were non-continuous viral sequences. However, the detailed mechanisms underlying the formation of cvDNA are also unclear in mosquitoes.

Germline-integrated riboviral sequences in eukaryotic genomes, presumably reverse transcribed and integrated by LTR retrotransposons

Some riboviral EVEs seem to have been endogenized by LTR retrotransposons (Ballinger et al., 2012; Cotton et al., 2016; Whitfield et al., 2017). This is especially obvious in insects. Ballinger et al. (2012) found that EVEs derived from sigmavirus, a ribovirus belonging to the family Rhabdoviridae, are present in the Drosophila genome, and appear to have been integrated by a blastopia-like LTR retrotransposon. Several copies of sigmaviral EVEs in Drosophila were detected as sigmavirus–LTR retrotransposon hybrid sequences. The sigmaviral EVEs have poly(A) tails, indicating that they were derived from mRNA. Although the detailed mechanism is unclear, these observations suggest that viral mRNA was reverse transcribed by a template-switching mechanism of an LTR retrotransposon. Interestingly, two sigmaviral EVEs were found to be embedded in LTR retrotransposons, suggesting that template switching occurred twice. Such an event seems to be rare, but Ballinger et al. (2012) gave a plausible explanation: elements with two LTRs may tend to be integrated into chromosomes.

Whitfield et al. (2017) also analyzed riboviral EVEs in the genome of Aedes mosquito-derived cultured cells. They utilized long-read sequencing technology to determine the genomic sequence of Aga2 cells, which are cultured cells derived from Ae. aegypti (Whitfield et al., 2017). Using the resultant high-quality contigs, they identified EVEs derived from eight families of riboviruses having positive- and negative-strand RNA genomes. Among them, flavivirus-, rhabdovirus- and chuvirus-derived EVEs accounted for a large proportion of the detected EVEs. They also analyzed elements flanking the EVEs, revealing that many of the flaviviral and rhabdoviral EVEs are flanked by LTR retrotransposons, especially by Ty3/Gypsy retrotransposons. The deep relationship between viruses and Ty3/Gypsy retrotransposons raises the possibility that compatibility exists between viruses and retrotransposons, as observed for LCMV and the IAP element (Geuking et al., 2009) (see section 2-3). However, this might also be a product of chance: Ty3/Gypsy elements may simply have been very active when the EVEs were integrated (Whitfield et al., 2017). Chuviral EVEs are flanked by a Pao Bel element (a type of LTR retrotransposon). Intriguingly, chuviral EVEs are mainly surrounded by one specific lineage of Pao Bel element, known as Pao Bel element 179 (Whitfield et al., 2017). This finding suggests that chuviral EVEs exist as a part of Pao Bel element 179 (Whitfield et al., 2017). Indeed, some LTR retrotransposons were proposed to have acquired envelope-like genes from viral glycoprotein genes (Malik et al., 2000). Unfortunately, Whitfield et al. did not perform detailed analyses for the Pao Bel element 179 or the chuviral EVEs. Further research will lead to a better understanding of the relationship between retrotransposons and riboviruses in insects.

A nodavirus-like EVE was also suggested to have been endogenized into the nematode Bursaphelenchus xylophilus genome by an LTR retrotransposon (Cotton et al., 2016). Nodaviruses belong to the family Nodaviridae and are positive-strand RNA viruses that replicate in the cytoplasm. Cotton et al. (2016) found a nodaviral RNA-dependent RNA polymerase-like element, named eBxnv-1, in the genome of B. xylophilus. Similar to the above example of sigmaviral EVEs (Ballinger et al., 2012), eBxnv-1 was also detected as a nodavirus–LTR retrotransposon hybrid sequence: eBxnv-1 was embedded in an LTR retrotransposon (Cotton et al., 2016). Thus, this EVE was also suggested to be integrated via recombination between viral RNA and an LTR retrotransposon.

In addition to the above findings, a bornavirus-derived EVE (named itEBLN) in the thirteen-lined ground squirrel may have been endogenized by an LTR retrotransposon. We previously estimated that itEBLN was integrated within 0.3 MYA (Suzuki et al., 2014). Importantly, LINE-1 activity became extinct approximately 4–5 MYA in this lineage (Platt and Ray, 2012). Thus, the endogenization of itEBLN probably occurred after the extinction of LINE-1, suggesting that RTase was provided by a retrotransposon other than LINE-1; the candidate element is an LTR retrotransposon. Although no obvious recombination was found, two LTR retrotransposons and a SINE (short interspersed nuclear element; a non-autonomous non-LTR retrotransposon) are located immediately upstream of itEBLN (Suzuki et al., 2014). Because retrotransposition of an LTR retrotransposon occurred 0.3 MYA in this lineage, itEBLN might have been integrated as recombinants of the LTR retrotransposons, the SINE and a bornaviral transcript (Suzuki et al., 2014). Further studies are needed to understand how itEBLN was endogenized. Since itEBLN was estimated to be integrated very recently, sequencing analyses of syntenic loci in closely related species should provide clues about the endogenization mechanism of itEBLN.

BIOLOGICAL FUNCTIONS OF RIBOVIRAL RNA RT IN SOMATIC CELLS

As described above in section 2, ribovirus-derived RNA can be reverse transcribed into vDNA in somatic cells. Such reverse-transcribed riboviral DNA may be a byproduct resulting from coincidental interactions between riboviruses and retrotransposons. However, at least in several insects, such as fruit flies and mosquitoes, RT of riboviral RNA during infection seems to be very important for antiviral defenses (Fig. 4).

Fig. 4.

Speculated and identified biological functions of reverse-transcribed riboviral sequences in their hosts. Current knowledge of speculated and identified biological functions of reverse-transcribed riboviral sequences and riboviral EVEs is shown. Question marks indicate speculative pathways or functions. Blue and orange lines show RNA and DNA molecules, respectively. RT, reverse transcription.

In Drosophila, reverse-transcribed vDNA is important for small RNA-mediated antiviral immunity (Goic et al., 2013; Tassetto et al., 2017; Poirier et al., 2018). Goic et al. (2013) showed that reverse-transcribed DNA is important for resistance against exogenous virus infections. As described above in sections 2 and 3, FHV-derived vDNA was detected in FHV-infected fruit flies as highly reconstructed DNA: the sequences originated from non-continuous viral sequences (Goic et al., 2013). Further analyses showed that vDNAs are present as chimeras with LTR retrotransposons. Small RNA-seq analyses revealed that many siRNAs could be mapped to the chimeric sequences, suggesting that siRNAs are expressed from the vDNA–retrotransposon chimeras. Treatment with the RTase inhibitor azidothymidine (AZT) decreased vDNA formation and increased viral titers as well as mortality of infected flies. Thus, Goic et al. (2013) concluded that vDNA formation is important for resistance against exogenous virus infection in Drosophila, and proposed that vDNAs can serve as templates for antiviral siRNA production.

Tassetto et al. (2017) also showed the importance of vDNA formation for systemic antiviral defense in Drosophila. They found that viral RNA in hemocytes is reverse transcribed into vDNA, which can act as a template for antiviral siRNA synthesis. The produced siRNA was released into extracellular exosome-like vesicles to promote systemic antiviral RNAi in infected individuals. Interestingly, the vDNA was present for more than 21 days, which corresponds to one-third of the lifespan of a fly. Thus, Tassetto et al. (2017) proposed that vDNA formation is involved in immune memory for systemic immunity in Drosophila. Intriguingly, hemocytes themselves did not seem to be efficiently infected by the viruses, and thus virus-derived dsRNA was proposed to be provided from other infected cells. Indeed, a dsRNA-uptake system that may contribute to this type of immunity has been reported (Saleh et al., 2006, 2009).

Poirier et al. (2018) further showed that in Drosophila, defective riboviral genomes tend to be reverse transcribed and the resultant cvDNA can serve as a template for antiviral siRNA production. Furthermore, they found that Dicer-2 played a dual role in this antiviral system: the well-known function of Dicer-2 is to cleave dsRNA into siRNA, but it also functions as a pattern recognition receptor (PRR) that senses foreign nucleic acids. As described above, the helicase domain of Dicer-2 is involved in the formation of cvDNA and interacts with viral RNA. Thus, the authors proposed that Dicer-2 acts as a PRR analogous to RIG-I-like receptors in mammals, which bind viral dsRNAs and activate innate immune signaling pathways. Taken together, these findings suggest that in Drosophila, vDNA formation is involved in small RNA-mediated anti-riboviral immunity (Poirier et al., 2018).

In Aedes mosquitoes, vDNA formation is involved in tolerance to virus infections. This is different from Drosophila, in which vDNA formation was shown to be important for resistance against virus infection. Goic et al. (2016) showed that vDNA was generated in Chikungunya virus- or Dengue virus-infected mosquito cells. Treatment with the RTase inhibitor AZT blocked vDNA formation, leading to reduced small RNA production and increased mortality from viral infections. Importantly, AZT treatment did not affect viral titers, and thus Goic et al. concluded that vDNA biogenesis in mosquitoes is involved in tolerance to riboviral infections. The mechanism of vDNA formation and subsequent small RNA production seemed to be at least partially similar to that in Drosophila. Poirier et al. (2018) showed that cvDNA was produced in ribovirus-infected Aedes mosquitoes. Furthermore, high defective viral genome (DVG)-containing virus stock showed lower infection rates in mosquitoes and treatment with AZT increased the infection rates of high DVG-containing viruses in mosquitoes. Thus, Poirier et al. proposed that DVGs can serve as templates for vDNA formation, which contributes to siRNA-mediated antiviral immune responses in insects.

Taken together, these data suggest that vDNA-mediated antiviral siRNA is an important antiviral pathway in fruit flies and mosquitoes. A series of findings regarding vDNA and small antiviral RNA formation opened the door for investigations into antiviral defenses in insects, which lack the RNA-dependent RNA polymerase that is involved in RNAi machineries in other eukaryotes such as nematodes and plants (Goic et al., 2013; Tassetto et al., 2017). Further studies in such insects should provide mechanistic insights into lineage-specific RNAi machineries.

BIOLOGICAL FUNCTIONS OF GERMLINE-INTEGRATED RIBOVIRAL SEQUENCES

In the genomes of many animals exist germline-integrated riboviral sequences, called EVEs. Riboviral EVEs may function at the DNA, RNA or protein level (Fig. 4). As DNA elements, EVEs may act as regulators such as enhancers, promoters and insulators. EVEs may also express long non-coding RNAs or short RNAs, such as microRNAs, siRNA and piwi-interacting RNA (piRNA). Furthermore, EVEs may encode proteins with various functions. Below, I introduce several reported functions or possible functions of riboviral EVEs.

EVEs can encode proteins

Genetic mutations and/or transposable element insertions can disrupt open reading frames (ORFs) within EVEs. Indeed, a large proportion of riboviral EVEs are thought to have been endogenized as far back as more than tens of MYA, and thus many have been pseudogenized. However, some EVEs still retain intact or relatively large ORFs. Although the presence of ORFs does not always mean that they encode proteins, such observations suggest that it is likely that these EVEs encode proteins in their hosts. Indeed, several riboviral EVEs have been shown to express proteins (Fujino et al., 2014; Kobayashi et al., 2016; Myers et al., 2016; Horie, 2017). Below, I summarize known and possible protein-encoding riboviral EVEs in animals.

Bornavirus-derived elements encoding proteins.

Here, I will briefly consider the known and possible functions of bornavirus-derived EVEs (endogenous bornavirus-like elements, EBLs), which have been well described in several review papers (Honda and Tomonaga, 2016; Horie, 2017). Some EBLs were shown to express protein products in mammals, such as humans, ground squirrels and elephants (Ewing et al., 2007; Fujino et al., 2014; Kobayashi et al., 2016; Myers et al., 2016). Some of these seem to be functional in the cells: an EBL in humans is apparently involved in the regulation of genome stability (Myers et al., 2016), and another in thirteen-lined ground squirrels may function as an anti-bornaviral gene (Fujino et al., 2014). Several EBLs in bats of the genus Eptesicus and afrotherian animals (e.g., elephant) have also been strongly suggested to encode functional proteins, although their functions remain unclear (Horie et al., 2016; Kobayashi et al., 2016). These possibly functional EBLs are found in the genomes of non-model organisms, which makes it difficult to systematically study them. However, recent progress in sequencing technologies has made it easier to study such organisms than previously thought. Further work is needed to understand the functions of bornavirus-derived genes in vertebrates.

Filovirus-derived elements.

Filoviruses belong to the family Filoviridae (Amarasinghe et al., 2018). Representative viruses in this family include Ebola virus (EBOV) and Marburg virus, which are highly pathogenic in humans and non-human primates (Amarasinghe et al., 2018). EVEs derived from filoviruses are present in bats of the genus Myotis (Belyi et al., 2010b; Katzourakis and Gifford, 2010; Taylor et al., 2010, 2011). Among them, an element homologous to the VP35 gene of filoviruses, named endogenous filovirus-like VP35 (EFL35), potentially has interferon-antagonist activity. EFL35 was probably integrated into the bat genome more than 20 MYA and retains a relatively long ORF, which has evolved and been maintained under natural selection (Belyi et al., 2010b; Katzourakis and Gifford, 2010; Taylor et al., 2010, 2011). Kondoh et al. (2017) and Edwards et al. (2018) showed that EFL35 in Myotis bats (mEFL35) suppressed innate immune responses, as does the original protein VP35 of exogenous filoviruses. Kondoh et al. (2017) showed that exogenous expression of mEFL35 from M. lucifugus suppressed the antiviral signaling pathway in human cells. Edwards et al. (2018) further expanded this finding. They used mEFL35 proteins from 16 Myotis species, and revealed that many of them showed similar activities in human and Myotis bat cells. The biological significance of this interferon-antagonist activity is unclear, although one possible explanation is that mEFL35 is involved in modulating IFN and inflammatory responses (Edwards et al., 2018). However, the interferon-antagonist activity is not conserved in all mEFL35s: mEFL35 from several Myotis species lacked the activity (Edwards et al., 2018). Furthermore, transcripts from the mEFL35 locus have not yet even been detected. Further studies are needed to assess the biological impact of the interferon-antagonist activity of mEFL35 in bats.

In addition to the above effect on immune signaling, intriguingly, mEFL35 expression slightly decreased the polymerase activity of EBOV in a mini-replicon assay in human cells (Kondoh et al., 2017), although this effect was not observed in another study (Edwards et al., 2018). Because mEFL35 forms a hetero-oligomer with the VP35 protein of modern filoviruses, this interaction might contribute to the inhibition of polymerase activity: namely, mEFL35 might function as a dominant-negative mutant against the viral VP35 protein. Note that mEFL35 is distantly related to modern exogenous filoviruses including EBOV (the amino acid identity between EBOV VP35 and EFL35 is 32%). Thus, this EFL35 might act as an antiviral gene against yet-to-be-discovered filoviruses that are genetically closer. Recently, two novel lineages of filoviruses, named Wenling frogfish filovirus and Wenling Thamnaconus septentrionalis filovirus, and Měnglà virus were detected in fishes and are distantly related to the three previously identified lineages of filoviruses (the genera Cuevavirus, Ebolavirus and Marburgvirus) (Shi et al., 2018). Although these two novel filoviruses are not genetically close to mEFL35, their discovery implies that additional filovirus lineages occur in nature. Thus, extensive surveillance of exogenous viruses will be important for understanding the function of riboviral EVEs, as previously discussed (Horie, 2017).

EVEs may function as small non-coding RNAs

Several EVEs encode piRNA, a type of small non-coding RNA that is mainly involved in epigenetic and post-transcriptional suppression of retrotransposable elements (Siomi et al., 2011). piRNA precursors are usually expressed from clusters in genomes, called piRNA clusters (Lau et al., 2006; Aravin et al., 2007). Interestingly, some riboviral EVEs are present in such piRNA clusters, and piRNAs are expressed from these riboviral EVE loci in animals (Parrish et al., 2015; Suzuki et al., 2017; Whitfield et al., 2017).

piRNAs are expressed from bornavirus-derived EVE loci, at least in several primates and rodents (Parrish et al., 2015). In insects, piRNAs are expressed from EVEs derived from many riboviruses, such as flaviviruses, rhabdoviruses, bunyaviruses and reoviruses (Suzuki et al., 2017; Whitfield et al., 2017). These EVE-derived piRNAs were suggested to be involved in antiviral mechanisms. Parrish et al. (2015) showed that bornaviral EVEs accumulated in piRNA clusters independently in two different lineages of mammals (primates and rodents), prompting them to propose that there may have been survival advantages in animals with such EVEs in the piRNA cluster, probably attributable to antiviral piRNA expression. However, the antiviral activity of these EVE-derived piRNAs against modern viruses has not been shown thus far. Further virological studies are needed to understand the biological significance of piRNA-encoding riboviral EVEs.

CONCLUSION AND FUTURE DIRECTIONS

As described above, recent studies on the interactions among eukaryotes, retrotransposons and riboviruses have contributed to a deeper understanding of viral biology. However, many open questions remain. Although many riboviral EVEs have been detected in diverse eukaryotes, their integration mechanisms are not well studied, except for several examples described here. Thus, more detailed investigations, such as sequence-based and molecular biological analyses, are needed to clarify the integration mechanisms. Also, several hypothetical mechanisms of action of EVE-derived genes, such as those underlying the antiviral effect of EVE-derived piRNAs, have not yet been tested (Parrish et al., 2015; Miesen et al., 2016; Parrish and Tomonaga, 2016; Suzuki et al., 2017; Whitfield et al., 2017). One can also imagine that some viruses may counteract such antiviral mechanisms, as many mammalian viruses encode innate immunity-antagonist proteins. Thus, studies from such a viewpoint will be important for understanding the arms race between riboviruses and their hosts. Considering the diversity of eukaryotes, the findings presented here are just the tip of iceberg: many other interesting interactions probably occur among eukaryotes, retrotransposons and riboviruses in other eukaryotes, or even in the animals introduced here. Further studies will contribute to a deeper understanding of these tripartite interactions.

CONFLICTS OF INTEREST

None.

ACKNOWLEDGMENTS

I thank ENDEAVR (the Endogenous Elements from Ancient Viruses Research Club) in Japan, which is partially supported by Cross-sectoral Research Platform Development Program 2016 of Kyoto University, for providing the opportunity to publish this review article. This work was supported by the Hakubi Project of Kyoto University; a Grant-in-Aid for Scientific Research on Innovative Areas from the Ministry of Education, Culture, Sports, Science and Technology (MEXT) of Japan (grant number 17H05821); and a JSPS KAKENHI for Challenging Research (Exploratory) (grant number 18K19443).

REFERENCES
 
© 2019 by The Genetics Society of Japan
feedback
Top