Theoretical insights into the DNA repair function of Arabidopsis thaliana cryptochrome-DASH

Ryuma Sato; Yoshiharu Mori; Risa Matsui; Noriaki Okimoto; Junpei Yamamoto; Makoto Taiji

doi:10.2142/biophysico.BSJ-2020010

Abstract

Following the discovery of cryptochrome-DASH (CRYD) as a new type of blue-light receptor cryptochrome, theoretical and experimental findings on CRYD have been reported. Early studies identified CRYD as highly homologous to the DNA repair enzyme photolyases (PLs), suggesting the involvement of CRYD in DNA repair. However, an experimental study reported that CRYD does not exhibit DNA repair activity in vivo. Successful PL-mediated DNA repair requires: (i) the recognition of UV-induced DNA lesions and (ii) an electron transfer reaction. If either of them is inefficient, the DNA repair activity will be low.

To elucidate the functional differences between CRYD and PL, we theoretically investigated the electron transfer reactivity and DNA binding affinity of CRYD and also performed supplementary experiments. The average electronic coupling matrix elements value for Arabidopsis thaliana CRYD (AtCRYD) was estimated to be 5.3 meV, comparable to that of Anacystis nidulans cyclobutane pyrimidine dimer PLs (AnPL) at 4.5 meV, indicating similar electron transfer reactivities. We also confirmed the DNA repair activity of AtCRYD for UV-damaged single-stranded DNA by the experimental analysis. In addition, we investigated the dynamic behavior of AtCRYD and AnPL in complex with double-stranded DNA using molecular dynamics simulations and observed the formation of a transient salt bridge between protein and DNA in AtCRYD, in contrast to AnPL in which it was formed stably. We suggested that the instability of the salt bridge between protein and DNA will lead to reduced DNA binding affinity for AtCRYD.

Significance

Although cryptochrome-DASH (CRYD) is high sequence homology with photolyases (PLs), CRYD cannot repair UV-damaged DNA. We investigated the electron transfer reactivity in terms of the electronic coupling matrix elements T_DA, using theoretical methods, indicated that T_DA value of CRYD is comparable with PLs. Hence CRYD will exhibit the comparable electron transfer reactivity with PLs. We furthermore explored DNA binding activity, elucidated that CRYD does not form the stable salt bridges among the specific amino acid residues and DNA, whereas PLs form their stable salt bridges. Therefore, we indicated that the stable salt bridge is important in order to DNA repair.

Introduction

DNA damage by UV radiation (290–400 nm) results in the formation of two major types of UV-induced DNA lesions at adjacent pyrimidine bases: cis-syn cyclobutane pyrimidine dimers (CPDs) and pyrimidine-pyrimidone (6-4) photoproducts ((6-4)PPs) [1–3]. These lesions block cellular replication and transcription, leading to growth inhibition in plants and skin cancer in mammals. Photolyases (PLs) have been identified as enzymes responsible for the repair of these lesions [1,4,5]. Cryptochromes (CRYs) [6–8], which are blue light receptors, play key roles in various biological functions, such as entrainment of circadian rhythms in plants and magnetic receptors in birds [8–14]. Both PLs and CRYs belong to the PLs/CRYs superfamily, which includes proteins with diverse functions. PLs and CRYs share homologies in amino acid sequence and tertiary structure [1], besides sharing a flavin adenine dinucleotide (FAD) as a common cofactor that absorbs blue light in sunlight to exert their functions [2,4]. However, CRYs do not exhibit DNA repair activity in response to UV-induced DNA damage because the FAD in CRYs is hardly converted to the fully reduced form of FAD (FADH^–) [15,16], which is the active redox state of FAD in PLs. In 2000, Brudler et al. identified a new type cryptochrome, cryptochrome-DASH (CRYD), in bacteria and plants [17]. The structure and the function of CRYD were investigated experimentally [17–21], and it was reported that Arabidopsis thaliana CRYD (AtCRYD) can selectively repair CPD on single-stranded DNA (ssDNA) but not on double-stranded DNA (dsDNA) [20]. The reason why CRYD does not exhibit this distinct DNA repair activity remains obscure.

The DNA repair process of PLs consists of the following steps: (1) recognition of UV-induced DNA lesions by PLs, accommodating the lesions in the active site; (2) light-excitation of the fully reduced state of FAD (FADH^–*); (3) forward electron transfer (FET) from FADH^–* to the UV-induced DNA lesions, forming a neutral semiquinoid FAD (FADH^•) and an anion radical of the lesions; (4) rearrangement of the UV-induced DNA lesions in the anion radical state; and (5) a back electron transfer from the repaired pyrimidine bases to FADH^• [1,22–26]. The FET reaction, a pivotal process for the light-driven DNA repair mechanism by PLs, has been explored in detail by theoretical and experimental analyses. Currently, FET is considered to occur using the superexchange mechanism via the adenine moiety of FAD. Stuchebrukhov et al. reported pioneering results regarding the superexchange mechanism via the adenine moiety by investigating the electron transfer (ET) pathway of CPD photolyases (CPD-PLs), using a tunneling current approach [27–30]. Liu et al. investigated the ET reaction of CPD-PLs using femtosecond spectroscopy and demonstrated that the adenine-mediated superexchange mechanism was dominant [25]. Recently, Miyazawa et al. and Sato et al. suggested the possibility of amino acid residue-mediated superexchange mechanism via a methionine and/or an asparagine residue in the active site [31,32] and Prytkova et al. suggested the direct ET from FADH^–* to CPD [33]. Although the outline of the DNA repair mechanism for CPD-PLs has been revealed during the last two decades by theoretical [29–39] and experimental studies [23–26,40–45], there is room for further research on the DNA repair mechanism because the primary process in the DNA repair reaction, i.e., the recognition of UV-induced DNA lesions by PLs, is not clearly understood. An understanding of the whole DNA repair process would shed light on the origin of the functional differences between PLs and CRYD.

Here, we investigated the FET reaction and DNA binding of Anacystis nidulans CPD-PL (AnPL) and AtCRYD to identify the differences in the DNA repair activities of PLs and CRYs. In this study, the electronic coupling matrix elements T_DA was estimated for AtCRYD using molecular dynamics (MD) simulations and quantum chemical calculations. In addition, the T_DA value of an AtCRYD mutant, in which Gln395 was replaced with methionine residue (AtCRYD^Q395M), was also estimated. Previous studies reported that the methionine (Met353) and the asparagine (Asn349) residues contributed to FET in AnPL [31,32]. The amino acid residue which is same position as Met353 of AnPL in AtCRYD replaced the glutamine residue (Gln395), whereas the amino acid residue which is same position of Asn349 of AnPL in AtCRYD was not replaced (Asn391) (the Supplementary Fig. S1 and S2). Therefore, comparing the T_DA values of wild-type AtCRYD and mutant AtCRYD, we considered that the differences in the amino acid residues might affect the FET reactivity. The result indicated that the T_DA value of AtCRYD was comparable to that of AtCRYD^Q395M and AnPL, which were estimated in the previous study [32]. Our experimental analysis confirmed that AtCRYD exhibited DNA repair activity for ssDNA. Then, we measured the DNA repair activity for wild-type AtCRYD and mutant AtCRYD by mutating the three amino acid residues in AtCRYD (Arg443Lys/Glu444Pro/Asp445Leu) and found that both proteins cannot repair dsDNA. To elucidate the underlying reason, we compared the behaviors of AnPL and AtCRYD in complex with dsDNA containing the CPD, using MD simulations. The MD simulations indicated that the stable salt bridges between the two positively charged amino acid residues (Lys401 and Arg404 for AnPL, Arg443 and Arg446 for AtCRYD) and DNA were stable in AnPL, in contrast to those in AtCRYD, which were transient. We proposed that this instability of the salt bridges might be one factor that leads to lower DNA binding activity of AtCRYD for dsDNA. In addition, our theoretical and experimental analyses further indicated that the two negatively charged residues, Glu444 and Asp445, are not be the cause of the transient salt bridge.

Methods

In this study, we investigated two major issues: (1) the electron transfer (ET) reactivity of AtCRYD in complex with UV-damaged single-stranded DNA (ssDNA) containing a CPD (AtCRYD-ssDNA) and (2) the dynamical behavior of AnPL in complex with UV-damaged dsDNA containing a CPD (AnPL-dsDNA) and of AtCRYD in complex with UV-damaged dsDNA containing a CPD (AtCRYD-dsDNA) complex.

Setup for the initial structures

To estimate the electronic coupling matrix element T_DA for evaluation of ET reactivity, theoretical calculations were performed using either the coordinates after geometry optimization of the X-ray crystal structure (PDB ID: 2VTB [21]) by quantum mechanics/molecular mechanics (QM/MM) calculation or the equilibrated structures of MD simulations, depending on the purpose. We used either i) each optimized structure of AtCRYD in complex with ssDNA containing a CPD, and AtCRYD mutant in which Gln395 of 2VTB was replaced with methionine (AtCRYD^Q395M) by QM/MM calculation, or ii) many snapshots during 500 ns MD simulations for AtCRYD-ssDNA and AtCRYD^Q395M-ssDNA to evaluate the contribution of the protein environment on the T_DA value. In both cases, the artificial CPD-analog in 2VTB was converted to the CPD with the natural phosphate by applying a force field upon the tLeap procedure described below. These model structures were used as the initial structures for the QM/MM calculation and the 500 ns MD simulations.

For the observation of the dynamical behavior of AnPL-dsDNA and AtCRYD-dsDNA complexes, a complex structure of AtCRYD with dsDNA containing CPD is required, but it has not been solved to date. We, therefore, prepared a model structure of the AtCRYD-dsDNA complex. The structures of AnPL in complex with dsDNA containing a CPD-analog with a formacetal linkage in its internucleoside moiety (PDB ID: 1TEZ [46]) and AtCRYD (PDB ID: 2VTB) were aligned by TM-align [47], and the AnPL was replaced with AtCRYD (Supplementary Fig. S2). We used these model structures as the initial structures for the 1000ns MD simulations. Note that the CPD-analog in AnPL-dsDNA and AtCRYD-dsDNA was also converted to the natural CPD in the manner described above.

Amber force fields for each cofactor and cyclobutane pyrimidine dimer

In order to obtain the force fields of CPD, FADH^–, 5,10-methenyltetrahydrofolate (MTHF), and 8-hydroxy-7,8-didemethyl-5-deazariboflavin (8-HDF), we performed a geometry optimization and electron structure calculation for each molecule with a B3LYP/6-31G(d) and MP2/6-31+G(d,p) level of theory using Gaussian 16 [48]. Here, the geometric coordinate of CPD was obtained from PDB ID: 1SNH [49], and FADH^–, MTHF, and 8-HDF were obtained from PDB ID: 2VTB and 1TEZ, respectively. Each cofactor and CPD was parameterized using a general amber force field [50], and the partial charges were calculated using the restrained electrostatic potential method using the antechamber module of the AMBER16 program package [51].

Setup for quantum mechanics/molecular mechanics calculations and molecular dynamics simulations

For the QM/MM calculation and MD simulations to estimate the T_DA value, we prepared initial structures using the tLeap module of the AMBER program package by assigning the force fields for AtCRYD-ssDNA and AtCRYD^Q395M-ssDNA. Here, we adopted the AMBER ff14SB for protein [52], the AMBER OL15 for DNA [53], and the created force fields for CPD, FADH^–, and MTHF. Since it has been supposed by theoretical and experimental studies that the protonation of Glu283 in AnPL affects the DNA repair reaction [35,46,54], Glu325 was protonated in AtCRYD-ssDNA and AtCRYD^Q395M-ssDNA, and other amino acid residues maintained the standard state in physiological pH. We adopted a TIP3P model for water molecules [55] and constructed square simulation boxes with a margin of at least 12 Å from proteins to the box boundaries. Finally, to neutralize each system, counterions (Na⁺) were added.

To investigate the dsDNA binding mechanism for CPD-PL and CRYD, we used the modified X-ray crystal structure of AnPL-dsDNA and the modeled structure for AtCRYD-dsDNA. Each solvated system was created according to the procedures described above, and we adopted the created force field for 8-HDF in AnPL-dsDNA. Simulation boxes with a margin of at least 30 Å from proteins to the box boundaries were used.

The procedures of molecular dynamics simulations

In this study, we carried out the MD simulations as follows: (1) the obtained AMBER topology file was converted to a GROMACS topology file for each system using the ACPYPE tool [56], and then energy minimization was performed for 5000 steps. (2) The temperature of each system was heated from 0 K to 298 K during 100 ps. First, the temperature was heated from 0 K to 5 K for 10 ps under the NVT ensemble (T=5 K). Then, the temperature was gradually heated for nine steps under the NPT ensemble (P=1 bar; T=5 K, 50 K, 100 K, 150 K, 200 K, 225 K, 250 K, 275 K, and 298 K), where each step was carried out for 10 ps. (3) The MD simulations were carried out for 500 ns in the T_DA values estimation or 1000 ns in the DNA binding study under the NPT ensemble (P=1 bar and T=298 K). Here a restriction for the 5'- and 3'-edge nucleobases of dsDNA (i.e., a force constant of 4184 kJ/(mol·nm²)) was applied during temperature heating and production run procedures. In all MD simulations, the linear constraint solver algorithm was used for constraints [57], the Nose-Hoover thermostat for temperature regulation [58–60], and the Berendsen barostat for pressure regulation [61]. The periodic boundary conditions were applied with the particle mesh Ewald method [62], and the simulation time step was 2 fs. A snapshot was recorded every 10 ps. In this study, all MD simulations were performed by the GROMACS version 5.0.4 program package [63].

Quantum mechanics/molecular mechanics calculation

We carried out the geometry optimization by the QM/MM calculation for the energy minimized structures of AtCRYD-ssDNA and AtCRYD^Q395M-ssDNA by MM level using GROMACS. In the QM/MM calculation of AtCRYD-ssDNA (AtCRYD^Q395M-ssDNA), the QM region included CPD, FADH^–, Glu325, Asn391, and Gln395 (Met395), whereas the other molecules were in the MM region (Supplementary Fig. S3). We adopted our own N-layered integrated molecular orbital and molecular mechanics (ONIOM) method [64] and performed the geometry optimization with B3LYP/6-31G(d):AMBER94 levels at Gaussian 09 [65], where the electronic embedding is adopted to include the MM region charge distribution in the QM evaluations [66].

Electronic coupling matrix elements

To investigate the ET reaction in this study, the electronic coupling matrix elements T_DA were estimated. The T_DA value is usually used to evaluate the ET rate constants K_ET. K_ET is expressed as the following equation by the Marcus theory [67]:

KET=2πℏTDA214πλkBTexp-∆G+λ24λkBT

(1)

where ΔG and λ are the driving force and the reorganization energy, respectively; ħ, K_B and T are the Planck constant divided by 2π, Boltzmann constant, and temperature, respectively. To estimate the T_DA value, we adopted the generalized Mulliken–Hush (GMH) method [68–70] with the configuration interaction singles (CIS) method [71]. The GMH method is expressed as follows:

TDA=μ12∆E12μ1-μ22+4μ122

(2)

where μ₁ and μ₂ are the corresponding state dipole moment vectors of initial state 1 and final state 2, respectively, and μ₁₂ is the transition dipole moment from the CIS wave functions ψ₁ to ψ₂. ΔE₁₂ is the difference in the excited state energies between ψ₁ and ψ₂. Here, we performed the CIS calculation using the GAMESS program [72].

We applied this method for the QM/MM optimized structure of each system (i.e., AtCRYD-ssDNA and AtCRYD^Q395M-ssDNA), where the isoalloxazine ring of FADH^–, the adenine moiety of FADH^–, CPD, Glu325, Asn391, and Glu395 or Met395 for AtCRYD-ssDNA and AtCRYD^Q395M-ssDNA were included in the QM region to calculate CIS, whereas the other atoms were treated as the point charge to simulate the surrounding environment of the QM region. To simplify the system further, the ribitol part of FADH^– and the main chain of Glu, Asn, Gln, and Met were replaced with methyl group. Also, the internucleoside linkages and its neighboring oxygen atom of the 5'-side and the oxygen atom adjacent the internucleoside linkages of 3'-side in CPD replaced with H atom and the ribose part of the adenine moiety of FADH^– were replaced with H atom (Supplementary Fig. S4). We also applied the same procedure for 200 MD snapshots of AtCRYD-ssDNA and AtCRYD^Q395M-ssDNA.

Results and Discussion

Electron transfer reactivity for cryptochrome-DASH

To investigate the ET reaction of AtCRYD-ssDNA and AtCRYD^Q395M-ssDNA, we evaluated the electronic coupling matrix elements T_DA with the molecular configurations shown in Figure 1a. As shown in Figure 2, the relevant molecular orbitals (MOs) of each system were found to be similar. In all cases, MOs for the ground state S₀ were populated in the distal side of the isoalloxazine ring toward the CPD, whereas those for the lowest singlet excited state S₁ and the second singlet excited state S₂ were populated in the proximal side of the isoalloxazine ring. MOs for the third singlet excited state S₃ were populated in the 3'-side of CPD. These results indicated that S₀→S₁ and S₀→S₂ transitions were assigned as π→π* excitations of flavin (locally excited (LE) state), and S₀→S₃ indicated the charge transfer (CT) from FADH^– to CPD. The excitation energies to S₁, S₂, and S₃ for AtCRYD-ssDNA were 4.4643, 4.8752, and 5.2237 eV, respectively, while those for AtCRYD^Q395M-ssDNA were 4.4432, 4.8812, and 4.9730 eV, respectively. And the oscillator strengths to S₁, S₂, and S₃ for AtCRYD-ssDNA were 0.00, 0.25, and 0.00, respectively, while those for AtCRYD^Q395M-ssDNA were 0.00, 0.25, and 0.00, respectively, (Table 1). These results indicate that the displacement of the electron amplitude from the distal side to the proximal side of the isoalloxazine ring occurs upon light-excitation (S₀→S₂) and subsequently being relaxes to the S₁ state by internal conversion. The internal conversion rate of flavin determined by time-resolved absorption and fluorescence experiments showed that the relaxation S₂→S₁ would occur on an ultrafast time scale (<1 ps) [73,74]. These results indicate that ET occurs from FADH^–* to CPD by the thermal excitation from the S₁ state to the S₃ state after relaxation.

Figure 1

A calculation system of CIS for QM/MM optimized structures and MD snapshots. (a) The molecules in the QM region; isoalloxazine ring of FADH^–, adenine moiety of FADH^–, CPD, Glu325, Asn391, and Gln395 or Met395 for AtCRYD or AtCRYD^Q395M. The other atoms are treated as point charges, which are shown as black dots. Notably, water molecules were taken into account as point charges, although they are not shown in this figure. (b) Schematic view of the relative position of the adenine moiety and the CPD. d₁ and d₂ are the distances between the adenine moiety (N6 atom) of FADH^– and CPD (two O atoms).

Figure 2

Molecular orbitals for each excited state calculated for the QM/MM optimized structures. S₀ is the ground state, and S_n is n^th singlet excited state. Here, the molecular orbitals of the largest CI coefficient of each excited state for QM region (Fig. 1a) were indicated.

Table 1 Excited state, CI vector, and excitation energies [eV] for QM/MM optimized structure

AtCRYD Excited state^a	CI vector (top CI coefficient and transition are shown)^b	Excitation energy (oscillator strength)^b
S₁ (LE state (I))	–0.64 (HOMO→LUMO+13)	4.4632 (0.00)
S₂ (LE state (I))	–0.56 (HOMO→LUMO+14)	4.8752 (0.25)
S₃ (CT state (C))	–0.99 (HOMO→LUMO)	5.2237 (0.00)
S₅ (CT state (A))	–0.86 (HOMO→LUMO+5)	5.8203 (0.02)
AtCRYD^Q395M Excited state^a	CI vector (top CI coefficient and transition are shown)^b	Excitation energy (oscillator strength)^b
S₁ (LE state (I))	–0.64 (HOMO→LUMO+14)	4.4432 (0.00)
S₂ (LE state (I))	–0.51 (HOMO→LUMO+16)	4.8812 (0.25)
S₃ (CT state (C))	–0.99 (HOMO→LUMO)	4.9730 (0.00)
S₅ (CT state (A))	0.70 (HOMO→LUMO+4)	5.8830 (0.05)

^a I, A, and C represent isoalloxazine ring, adenine moiety, CPD, respectively. ^b The calculation level of CIS/6-31G(d).

We then estimated the T_DA value for each system using the obtained MOs. The T_DA value was 6.5 meV for AtCRYD-ssDNA and was 3.1 meV for AtCRYD^Q395M-ssDNA. To verify the structures obtained by QM/MM optimization, the distances between CPD and the adenine moiety of FADH^– in AtCRD-ssDNA and AtCRD^Q395M-ssDNA were evaluated. The measured distances d₁ and d₂ represent the relative position of adenine toward the CPD (i.e., the distances between the N6 of the adenine moiety and the two oxygens O4 of CPD; 5'-side of CPD for d₁ and 3'-side for d₂; Fig. 1b). The obtained distances are summarized in the Supplementary Table S1. In the comparison between d₁ and d₂, the relative position of the adenine moiety for AtCRYD-ssDNA was more biased to the 5'-side of the CPD than that of AnCRYD^Q395M-ssDNA, with a difference of 0.48 Å compared to AtCRYD^Q395M-ssDNA. Thus, it is suggested that the position of the adenine moiety gave a major contribution to the T_DA value. Notably, ET reaction is generally reported to be sensitive to the protein environment [75]; hence, the contribution of the protein environments should be taken into account. To investigate the influence of the protein environments, we performed CIS calculation for the QM/MM optimized structure of AtCRYD-ssDNA, excluding the all point charges; however, we could not observe the CT state from FADH^– to CPD within ten excitation states (S₁–S₁₀). Then we performed a CIS calculation for the system, excluding the point charge of ssDNA. As a result, the estimated excitation energy of the CT state (S₃) from FADH^– to CPD was 4.87 eV, which was smaller than that of the full system (5.22 eV). In contrast, the excitation energies of the LE states of FADH^– (S₁ and S₂) were almost unchanged (<0.05 eV). The T_DA value (7.0 meV) changed slightly as compared to that of the full system (6.5 meV). Thus, it is shown that the presence of the surrounding amino acid residues of active site and nucleotides have a vast influence on the electronic state of the active site.

Based on the above results, we estimated the T_DA value to gauge the structural fluctuation using MD snapshots. First, we performed 500 ns MD simulations for AtCRYD-ssDNA and AtCRYD^Q395M-ssDNA and estimated the root-mean-square deviation (RMSD) value (Supplementary Fig. S5). Each RMSD value was stable after 300 ns, and the average RMSD values for the last 200 ns were 2.3±0.2 and 2.4±0.1 Å in AtCRYD-ssDNA and AtCRYD^Q395M-ssDNA, respectively. We, therefore, randomly chose 200 snapshots from the last 200 ns MD simulations and calculated the MOs for each snapshot in the same way as for the QM/MM optimized structures. A representative set of MOs selected from these 200 calculations is depicted in Figure 3, and the excitation energies and the oscillator strengths are summarized in Table 2. The S₁ and S₂ excited states were found to be in the same states (LE state of the isoalloxazine ring) as those for the QM/MM optimized structure, with excitation energies of ~4.0 eV and ~4.6 eV for AtCRYD-ssDNA and AtCRYD^Q395M-ssDNA, respectively. In contrast, the S₃ excited state was in the CT state from the isoalloxazine ring to the adenine moiety (CT-A), which was distinct from the result for the QM/MM optimized structures (Table 1). Finally, the S₄ excited state was found to be the CT state from the isoalloxazine ring to CPD (CT-C), indicating that the CT-A state had decreased in energy due to the protein environment. In some of the MD snapshots, we also observed the S₃ excited state as CT-C, but the rate of its occurrence was rather low (<10%). Indeed, such behavior was previously reported for AnCPD-PL by Lee et al. [38], who calculated the excited states for 20 structures containing FADH^– and CPD with an ADC(2)/def2-SVP level of theory and reported that the orders of the CT-A and CT-C states depending on the structure. Although a hopping mechanism through the adenine moiety was considered, this mechanism was ruled out by a later experimental study [25]. Therefore, we assumed that the ET reaction from the isoalloxazine ring to CPD always occurs. Taking into account the similarity between the energy levels, we concluded that the S₄–S₆ excited states were dominant in the ET reaction for AtCRYD-ssDNA and AtCRYD^Q395M-ssDNA (53% and 59%, respectively). Notably, the orders of the excited states higher than S₄ for the MD snapshots were diverse, with similar excitation energies within a range of ~0.2 eV (Supplementary Table S2).

Figure 3

Representative molecular orbitals calculated for MD snapshot. S₀ is the ground state, and S_n is n^th singlet excited state. Here, the molecular orbitals of the largest CI coefficient of each excited state for QM region (Fig. 1a) were indicated.

Table 2 Excited state, CI vector, and excitation energies [eV] for representative MD snapshot

AtCRYD Excited state^a	CI vector (top CI coefficient and transition are shown)^b	Excitation energy (oscillator strength)^b
S₁ (LE state (I))	0.66 (HOMO→LUMO+14)	4.1277 (0.03)
S₂ (LE state (I))	0.80 (HOMO→LUMO+10)	4.6373 (0.20)
S₃ (CT state (A))	0.86 (HOMO→LUMO+2)	5.2078 (0.02)
S₄ (CT state (C))	–0.92 (HOMO→LUMO)	5.3534 (0.00)
AtCRYD^Q395M Excited state^a	CI vector (top CI coefficient and transition are shown)^b	Excitation energy (oscillator strength)^b
S₁ (LE state (I))	–0.75 (HOMO→LUMO+14)	4.0096 (0.03)
S₂ (LE state (I))	0.58 (HOMO→LUMO+13)	4.6743 (0.17)
S₃ (CT state (A))	0.91 (HOMO→LUMO+1)	5.2596 (0.00)
S₄ (CT state (C))	0.93 (HOMO→LUMO)	5.4777 (0.00)

^a I, A and C represent isoalloxazine ring, adenine moiety and CPD, respectively. ^b The calculation level of CIS/6-31G(d).

The average T_DA values for the MD snapshots were 5.3±4.6 meV for AtCRYD-ssDNA and 5.1±4.8 meV for AtCRYD^Q395M-ssDNA, whereas these values for the QM/MM optimized structures were ~6.0 and ~3.0 meV, respectively. These changes indicated that the T_DA value was remarkably affected by the protein environment. In the previous study using the same method, the T_DA value of AnPL-dsDNA was estimated to be 4.5±3.7 meV [32], which is in a good agreement with the estimated values for AtCRYD-ssDNA and AtCRYD^Q395M-ssDNA. In the previous experimental study, the ET rate constant of Escherichia coli CPD-PL was identified as 250 ps, with –0.44 eV for the driving force ΔG and 1.21 eV for the reorientation energy λ, corresponding to 3.0 meV of the estimated electronic coupling constant [25]. This result indicates that the ET rate constant of AtCRYD may be comparable to that of AnPL. As mentioned above, a difference in the lesion-binding site could be responsible for the reduction in the DNA repair activity in AtCRYD; however, the T_DA value of AtCRYD^Q395M, which a binding site resembling that of AnPL, was found to be identical to that of AtCRYD. This fact excludes the possibility that the methionine in the active site critically affects the ET rate constant, although the methionine is important for the stabilization of the active site [54] and the CPD-PL activity [46,76].

In the MD simulation for AtCRYD-ssDNA, the average distances d₁ and d₂ were 2.9±0.2 Å and 3.3±0.4 Å, respectively, whereas those for AtCRYD^Q395M-ssDNA were 3.1±0.2 Å and 3.1±0.3 Å, respectively. To investigate the relationship between the relative position of the adenine–CPD pair and the T_DA value, the T_DA value for each snapshot was evaluated and mapped as functions of the distances d₁ and d₂ (Fig. 4). The distances d₁ and d₂ were distributed in the range of 2.6–4.0 Å; however, large T_DA values (>15 meV) were mostly observed only when either d₁ or d₂ was shorter than 3.0 Å. This result indicates that the T_DA value might decrease when the adenine moiety is located away from CPD. Rousseau et al. also reported that the contribution of the superexchange mechanism decreased when the adenine moiety was separated from CPD [39]. Therefore, the adenine moiety is very important for the ET reaction.

Figure 4

Correlation between distances, d₁ and d₂, and the T_DA value for AtCRYD and AtCRYD^Q395. Here blue, green, yellow, red and black dots express T_DA<5.0 meV, 5.0≤T_DA<10.0 meV, 10.0≤T_DA<15.0 meV, 15.0≤T_DA<20.0 meV, and T_DA≥20.0 meV, respectively.

Molecular dynamics simulations for dsDNA binding state

Because the electron transfer efficiency of AtCRYD was estimated to be similar to that of AnPL, the functional differences between CRYD and PL have been speculated to originate from an upstream DNA repair process. Indeed, a previous experimental study demonstrated that AtCRYD is able to bind specifically to the CPD located in the bubble of dsDNA, indicating that the CPD binds weakly to the dsDNA [21]. A previous theoretical study suggested that the two negatively charged amino acid residues, Glu444 and Asp445, in AtCRYD would give rise to the repulsion of dsDNA [77]. The same sequence positions in AnPL are occupied by nonpolar amino acid residues (Pro402 and Leu403) (Supplementary Fig. S1). Therefore, we first considered the possibility that these negatively charged residues affect the DNA repair activity of AtCRYD. To confirm this, we prepared mutant AtCRYD (Supplementary Texts S1, S2 and Figs. S6 and S7) by mutating the three amino acid residues in AtCRYD to those of AnPL, namely Arg443Lys/Glu444Pro/Asp445Leu, and examining their DNA repair activities in vitro. Although the ssDNA containing the CPD was successfully repaired in both proteins, as expected from our results on ET reactivity, the repair of the CPD in the dsDNA was unsuccessful even for the triple mutant (Supplementary Text S3 and Fig. S8). This result indicated that the repulsion among two negatively charged residues and DNA will not be a dominant factor for the inhibition of DNA repair activity in AtCRYD.

To elucidate the other factors inhibiting DNA repair activity, we compared the behavior of the AnPL-dsDNA complex with that of the AtCRYD-dsDNA complex using MD simulations. We focused the formation of the salt bridge between Lys401, Arg404, and DNA in AnPL because it was reported that the salt bridge between protein and DNA relates to the DNA repair yield of PLs [78]. Although no salt bridge between the positively charged residues (Lys401 and Arg404) and the phosphate backbone of the DNA was found in the AnPL crystal structure [46], the stable salt bridges were observed in our MD simulations. The average value of the distances between Lys401 and the phosphate backbone of 5'-side of CPD (d_NZ-O1 and d_NZ-O2: the distances between the nitrogen atom NZ of Lys401 and the two oxygen atoms O1 or O2 of the phosphate) were close enough to form the salt bridge, with 4.2±1.0 Å for d_NZ-O1 and 3.0±0.5 Å for d_NZ-O2. The same held true for the Arg404 and the internucleoside phosphate of the CPD pair, and d_NH1-O11, d_NH1-O12, d_NH2-O11, and d_NH2-O12 (i.e., the distances between the two nitrogen atoms NH1 and NH2 of Arg404 and two oxygen atoms O11 and O12 of the internucleoside phosphate of CPD) were 3.0±0.6, 4.1±0.5, 4.4±0.9, and 4.5±0.7 Å, respectively, during the last 500 ns of 1000 ns MD simulations (Figs. 5a–c). The above conformational difference between the crystal structure and the MD simulation would be attributable to the presence of the natural phosphate at the internucleoside linkage in the CPD, instead of the nonpolar formacetal moiety. In the structural analysis of class II CPD-PL from Methanosarcina mazei, the replacement of the formacetal by the phosphate induced a rearrangement at the active site that allowed the CPD lesion to be accommodated more tightly than with the formacetal linkage [79]. Based on these results, formation of the salt bridge between DNA and amino acid residues is important for DNA binding, hence implying necessity to flip the CPD out of the DNA duplex at the beginning of the DNA repair process.

Figure 5

The distance between an amino acid residue and CPD that form a salt bridge during 1000 ns MD simulations. For AnPL, d_NZ-O1 and d_NZ-O2 are the distances between nitrogen atom NZ of Lys401 and oxygen atoms O1 and O2 of CPD, respectively. d_NH1-O11, d_NH1-O12, d_NH2-O11, and d_NH2-O12 are the distance between nitrogen atoms NH1 and NH2 of Arg404 and oxygen atoms O11 and O12 of CPD, as indicated by the subscripts. For AtCRYD, d1_NH1-O11, d1_NH1-O12, d1_NH2-O11, and d1_NH2-O12 are the distances between nitrogen atoms NH1 and NH2 of Arg443 and oxygen atoms O11 and O12 of CPD, as indicated by the subscripts, while d2_NH1-O11, d2_NH1-O12, d2_NH2-O11, and d2_NH2-O12 are the distances between nitrogen atoms NH1 and NH2 of Arg446 and oxygen atoms O11 and O12 of CPD, as indicated by the subscripts. (a), (b) and (c) are each distance of AnPL. (a) the distances d_NZ-O1 and d_NZ-O2 are displayed, (b) the distances d_NH1-O11 and d_NH1-O12 are displayed, and (c) the distances d_NH2-O11 and d_NH2-O12 are displayed, respectively. On the other hand, (d), (e), (f), and (g) are each distance of AtCRYD. (d) the distances d1_NH1-O11 and d1_NH1-O12 are displayed, (e) the distances d1_NH2-O11 and d1_NH2-O12 are displayed, (f) the distances d2_NH1-O11 and d2_NH1-O12 are displayed, and (g) the distances d2_NH2-O11 and d2_NH2-O12 are displayed, respectively.

In contrast with AnPL-dsDNA, the two positively charged residues in AtCRYD (Arg443 and Arg446) did not participate in the electrostatic interactions during the MD simulations (Fig. 5), despite their location at the same position as AnPL (Supplementary Fig. S1). The Arg443 residue, corresponding to Lys401 in AnPL, was located away from the phosphate backbone of 5'-side of CPD and distal to the internucleoside phosphate moiety during the 900 ns MD simulations (10.9±3.1 Å for d1_NH1-O11, 11.9±3.4 Å for d1_NH1-O12, 10.4±3.5 Å for d1_NH2-O11, and 11.3±3.8 Å for d1_NH2-O12). However, we observed that Arg443 swung closer to the internucleoside phosphate at a distance of <4 Å, forming a salt bridge during the last 100 ns of MD simulations (Figs. 5d, e). The electrostatic interaction between Arg446, corresponding to Arg404 in AnPL, and the internucleoside phosphate of CPD seemed to be weak, with slightly longer distances compared to those observed for AnPL (5.9±2.3 Å for d2_NH1-O11, 7.4±2.2 Å for d2_NH1-O12, 5.4±1.7 Å for d2_NH2-O11, and 6.5±1.8 Å for d2_NH2-O12; Figs. 5f, g). Therefore, the salt bridge would be transiently formed upon DNA binding in the case of AtCRYD. These results indicate that the stability of the salt bridge in AtCRYD is much weaker than that of AnPL. The transient salt bridge formation observed here would inhibit proper DNA binding of AtCRYD. In addition, we attempted the MD simulations for the model structure of triple mutant AtCRYD, but the salt bridges were not improved. In particular, the salt bridge of Arg446, which was considered more important for DNA repair activity [78], was not definitely formed (Supplementary Fig. S9).

As we suggested above, the salt bridge formation would play a key role in lesion flipping. The formation of a transient salt bridge would be incompatible with accommodation of the CPD lesion in the binding site, presumably preventing the repair of dsDNA for AtCRYD. Therefore, we suggested that the formation of a transient salt bridge is one factor determining the DNA binding inhibition of AtCRYD by generating attraction between the positively charged residues (Arg443 and Arg446) and DNA rather than the repulsion by the negatively charged residues (Glu444 and Asp445). In addition, it was also indicated that the presence of the negatively charged residues is not responsible for the instability of the salt bridge.

Conclusion

In this study, we searched for the intrinsic origin of the functional differences between AtCRYD and AnPL by using theoretical calculations with supplementary experiments. Regarding their ET reactivities, we found that i) the energy levels of the molecular orbitals in excited states S₃ or above varied depending on the protein environment, ii) the ET reactivity, described by electronic coupling matrix element T_DA, for AtCRYD was in good agreement with that for AnPL, and iii) an introduction of a minor mutation, such as replacement of methionine with glutamine in the active site did not influence the ET reactivity. Comprehensively, these findings indicated that AtCRYD should be able to repair UV-damaged DNA if the lesion is properly accommodated in the active site. Therefore, it is suggested that the differences between AtCRYD and AnPL in biological function would be ascribed to the DNA binding properties. The MD simulations of AnPL-dsDNA and AtCRYD-dsDNA complexes indicated that DNA binding would be inhibited by transient salt bridge formation in AtCRYD and the repulsive interaction with DNA due to negatively charged residues (Glu444 and Asp445 in AtCRYD) would not be the cause of this instability. The proper formation of the salt bridge will be a key factor that leads the DNA repair function in AtCRYD. This finding of the transient salt bridge formation in AtCRYD will give further insights into the functional differences between AtCRYD and AnPL in future studies.

Acknowledgements

This work was supported by RIKEN’s special postdoctoral researcher program. The calculations were performed using Research Center for Computational Science (RCCS), Okazaki, Japan and RIKEN HOKUSAI System.

Conflicts of Interest

All the authors declare that they have no conflict of interest.

Author Contributions

R.S. and M.T. directed this project. R.S. and Y.M. performed the theoretical calculations and data analysis. R.M. and J.Y. performed the AtCRYD production and in vitro assays. R.S., N.O., J.Y. and M.T. wrote the manuscript.

References

Corresponding author

Register with J-STAGE for free!