2017 Volume 40 Issue 3 Pages 345-351
The new emerging Middle East Respiratory Syndrome Coronavirus (MERS CoV) encodes several resistance proteins against the innate immune response of the host, including interferon (IFN) resistance. Monitoring of the status of such proteins will be important to track viral pathogenicity. In this study, molecular dynamics approaches were used to investigate MERS CoV Non-Structural Protein 3 (NSP3) specific proteins that resist host innate immunity. MERS CoV papain-like protease (Plpro) was more conformationally flexible than Severe Acute Respiratory Syndrome CoV (SARS) CoV Plpro. This flexibility was evident in either the free form or when bound with ubiquitin. There were marked changes in the root-mean-square deviation (RMSD) in the ubiquitin like domain (Ubl) and the fingers subdomain of the catalytic domain of Plpro. An interesting feature is the dynamic change in Ubl, which shows a rigid conformation in the free form of Plpro but is fully flexible upon the binding of ubiquitin. This increased flexibility could be important for the downstream effects of the interaction with other proteins and the inhibition of the innate immunity. Four major residues involved in deubiquitination, L106, P163, R168 and F265, were conserved in all MERS CoVs and differed from other Beta CoVs. These conserved CoV residues were associated with lower deubiquitinating activity and render MERS CoV Plpro with less potent deubiquitinating potential. The number of residues and total interactions with ubiquitin were lower for the MERS CoV Plpro than for the SARS CoV. These factors contribute to the lower deubiquitinating actions of MERS CoV NSP3 and its subsequently lower interaction with the host immune system.
Coronavirus (CoV) was first discovered in Saudi Arabia in 2012 in a patient hospitalized with severe respiratory distress followed by pneumonia and renal failure.1) Early work indicated the relationship of the new virus with the Beta lineage of CoV. The study group of the CoV later termed this new infection Middle East Respiratory Syndrome Coronavirus (MERS CoV). Soon after its initial discovery, the virus was discovered in several countries, including the U.S.A., U.K., France, Jordan, Tunisia and other countries.2) Recently, MERS CoV took a dangerous turn by causing a serious outbreak in Korea and a marked number of cases in China.3,4) The case fatality rate of MERS CoV exceeds that of the known Severe Acute Respiratory Syndrome CoV (SARS). This might be due to the severe respiratory and renal failure found during MERS CoV infection. Both SARS and MERS CoVs are similar in many structural and epidemiological aspects. While SARS CoV infection is more acute in nature, and the number of cases decreased after the first episode, MERS CoV has become more endemic in the Arabian Peninsula, with the continuous discovery of new cases over several years from 2012 until the present.
The CoV genome is approximately 30 kilobases and is composed of 10 open reading frames (ORFs).5) At the 5′ end of the genome, there are two large ORFs, ORF1a and ORF1b, which are translated to polyprotein 1a and polyprotein 1b (in MERS CoV, one large polyprotein, AB, is expressed due to a frameshift mutation). The other 8 ORFs translate to structural and non-structural proteins. There are 4 structural proteins, including the spike protein (S), which directly follows ORF1b, and the other 3 proteins are encoded at the end of the genome and include the envelope (E), membrane (M) and nucleocapsid (N) proteins. Between the E and S proteins, there are 4 nonstructural proteins, including ORF3, ORF4a, ORF4b, and ORF5 (Fig. 1).
ORF1ab encodes the non-structural proteins required for virus replication. ORF1ab encodes 16 non-structural proteins (NSP1–16).
The MERS CoV polyprotein AB is processed by two virus-encoded proteases, a 3-C-like protease (3CLpro, main protease) and a papain-like protease (Plpro). Both MERS and SARS CoVs encode only one Plpro, while in some other CoVs, multiple copies are present.6) The MERS CoV Non-Structural Protein 3 (NSP3) is composed of several proteins, including the Plpro, Ubiquitin like (Ubl), ADP ribose (Rib) phosphatase and transmembrane domains. The main protease (3CLpro) cleaves the last 11 sites, while Plpro cleaves the N-terminal three sites, releasing 16 different nonstructural proteins of MERS CoV. The main protease of MERS CoV, 3CLpro, has strictly protease activity. In contrast, Plpro is a multifunctional protein that shows protease, deubiquitination, de-ISGylation and anti-host innate immunity activities.7,8) MERS CoV Plpro is composed of 2 independent domains, a C-terminal palm-shaped protease composed of the thumb, palm and finger subdomains, and the Ubiquitin like domain (Ubl), which comprises the N-terminal 62 residues9) These two domains are functionally distinct, with various downstream activities and molecular interactions. The Ubl domain is conserved in most CoVs including MERS CoV and SARS CoV, while the transmissible gastroenteritis CoV lacks the Ubl domain.10,11) In studies related to the SARS CoV, Ubl was found to interfere with the IRF3 and nuclear factor-kappaB (NF-κB) antiviral pathways.12) Therefore, Ubl is a viral protein that shares in antagonizing the host innate immune response.
Molecular modeling and molecular dynamics (MD) simulations are widely used techniques to understand the molecular motion, conformational changes and other molecular changes during protein–protein interactions.13–17) In this work, we performed MD analysis of the interaction of NSP3 Plpro with ubiquitin and Ubl among important viruses that pose serious health risks, such as SARS and MERS CoV. Investigation of the detailed interaction at the protein–protein interface will contribute to our knowledge about the interaction of viral proteins with host innate immunity proteins.
Several crystal structures were obtained from the Protein Data Bank (PDB) database. The structures included SARS or MERS Plpro either in its free form or bound with ubiquitin (Table 1). After browsing of the PDB database, Plpro from the coronaviridae were only evident with SARS and MERS CoV and Infectious Bronchitis virus (IBV). 2X2Z, the structure of the IB virus Plpro, was also used for comparison with SARS and MERS CoV Plpro. The structures were inspected, and the missing lobes were modeled.
PDB ID | Description |
---|---|
3E9S | Crystal structure of SARS Plpro |
4P16 | Crystal structure of MERS CoV Plpro |
4MM3 | Crystal structure of SARS Plpro bound with ubiquitin |
4WUR | Crystal structure of MERS CoV Plpro bound with ubiquitin |
4X2Z | Crystal structure of IB virus Plpro |
MD simulation was implemented for 10 ns in YASARA (version 14.12.2) using an AMBER99 force field.18) Simulations were applied to different crystal structures, including MERS CoV, SARS CoV and IB virus (Table 1). These structures represent the available coronaviridae structures in the database. For all of the used PDB files, the same protocol was repeated. In the first simulation, a cell was constructed with a real space cut-off of 7.9 A. The pKa values of ionizable groups were predicted, and protonation states were assigned at a default pH of 7.2. The simulated cell was filled with water to a density of 0.997 g/L. The initial structures were optimized by short steepest descent minimization to remove bumps and correct covalent geometry errors, followed by 500 steps of simulated annealing at 298 K as an equilibrium phase. Atom velocities were rescaled every 25 simulation steps to maintain the time average temperature at 298 K. The solvated structures were finally subjected to MD simulation for 10 ns. The final snapshots were selected from the lowest energy snapshots, which were saved every 250 ps. The root-mean-square deviation (RMSD) was calculated after least squares fitting using Cα atoms.
Sequence Alignment and Molecular ModelingThe sequences of Plpro from SARS CoV, MERS CoV and IBV were imported from the gene databases by Geneious software ver. 7. The sequences were aligned by the ClustalW module using the BLOSUM cost matrix, the gap open cost was set to 10 and the gap extension cost was set to 0.1. The Ubl domain and fingers subdomain (selected after MD simulation results) were directly extracted from the alignment output file and realigned, and homology parameters were calculated based on rates of similarities or differences (Figs. 2A–E). The crystal structures of Plpros were visualized and annotated, and figures were generated by CLC drug discovery or Molsoft ICM browser software.
A colour scale from black to white according the degree of similarity between residues where black highlighted regions indicate identical residues. Pairwise identity matrices are shown for Plpro (C), Ubl (C) and the fingers subdomain in (E). Stereo view of the domain contents of Plpro are represented in (F). The fingers subdomain and Ubl domain are highlighted. Geneious software was used for the generation of alignment by ClustalW and the generation of the identity matrix.
The protein complexes were submitted to the bioCOmplexes COntact MAPS server (https://www.molnac.unisa.it/BioTools/cocomaps/index.psp) using the default parameters.19) The output parameters included Plpro–ubiquitin interface analysis, interactions map, distance range analysis, intermolecular hydrogen bonds and accessible surface area parameters (Table 2).
4MM3 | 4WUR | |
---|---|---|
Number of interacting residues in ubiquitin | 40 | 30 |
Number of interacting residues in Plpro | 68 | 67 |
Number of hydrophilic–hydrophobic interaction | 194 | 175 |
Number of hydrophobic–hydrophobic interaction | 66 | 53 |
Buried area upon the complex formation (Å2) | 1881.7 | 1634.6 |
Buried area upon the complex formation (%) | 9.12 | 8.2 |
Interface area (Å2) | 940.85 | 817.3 |
Interface area MOL1 (%) | 20.03 | 5.41 |
Full genome sequences of the MERS CoV were imported into the CLC genomics workbench. The sequence of the MERS CoV Plpro, including Ubl, was used to blast either the local database of MERS CoV genomes or shorter sequences at the NCBI database. The markers for innate immunity resistance were compared among different MERS CoV and other coronaviridae sequences. These markers were based on previous experimental evidence of the role of some residues or subdomains and domains in resistance to innate immunity or from our MD simulation results. Attention was directed to 1) the Ubl domain, 2) the fingers subdomain of the catalytic domain, and 3) the important residues for deubiquitinating activity, such as L106, P163, E168 and Y265.
The sequences of MERS CoV, SARS CoV and IBV Plpros were aligned by the ClustalW method (Figs. 2A–E). Comparison of the amino acid composition of the three Plpros was dependent on the percent identity matrix. The % identity among all Plpros was less than 30% (Fig. 2C). Comparison of the MERS CoV Plpro with the SARS CoV Plpro showed 29.9% identity, which is higher than the 22.3% identity with IBV. During MD simulations, marked differences in RMSD residues were observed in two segments of Plpro. The first segment is the N-terminal Ubl domain, and the second segment is the fingers subdomain of the catalytic domain of MERS CoV Plpro (highlighted in Fig. 2F). In agreement with the previous result, the fingers subdomain of Plpro showed low % identity among the examined Plpros. In contrast, the CoV Ubl domain showed the lowest % identity among the examined Plpros, which was in the range of 15.8–28.8%. The low identity among Plpros and its domains and subdomain components indicates subtle differences in the corresponding biological activities and interactions with other proteins. We compared the sequences of MERS CoV and SARS CoV Plpros. There was a putative weaker deubiquitinating activity of MERS CoV. In the catalytic domains of MERS CoV Plpro, leucine, proline, arginine and phenylalanine were present at position numbers 106, 163, 168 and 256, respectively. In contrast, SARS CoV has tryptophan, leucine, glutamate, and tyrosine in these respective sites. In this context, the L106W mutation of the MERS CoV Plpro resulted in a drastic increase in deubiquitinating activity.10) In addition, the mutation of SARS Plpro to residues similar to MERS CoV amino acids resulted in the profound loss of deubiquitinating activity.10,20) Therefore, MERS CoV Plpro naturally has lower deubiquitinating activity compared with SARS CoV Plpro.
The RMSD of the Cα atoms during MD simulation relative to the original structure is represented in Fig. 3. In the first 100 ps, there was a prompt increase in RMSD values due to the relaxation of the dissolved structure, and there were more changes in comparison with the original structure. It is also noticeable that the magnitude of changes in RMSD was higher in the MERS CoV Plpro than in the SARS or IBV Plpros (Figs. 3A, C). The RMSD average of MERS CoV was 2.5 Å compared with 1.5 Å in the SARS CoV and IBV Plpros (Fig. 3A). The dynamic behavior thus confirms that the SARS and IBV Plpros are more conformationally rigid, as indicated by the RMSD values. Analysis of the changes in individual amino acids of Apo Plpro reveals more generalized, higher RMSD in the structures of MERS Plpro (Fig. 3B), with a clear, high peak in the maximum changes in 2 segments of the MERS CoV Plpro. The range of the greatest changes in RMSD are A181-C195 and L218-C232. These 2 segments represent 3 of the 4 beta sheets of the fingers subdomain of the catalytic domain of the MERS CoV Plpro. These two segments are not in direct contact with ubiquitin, except for one residue, N227. The N-terminal region comprising the Ubl domain was almost stable and showed little or no change among the SARS, MERS and IBV structures (Figs. 2B, 3A). This may indicate that the conserved features and functions of the Ubl domain among all coronaviruses are rigid, with low rates of conformational change in the absence of ubiquitin.
MD simulation was run to 10 ns and snapshots were taken every 250 ps. The changes in RMSD per residue are represented in (B). Time dependence of RMSD for Plpros from MERS CoV and SARS CoV bound with ubiquitin (C). The changes in RMSD per residue in Plpro bound with ubiquitin (D). Comparison of the free form of Plpro in (C) with ubiquitin-bound Plpro reveals changes in the N-terminal (first 50 residues) Ubl domain and the fingers subdomain (residues 180–240).
The MD simulation of the CoVs Plpro bound with ubiquitin is shown in Figs. 3C and 4D. RMSD changes were more evident in the Ubl domain and fingers subdomain (Fig. 3D). Comparison of the free form of Plpro with those structures bound with ubiquitin reveals interesting features that are summarized as follows: 1) the SARS Plpro was more rapidly stabilized than the MERS CoV Plpro (Fig. 3A). The SARS Plpro stabilized at 300 ps, while the MERS CoV Plpro stabilized after 1.2 ns. 2) After stabilization, the average RMSD values of SARS CoV Plpro and MERS CoV Plpro were 2 and 2.6, respectively. This indicates that more structural rearrangements are expected with the MERS CoV Plpro. 3) There were marked changes in the RMSD of the N-terminal Ubl domain, especially in the Apo MERS CoV Plpro (Figs. 4A, B). The Ubl domain therefore changes from a rigid to a flexible state in the presence of ubiquitin. Ubl is thought to share in protein–protein interactions and in the regulation of downstream interactions, especially in the modulation of the host innate immune response of the host.
Noticeable changes were recorded with UBl domain and fingers subdomain. MERS CoV structure (4P16) showed the largest increase in RMSD of residues at the fingers subdomain. The explanation of PDB ID contents is shown in Table 1.
Analysis of residues at the Plpro–ubiquitin interface was performed by submission of the corresponding structures to the bioCOmplexes COntact MAPS server. The obtained results were imported into an Excel file and inspected for interface residues, energy factors and other analytical data. In the MERS CoV Plpro, there were 9 hydrogen bonds between the Plpro and ubiquitin, which was lower than the number in the SARS CoV Plpro complex (10 hydrogen bonds). In addition, the MERS CoV Plpro–ubiquitin interaction showed a lower number of hydrophobic interactions and total number of interacting residues (Table 2).
A representation of the MERS CoV Plpro–ubiquitin interface is shown in Fig. 5 and Table 2. The interface residues are distributed within the palm and finger subdomains of the catalytic domain of Plpro. MD analysis of the interface residues in both the MERS CoV and SARS CoV Plpros revealed variable degrees of changes among different residues (Fig. 6). While most of the residues showed RMSD values of approximately 1 Å, some residues showed marked changes. In the MERS CoV Plpro, Q227 was the only residue in the fingers subdomain that interacted with ubiquitin and showed marked increases in RMSD approaching 4.6 Å (Figs. 5, 6). In contrast, 5 residues in the SARS CoV Plpro fingers subdomain interacted with ubiquitin (Figs. 5, 6). These residues showed a moderate increase in RMSD up to 2 Å (Fig. 6). Therefore, the MERS CoV Plpro fingers subdomain interacts with ubiquitin with 1 residue, which showed drastic changes in RMSD. In comparison, the SARS CoV Plpro interacts with 5 residues with ubiquitin, with a moderate increase in RMSD during MD simulation.
A) Residues in the MERS CoV Plpro at the interface of Plpro–ubiquitin interactions are shown in light gray, Plpro is represented in white skin surface, and ubiquitin is shown in the 3D cartoon. B) The fingers subdomain of the MERS CoV Plpro with residues interacting with ubiquitin (light gray). C) The fingers subdomain of the SARS CoV Plpro with residues interacting with ubiquitin (dark gray).
Comparison of the Ubl domain from 300 different full genome sequences of the coronaviridae indicates restricted conservation of Ubl among MERS CoV genomes and marked differences within the coronaviridae. The matrix of pairwise comparison reveals that only 1 residue differs within the Ubl of MERS CoV genomes, constituting 98.4–100% of the identity. Comparison with other Ubl domains from SARS CoV or CoVs isolated from bats reveals an identity decreased by ca. 40%. Similarly, the fingers subdomain showed 0–1 differences among MERS CoV genomes, while the number of differences increased to 22 residues in comparison with other Beta CoVs (62%). For residues with proven importance in Plpro activity, among 300 full genomes of CoVs, L106 was conserved in all MERS CoV and bat CoVs, and R168 was conserved in all MERS CoV Plpros and was replaced with lysine or methionine in other beta CoVs. Similarly, the conserved residue F265 was replaced with tyrosine in the other viruses.
In conclusion, both previous experimental work and the present modeling studies revealed low deubiquitinating activity of MERS CoV. This is supported by the mutation of essential residues for Plpro actions. MERS CoV Plpro showed lower surface area of interaction and lower number of interface residues. Being more flexible and movable, the Ubl domain indicates different interaction styles with other host immune proteins that require more experimental evidence. Ubiquitin induces noticeable changes in the entire structure of the MERS CoV Plpro, especially at the Ubl domain. This finding indicates that the changes in Ubl domain is affected by the presence of ubiquitin.
This work was supported by a research grant from King Faisal University, Deanship of Scientific Research under Grant number 160046.
The authors declare no conflict of interest.