Biological and Pharmaceutical Bulletin
Online ISSN : 1347-5215
Print ISSN : 0918-6158
ISSN-L : 0918-6158
Review
Structural Perspective of NR4A Nuclear Receptor Family and Their Potential Endogenous Ligands
Ryoichi Hashida Takeshi Kawabata
著者情報
ジャーナル フリー HTML
電子付録

2024 年 47 巻 3 号 p. 580-590

詳細
Abstract

There are 48 nuclear receptors in the human genome, and many members of this superfamily have been implicated in human diseases. The NR4A nuclear receptor family consisting of three members, NR4A1, NR4A2, and NR4A3 (formerly annotated as Nur77, Nurr1, and NOR1, respectively), are still orphan receptors but exert pathological effects on immune-related and neurological diseases. We previously reported that prostaglandin A1 (PGA1) and prostaglandin A2 (PGA2) are potent activators of NR4A3, which bind directly to the ligand-binding domain (LBD) of the receptor. Recently, the co-crystallographic structures of NR4A2-LBD bound to PGA1 and PGA2 were reported, followed by reports of the neuroprotective effects of these possible endogenous ligands in mouse models of Parkinson’s disease. Based on these structures, we modeled the binding structures of the other two members (NR4A1 and NR4A3) with these potential endogenous ligands using a template-based modeling method, and reviewed the similarity and diversity of ligand-binding mechanisms in the nuclear receptor family.

1. INTRODUCTION

Nuclear receptors comprise a superfamily of hydrophobic low molecular ligand-activated transcription factors that regulate gene expression in many physiological and pathological events. Endogenous ligands have only been identified for about half of 48 nuclear receptors in the human genome.1,2) The remaining receptors, called orphan nuclear receptors, form promising pharmaceutical targets for research and development25) as well as orphan G-protein coupled receptors. Although the nuclear receptor family, nuclear receptor subfamily 4 group A (NR4A), consisting of three members,6) NR4A1, NR4A2, and NR4A3 (old annotations are Nur77, TR3, or NGFI-B,7) Nurr1,8) and NOR19)) are orphan receptors, they are promising therapeutic targets in immune diseases,1013) neurological diseases,1419) and cancer.2024) They have been classified as immediate-early genes whose expressions are tightly regulated by extracellular signals. In addition, as shown in Supplementary Fig. S1, NR4A1 and NR4A2, but not NR4A3, can heterodimerize with retinoid X receptor alpha (RXRα) and mediate efficient trans-activation through a direct repeat-5 (DR-5) element in response to the RXR-specific ligand, 9-cis retinoic acid.2528) Although the NR4A family shares features of typical steroid receptor organization comprising an N-terminal transactivation domain, a central DNA-binding domain and a C-terminal putative ligand-binding domain (LBD) as shown in Supplementary Fig. S1A, there were few reports about specific ligands that regulate NR4A-dependent transcriptional activity. Moreover, no function has yet been inclusively ascribed to the LBDs of the NR4A superfamily.

Dysfunction of NR4A nuclear receptors is associated with human diseases similar to other nuclear receptor families. NR4A3 is implicated in oncogenesis as part of the EWS fusion protein, resulting from a chromosomal translocation found in human extraskeletal myxoid chondrosarcoma tumors.29) In addition, NR4A3, as well as NR4A1, play a key role in the apoptosis of T lymphocytes, eosinophils, and various other cell types,1013,3032) and thus may be implicated in disorders related to genetically- or environmentally-induced defects of activation-induced apoptosis, including autoimmune diseases and allergic diseases. Recently, NR4A3 was also reported to be closely associated with cardiovascular diseases such as atherosclerosis, pulmonary arterial hypertension, and abdominal aortic aneurysm through vascular remodeling.33) Regarding NR4A2, there have been many reports about its relationship with neurological diseases especially Parkinson’s disease, which is related to the differentiation of midbrain dopaminergic neurons.1419)

We examined a large number of lipid mediator metabolites, including those produced by the arachidonate and glycerolipid pathways, which might be induced in various cells following pathological stimuli such as antigen-induced apoptotic signaling. We found that only prostaglandin (PG) A1 and PGA2 have the ability to activate NR4A3-dependent transcription, and could bind directly to NR4A3-LBD. Furthermore, primary cultured spleen cells derived from transgenic mice overexpressing NR4A3 showed PGA2 specific apoptosis in comparison with those derived from wild-type mice.31,32)

Recently, Rajan et al. reported that PGA1, PGA2, and PGE1 interacted directly with the LBD of NR4A2 to stimulate and NR4A2 nuclear receptor functions. They also reported the co-crystallographic structures of NR4A2-LBD bound to PGA134) and PGA2.35) Furthermore, these PGs exhibited neuroprotective effects in an NR4A2-dependent manner. Based on these results, they suggested these arachidonate metabolites may function as bona fide endogenous ligands of the NR4A2 receptor and have beneficial effects in Parkinson’s disease. However, the crystallographic structure of NR4A3 have not been clarified yet. Regarding NR4A1, the crystallographic structure of the LBD has been reported,36) but there is no information about the transcriptional functions of these arachidonate metabolites or the co-crystallographic structure of NR4A1-LBD bound to these candidate endogenous ligands.

The three-dimensional (3D) structures of molecular complexes assessed by co-crystallographic structures reveal the atomic details of molecular interactions between compound-protein pairs, and thus provide important information to help us understand their biological functions. Computer modeling approaches are frequently performed to extend the data of these 3D complex structures. Both template-based modeling and de novo modeling are performed to model 3D complex structures. Usually, template-based modeling is performed first because it has advantages in terms of accuracy and computation costs, if proper templates are available. Template-based modeling has also been applied to the modeling of small compound-protein complexes. For this approach, a target chemical compound is superimposed onto the template compound using the 3D structural alignment programs of chemical compounds.37) Standard chemical compound-protein docking calculations are often improved using known compound-protein complexes as templates.3843) Because the accuracy of template-based modeling largely depends on the choice of the template structure, using a database containing the latest 3D structures with molecular similarity search is essential. The Homology Modeling of Complex Structure (HOMCOS) server (http://homcos.pdbj.org), constructed by Kawabata and Fukuhara in 2008,44,45) was significantly updated for the template-based modeling of 3D complexes for proteins and small compounds in the Protein Data Bank (PDB) in 2016.46)

Then, we tried to apply this in silico approach to analyze interactions between the LBDs of NR4A in the nuclear receptor family and compounds including candidate endogenous ligands.

2. THE NR4A NUCLEAR RECEPTOR FAMILY HAS MANY POSSIBLE PHARMACOLOGICAL FUNCTIONS

2.1. NR4A Family Members as Therapeutic Targets for Allergic and Neurologic Diseases

To identify novel genes related to the clinical signs of allergic diseases such as atopic dermatitis (AD), differentially expressed genes were comprehensively evaluated by comparing peripheral blood leukocytes from patients and healthy volunteers.47,48) We found all three genes, NR4A1, NR4A2, and NR4A3, members of the NR4A nuclear family, had significantly higher levels in AD patients than in healthy volunteers.30) Differential expressions of NR4A1 and NR4A2 were shown in whole blood leukocytes. However, NR4A3 was selected as a peripheral blood eosinophil-specific gene. NR4A3 expression in peripheral blood eosinophils of AD patients was enhanced with exacerbation and decreased with remission of the disease. The NR4A3 gene was also highly expressed in surgically removed nasal polyps containing large eosinophil accumulations.32) We also reported that the NR4A1 and NR4A3 genes were markedly induced in correlation with eosinophil-specific apoptosis.31) In contrast, the coincidental downregulation of NR4A1 and NR4A3 was reported in myeloid leukemogenesis.24) Such information suggests that these 2 receptors of the NR4A family are related to each other, with similar pathological roles.

Regarding NR4A2, there are many reports that this nuclear receptor is indispensable for the differentiation, maturation, and maintenance of midbrain dopaminergic neuron clusters.1419) NR4A2-deficient mice failed to generate midbrain dopaminergic neurons.49) Then, it was reported that NR4A2 expression in the brains of Parkinson’s disease patients and rodent models of Parkinson’s disease was downregulated.1719) To date, no mechanistic hypothesis has been put forth to explain this reduced expression. Therefore, the development of selective NR4A2 agonists may provide target-based therapeutic interventions for Parkinson’s disease and other related diseases.50)

2.2. Activation of NR4A3 by PGA1 and PGA2

We thought that the NR4A orphan nuclear receptors may not be genuinely orphan in nature, and that some endogenous ligands may be present close to where these receptors are expressed at high levels. It is the similar strategy that chenodeoxycholic acid was proved to be an farnesoid X receptor (FXR) ligand in the liver, an organ with a high expression of the nuclear receptor subfamily 1, group H (NR1H) family member FXR.51,52) Thus lipid mediators such as arachidonate and glycerolipid cascade metabolites in the mammalian lipid metabolic pathways, which might be induced in activated leukocytes following pathological stimuli, were screened. We need a mammalian hybrid system for this screening, in which GAL4-NR4A receptor-linked chimera plasmids and the GAL4 binding domain-firefly luciferase reporter plasmid were co-transfected into cultured cells, and luciferase activity after ligand candidate addition was assayed. PGA1 and PGA2 were found to have NR4A3 nuclear receptor-LBD dependent transactivation activity in this system (13,14-dihydro-15-keto-PGA2 and PGE1 were negative in this system.).31)

The putative LBD of the nuclear receptor was also shown to bind directly to PGA1 and PGA2 using high-resolution Biacore S51.31) Evaluation of the effects of PGA1 and PGA2-like cyclopentenone derivatives on NR4A3 LBD-dependent transcription demonstrated subtype-specific activators and clear structure–activity relationships, as assessed by minor modification of the functional groups in these similar PG structures.

We also established NR4A3 transgenic mice and surprisingly found that PGA2 had a strong stimulatory effect on apoptosis induction in spleen cells derived from these mice compared with wild-type mice.31) These findings also suggest specific molecular linkage between PGA and NR4A3.

PGA2 is the non-enzymatic breakdown product of PGE2, and PGE2 is one of the main products of arachidonate cascade highly produced by cyclooxygenase in inflammatory stimulated leukocytes including eosinophils. It means that our endogenous ligand screening strategy might be successful as in case of chenodeoxycholic acid and FXR.51,52) Peroxisome proliferator-activated receptor γ (PPARγ) is the reference as a more familiar nuclear receptor in this review. Other arachidonate metabolites, PGJ2 and 15d-PGJ2, the non-enzymatic products of PGD2, were proved to be endogenous ligands of PPARγ in 1995.53) These evidences might be correlated well with our findings. However, PGJ2 and 15d-PGJ2 were not active in our system.

2.3. Interaction of NR4A2 with PGs and Binding Structures between NR4A2 and Its Endogenous Ligand Candidates by X-Ray Co-crystallography

After more than 15 years of our publications,3032) two exciting papers were published in 202034) and 202235) by the same group. Rajan et al. showed that PGE1 and its dehydrated metabolites, PGA1 interacts directly with the LBD of NR4A2 and stimulates its transcriptional function.34) They additionally reported that PGA2 has the same activities.35) They also proved the X-ray crystallographic structures of NR4A2 bound to PGA1 and PGA2. Each co-crystal structure of NR4A2-LBD bound to PGA1 and PGA2 revealed the covalent couplings of PGA1 and PGA2 with the LBD through Cys566 by forming Michael adducts. Both PGA1 and PGA2 bindings also induced notable conformational changes, including a 21° shift of the activation function 2 region (AF-2) in the C-terminal helix H12 away from the protein core. Both PGA1 and PGA2 physical bindings with NR4A2-LBD were also supported by NMR titration. PGE1 was interacted with NR4A2-LBD, but was not covalently bound to the LBD.

The pharmacological functions of PGA1, PGA2, and also PGE1 were demonstrated in the 1-methyl-4-pheny-1,2,3,6-tetrahydropyridine (MPTP)-induced lesioned mouse model34) or leucine-rich repeat kinase2 (LRRK2) G2019S transgenic fly model,35,54) both models of Parkinson’s disease, in which these arachidonate metabolites rescued the locomotor deficits and neuronal degeneration. These findings by the Rajan group suggest these arachidonate metabolites may represent bona fide endogenous ligands of NR4A2. Together with our previous findings, PGA1/PGA2 might be the endogenous ligands of NR4A nuclear receptor superfamily members, although there have been no reported bindings of the compounds PGA1 and PGA2 to the protein NR4A1.

There have been few structural studies of the NR4A receptor family with endogenous ligand candidates, except NR4A2. Concerning NR4A1, there were some reports about 3D structures and synthetic low molecular agonists,5558) but no wet data related with arachidonate metabolites. Therefore, we modeled 3D structures between these ligand candidates and this nuclear receptor family using the template-based modeling methods described in the following chapters.

3. INTERACTIONS OF NR4A NUCLEAR RECEPTOR FAMILY MEMBERS WITH ENDOGENOUS LIGAND CANDIDATES REVEALED BY EXPERIMENTAL AND MODELED 3D STRUCTURES

3.1. 3D Structures of Ligand Binding Domains of NR4A

Several X-ray crystallographic structures of the LBD of NR4A1 and NR4A2 have been solved, although no structures are available for the LBD of NR4A3 to date. The PDB IDs of these structures are summarized in Table 1, and three of them are shown in Fig. 1. Between similar families of nuclear receptors, PPARγ was one of the earliest factors to have its 3D structure elucidated, revealing it possessed a canonical ligand-binding site. We demonstrate PPARγ as a positive control of the most similar and successful pharmaceutical target among nuclear receptor families. Because there were many references about 3D structure and the natural ligands were proved to be arachidonate cascade products, PGJ2 and 15d-PGJ2.53) Some synthetic ligand compounds are also clinically successful in improvement of insulin resistance. Figure 1C shows that the canonical binding site of PPARγ holds the compound 15d-PGJ2 by a covalently-bonded cysteine. Many agonistic and antagonistic compounds for PPARγ were reported to bind to the canonical binding site.59,60)

Table 1. PDB IDs of 3D Structures of the Ligand Binding Domain of the NR4A Family and PPARγ

ApoWith synthetic liganda)With natural liganda)
NR4A13v3e3v3q[TMY](TMPA), 4rzg[ZHN](PDNPA), 4kzi[ZJ0], 4ref[3N0], 4whg[3NB],4jgv[T94](THPN), 4ree[3MZ], 4whf[3MX], 4re8[3MJ]NA
NR4A21ovl8cyo[OBJ]5y41[RPG](PGA1), 5yd6[8SU](PGA2), 6dda[G7J](dopamine metabolite)
NR4A3NANANA
PPARγ1prg2xkw[P1B](pioglitazone)2zk1 [PTG](15d-PGJ2)

a) In square brackets [ ], the compound IDs (comp_id) of the PDB are indicated, and in parentheses ( ), the common names of the compounds are shown.

Fig. 1. 3D Structures of the NR4A2 Family and PPARγ

Molecular graphics were created using UCSF Chimera.81) Using the program MATRAS,82) orientations of these protein structures are aligned to match the NR4A2 structure with PGA1 (PDB ID: 5y41). (A) The holo structure of NR4A2 bound to PGA1. The PDB ID of the structure is 5y41.34) The helix H12 has an open conformation. The orange region is the I-box region (K554-L562), which is important for NR4A2-RXRα heterodimerization.77) The cyan region is the AF-2 (activation function 2) region (A586-L593), which is important for a transcription activation function.83) (B) The holo and the apo structure of NR4A2. The apo structure is taken from PDB ID 1ovl,61) and is colored dark slate blue. Upon the binding of the ligand PGA1, the helix H12 (AF-2 region) rotates 19.3 degrees outward from the protein’s core. The angle change of Helix H12 (19.3 degrees) was measured by the bond angle between the Cα atoms at the C-terminus, with the Cα atom at the N-terminus as the center. (C) The structure of NR4A1 with nine compounds.5558) The protein structure is taken from PDB ID:3v3q. The helix H12 has a closed conformation, whereas the helix H10 is slightly open. Compound names are 3-letter comp_id of the Protein Data Bank. Three compounds (TMY, ZHN, and ZJ0) bound to the binding site of PGA1 are marked by a dotted red ellipse. The other six compounds (T94, 3 MJ, 3MZ, 3MZ, 3NB and 3N0) are bound to other sites of NR4A1. Compound structures (shown in Supplementary Fig. S2) are taken from the PDB structures 3v3q for TMY, 4rzg for ZHN, 4kzi for ZJ0, 4jgv for T94, 4re8 for 3MJ, 4ree for 3MZ, 4whf for 3MX, 4whg for 3NB, and 4ref for 3N0. (D) The structure of PPARγ bound to 15d-PGJ2. The PDB ID of the structure is 2zk1.65) The binding site of PPARγ is called “canonical binding site.” The helices H10 and H12 have closed conformations.

In 2003, the first X-ray crystal structure of NR4A2 in the apo state was reported by Wang et al. They reported that NR4A2 lacked the canonical binding pocket due to the tight packing of hydrophobic side chains61) (Figs. 2A, B). Therefore, it was hypothesized that the NR4A superfamily might be ligand-independent orphan nuclear receptors. However, more than fifteen years later, Bruning et al. reported a complex structure of NR4A2 with a dopamine metabolite IQ (an oxidized DHI, as shown in Supplementary Fig. S3) (PDB ID:6dda),62) and Rajan et al. solved the structures of NR4A2 with PGA1 and PGA2 (PDB ID:5y41 and 5yd6).34,35) These three compounds were placed in a non-canonical pocket (PGA1: Figs. 1A, 5A; PGA2: Fig. 5B; IQ: Fig. 5F). This pocket did not exist in the apo structure; the H12 helix was reoriented away from the apo structure by an angle of 21° to form a non-canonical binding pocket, as shown in Figs. 1B, 2C, and D. These three ligands were covalently bound to the Cys566 of NR4A2, as shown in Figs. 1A, 2E, 5A, B, and F. Figure 2E shows a 2D ligand–protein interaction diagram for PGA1 and NR4A2 drawn using LigPlot+.63) Figures 2D and E indicate that the binding site of PGA1 is mainly hydrophobic, except for the carbonyl group (O9) on the five-membered ring. The carboxyl group (C1, O1, and O2 atoms) of PGA1 was exposed to bulk water and did not form any bonds with specific protein atoms.

Fig. 2. Binding Sites of NR4A2 and PPARγ

(A) A cross section of the “canonical binding site” of PPARγ with 15d-PGJ2. (PDB_ID: 2zk1). (B) A cross section of the region of NR4A2 corresponding to the “canonical binding site” (PDB_ID: 1ovl). The binding site is occupied by the tightly packed hydrophobic side chains. (C) The binding site of the apo structure of NR4A2 (PDB_ID: 1ovl).61) (D) Binding site of the holo structure of NR4A2 bound to PGA1 (PDB_ID: 5y41).34) Molecular surfaces are colored using the Kyte–Doolittle hydrophobic scale. Hydrophobic residues are colored in red, whereas hydrophilic residues are colored in blue. (C) 2D ligand–protein interaction diagram for PGA1 and NR4A2 (PDB ID:5y41) drawn by LigPlot+.63)

The binding structures of NR4A1 for nine synthetic chemical compounds have been solved, as shown in Fig. 1C. These artificial low molecular compounds were reported to be agonists and antagonists of NR4A1.5558) TMPA [TMY] antagonizes the NR4A1-LKB1 interaction,55) THPN [T94] induces autophagy with the participation of NR4A1,56) and PDNPA [ZHN] inhibits the interaction between NR4A1 and p38α.57) The chemical structures of these compounds are shown in Supplementary Fig. S2. These nine compounds can be classified into two groups by the locations of their binding sites. The three compounds (shown in the dotted red circle in Fig. 1B) were bound to the PGA-binding site, whereas the other six compounds (shown in dotted blue circles in Fig. 1B) were bound to a site that is far from the canonical binding site and the PGA-binding site. The chemical structures of the six compounds share a common substructure (3,4,5-trihydroxyphenyl group) and their bound 3D structures overlapped compared to the 3D structure of the three compounds. Thus, these three compounds might inhibit PGA1, PGA2, and IQ from binding to NR4A proteins because they bind close to the PGA-binding site.

3.2. Multiple Sequence Alignment of the Ligand Binding Domains of NR4A

The multiple alignment of the proteins NR4A1, NR4A2, NR4A3, and PPARγ was generated using the program ClustalW2,64) and shown in Fig. 3 with the binding sites and secondary structures. The binding sites of PGA1 for NR4A2 are shown in green boxes, and the canonical ligand binding sites for PPARγ are shown in magenta boxes.

Fig. 3. Multiple Sequence Alignment (MSA) of the Ligand Binding Domain (LBD) of the NR4A Family with Structural Information

MSA was calculated by ClustalW2.64) Secondary structures of NR4A2 (PDB_ID: 5y41) are shown in the upper MSA. The alignment between PPARγ and NR4A2 was obtained by 3D structural alignment (between PDB IDs 1prg and 5y41) using the protein structure comparison program MATRAS.82) The amino acid sequence of PPARγ (PPARG_HUMAN) is shown at the bottom of MSA with its secondary structures. Green boxes on MSA are the binding sites of PGA1 obtained from NR4A2 (PDB_ID:5y41). Binding sites for three compounds (TMY, ZHN, and ZJ0) are indicated by red boxes on the sequence NR4A1, whereas those for six compounds (T94, 3 MJ, 3MZ, 3MZ, 3NB and 3N0) are shown by blue boxes. The bound structures of these compounds are shown in Fig. 1B. Binding sites of PPARγ for 15d-PGJ2 (canonical binding sites) are shown in purple boxes on the sequence PPARG_HUMAN. The I-box region (K554-L562) in NR4A2 is colored orange, which was reported to be important for NR4A2-RXRα heterodimerization.77) The AF-2 (activation function 2) region (A586-L593) is colored cyan, which was reported to be important for a transcription activation function.83)

A comparison of the two binding sites indicates the following three points. First, the canonical binding sites and the PGA1-NR4A2 binding sites have few common sites. This is consistent with the spatial difference between the two binding sites shown in Figs. 1A and C. Second, the PGA1/PGA2-NR4A2 binding sites are conserved with nearly the same amino acids in the proteins NR4A1 and NR4A3, but not in PPARγ. This suggests that NR4A1 and NR4A3 are likely to bind to PGA1 and PGA2 molecules in a similar way. Third, the Cys566 involved in the covalent binding of PGA1 to NR4A2 (at “TLCTQ” in NR4A2) is replaced by valine (V) in PPARγ, whereas the Cys285 (Cys313 in the UniProt PPARG_HUMAN) involved in the covalent binding of 15d-PGJ2 to PPARγ (at “QGCQF” in PPARγ) is replaced by phenylalanine (F) in NR4A. These findings suggest that PGA1 and PGA2 cannot bind to PPARγ in the same way as NR4A2, and that 15d-PGJ2 cannot bind to NR4A proteins in the same way as PPARγ.

3.3. Chemical Structures of Endogenous Ligand Candidates and Their Binding to Proteins

The chemical structures of the endogenous ligand candidates are shown in Fig. 4, which are related to their ability to form covalent bonds with protein cysteine residues. The complex structures of NR4A2 with PGA1 and PGA2 reveal that a covalent bond was formed between the C11 carbon atom of the cyclopentenone ring (red circles in Figs. 4A, B) of the ligand and the thiol group of Cys566 in NR4A2.34,35) This suggests that PGE1, PGJ2, and 15d-PGJ2 cannot form a covalent bond because the C11 carbons of these three ligands (dotted red circles in Figs. 4C–E) have additional hydroxyl or carbonyl groups.

Fig. 4. Chemical Structures of Endogenous Ligand Candidates

(A) Prostaglandin A1 (PGA1). (B) Prostaglandin A2 (PGA2). (C) Prostaglandin E1 (PGE1). (D) Prostaglandin J2 (PGJ2). (E) 15-Deoxy-Delta12,14-prostaglandin J2 (15d-PGJ2). (F) Dopamine metabolite 5,6-indole-quinone (IQ). Note that the molecule 5,6-dihydroxyindole (DHI) is self-oxidized into the molecule 5,6-indole-quinone (IQ), and IQ binds to NR4A2 as Cysteine IQ adduct as shown in Supplementary Fig. S3. (G) Docosahexaenoic acid (DHA). The solid red circles are carbon atoms (the C11 atoms of the rings in PGA1 and PGA2, the C2 atom in the indole ling of DHI), which form covalent bonds between the thiol group and Cys566 of NR4A2. The dotted red circles are the corresponding carbon atoms of PGE1, PGJ2, and 15d-PGJ2. These carbons cannot form covalent bonds with proteins because –OH or = O groups are attached. The solid green circle is the unsaturated carbon atom of 15d-PGJ2, which form a covalent bond with the thiol group of Cys285 of PPARγ.

However, in the complex structure of PPARγ with 15d-PGJ2, a covalent bond was formed between the C13 carbon atom of the hydrophobic tail (a green circle in Fig. 4E; described as C8 in PTG of PDB ID: 2zk1) and the Cys285 in PPARγ.65) The C13 carbon of 15d-PGJ2 has a double bond with the five-membered ring. This double bond does not exist in the other four prostaglandins. Therefore, these four molecules are unlikely to form covalent bonds with the Cys313 in PPARγ similar to 15d-PGJ2. Additionally, the two molecules PGJ2 and 15d-PGJ2 lack a carbonyl group (O9) in the five-membered ring (purple circles in Figs. 4A–C), which can form a hydrogen bond with Thr567 or Arg563 (Fig. 2E). The lack of the carbonyl group may weaken the interaction between PGJ2 and 15d-PGJ2 for the NR4A proteins.

3.4. Models of the Bound Structures of Candidate Endogenous Ligands on NR4A

As explained previously, experimental complex structures are only available for NR4A2 with the PGA1, PGA2, and IQ molecules. To investigate experimentally undetermined binding structures, we conducted a computer modeling to determine the binding structures of candidate endogenous ligands and proteins from the NR4A subfamily. Using the three binding structures as templates, a template-based modeling approach can be applied to model the complex structures of similar compounds with similar proteins. We generated the 3D structural models of seven molecules (PGA1, PGA2, PGE1, PGJ2, 15d-PGJ2, IQ, and docosahexaenoic acid [DHA]) on the three NR4A proteins (NR4A1, NR4A2, and NR4A3). The cysteine IQ adduct was employed as an oxidized binding form of IQ as shown in Supplementary Fig. S3. Protein sequences were taken from UniProt,66) chemical structures were obtained from the KEGG database,67) and their initial 3D conformations were generated by the program OpenBabel.68) We used the HOMCOS server46) to prepare the template structures, the program MODELLER69) for the homology modeling of proteins, the program fkcombu41,70) for the flexible alignment of chemical structures, and the program AMBER71) to refine conformations. The details of computational modeling procedures are described in the Supplementary materials. The model structures are available from the Biological Structure Model Archive (BSM-Arc) under BSM-ID BSM00054 (https://bsma.pdbj.org/entry/54).72)

The compounds and proteins had sufficient similarities with the template complex structures; the five prostaglandin molecules share a common chemical structure where a five-membered ring is attached to a hydrophilic tail and hydrophobic tail (Fig. 4), and the binding sites of PGA1 on NR4A2 are well conserved in NR4A1 and NR4A3 (Fig. 3). In the template structures of the ligands PGA1 and PGA2, the atom C11 forms a covalent bond with the proteins’ cysteine residue (Cys566). When superimposing the target ligands PGE1, PGJ2, and 15d-PGJ2 onto the template ligand of PGA1 or PGA2 using the program fkcombu,41,70) the functional group attached to C11 (–OH or = O) clashes with the side chain of the cysteine. These atomic clashes were avoided by performing energy minimization calculations using the AMBER program,71) which allows for the reorientation of the compound and deformation of the side chain of the cysteine.

Modeled structures for NR4A2 and NR4A3 are shown in Figs. 5 and 6, respectively, and those for NR4A1 are shown in Supplementary Fig. S5. No atomic clashes or unnatural torsions existed in the created model structures. Due to the amino acid conservation of the binding sites, the interactions in the modeled structures NR4A1 and NR4A3 should be similar to those of the template structure NR4A2. Observation of the binding structures of the template structures PGA1 and NR4A2 demonstrated that the interactions between the ligand and protein primarily involved hydrophobic interactions and covalent bonding with cysteine (Fig. 2E). The most significant difference in binding among the seven modeled compounds was the presence or absence of a covalent bond with cysteine. PGA1, PGA2, and IQ (an oxidized DHI, as shown in Supplementary Fig. S3) can form a covalent bond with cysteine, whereas PGE1, PGJ2, 15d-PGJ2, and DHA cannot. The absence of covalent bonds does not necessarily mean weaker binding. For example, Rajan et al. reported that the binding affinities of the NR4A2 protein for PGA1 and PGE1 did not differ significantly; NMR titration experiments showed that the binding affinity of PGA1 with NR4A2 was slightly higher than that with PGE1, whereas competitive inhibition experiments for [3H]-PGE1 and [3H]-PGA1 showed that the affinity of PGA1 was slightly lower.34) However, the nature of the binding for covalent-bonded compounds may differ from non-covalent-bonded compounds. The 'covalent advantage' was coined to describe the unique feature of covalent modification.73) Since irreversible modifications of proteins can accumulate over time and amplify a signal, even low levels of ligands may be able to initiate signaling.

Fig. 5. Experimental and Model Complex Structures of NR4A2 and Endogenous Ligand Candidates

Bound compounds are drawn as a ball-and-stick model, and Cys566, Arg563 and Thr567 are drawn as a stick model. (A) Experimental structure with PGA1 (PDB ID:5y41).34) (B) Experimental structure with PGA2 (PDB ID:5yd6).35) (C) Model structure with PGE1, (D) PGJ2, and (E) 15d-PGJ2. (F) Experimental structure with Cysteine IQ adduct (Supplementary Fig. S3) (PDB ID:6dda).62) (G) Model structure with DHA.

Fig. 6. Model Complex Structures of NR4A3 and Endogenous Ligand Candidates

Bound compounds are drawn as a ball-and-stick model, Cys594, Arg591, and Thr595 are drawn as a stick model. (A) Model structure with PGA1, (B) PGA2. (C) PGE1, (D) PGJ2, (E) 15d-PGJ2, (F) Cysteine IQ adduct (Supplementary Fig. S3), and (G) DHA.

The two molecules PGJ2 and 15d-PGJ2 lack a carbonyl group (O9) on the five-membered ring, which can form a hydrogen bond with Thr567 or Arg563 (Figs. 2E, 4D, and 4E). Note that the molecule PGE1 cannot make a cysteine-covalent bond but has the O9 carbonyl group to form a hydrogen bond with the protein. Rajan et al. reported that PGJ2 and 15d-PGJ2 had no effect on the transcriptional activity of NR4A2-LBD, whereas PGE1 prominently stimulated NR4A2-LBD reporter activity, and NMR spectroscopy revealed the same binding region of NR4A2 was recognized by PGE1 and PGA1.34)

A metabolomic study reported interactions between DHA and the NR4A1 protein.74) Although a crystal structure of bound DHA is not available to date, de Vera et al. reported several interacting sites of NR4A2 for DHA using chemical shift perturbation by solution NMR spectroscopy.75,76) They identified more than 20 perturbed residues by DHA-binding, covering both the canonical binding site and the PGA1-binding site of NR4A2. DHA binding also affected the methyl groups of Leu410, Ile483, and Ile486. The residue Leu410 is placed on the PGA1-binding site of NR4A2. Because our 3D structure model of DHA was built by fitting on the bound-PGA1 molecule, it has contact with Leu410 (Fig. 5G). However, the residues, Ile483 and Ile486 are far from the position of the modeled DHA, rather than close to the canonical binding site. In that sense, our NR4A2-DHA model is hypothetical and cannot explain all the perturbed residues reported by de Vera et al.75,76) We suggest that multiple binding sites for DHA can exist in the NR4A proteins, and our model should be one of them.

3.5. Heterodimerization and Mechanism of Transactivation

A detailed mechanism of how LBD activates DNA binding is not well understood; however, most researchers assume that the binding of an agonist to LBD leads to a conformational change of LDB, which enhances the dimerization of the LBD of NR4A proteins, and finally, the dimerization affects DNA-binding (Supplementary Fig. S1). When NR4A1 or NR4A2 form heterodimers with RXRα, their DNA-binding ability is enhanced, whereas NR4A3 does not form heterodimers.2528) Experimental 3D structures for the LBD heterodimer of NR4A and RXRα are not available to date, but limited structural information has been reported. Aarnisalo et al. reported that the sequence motif I-box (K554-L562; KLLGKLPEL) in NR4A2 is essential for NR4A2-RXRα interactions.77) The location of the I-box motif is shown in the structure in Fig. 1A and in the alignment in Fig. 3. A conformational change of the I-box by ligand binding might be necessary for dimerization. Interestingly, the sequences of the I-box are not entirely conserved within the NR4A family, namely, NR4A1: RLLGKLPEL, NR4A2: KLLGKLPEL, and NR4A3: KVLGALVEL, as shown in Fig. 3. The difference in these I-box sequences might explain the different heterodimerization between NR4A1, NR4A2, and NR4A3. Recently, Zhao et al. modeled the structure of the NR4A2-RXRα heterodimer with the help of cross-linking mass spectrometry and molecular dynamics.78) Their 3D model of the heterodimer is highly flexible, especially in the DBD. Estimated interacting residue pairs from their model are consistent with the I-box reported previously.

Such precise analysis between conserved amino acid sequences and conformational changes of these receptors by various ligands might be important to clarify the similarity and diversity of activation mechanisms in nuclear receptor families. For the purpose of drug discovery, relationships between ligand binding and pharmaceutical activity caused by activation of each receptor will also be important.2,79)

4. CONCLUSION

Our previous group in the Genox Research Institute published an original paper in this Journal, Biol. Pharm. Bull., entitled as “Prostaglandin (PG) A2 acts as a trans-activator for NOR1 (NR4A3) within the nuclear receptor superfamily.”31) After more than 15 years, the citation rates and inquiries of this paper have increased dramatically, which might be related to the recent publication of two papers by another group.34,35)

As a type of de novo hydrophobic ligands, steroid compounds have important roles in health and have contributed to the clinical environment, working through various nuclear receptors. Besides steroids, there have been some successful nuclear receptor–ligand relationships such as retinoic receptors vs. retinoids and PPARγ vs. thiazolidine derivatives. There are still some orphan receptor families in the 48 nuclear receptor genome, whose endogenous ligands have not been identified yet.2) The NR4A superfamily, which consists of 3 receptors, is a typical example. There are no clinically successful cases of NR4A superfamily members yet, although there have been many preclinical studies on the pathological importance of this family.

This might be because of an X-ray structural report in 2003 by Wang et al. in which the Nurr1 (NR4A2) LBD contained no cavity and Nurr1 lacked a ‘classical’ binding site for coactivators.61) At that time, technology related to in silico ligand analysis or ligand design approaches was limited. We previously reported specific molecular linkage between PGA1/PGA2 and NR4A3 and suggested that PGA1/PGA2 might be endogenous ligands of the NR4A nuclear receptor family,31,32) but these ideas were not widely accepted.

After recent two reports about the co-crystallographic structures of NR4A2-LBD bound to PGA1 and PGA2,34,35) we focus on the further possibilities of the NR4A family as promising pharmaceutical molecular targets for immunological or neurological diseases. It is especially because information of pathological importance related to these receptors has increased and in silico simulation technologies between receptors and ligands have progressed greatly during the last 15 years.

Recent reports on ligand-bound 3D structures of NR4A proteins have enabled us to perform template-based comparative modeling to build 3D complex structures of many similar proteins and compounds, as shown in this paper. For a broader range of compounds, molecular dynamics, and covalent ligand docking will be necessary to predict their binding structures and affinities, because there are two challenging aspects for modeling and simulating NR4A-LBD and ligands. First, the conformation of NR4A-LBD is highly dynamic and flexible. This has been demonstrated through NMR studies75,76) and the significant structural differences between the apo and holo forms (Figs. 1B, 2C, D). The binding of unknown ligands may lead to novel binding pockets that have not been identified yet. Long-time molecular dynamics or coarse-grained simulation may be required to capture the flexibility of NR4A-LBD. Second, covalent bonding with the cysteine of the NR4A is necessary for the binding of some natural ligands (PGA1, PGA2, and IQ). Because classical molecular mechanics cannot represent the formation of covalent bonds, most docking programs struggle to calculate conformational potential energy and binding free energy for covalently bound ligands. Recently, several programs for covalent ligand docking have been developed, which might help model the LBD-ligand system.80)

The NR4A nuclear receptor family is among the most promising pharmaceutical targets as well as other nuclear receptor families although they are still recognized as an orphan receptor family. To analyze the binding and activation mechanisms of these receptor family members with ligand candidates such as the PGs discussed here, will be very informative. The design of novel compounds that function as agonists and activate these receptors will help develop future therapeutic strategies for NR4A-related human disorders. For this purpose, using in silico simulation tools for template-based modeling will be very useful.

Acknowledgments

This research was supported by the Research Support Project for Life Science and Drug Discovery (Basis for Supporting Innovative Drug Discovery and Life Science Research (BINDS)) from AMED under Grant Number: JP23ama121019. The model structures have been submitted to the Biological Structure Model Archive (BSM-Arc) under BSM-ID BSM00054 (https://bsma.pdbj.org/entry/54).72)

Conflict of Interest

The authors declare no conflict of interest.

Supplementary Materials

This article contains supplementary materials.

REFERENCES
 
© 2024 The Pharmaceutical Society of Japan
feedback
Top