Electronic Similarity among Protein-Ligand Complexes: Development of the          Interaction-Energy Projection Method

Manabu SUGIMOTO; Takafumi INOUE

doi:10.2477/jccj.2016-0061

Abstract

A numerical model, called the interaction energy projection method (IEPM), is suggested to evaluate electronic similarity among protein-ligand complexes. Herein we apply the method by referring to the "inter-fragment interaction energy (IFIE)," calculated using the fragment molecular orbital (FMO) method, in two human estrogen receptor complexes.

1 Introduction

Search for functional molecules is one of the most important and challenging issues in applied chemistry. It is also true in pharmaceutical science which demands the discovery of molecules bound to target proteins [1]. For systematic and efficient molecular search, development of quantitative measures of protein-ligand complexes seems demanded.

In this paper, we propose a method describing the amino acid residue (AAR)–ligand interaction, which is called "Interaction Energy Projection Method (IEPM)." By using this method, we can quantitatively compare similarity/dissimilarity among protein-ligand complexes. In the present implementation, we refer to the "inter-fragment interaction energy (IFIE)" between AAR-ligand pairs calculated using the fragment molecular orbital (FMO) method developed by Kitaura et al. [2]. Therefore, the similarity evaluated in this paper can be called "electronic similarity" reflecting the electronic interaction in the complexes.

Herein we use the IFIE data at the MP2/6-31G* level recently calculated by Anzaki et al. on the human estrogen receptor α with the Y537S mutation (hERα/Y537S in Figure 1 (a)) [3]. We evaluate the electronic similarity between two complexes of hERα, which are called 2QA6 [4] and 3UU7 [5]. Their ligands (KN2 and 2OH ligands, respectively) are shown in Figure 1.

Figure 1.

(a) Estrogen Receptor (hERα/Y537S), and the ligands of (b) KN2 and (c) 2OH bound to hERα/Y537S. Their complexes are named 2QA6 and 3UU7, respectively. The circled region in (a) indicates the ligand-binding site.

2 3D Description of Protein-Ligand Interactions

The IEPM method is decomposed into three steps: (A) three-dimensional description of the reference and target complexes, (B) orientation matching, and (C) similarity measurement. The atomic coordinates of the target protein-ligand complexes are taken from the Protein Data Bank [4,5].

The algorithm in Step (A) mentioned above is as follows:

(A1) the origin of the coordinate is taken as the geometric center of the ligand in the ER-ligand complex.

(A2) The IFIE is projected onto a sphere with a radius of R (5000 Å in the present study).

(A3) The position of each AAR is represented using the polar coordinate (r, θ, ϕ) of its C_α atom.

(A4) A Gaussian function in the (θ, ϕ) plane is defined to represent the AAR-ligand interaction. Overlap of the Gaussian functions defined to all the ligand-amino-acid residue pairs gives the projection sphere. We call this sphere "projected interaction pattern (PIP)." The resultant function P = P(θ, ϕ) is given as

P = ∑ I a l l A A R E L I exp [ − α { ( θ − θ I ) 2 + ( ϕ − ϕ I ) 2 } ]

where E_LI is the IFIE value for the ligand L and the summation is taken over all AARs. α is defined so that the half-width of the Gaussian is equal to 0.01r_la where r_la is the AAR-ligand distance. The topologies of the PIPs for 2QA6 and 3UU7 are shown in Figure 2 (a).

Figure 2.

(a) PIPs of 2QA6 (reference) and 3UU7, (b) their GPFs (dot and triangles for 2AQ6 and 3UU7, respectively) overlapped with PIPs, (c) GPFs only, (d) the overlap of GPFs after the reorientation, and (e) the reoriented PIPs with GPFs.

3 Orientation Adjustment for Similarity Analysis

In order to have maximum overlap between two PIPs, we adjust their mutual orientations using the following algorithm:

(B1) Initially, the 10⁴ sampling points are randomly generated on the mapped surface.

(B2) The number of sampling points near a peak is determined to reflect its height: when the IFIE value is x, we determine the number (n) as ceiling[|x|] (Iverson's ceiling function for the absolute value of x). The n points near the peak axis in the (θ, ϕ) space are chosen among the 10⁴ points. The points selected in this process are called "geometric-feature points" (GFPs).

(B3) Rotation of the target complex is carried out by minimizing the averaged distance between the pairs of the GFPs of the reference and target complexes shown in Figure 2. In this process one GFP of the target is chosen as the nearest neighbor of each GFP of the reference complex.

It is noted that translational matching is not necessary because the origin of the coordinate is taken as the geometric center of the ligand.

We show the overlap of GPFs of 2QA6 and 3UU7 after orientation adjustment in Figure 2 (d). The reoriented PIP of 3UU7 is shown in Figure 2 (e). These two figures indicate that the present procedure is reasonably good.

4 Electronic Similarity

The electronic similarity among the protein-ligand complexes is evaluated using the following algorithm:

(C1) The IFIE value is plotted with respect to θ and ϕ where the reoriented PIP is referred to. This process generates figures as shown in Figure 3.

Figure 3.

Mapping of the IFIE data onto the (θ, ϕ) plane: (a) 2QA6 and (b) 3UU7.

(C2) The electronic similarity S is calculated by using the following integral:

S = ∬ P r P t d θ d ϕ / [ ∬ | P r | 2 d θ d ϕ ∬ | P t | 2 d θ d ϕ ] 1 / 2

where the denominator is to normalize both P_r and P_t of the reference and target complexes, respectively.

The calculated value of S between 2QA6 and 3UU7 was 63.0% in the present method. This is not so high although the complexes are known to exist experimentally. The unexpectedly small value may be due to the fact that, although the AAR giving the most intense peak in 2QA6 and 3UU7 is common, the IFIE in 3UU7 is about 1/2 of that in 2QA6. This implies that, although a sufficiently strong binding site is necessary, the multi-site interaction should also play an important role.

5 Summary and Conclusions

We have developed a numerical method called the interaction energy projection method (IEPM) in order to compactly describe electronic AAR-ligand interactions in protein-ligand complexes. The present method facilitates numerical evaluation of electronic similarity in the complexes. It is expected that important patterns in the complexes will be found through systematic applications of the present method.

Acknowledgment

A part of this work was financially supported by MEXT KAKENHI Grant Number 26102015. This research was also done in activities of the FMO drug design consortium (FMODD). We are grateful to Prof. K. Fukuzawa, Prof. S. Tanaka, and Mr. S. Anzaki for providing the IFIE data used in this paper.

References

[1] R. B. Silverman, “The Organic Chemistry of Drug Design and Drug Action (2nd Ed.)”, Elsevier, Amsterdam, 2004.
[2] K. Kitaura, E. Ikeo, T. Asada, T. Nakano, M. Uebayasi, Chem. Phys. Lett., 313, 701 (1999).
[3] S. Anzaki, C. Watanabe, Y. Okiyama, T. Honma, K. Fukuzawa, S. Tanaka, "Comprehensive Analysis of Protein-Ligand Interactions in Estrogen Receptor α Using Fragment Molecular Orbital Method" (P1–25), presented at the Chem-Bio Informatics (CBI) Society Annual Meeting 2015.
[4] http://www.rcsb.org/pdb/explore/explore.do?pdbId=2QA6
[5] http://www.rcsb.org/pdb/explore/explore.do?structureId=3UU7

Corresponding author

Correction information

Register with J-STAGE for free!