Journal of Computer Chemistry, Japan -International Edition
Online ISSN : 2189-048X
ISSN-L : 2189-048X
General Paper
Development of PolyParGenv2 Software for Determination of Molecular Dynamics Simulation Parameters for Molecules with Crosslinked or Condensed Ring Structures
Makoto YABEKazuki MORIJun KOYANAGI
Author information
JOURNAL FREE ACCESS FULL-TEXT HTML

2020 Volume 6 Article ID: 2019-0031

Details
Abstract

We had previously developed o2p, a parameter conversion semi-automation program for polymers with repeating structures, and PolyParGen, a parameter acquisition program primarily meant for chainlike macromolecules. These programs use the parameters obtained via the automatic parameter generation tool. However, with these programs, it is difficult to acquire parameters of molecules with crosslinked structures or large condensed ring structures. To acquire these parameters, herein we improved the algorithm to develop PolyParGen v2. Further, this program adds Antechamber as the automatic parameter generation tool and can obtain the Amber force field parameter for polymers. This program was evaluated via simulations using graphene, carbon nanotubes, and fullerene. Our results agree with those of previous studies and verify the effectiveness of the proposed program.

1 Introduction

Obtaining parameters corresponding to polymers with crosslinked structures, such as an epoxy resin; carbon compounds such as graphene, carbon nanotubes (CNTs), fullerenes; and derived compounds having random defects or modifications in their structures is difficult. These compounds have been extensively studied as composite and electronic materials, and there are many reports on Molecular Dynamics (MD) simulations [1,2,3,4,5,6,7,8,9,10], and the number is only expected to increase in the future. Previously, we had developed o2p [11] and PolyParGen [12] to easily obtain the files necessary for the calculation of MD for polymers with repeating structures, randomly polymerized polymers, and randomly modified polymers. However, these programs are difficult to apply to molecules with crosslinked structures or condensed ring structures. Further, in the cases of materials like derivatives of fullerene, it may be impossible to acquire parameters via LigParGen [13], even if the number of atoms is within the limit that can be analyzed by the program. To address this problem, we aimed to improve the algorithm of PolyParGen by dividing the molecule at the cyclic structural sites and making it possible to obtain the files for MD simulations of these molecules by combining the parameters of the divided molecules. This study discusses the aforementioned improvements made to PolyParGen.

2 Development environment

Like PolyParGen, the program proposed in this paper is also a web application. We created a user interface using PHP [14], HTML, and JavaScript, while Python [15] was used to compose the body of the program. In addition to LigParGen, Antechamber [16] can also be used as an automatic parameter generation tool to obtain OPLS and Amber force field parameters. ACPYPE [17] was used to convert the Antechamber output to the Gromacs format. To obtain the OPLS force field, Gromacs and LAMMPS [18] formats were used.

3 System overview

3.1 Input file format

While dividing a molecule having a conjugated double bond, such as a graphene molecule, it is impossible to execute the division properly and add hydrogen to the terminal unless the bonding order is accurate. Additionally, because the file input into LigParGen has no bonding order, such as PDB format, it sometimes becomes impossible to acquire the parameter because the automatically determined bonding order is different from the intended bonding order. PolyParGen assumes that a molecule does not contain a large condensed ring structure. In the case of double/triple bonds and ring structures, the process can be carried out without any problems because the molecules are not divided, .As these structures are also subject to division in this program, the input file format is limited to CML to ensure the accurate acquisition of the bonding order.

3.2 Division of the molecule into partial structures

3.2.1 Selecting split atoms

As PolyParGen was primarily intended for linear polymers, the molecules were divided along the main chain. In this program, to correspond to a spherical molecule such as fullerene or a crosslinked structure (possibly three dimensional), the division site is determined as follows. First, one atom is randomly selected from the input molecule (Figure 1 (a)). Based on the selected atom, the connected atoms are traced and added to the same group as the previous atom as a split atom. At this time, the maximum number of atoms in one group is set so that the whole molecule does not constitute one group. Atoms are randomly selected from the remaining ones, and the aforementioned operation is repeated to divide the entire molecule into split atoms (Figure 1 (b)). Although the initial value of the maximum number of atoms is 150 (the same as the number of atoms input to the automatic parameter generation tool), it can be set arbitrarily to any value in the range of 1 to 150. The automatic parameter generation tool can acquire the parameters of molecules such as fullerenes, whose parameters cannot be acquired by intentionally dividing the molecule into small molecules even within the number of permissible atoms corresponding to the parameter automatic generation tool.

Figure 1.

 As an example of molecular division, small molecules are represented as planes merely for explanation, but they are present in three dimensions. (a) Input molecule. (b) Partitioned molecule. Atoms depicted in the same color are assigned to the same part of the divided molecule. (c) In order to obtain the MD parameters of the color-coded partial parts in (b), partial molecules are created. A contains B as a partial structure and C contains D and E as partial structures, so B, D and E are discarded. From the partial molecules, A and C, the parameters of the whole molecule are obtained.

3.2.2 Adding overlap atoms and hydrogen

Next, the atoms attached to the split atoms are traced, and five overlap atoms are added. These atoms are selected to be able to maintain several uncondensed benzene rings. Further, hydrogen (or two hydrogen atoms if it is a double bond carbon) is added to the end depending on the bond order to form a partial molecule (Figure 1 (c)). At this time, if the number of atoms that can be input to the parameter automatic generation tool (this time it is also set to 150) is exceeded, the maximum number of split atoms is reduced and the process is once again repeated from split atom creation onwards. This operation is repeated until all partial molecules are within 150 atoms or the maximum number of split atoms is 1. Unless these conditions are met, it might still not be possible to obtain the parameters. Further, if one partial molecule is completely contained in another partial molecule (Figure 1 (c)), the former molecule is deleted because it is unnecessary.

3.2.3 Maintaining the ring structure

In PolyParGen v2, the ring structure is also targeted for division, but it seems that, in some cases, the ring structure on the polymer main chain need not be targeted for division (Figure 2). Therefore, a large ring structure, such as a crosslinked structure, is used as a target site for division, and in order to suspend division at the ring structure of the polymer main chain, the size of the ring not to be divided can be set. However, if it is set by graphene, CNTs, or fullerenes, a divisible site cannot be found, and it becomes impossible to acquire the parameters.

Figure 2.

 If the user wants to split the part depicted in blue instead of the benzene ring, the option, “Maintain ring structure” should be selected.

3.3 Automatic parameter generation

In this program, automatic parameter generation tools, Antechamber and LigParGen, are executed on the same server, and the parameters of the partial structure are acquired. Automatic parameter generation tools sometimes fail to execute the process depending on the structure, but if it fails in this program, it is altered to try to acquire the parameters again after further structure optimization. Although this increases the success rate, similar to the case of PolyParGen, the parameters cannot be acquired if there is even one partial molecule that the automatic parameter generation tool fails for, even after structure optimization.

3.4 Combining parameters

One partial molecule is selected, and a partial molecule having a split atom which binds to a split atom of the selected partial molecule is identified. Then, both are joined. Further, unbound partial molecules with a split atom bonded to a split atom of a partial molecule are retrieved and combined. This operation is repeated until all the partial molecules are combined. Finally, the charge is adjusted in a manner similar to that in PolyParGen.

3.5 Estimation of atomic charge

When users obtain the molecular parameters of Amber or OPLS, they have the option of obtaining the calculated atomic charge of all split fragments via ab initio calculations, too. The ab initio calculation software used was NWChem [19]. Users can select ESP or Mulliken atomic charges except the original atomic charges of OPLS and Amber. Previously, we had simulated our PolyParGen using graphene sheets [20]. We had reported good agreement of our results with the experimental data.

4 Application examples

Based on three application examples, we evaluated the molecular parameters created by PolyParGen. The first example comprised MD simulations of CNTs and graphene sheets in water. CNT and graphene sheets have been reported to have a wrapped structure in water [21]. A second set of MD simulations for [6,6]-phenyl-C61-butyric acid methyl ester (PCBM) in chlorobenzene, 1-chloronaphthalene, and 1,8-diiodooctane was also conducted. The last simulation was the estimation of the interface energy between graphene sheet and resin. The steps for each of these simulations are given below.

In the first example, using Avogadro [22], 6 nm × 2 nm graphene sheets and 2-nm long CNTs (16, 0) were prepared and saved in the CML format. The obtained CML structure file was input into PolyParGen v2, and as with the preceding example, the itp and gro files were downloaded from this program. LigParGen was used as the automatic parameter generation tool. The obtained MD molecular parameter files were implemented in the 2019 version of Gromacs to perform the 10-ns MD simulation. The simulation conditions were set to the values suggested by Gade et al. [21]. In other words, a 10 × 6 × 6 nm3 box, in which graphene sheets and CNT were arranged at a distance of 1.4 nm, was added with 12000 water molecules as an initial structure. The initial structures have been depicted in Figure 3 (a). As is apparent from Figure 3 (b) and (c), the MD simulation yielded a wrapped structure.

Figure 3.

 Simulation snapshots at (a) 0 ns, (b) 5 ns and (c) 10 ns.

In the second example, the PCBM structure was downloaded from jasonlarkin/pcbm [23]. Structures of chlorobenzene, 1-chloronaphthalene, and 1,8-diiodooctane were created in Avogadro. These structure files were obtained and input into PolyParGen v2, and as with the preceding example, the itp and gro files were downloaded from this program. PCBM yields the number of atoms that can acquire parameters via LigParGen, but the web version of LigParGen timed out for both pdb format and mol format, and so the parameters could not be acquired. The parameter files for chlorobenzene, 1-chloronaphthalene and 1,8-diiodooctane were separately obtained via LigParGen. In Antechamber, an error occurred in a partial structure containing a three-membered ring structure, and, as a result, the PCBM parameters could not be obtained. Then, ten PCBM molecules were set in a simulation box, and none or 40 1-chloronaphthalene molecules or 40 1,8-diiodooctane molecules were inserted. Finally, 800 chlorobenzene molecules were inserted. After NVT ensemble and NPT ensemble, under conditions of 1 atm pressure, 0.1 fs time step, 300 K temperature, 100-ns simulation was performed. The cluster size distribution for PCBM molecules (Figure 4) confirmed that the results agree with those previous reports [24].

Figure 4.

 Cluster size distribution for 10 PCBM molecules

In the last example, we investigated the interface energy between graphene and resin to estimate the mechanical properties of the composite material. The simulated resins were TriA-X, bisphenol A diglycidyl ether (DGEBA), triethylenetetramine (TETA), vinyl ester, and polyamide 6 (PA6). Their chemical structures have been depicted in Figure 5. The simulated epoxy resin was made to react with DGEBA and TETA. In addition, the dimensions of the graphene sheets used were 11.92 nm × 10.39 nm. The number of carbon atoms in this graphene sheet is 4900. Three layers of graphene sheet were set in the simulation model. To estimate the surface energy between graphene and resin, we constructed a model in which the resins were distributed on the graphene model as depicted in Figure 6. The interface energy is calculated by equation (1). E denotes Energy and is calculated via molecular simulation. S denotes the area of the graphene sheet.    E(interface energy) = (E(total) − E(resin) + E(graphene))/2S    (1)   

Figure 5.

 Chemical structures of (a) TriA-X; (b) DGEBA; (c) TETA; (d) Vinyl ester; (e) PA6

Figure 6.

 Model of graphene and resin for estimating the interface energy between graphene and resin.

Each interface energy has been tabulated in Table 1. The absolute values of obtained interface energies were in the order of TriA-X polyimide> epoxy> vinyl ester> PA6. The order of the absolute values was the same compared to that of the interfacial strengths obtained via FEM and micro bond test. Based on the results, we could conclude that this interface energy was the one corresponding to interfacial peeling factors caused by the inputted atomic charges and molecular parameters of each graphene carbon atom appropriately [25].

Table 1.  The interface energy between graphene sheets and resin.
Resin Interface energy[J/m2] Interface strength [MPa]
TriA-X −0.200 130
Epoxy −0.174 74
Vinyl ester −0.074 43
PA6 −0.021 26

5 Conclusions

The program proposed in this paper enables us to easily prepare the files necessary for the MD simulation of macromolecules with crosslinked and condensed ring structures. Moreover, it makes it possible to acquire parameters for molecules for which LigParGen fails to acquire parameters. This works even with small molecules, by dividing them into smaller molecules. Further, even if LigParGen failed, the success rate was improved by retrying after structure optimization. By randomizing the division of molecules, it is sometimes possible to obtain parameters by inputting the data again into PolyParGen v2, even for molecules for which the program once failed, and as a whole, the parameter acquisition rate was improved. However, as with PolyParGen v2, this program is only applicable in the case of neutral molecule correspondence, and correspondence with charged molecules is delegated to a future task. Further, for an input molecule with a condensed aromatic ring structure, such as graphene, five overlapping atoms are provided from the atom at the origin, and the parameters are created by integrating those of many partial molecules. Therefore, even in the case of condensed benzene rings, such as graphene and CNT, the carbon of the aromatic ring retains sp2 hybridization. However, although parameters can be accurately obtained via this method, it takes a long time to calculate them because there are many partial molecules. For molecules in which the fused benzene ring is not as continuous as in graphene or CNT, selecting “Maintain ring structure” can divide the molecule to protect large aromatic rings without breaking them. If splitting is not possible while maintaining the ring, the user is notified automatically. This program is free and open to all at the homepage of ITOCHU Techno-Solutions Corporation, http://polypargen.com/.

Acknowledgements

This study was partially supported by JST MIRAI Grant number 19215408.

References
 
© 2020 Society of Computer Chemistry, Japan
feedback
Top