Extraction of Local Structure Information from X-ray Absorption Near-Edge Structure: A Machine Learning Approach

Megumi Higashi; Hidekazu Ikeno

doi:10.2320/matertrans.MT-MG2022028

Abstract

In this work, we constructed machine learning models to predict structural descriptors that numerically represent the atomic structures in three dimensions from x-ray absorption near-edge structure (XANES) spectra. The neural network models that predict radial distribution functions (RDF) and orbital-field matrix (OFM), a descriptor that deals with the anisotropy of the local structure, the valence electron number of the ligand, and orbital information, were constructed. We used more than 120,000 O K-edge XAS spectra data from the Materials Project database as the training data set. We successfully constructed models that roughly predicted RDFs with 74% of the test data. Furthermore, the model that predicted OFM also captured an overview of OFM in 97% of the test data. These results demonstrate that the atomic structural information can be directly extracted from XANES spectra using neural network models.

Fig. 1 Schematic drawing of the neural network model that predicts structure descriptors from XANES spectra.

1. Introduction

Core-electron spectroscopies, a group of spectroscopic techniques to obtain local electronic structures around a particular element in materials, are powerful techniques to investigate nanoscale structures.¹^–³⁾ Among them, X-ray absorption near-edge structure (XANES) is the most common spectroscopic technique used in a wide range of fields in physics and chemistry.⁴^,⁵⁾ In XANES, a core electron is excited into unoccupied orbitals. Therefore, XANES provides local geometric structures such as coordination numbers, inter-atomic distances, symmetries, and electronic structures such as chemical states, spin states, and chemical bonding around absorbed atoms. The recent development of experimental equipment and measurement techniques has made it possible to obtain a large amount of XANES spectral data quickly.⁶^,⁷⁾ For example, time-resolved or operando measurement makes it possible to measure XANES while tracing chemical reaction processes occurring on the picosecond and nanosecond scales.⁸^–¹¹⁾ Electron energy loss near-edge structure (ELNES) using a transmission electron microscope (TEM) also provides similar information to XANES, as both XANES and ELNES correspond to the same electronic transitions. The advantage of using ELNES instead of XANES is the high spatial resolution. Using a modern system comprising scanning TEM (STEM) equipped with a probe aberration corrector, the spatial resolution of ELNES improves to sub-Angstrom order which enables us to obtain spectrum images from single atomic columns in crystals.¹²^–¹⁴⁾

Analyzing local electronic structures from numerous XANES/ELNES spectra obtained by the state-of-art techniques described above is not an easy task. Various first-principles approaches for calculating XANES spectra have been proposed and have successfully reproduced a wide variety of experimental spectra.¹⁵^–¹⁸⁾ The local atomic arrangement and electronic structures can be determined by enumerating the possible atomic structures of the target system and fingerprinting the theoretical XANES obtained for these structures against the experimental spectra. Though this approach is most reliable, the computational costs could be enormously large when the number of possible atomic structures is significant, e.g., considering the spectra from impurity atoms in solids or adsorbates on surfaces. Therefore, a novel approach that can simulate and predict XANES spectra from given atomic structures with much less computational time than first-principles calculations is highly desirable.

Recently, machine learning has been rapidly spreading in materials science and has been applied to various problems, such as high-throughput materials exploration and virtual screening of atomic structures.¹⁹^–²²⁾ Machine learning algorithms are also used for accelerating quantum chemical or density functional calculations.²³^–²⁶⁾ Several applications of the machine learning approach to analyzing XANES spectra were also reported. Rankine et al. succeeded in constructing deep NN models to predict XANES spectra directly from atomic structures.²⁷⁾ The data-driven approach for interpreting XANES spectra based on tree-based algorithms has also been reported.²⁸^,²⁹⁾

Another aspect of the XANES analyses using machine learning is to extract local structural information around the excited atom. Timoshenko et al. developed the neural network (NN) model to extract three-dimensional structures of Pt nano-particles from Pt-L₃ XANES spectra.³⁰⁾ Attempts to extract radial distribution functions (RDFs) from XANES spectra were also made in several groups.³¹^,³²⁾ However, no machine learning model that predicts three-dimensional local atomic structures, including the anisotropy around the central atom, from XANES spectra is reported. In this study, we constructed a machine learning model to predict structural descriptors that numerically represent atomic structures in three dimensions from XANES. We focused on the two types of structural descriptors: one is RDF, which reflects the interatomic distances, and the other is the orbital-field matrix (OFM) that contains the information of interatomic distances, bond-angles among neighboring atoms, and electronic configurations of ligand atoms.³³⁾ The performance of deep NN models to predict atomic structure information is demonstrated.

2. Methods

2.1 Data collection

Theoretical XANES and crystal structure data used for machine learning are collected from the Materials Project, a database of electronic structure calculations.³⁴⁾ This database contains theoretical XANES spectra calculated using FEFF code based on multiple scattering theory.³⁵^,³⁶⁾ In this study, atomic structures, and theoretical O-K edge XANES spectral data were collected for binary, ternary, and quaternary compounds containing oxygen, which have rich variations in chemical compositions and coordination structures. Site-dependent XANES spectra are provided for materials with multiple O sites in this database. A data set of local structures and spectra was created, considering this site dependence. The constituent elements of the compound range from H to Pu; there is a considerable variation in the frequency of occurrence of each element. If the number of local structures containing a particular element is small, it may cause insufficient learning and degradation of the generalization performance. In this study, we decided to remove the data containing a particular element if the number of local structures containing that element is less than 2000. In the end, we obtained a data set containing 122,160 local structures and O-K XANES spectra from 16,285 compounds.

Theoretical XANES spectra were discretized in the range of 520 eV to 580 eV in 0.01 eV increments to produce vector data, which were used as the explanatory variables of machine learning models. The RDFs and OFMs were calculated from atomic structures.

We randomly selected 80% of the 16,285 compounds in the dataset and used their site-dependent spectra as training data, while the site-dependent spectra for the remaining 20% were used as test data to verify the generalization performance of the training model. The training and test data numbers are 98,528 and 23,632, respectively.

2.2 Structure descriptors

The coordinates and atomic numbers of constituent atoms can describe the local atomic structure around an absorption O-atom. However, this information cannot be used directly for machine learning because the number of atoms varies depending on the material and site. Therefore, it is necessary to convert the local atomic structure into descriptors. Various descriptors for molecules and crystals have been proposed to predict the properties of materials. The descriptors used in this study are as follows.

(1) Radial distribution functions (RDFs)

The RDF g(r) is formally defined as,

\begin{equation} g(r) = \frac{dn(r)}{4\pi r^{2}\rho(r)dr} \end{equation}

(1)

where r is the distance from an absorption O atom and dn(r) is the number of atoms located between [r, r + dr]. ρ(r) is the local atomic density around an absorption O atom. RDF reflects the information on the coordination number and bond distance around the absorption O site.

(2) Orbital-field matrices (OFMs)

Pham et al. proposed an OFM as a representation of a local atomic structure.³³⁾ The OFM, X^c, is described as,

\begin{equation} X^{c} = \overrightarrow{O^{c}}^{T} + \sum_{n = 1}^{M}\overrightarrow{O^{c}}^{T}\overrightarrow{O^{n}}\theta_{cn}\zeta(r_{cn}) \end{equation}

(2)

where, $\overrightarrow{O^{c}}$ and $ \overrightarrow{O^{n}}$ are vectors representing the electron configuration of the outermost shell as a quantity characterizing the central O atom and neighboring atom, respectively. θ_cn is a weighted coordination number defined using the solid angles determined by the faces of Voronoi polyhedra between atoms c and n,³⁷⁾ and ζ(r_cn) is a non-negative function of the interatomic distance between these atoms. In this work, $\zeta (r_{cn}) = r_{cn}^{ - 4}$ is adopted. The OFM reflects the distortion of the coordination polyhedron together with the ligand type and coordination number.

2.3 Deep neural network model

We have created fully connected NN models to separately predict RDFs and OFMs from XANES spectra, as shown in Fig. 1. These NN models use the discretized XANES spectrum as the input layer and discretized RDF in 500 points and OFM (X^c in eq. (2)) as the output layer. There are hidden layers between the input and output layers. Each hidden node and output node receives inputs {x_i} from each node in the previous layer, apply an activation function ϕ to each input, and outputs the sum ∑_i w_iϕ(x_i) where {w_i} are weights assigned to the edges connecting the nodes. The weight parameters on the edges are optimized based on the training data. The inverse error propagation method based on Adam’s method was used to optimize the learning parameters.³⁸⁾ We used scaled exponential linear units (SELUs) as the activation function.³⁹⁾ The loss function used in this work was the mean squared error with the L₂ regularization term,

\begin{equation*} E = \sum_{i}(y_{i} - f_{i}(\boldsymbol{x},\boldsymbol{w}))^{2} + \frac{\alpha}{2}\|\boldsymbol{w}\|_{2} \end{equation*}

where, y_i and f_i represent the element of objective variables and its prediction by the NN model, respectively, and α > 0 is the regularization parameter. The hyperparameters, such as the number of hidden layers and the number of nodes in each hidden layer, were determined using Bayesian optimization to maximize the prediction accuracy of the validation data. The hyperparameters used for training are shown in Table 1.

Fig. 1

Schematic drawing of the neural network model that predicts structure descriptors from XANES spectra.

Table 1 Hyperparameters used for training NN models.

The coefficient of determination R² was used in this study as an index to evaluate the accuracy of NN models. R² is defined as,

\begin{equation} R^{2} = 1 - \frac{\varSigma_{k}(y_{k} - f_{k})^{2}}{\varSigma_{k}(y_{k} - \bar{y})^{2}}, \end{equation}

(3)

where $\bar{y}$ is the average value of {y_k}. The closer the value of R² is to 1, the more similar the predicted values are to the original ones.

3. Results and Discussion

3.1 Neural network model for RDFs

The NN model for predicting RDFs centered at an absorption O-atom from O-K XANES spectra with R_max = 5 Å was constructed following the procedure describing Sec. 2.

Figure 2 shows the predicted RDFs obtained using the NN model for selected compounds in the test data. It can be seen that the NN model accurately reproduces the theoretical RDFs for SnO₂ (R² = 0.990), Sr₂MnO₄ (R² = 0.988), and TiO₂ (R² = 0.986). Small deviations between the predicted RDFs and theoretical ones are found for SrFeO₃ (R² = 0.916), CaFe(SiO₃)₂ (R² = 0.881), LiTiO₂ (R² = 0.858), but the main features of RDFs are reproduced. For LaTiCrO₆ (R² = 0.831), Cu₅Bi₂(B₂O₇)₂ (R² = 0.812), and CrCoO₄ (R² = 0.811), good agreement between the predicted and theoretical RDFs are found within 3 Å, while significant deviations are found in RDFs beyond 3 Å. In addition, some of the predicted RDFs do not capture the general shape of the theoretical values, such as FeO₂ (R² = 0.716), USbO₅ (R² = 0.613), and V₂O₃ (R² = 0.508).

Fig. 2

The RDFs around the O-site predicted by the NN model (red lines) of the test data. They are compared with theoretical RDFs calculated by their crystal structures (blue lines). The R² values calculated by eq. (3) are also shown.

Figure 3(a) shows the histogram of the distribution of the R² values for all test data. As shown in Fig. 2, the predicted RDFs reproduce the main features of theoretical ones for the data with R² values of 0.85 or higher. These data accounted for 56.4% of the total. In addition, as mentioned earlier, many of the data with R² values less than 0.85 showed a significant discrepancy only beyond 3 Å, and this discrepancy causes a reduction in the R² values. We also conducted another NN model for RDFs with R_max = 3 Å. This model exhibits higher accuracy than the NN models with R_max = 5 Å: 73.9% of test data had R² values of 0.85 or higher, as shown in Fig. 3(b).

Fig. 3

Histograms of the distribution of R² values for RDFs in test data. (a) and (b) are the results for the models that RDFs are predicted within 5 Å and 3 Å radius of the central O atom, respectively.

The results demonstrate that the RDFs close to the absorption atom can be extracted directly from XANES spectra using machine learning. Theoretically, XANES reflects the information of unoccupied orbitals localized by the core-hole effects on excited atoms. The results are consistent with the physical picture that XANES strongly reflects the information near the excited atoms.

3.2 Neural network model for OFMs

The NN model for predicting OFMs around the absorption O-atoms from O-K XANES spectra was also constructed separately from the NN models for RDFs. Figure 4 compares the predicted value of OFM by the NN model with the theoretical value calculated from the crystal structure. The rows of those heatmaps reflect the electronic configuration of the central O-atom, while the columns reflect those of neighboring atoms. The OFM is expressed by eq. (2). However, only the second term on the right-hand side is plotted because the first term on the right-hand side is the value determined by the outermost electronic configuration of the absorption O-atom and is common to all systems. Considering the outermost electron configuration of the O atom, the elements other than the rows corresponding to the s² and p⁴ configurations in the OFM are, in principle, zero. Therefore, only the rows near the corresponding electronic configurations are shown in Fig. 4. Figure 4 shows that for systems with R² values of 0.94 or higher, including Ta₂O₅, Mn(FeO₂)₂, KAlSiO₄, Li(CoO₂)₂, and Zn(MoO₂)₂, the heatmaps of the theoretical values of OFM and the predictions by NN are almost identical. Clear deviation of heatmaps between theoretical and predicted OFMs could be found when the R² values were below 0.94, including CaAl₂O₄, Mo(H₂O)₂, and Mg(AgO₂)₂, where some elements that should be zero have non-zero values in predicted OFM.

Fig. 4

Heatmaps of OFMs around the O-site predicted by the NN model (lower panels). They are compared with the original values calculated from their crystal structures (upper panels).

Figure 5 shows the histogram of the distribution of R² values for all test data. As mentioned above, good agreement between predicted and theoretical OFMs was found for the systems with R² values of 0.94 or higher. These data accounted for 97.1% of the test data, meaning the OFMs can be predicted accurately from XANES spectra using the NN model. The OFM is a descriptor consisting of a vector representing the outermost electronic configuration and a weighted coordination number defined using the solid angles determined by the Voronoi polyhedral faces. The latter contains angular-dependent information on local atomic structures. The results suggest that three-dimensional local atomic structure information, including interatomic distances and anisotropy, can be directly extracted from XANES spectra using machine learning techniques.

Fig. 5

Histograms of the distribution of R² values for OFMs in test data.

4. Conclusions

In this study, we constructed machine learning models which predict descriptors representing three-dimensional local atomic structure information from O-K XANES spectra. NN models were constructed separately, which take discretized XANES spectra as input and output RDFs and OFMs around the absorption O-atoms. The NN model for predicting RDFs can reproduce the theoretical RDF near the central O-atom (within 3 Å) with a probability of 74%, and that of OFMs can reproduce theoretical OFMs with a probability of 97%. The results demonstrate that machine learning techniques can extract local atomic structure information, such as interatomic distances and anisotropy, directly from XANES spectra.

Extracting local atomic arrangements directly from XANES spectra would be extremely useful. For this purpose, developing a method to recover atomic arrangements from structural descriptors is necessary, which is future work.

Acknowledgments

This work was supported by JSPS KAKENHI, grant numbers JP20H05192 and JP22H04512.

REFERENCES

1) J. Als-Nielsen and D. McMorrow: Elements of Modern X-ray Physics, 2nd edn., (Wiley, Hoboken, 2011).
2) F. de Groot and A. Kotani: Core Level Spectroscopy of Solids, (CRC Press, Boca Raton, 2008).
3) J.A. van Bokhoven and C. Lamberti: X-Ray Absorption and X-Ray Emission Spectroscopy: Theory and Applications, 1st edition, (Wiley, Chichester, West Sussex, 2016).
4) G. Bunker: Introduction to XAFS: A Practical Guide to X-ray Absorption Fine Structure Spectroscopy, 1st edn., (Cambridge University Press, Cambridge, 2010).
5) S. Calvin: XAFS for Everyone, 1st edition, (CRC Press, Boca Raton, Florence, 2018).
6) J. Evans: X-ray Absorption Spectroscopy for the Chemical and Materials Sciences, 1st edition, (Wiley, Hoboken, NJ, 2018).
7) Y. Iwasawa, K. Asakura and M. Tada (ed.): XAFS Techniques for Catalysts, Nanomaterials, and Surfaces, 1st ed. 2017 edition, (Springer, New York, NY, 2016).
8) F. Ráksi, K.R. Wilson, Z. Jiang, A. Ikhlef, C.Y. Côté and J. Kieffer: J. Chem. Phys. 104 (1996) 6066–6069. doi:10.1063/1.471305
9) P.M. Kraus, M. Zürch, S.K. Cushing, D.M. Neumark and S.R. Leone: Nat. Rev. Chem. 2 (2018) 82–94. doi:10.1038/s41570-018-0008-8
10) S.-C. Lin, C.-C. Chang, S.-Y. Chiu, H.-T. Pai, T.-Y. Liao, C.-S. Hsu, W.-H. Chiang, M.-K. Tsai and H.M. Chen: Nat. Commun. 11 (2020) 3525. doi:10.1038/s41467-020-17231-3
11) T. Tanabe, Y. Nagai, K. Dohmae, N. Takagi, N. Takahashi, S. Matsumoto and H. Shinjoh: Appl. Catal. B 105 (2011) 41–49. doi:10.1016/j.apcatb.2011.03.033
12) K. Kimoto, T. Asaka, T. Nagai, M. Saito, Y. Matsui and K. Ishizuka: Nature 450 (2007) 702–704. doi:10.1038/nature06352
13) D.A. Muller, L.F. Kourkoutis, M. Murfitt, J.H. Song, H.Y. Hwang, J. Silcox, N. Dellby and O.L. Krivanek: Science 319 (2008) 1073–1076. doi:10.1126/science.1148820
14) T. Mizoguchi, H. Ohta, H.-S. Lee, N. Takahashi and Y. Ikuhara: Adv. Funct. Mater. 21 (2011) 2258–2263. doi:10.1002/adfm.201100230
15) J.J. Rehr and R.C. Albers: Rev. Mod. Phys. 72 (2000) 621–654. doi:10.1103/RevModPhys.72.621
16) I. Tanaka, T. Mizoguchi and T. Yamamoto: J. Am. Ceram. Soc. 88 (2005) 2013–2029. doi:10.1111/j.1551-2916.2005.00547.x
17) H. Ikeno and T. Mizoguchi: Microscopy 66 (2017) 305–327. doi:10.1093/jmicro/dfx033
18) F.M.F. de Groot et al.: J. Electron Spectrosc. Relat. Phenom. 249 (2021) 147061. doi:10.1016/j.elspec.2021.147061
19) A. Seko, T. Maekawa, K. Tsuda and I. Tanaka: Phys. Rev. B 89 (2014) 054303. doi:10.1103/PhysRevB.89.054303
20) A. Seko, A. Togo, H. Hayashi, K. Tsuda, L. Chaput and I. Tanaka: Phys. Rev. Lett. 115 (2015) 205901. doi:10.1103/PhysRevLett.115.205901
21) S. Kiyohara, H. Oda, T. Miyata and T. Mizoguchi: Sci. Adv. 2 (2016) e1600746. doi:10.1126/sciadv.1600746
22) S. Curtarolo, G.L.W. Hart, N.M. Buongiorno, N. Mingo, S. Sanvito and O. Levy: Nat. Mater. 12 (2013) 191–201. doi:10.1038/nmat3568
23) K.T. Schütt, M. Gastegger, A. Tkatchenko, K.-R. Müller and R.J. Maurer: Nat. Commun. 10 (2019) 5024. doi:10.1038/s41467-019-12875-2
24) P.O. Dral: J. Phys. Chem. Lett. 11 (2020) 2336–2347. doi:10.1021/acs.jpclett.9b03664
25) R. Nagai, R. Akashi and O. Sugino: Npj Comput. Mater. 6 (2020) 43. doi:10.1038/s41524-020-0310-0
26) M. Tsubaki and T. Mizoguchi: Phys. Rev. Lett. 125 (2020) 206401. doi:10.1103/PhysRevLett.125.206401
27) C.D. Rankine, M.M.M. Madkhali and T.J. Penfold: J. Phys. Chem. A 124 (2020) 4263–4270. doi:10.1021/acs.jpca.0c03723
28) S. Kiyohara, T. Miyata, K. Tsuda and T. Mizoguchi: Sci. Rep. 8 (2018) 13548. doi:10.1038/s41598-018-30994-6
29) S.B. Torrisi, M.R. Carbone, B.A. Rohr, J.H. Montoya, Y. Ha, J. Yano, S.K. Suram and L. Hung: Npj Comput. Mater. 6 (2020) 109. doi:10.1038/s41524-020-00376-6
30) J. Timoshenko, D. Lu, Y. Lin and A.I. Frenkel: J. Phys. Chem. Lett. 8 (2017) 5091–5098. doi:10.1021/acs.jpclett.7b02364
31) J. Timoshenko, A. Anspoks, A. Cintins, A. Kuzmin, J. Purans and A.I. Frenkel: Phys. Rev. Lett. 120 (2018) 225502. doi:10.1103/PhysRevLett.120.225502
32) S. Kiyohara and T. Mizoguchi: J. Phys. Soc. Jpn. 89 (2020) 103001. doi:10.7566/JPSJ.89.103001
33) T.L. Pham, H. Kino, K. Terakura, T. Miyake, K. Tsuda, I. Takigawa and H.C. Dam: Sci. Technol. Adv. Mater. 18 (2017) 756–765. doi:10.1080/14686996.2017.1378060
34) A. Jain, S.P. Ong, G. Hautier, W. Chen, W.D. Richards, S. Dacek, S. Cholia, D. Gunter, D. Skinner, G. Ceder and K.A. Persson: APL Mater. 1 (2013) 011002. doi:10.1063/1.4812323
35) K. Mathew, C. Zheng, D. Winston, C. Chen, A. Dozier, J.J. Rehr, S.P. Ong and K.A. Persson: Sci. Data 5 (2018) 180151. doi:10.1038/sdata.2018.151
36) C. Zheng, K. Mathew, C. Chen, Y. Chen, H. Tang, A. Dozier, J.J. Kas, F.D. Vila, J.J. Rehr, L.F.J. Piper, K.A. Persson and S.P. Ong: Npj Comput. Mater. 4 (2018) 12. doi:10.1038/s41524-018-0067-x
37) M. O’Keeffe: Acta Crystallogr. Sec. A 35 (1979) 772–775. doi:10.1107/S0567739479001765
38) D.P. Kingma and J. Ba: Adam: A Method for Stochastic Optimization, ArXiv14126980 Cs.
39) G. Klambauer, T. Unterthiner, A. Mayr and S. Hochreiter: Self-Normalizing Neural Networks, ArXiv170602515 Cs Stat.

責任著者(Corresponding author)

早期公開記事改版情報

J-STAGEへの登録はこちら（無料）