Host: Division of Chemoinformatics, The Chemical Society of Japan
Name : Symposium on Chemoinformatics
Number : 42
Location : [in Japanese]
Date : October 28, 2019 - October 29, 2019
Pages 1B03-
Applications of machine learning methods to chemistry and materials science have been attracting much attention in recent studies. In these fields, only limited number of experimental data are usually available for supervised machine learning (SML). Herein, we performed a model study on the accuracy of SML in obtaining regression models for electron-transfer rate using a small data set of reference data. The model data was prepared by applying the Marcus theory on electron transfer. Three parameters that reflect the characteristics of the reaction substrate in the formula were generated using random numbers, and 1000 pieces of training and test data sets were created. Arbitrary numbers were chosen from the training set, 0-30% error was added, and the performance was compared by performing prediction using a support vector machine (SVR). As a result, when there was no error in the training data, at least 30 pieces of data were required for the R2 value of the test set to be 0.8 or more.