化合物探索のためのiterative screeningにおける機械学習手法の比較

宮尾 知幸; 船津 公人

42th Symposium on Chemoinformatics, Tokyo

Conference information

Host: Division of Chemoinformatics, The Chemical Society of Japan

Name : Symposium on Chemoinformatics

Number : 42

Location : [in Japanese]

Date : October 28, 2019 - October 29, 2019

Oral Session (B)

Comparison of machine learning methods for iterative screening of compound database

*Tomoyuki Miyao, Kimito Funatsu

Author information

Keywords: Iterative screening, Bayesian Optimization, Compound search, Gaussian process regression

CONFERENCE PROCEEDINGS FREE ACCESS

Pages 1B02-

Details

Abstract

Iterative screening surveys a small set of (virtual) compounds, during which their property values are determined by experiments and used as feedback for updating quantitative structure-property (activity) relationship models. This cycle is repeated several times until identifying the compounds exhibiting desired property values or better property (activity) values. In the present work, we have conducted a series of virtual experiments to assess the characteristics of different iterative screening methods using compounds from ZINC and ChEMBL databases. Overall, batch-based Bayesian optimization with Gaussian process, which impose penalty on the acquisition function for compounds proximal to already sampled compounds in a batch, performed better in terms of the number of iterations to identify one of the goal compounds. Linear regression models without taking into account the domain of applicability to the regression model also worked consistently for the property for which a key factor was present in the set of molecular descriptors.

Corresponding author

Register with J-STAGE for free!