Proceedings of the Symposium on Chemoinformatics
42th Symposium on Chemoinformatics, Tokyo
Conference information

Oral Session (B)
Comparison of machine learning methods for iterative screening of compound database
*Tomoyuki MiyaoKimito Funatsu
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Pages 1B02-

Details
Abstract

Iterative screening surveys a small set of (virtual) compounds, during which their property values are determined by experiments and used as feedback for updating quantitative structure-property (activity) relationship models. This cycle is repeated several times until identifying the compounds exhibiting desired property values or better property (activity) values. In the present work, we have conducted a series of virtual experiments to assess the characteristics of different iterative screening methods using compounds from ZINC and ChEMBL databases. Overall, batch-based Bayesian optimization with Gaussian process, which impose penalty on the acquisition function for compounds proximal to already sampled compounds in a batch, performed better in terms of the number of iterations to identify one of the goal compounds. Linear regression models without taking into account the domain of applicability to the regression model also worked consistently for the property for which a key factor was present in the set of molecular descriptors.

Content from these authors
Previous article Next article
feedback
Top