Chem-Bio Informatics Journal
Online ISSN : 1347-0442
Print ISSN : 1347-6297
ISSN-L : 1347-0442
Original
Application of Rough Set Theory to High Throughput Screening Data for Rational Selection of Lead Compounds
Michio KoyamaKiyoshi HasegawaMasamoto ArakawaKimito Funatsu
Author information
JOURNAL FREE ACCESS

2008 Volume 8 Issue 3 Pages 85-95

Details
Abstract
In the field of drug discovery, high-throughput screening (HTS) is widely used to identify new lead compounds. A considerable number of hit compounds, however, will subsequently be found to have low activities when their inhibitory activities are measured more precisely. Such compounds are called false positives. For a more efficient selection of lead compounds, virtual screening methods with QSAR models have been investigated, but no definitive solutions have been found. In this study, we propose an effective method to identify lead compounds. The proposed method is based on rough set theory (RST), which is a mathematical tool for depicting the uncertainty and vagueness of knowledge. The essential parts of RST are the construction of reducts, which are minimal subsets of variables to distinguish samples, and the extraction of rules using their reducts. By applying RST to the QSAR study of monoamine oxidase (MAO) inhibitors, we extracted several rules for identifying lead compounds. First, 3D-structures of MAO inhibitors were generated uniformly by CORINA, and chemical descriptors were calculated by the Volsurf method. Finally, three unique rules were extracted by using RST. It is found that the each rule is chemically reasonable and compatible with previous studies. Furthermore, the predictive power of RST was also proved by comparison with partial least squares (PLS) and decision tree (DT). These results demonstrate the usefulness of our method.
Content from these authors
2008 Chem-Bio Informatics Society
Previous article Next article
feedback
Top