Journal of Computer Chemistry, Japan
Online ISSN : 1347-3824
Print ISSN : 1347-1767
ISSN-L : 1347-1767
General Papers
Development of Neural Network for Incomplete Data Set, CQSAR: Compensation Quantitative Structure-Activity Relationships
Tomoo AOYAMAJunko KAMBEUmpei NAGASHIMAHidenori UMENO
Author information
JOURNAL FREE ACCESS

2007 Volume 6 Issue 5 Pages 263-274

Details
Abstract
Multi-layer neural networks are used for the multi regression analysis of many kinds of phenomena whose expressions are unknown. The application fields are medicine designs and environmental problems. We often find incomplete parts in descriptors, which make the precision of the analysis lower. Moreover, the incomplete parts make the linked parts of other descriptors invalid. We often cannot calculate the multi regression analysis, therefore, we wish to eliminate the erroneous effects. In the paper, we discuss some approaches to eliminate such effects, and derive a method based on neural networks, which compensates defect descriptors. We call the method the compensation quantitative structure-activity relationships method (CQSAR).
The first step of CQSAR is to interpolate defective parts of descriptors. There are various interpolation methods. The selection depends on the number of data and characters of the data set. We introduce 3 kinds of methods; these are, variable absolute-function fitting, parametric observed-vector method, and interpolation by using a 3-layer neural network and an arithmetic progression. The first 2 methods are for small data set. The neural network approach is for general purpose, which requires over 7 data. If we can get over 11 data, the approach also gives partial derivative coefficients also. We confirm the effects in numerical calculations.
The second step of CQSAR is to calculate the multi regression analysis based on the non-linear fitting functions of neural networks. We evaluate propagations of error caused by interpolations in the first step, and show that the error does not increase. In the evaluation, we give a new reconstruction learning, and show the effectiveness of the CQSAR method under a defect ratio of 50%.
Content from these authors
© 2007 Society of Computer Chemistry, Japan
Previous article Next article
feedback
Top