Journal of the Japanese Society of Computational Statistics
Online ISSN : 1881-1337
Print ISSN : 0915-2350
ISSN-L : 0915-2350
Theory and Applications
GARROTE TREES AS TREE STRUCTURED REGRESSION ANALYSIS
Masatoshi NakamuraYoshimichi OchiHiroki MotogaitoMasashi Goto
Author information
JOURNAL FREE ACCESS

2017 Volume 30 Issue 1 Pages 65-80

Details
Abstract

In regression analysis, stochastic models are often constructed to model relationships between outcomes and explanatory variables. We derive statistical interpretation about the underlying structure of data based on these models. When we use a linear regression model and the model provides good fitting to the data, it is straightforward to interpret the relation. However, there are cases where it may be difficult to formulate a linear model reflecting actual characteristics in detail. In such cases, a tree-structured approach is recommended, such as classification and regression trees (CART), which develops a tree and provides an interpretation of the data based on the fundamental model derived from the tree. Random Forest (RF) involves an ensemble learning method based on the trees and can predict outcomes more precisely. However,RF cannot provide a tree-structured model for interpreting the data. We examine a nonnegative garrote (NNG), a shrinkage estimator, and propose Garrote Trees (GT) as an adjustment of RF based on NNG. In addition, GT can lead making trees that are useful for interpretation of data. Two case studies of diabetes and prostate cancer data illustrate predictive accuracy and descriptive features of GT. Finally, our simulation studies show that the proposed method is highly accurate predictively and provides a potential ability to interpret the data from new meaningful standpoints.

Content from these authors
Previous article
feedback
Top