To mine data set for an optimal model taking into account both linear and non-linear parts of variation of response variable, a strategy of hybrid modeling for combining CART with linear regression analysis is proposed after the features are extracted from a POS data set. To valid the strategy, the POS data set is applied, and useful results are obtained as follows.
(1)Hybrid modeling can improve accuracy of the sole modeling of CART and/or regression model.
(2)It can handle any case with surrogate variable instead of any predictor, which has a missing value.
(3)Autocorrelation of residual time series and its residual mean squared error after hybrid can be improved by application of autoregressive model.
View full abstract