人工知能学会論文誌
Online ISSN : 1346-8030
Print ISSN : 1346-0714
ISSN-L : 1346-0714
論文
二段階回帰木による損害保険の純保険料推定
西 久美子竹内 一郎
著者情報
ジャーナル フリー

2007 年 22 巻 2 号 p. 183-190

詳細
抄録

We study a regression tree algorithm tailored to casualty insurance pure premium estimation problems. Casualty insurance premium is mainly determined by the expected amount that the insurance companies have to pay for the contract. Therefore, casualy insurance companies have to estimate the expected insurance amount on the basis of insurance risk factors. This problem is formulated as a regression problem, i.e. estimation of conditional mean E[Y|x], where Y is insurance amounts and x is risk factors. In this paper, we aim to implement the regression problem in regression tree framework. The difficulty of the problem lies in the fact that the distribution of insurance amount P(Y|x) is highly skewed and exhibits a long-tail toward positive direction. Conventional least-square-error regression tree algorithm is notoriously unstable under such long-tailed error distribution. On the other hand, several types of robust regression trees, such as least-absolute-error regression tree, are neither appropriate in this situation because they yields significant bias to conditional mean E[Y|x]. In this paper, we propose a two-stage tree fitting algorithm. In the first stage, the algorithm constructs a quantile tree, a kind of robust regression tree, which is stable but biased to conditional mean E[Y|x]. In the second stage, the algorithm corrects the bias using least-square error regression tree. We discuss the theoretical background of the algorithm and empirically investigate the performances. We applied the proposed algorithm to a car insurance data set of 318,564 records provided from a north-american insurance company and obtained significantly better results than conventional regression tree algorithm.

著者関連情報
© 2007 JSAI (The Japanese Society for Artificial Intelligence)
前の記事 次の記事
feedback
Top