日本リモートセンシング学会誌
Online ISSN : 1883-1184
Print ISSN : 0289-7911
ISSN-L : 0289-7911
EMアルゴリズムを用いたトレーニングデータの代表性の改善
飯倉 善和安岡 善文
著者情報
ジャーナル フリー

1989 年 9 巻 4 号 p. 341-349

詳細
抄録

For supervised classification of multispectral images, it is of primary importance to select an appropriate training data set for the categories to be classified. However, as the selection of the training data set is not based on statistical procedures such as random sampling, the estimated distribution parameters for each categories often show biased properties.
This paper discusses the correction of the biased estimates for the training data set by the EM algorithm, which is an iterative procedure for obtaining the maximum-likelihood estimates in incomplete data problems. For this purpose, the correction of biased estimates is mathematically formulated as the mixture density problem, where training data and non training data is regarded as the complete data and incomplete data, respectively. In the iterative procedure, the incomplete data are regarded as the pseudo-complete data having the posterior probability (E step), which in turn is utilized to estimate the distribution parameters according to the maximum likelihood method (M step).
It is found that the application of the EM algorithm to the multispectral images gives rise to following two problems: (1) inefficiency of the algorithm becomes significant if all pixels in the image are used as the incomplete data, and (2) the results depend on the number of training data selected.
The algorithm is modified by introducing the reliability index of the training data to give the stable estimates even if the small number of pixels are used. It is shown that the modified algorithm is successfully applied to the classification of Landsat TM data without losing the good properties of the original EM algorithm.

著者関連情報
© 社団法人 日本リモートセンシング学会
次の記事
feedback
Top