日本リモートセンシング学会誌
Online ISSN : 1883-1184
Print ISSN : 0289-7911
ISSN-L : 0289-7911

この記事には本公開記事があります。本公開記事を参照してください。
引用する場合も本公開記事を引用してください。

時系列ALOS-2 PALSAR-2データを用いたRandom Forestによる水稲作付分類における訓練データの数と品質の評価
中元 経史朗大吉 慶
著者情報
ジャーナル フリー 早期公開

論文ID: 2023.004

この記事には本公開記事があります。
詳細
抄録

Machine learning has recently come into widespread use for the highly accurate classification of cultivated land and land cover using satellite data. Accurate classification requires a sufficient amount and quality of training data, but the collection of data for training is very costly. Therefore, to evaluate the relationship between the required amount and quality of training data and classification accuracy, this study evaluated paddy rice discrimination in California, US, using ALOS-2 PALSAR-2 data with random forest in a case study.

The US Department of Agriculture (USDA) Cropland Data Layer (CDL) made considerable training data available on land cover distribution in 2021. The amount of training data was evaluated after the data volume increased from 100 to 100,000 samples. The quality of the training data was determined by randomly replacing a certain percentage of paddy/non-paddy labels in the training data with incorrect labels. This case study then evaluated the correlation between the amount of training data and the accuracy (ACC) of classification. We found that at least 1,000 training samples are necessary to achieve 0.95 ACC stably under the condition of this study. Next, the study evaluated the correlation between the quality of training data and classification accuracy and found that ACC can be maintained above 0.95 for an error ratio of up to 20 % if there are more than 1,000 samples.

著者関連情報
© 2023 社団法人 日本リモートセンシング学会
feedback
Top