Computer Software
Print ISSN : 0289-6540
An Effect of Data Size on Performance of Effort Estimation with Missing Data Techniques
Koichi TAMURAAkito MONDENKen-ichi MATSUMOTO
Author information
JOURNAL FREE ACCESS

2010 Volume 27 Issue 2 Pages 2_100-2_105

Details
Abstract
To deal with missing data in historical project data sets is an important issue for constructing effort estimation models. Past researches have showed that the similarity-based imputation showed high estimation performance. However, it is unclear if it is still effective for small data sets. In this paper, using multiple data sets with different project cases each extracted from ISBSG data set, we present an experimental evaluation among four methods: mean imputation, similarity-based imputation, row-column deletion and pairwise deletion. The result showed that the row-column deletion showed better performance than similarity-based imputation for data sets not exceeding 220 cases.
Content from these authors
© Japan Society for Software Science and Technology 2010
Previous article Next article
feedback
Top