2020 年 37 巻 2 号 p. 2_120-2_126
This paper proposes a method to remove extraneous words in defect logs collected in software reviews to support for creating effective review checklists and scenarios used in future reviews. Extraneous words are commonly used and not specific words to capture defects to be detected in future reviews. The proposed method removes extraneous words in defect logs by excluding words in common vocabulary among defect logs collected in different types of software development projects because extraneous words are included in defect logs of any types of software development projects. This paper empirically evaluates the effectiveness of the proposed method with defect logs collected in industry. In the evaluation, words in common vocabulary among defect logs collected in reviews of a transportation system and a development support tool are excluded from those of public institutions. The results show that words contributing to creating checklists and scenarios, including “medical examination,” “fiscal year,” and “dependent in tax law,” are not excluded while extraneous words, which general stop word removal methods cannot exclude, such as “update,” “result,” and “process” are excluded.