看護ケアテキストデータ分類のためのGAを用いた特徴抽出

安藤 滋; 新居 学; 高橋 豊; 内布 敦子; 坂下 玲子

doi:10.14864/fss.24.0.92.0

Abstract

Currently, nursing-care freestyle texts (nursing-care data) are collected from many hospitals in Japan via the Internet and stored into the database. For improving nursing care quality, experts need to read all freestyle texts carefully. However, it is a hard task for each expert to evaluate the data because of huge number of nursing-care data in the database. In order to reduce workloads to evaluate nursingcare data, we have proposed a support vector machine (SVM) based classification system. In this paper, to improve the classification performance, we propose a genetic algorithm (GA) based feature selection method for generating numerical data from collected nursing-care texts. First, we extract nouns and verbs from nursing-care texts using the morphological analysis software "MeCab" and store the extracted terms into a "term list". Some combinations of terms in the term list are selected by GA with two objectives; (1) maximization of the number of correctly classified texts and (2) minimization of the number of selected terms. And then, we classify the nursing-care numerical data using the SVM. From computer simulation results, we show the effectiveness of our proposed method.

Content from these authors

Favorites & Alerts

Corresponding author

Register with J-STAGE for free!