2022 年 142 巻 10 号 p. 1144-1155
Tackling diabetes, an increasingly common lifestyle-related illness, requires information on patients’ lifestyle and habits. Blogs maintained by patients afflicted with the incurable illness may be useful for analyzing how lifestyle affects health-status. In this paper, we propose an intrinsic expression extractor that extracts keywords related to lifestyle and health-status from blogs of patients diagnosed with type-2 diabetes. To counter class imbalance and add to the corpus, the proposed method tags each keyword with information based on cue-words extracted from manually tagged data. The named-entity recognition (NER) for the extracted keywords uses a bidirectional gated recurrent unit neural network (BiGRU) and was evaluated for accuracy by cross-validation. We obtained F1-score of approximately 0.76. Although the accuracy of extraction can further be improved, the novel approach has applications in analyzing and improving the lifestyle of diabetes-afflicted patients.
J-STAGEがリニューアルされました! https://www.jstage.jst.go.jp/browse/-char/ja/