Journal of Advanced Computational Intelligence and Intelligent Informatics
Online ISSN : 1883-8014
Print ISSN : 1343-0130
ISSN-L : 1883-8014
Regular Papers
Ancient Chinese Sentence Segmentation Based on Bidirectional LSTM+CRF Model
Hongbin WangHaibing WeiJianyi GuoLiang Cheng
著者情報
ジャーナル オープンアクセス

2019 年 23 巻 4 号 p. 719-725

詳細
抄録

This study proposes a novel method for the segmentation of Archaic Chinese sentences based on a bidirectional long short-term memory (LSTM) + conditional random field (CRF) model. The method added a layer of linear statistical model to the traditional bidirectional LSTM neural network; it can be used for sequence annotation from the sentence level. In addition, this model introduced the stochastic gradient descent (SGD) to prevent excessive fitting, and the viterbi algorithm was used to calculate the optimal sequence of the sentences. In the experiment, this study tests the performance of the proposed method using the History of the Han Dynasty, the History of the later Han Dynasty, Three Kingdoms, and the Book of Jin, amongst others. The results show that the precision value, recall value, and F1 value are 0.77, 0.75, and 0.76, respectively, in the open test, and 0.90, 0.88, and 0.76, respectively, in the closed test.

著者関連情報

この記事は最新の被引用情報を取得できません。

© 2019 Fuji Technology Press Ltd.

This article is licensed under a Creative Commons [Attribution-NoDerivatives 4.0 International] license (https://creativecommons.org/licenses/by-nd/4.0/).
The journal is fully Open Access under Creative Commons licenses and all articles are free to access at JACIII Official Site.
https://www.fujipress.jp/jaciii/jc-about/
前の記事 次の記事
feedback
Top