Phrase-Based Statistical Model for Korean Morpheme Segmentation and POS Tagging

Seung-Hoon NA; Young-Kil KIM

doi:10.1587/transinf.2017EDP7085

抄録

In this paper, we propose a novel phrase-based model for Korean morphological analysis by considering a phrase as the basic processing unit, which generalizes all the other existing processing units. The impetus for using phrases this way is largely motivated by the success of phrase-based statistical machine translation (SMT), which convincingly shows that the larger the processing unit, the better the performance. Experimental results using the SEJONG dataset show that the proposed phrase-based models outperform the morpheme-based models used as baselines. In particular, when combined with the conditional random field (CRF) model, our model leads to statistically significant improvements over the state-of-the-art CRF method.

著者関連情報

お気に入り & アラート

お気に入りに追加
追加情報アラート
被引用アラート
認証解除アラート

閲覧履歴

G0700301 ディーゼル発電機によるバイオガス有効利用に関する研究
Determination of Ecological Footprint at the Prefectural Level in Japan from the Perspective of the Consumption of Resources and Energy

発行機関からのお知らせ

PPV is available from https://globals.ieice.org/en_transactions/information

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）