Journal of Natural Language Processing
Online ISSN : 2185-8314
Print ISSN : 1340-7619
ISSN-L : 1340-7619
Paper
Improvement in Domain Specific Word Segmentation by Symbol Grounding
Suzushi TomoriHirotaka KamekoTakashi NinomiyaShinsuke MoriYoshimasa Tsuruoka
Author information
JOURNAL FREE ACCESS

2017 Volume 24 Issue 3 Pages 447-461

Details
Abstract

We propose a novel framework for improving a word segmenter using information acquired from symbol grounding. The framework uses a dataset consisting of pairs of non-textual information and a commentary. We generate a pseudo-stochastically segmented corpus from the commentaries, and then build a neural network to predict relationships between non-textual information and the words. We generate a domain specific term dictionary by using the neural network for word segmenter. We applied our method to game records of Japanese chess with commentaries. The experimental results show that the accuracy of a word segmenter can be improved by incorporating the generated dictionary.

Content from these authors
© 2017 The Association for Natural Language Processing
Previous article Next article
feedback
Top