Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
33rd (2019)
Session ID : 1N4-J-9-02
Conference information

Adding Multiple Subword Sequences to BiLSTM-CRF Model for Compound Name Extraction
*Hiroto SEKINEGo URASAWATakashi INUITomoya IWAKURA
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

In this paper, we propose a BiLSTM-CRF model for extracting compound names from documents in chemical domain. The proposed model can be taken multiple subword sequences as input in order to obtain sufficient features for long span or unknown tokens. Subword LSTM units with contextual information are introduced in the input layer of the model. We conducted experiments based on CHEMDNER challenge to investigate the effectiveness of the model. As a result, the extraction accuracy outperformed the normal BiLSTM-CRF model, and experimental results on unknown words showed that the proposed method works better.

Content from these authors
© 2019 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top