Information and Media Technologies
Online ISSN : 1881-0896
ISSN-L : 1881-0896
Computing
A Survey on Large Scale Corpora and Emotion Corpora
Michal PtaszynskiRafal RzepkaSatoshi OyamaMasahito KuriharaKenji Araki
著者情報
ジャーナル フリー

2014 年 9 巻 4 号 p. 429-445

詳細
抄録

In this paper we present a survey on natural language corpora, with particular focus on corpora of large scale and those applicable to sentiment analysis. Natural language corpora are crucial for training various Software Engineering applications, from part-of-speech taggers and dependency parsers to dialog systems or sentiment analysis software. We compare several natural language corpora created for different languages, analyze their distinctive features and the amount of additional annotations provided by the developers of those corpora.

著者関連情報
© 2014 Japan Society for Software Science and Technology
前の記事 次の記事
feedback
Top