The Gunma-Kosen Review
Online ISSN : 2433-9776
Print ISSN : 0288-6936
ISSN-L : 0288-6936
Statistical Analyses on the Literary Works at the Use Situation of the vocabulary
Takuma InoueKazuko Tagai
Author information
RESEARCH REPORT / TECHNICAL REPORT OPEN ACCESS

2019 Volume 38 Pages 65-70

Details
Abstract
In this study, I aim to clarify the feature of the genre by analyzing statistically the use of vocabulary of the Japanese modern literatures. It seems that the juvenile and the fantasy are different genre, have different readers, and therefore have different style. I select as juveniles Gongitsune 'Gon, the little fox',Tebukuro wo Kaini' Buying Mittens' both by NIIMI Nankichi, Kumonoito 'The Spider's Thread' by AKUTAGAWA Ryunosuke, and Tyumonno-oi Ryoriten 'The Restaurant of Many Orders' by MIYAZAWA Kenji, and as fantasies Koya-hijiri 'The Saint of Mt. Koya', Yashaga-ike 'Demon Lake' both by IZUMI Kyoka, Ningen-isu 'The Human Chair' by EDOGAWA Ranpo, and Sangetsuki 'The Moon above the Mountains' by NAKAJIMA Atsushi. 300 pieces quoted from beginning of each target works in Aozora-bunnko library texts are analyzed by morphological analysis tool Web-chamame, and from the result got by the analysis above, the indicator which represent the usages of vocabulary is calculated and the basic tendency of the target works can be grasped.
I compose a new parameter which can sum up the information of data, perform the principal component analysis (PCA) to grasp the characteristic of the target works. Furthermore, I apply the cluster analysis using the new variable provided by PCA and by classifying target works in some unities (cluster),try to consider what characteristic of the target works can be seen, or what kind of difference can be studied by a genre from a quantitative point of view.
As a result of basic analysis, I can point out that the rate of Japanese words or that of Sino-Japanese written by hiragana characters show the statistical significant difference by a genre. By the cluster analysis, I recognized the similarity between the same genre, but also found that there are some works which have the different character from which genre is generally said.
Content from these authors
© 2019 Author
Previous article Next article
feedback
Top