Transactions of the Japanese Society for Artificial Intelligence
Online ISSN : 1346-8030
Print ISSN : 1346-0714
ISSN-L : 1346-0714
Original Paper
Characterization and Similarity Analysis of Japanese Writers’ Syntactic Structures by Kernel Method
Eriko KanagawaTakeshi Okadome
Author information
JOURNAL FREE ACCESS

2017 Volume 32 Issue 3 Pages F-G94_1-14

Details
Abstract

The subtree kernel and the information tree kernel defined here permit us to measure the syntactic characteristics and similarity of sentences. The subtree kernel is the total number of the common subtrees in two trees and the information tree kernel is defined as the total Shannon information contents contained in the common subtrees. The information tree kernel enables us to capture such structural characteristics peculiar to the styles of writers. The analyses using by these kernels reveal some syntactic characteristics and similarities of the Japanese 31 authors’ writing styles. In particular, the results of the analyses for the great five authors, Soseki Natume, Ryunosuke Akutagawa, Osamu Dazai, Nankiti Niimi, and Kenzi Miyazawa, show that, for example, (1) Natume more often writes a sentence of the dependency structure in which the same subtree structure occurs multiple times in the sentence. (2) Akutagawa more often uses the dependency structures for extra or detailed expressions that modifies a noun phrase than the others do. (3) Dazai often uses the dependency structures that consist of many shallow subtrees arranged in parallel, but the others seldom write sentences of the parallel subtree structures. (4) Niimi uses simpler dependency structures than Miyazawa does and Miyazawa writes short sentences in more various dependency structures.

Content from these authors
© The Japanese Society for Artificial Intelligence 2017
Previous article
feedback
Top