Proceedings of Annual Conference, Japan Society of Information and Knowledge
Online ISSN : 2432-9908
ISSN-L : 2432-9908
Proceedings of the 2nd Workshop on the Japan Society of Information and Knowledge
Session ID : A-4
Conference information

Creating a Vocabulary Index for Genji Monogatari Taisei
*H. UedaY. UedaY. ImanishiT. KabashimaT. NakagawaM. Murakami
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

 Indexes of the words in Japanese literary classics are generally arranged in a-i-u-e-o order, under which headings the words are given with various prefixes, suffixes, inflections, and so forth. Thereafter the volume, page, and line numbers of each entry are given. Very occasionally, particles, auxiliary verbs, and other associated words are also listed.
 Such indexes are useful in defining where and in what context each word of a classic is used. However, if you want to know whether some particular word is always used in the same context, you must check every page on which it appeares and note with what other words it appears in conjunction. This can become a very cumbersome process for words frequently used.
 The current study aimed at creating a software program that could list up words (in boldface type) in a-i-u-e-o order, together with five to six preceding and the same number of following words that revealed the context. This was achieved using our previously completed full-text database of Genji Monogatari Taisei in which all words had been identified by parts of speech.

Content from these authors
© 1994 Japan Society of Information and Knowledge
Previous article Next article
feedback
Top