Journal of Natural Language Processing
Online ISSN : 2185-8314
Print ISSN : 1340-7619
ISSN-L : 1340-7619
Volume 16, Issue 3
Displaying 1-5 of 5 articles from this issue
Preface
Paper
  • Asuka Sumida, Naoki Yoshinaga, Kentaro Torisawa
    2009 Volume 16 Issue 3 Pages 3_3-3_24
    Published: 2009
    Released on J-STAGE: September 01, 2011
    JOURNAL FREE ACCESS
    This paper describes a method of extracting a large set of hyponymy relations with a high precision from hierarchical layouts in Wikipedia articles. Hyponymy relation has been studied as one of the principal knowledge for information retrieval and web directory, which helps users to access the growing web. Various methods have been proposed to automatically acquire hyponymy relations. In this article, we first extract hyponymy relation candidates from sections and itemizations in hierarchical layouts of Wikipedia articles, and then filter out irrelevant candidates by using a machine learning technique. In experiments, we successfully extracted more than 1.35 million relations from the hierarchical layouts in the Japanese version of Wikipedia, with a precision of 90%.
    Download PDF (766K)
  • Yugo Fujie, Hirokazu Watabe, Tsukasa Kawaoka
    2009 Volume 16 Issue 3 Pages 3_25-3_49
    Published: 2009
    Released on J-STAGE: September 01, 2011
    JOURNAL FREE ACCESS
    Recently the development of computers and networks makes amount of information huge. It is very difficult to find necessary information in the huge information. The existing retrieval system uses not the meaning of input words but the notation of them. Therefore, different words bring a defferent result of retreieval even if they have the same meaning. A user of the system has to consider the input words to search the necessary information. This paper proposes the quantification technique of the semantic distance between documents based on relevance of the word to realize the search that captured the meaning of the document. Concretely the related degree between words is calculated by concept-base and the resemblance degree between documents is calculated by Earth Mover’s Distance. Besides this paper proposes method that no existence word on concept-base is defined as a concept based on Web information to expand concept-base automatically. Retrieval experiments using the NTCIR3-WEB in comparison with the other method have shown that our method is effective than other method.
    Download PDF (1951K)
  • Tomoko Ohkuma, Hiroshi Umemoto, Yasuhide Miura, Hiroshi Masuichi
    2009 Volume 16 Issue 3 Pages 3_51-3_80
    Published: 2009
    Released on J-STAGE: September 01, 2011
    JOURNAL FREE ACCESS
    Japanese numeral classifier (NC) expresses quantificational expressions by following number nouns. The parser based on phrase structure grammar like Lexical-Functional Grammar (LFG) or Head-driven Phrase Structure Grammar (HPSG) has to define the relationships between noun and NC or number and NC by the grammatical rules. NC has various relationships syntactically because of its’ various characteristics. The treatment of NCs in LFG formalism should take account of their lexical and syntactical characteristics. This report proposed LFG rules for NCs and examined their validity and accuracy. The comparison between Japanese f-structure inducted by the rules proposed in this report and English f-structure and Chinese f-structure which correspond to the Japanese f-structures show that the rules make f-structures parallel between same phrases in Japanese and in other languages. The experiment of evaluations shows the rules improve F-score of NCs for currency and unit from 52.10% to 77.90% and F-score of NCs for quantity from 81.92% to 87.32% and F-score of NCs for Order from 59.62% to 81.26%.
    Download PDF (1285K)
  • Tomoya Noro, Hozumi Tanaka, Taiichi Hashimoto, Kiyoaki Shirai
    2009 Volume 16 Issue 3 Pages 3_81-3_101
    Published: 2009
    Released on J-STAGE: September 01, 2011
    JOURNAL FREE ACCESS
    Adjacent symbol connection constraints (ASCCs) are very useful for not only morphological analysis of non-segmenting language such as Japanese language, but also for continuous speech recognition of any language. By incorporating ASCCs into an LR parsing table, it is possible to reduce the size of the table, as well as reject any locally implausible parsing results. Although several algorithms have been proposed, they cannot remove all of the unnecessary actions because they consider only local context. This paper proposes a new algorithm and show some evaluation results. The proposed algorithm incorporates ASCCs by searching for global action chains from the initial state to the final state. According to the results, the proposed algorithm can remove about 1.2% more actions than a conventional algorithm, and the parsing time can be reduced by about 2.4%. Lastly, we show the completeness of our algorithm.
    Download PDF (665K)
feedback
Top