Journal of Natural Language Processing
Online ISSN : 2185-8314
Print ISSN : 1340-7619
ISSN-L : 1340-7619
Building a Paraphrase Corpus Based on Class-oriented Candidate Generation
ATSUSHI FUJITAKENTARO INUI
Author information
JOURNAL FREE ACCESS

2006 Volume 13 Issue 3 Pages 133-150

Details
Abstract

Several classes of paraphrases have a potential to be compositionally explained byreferring to syntactic and semantic properties of constituent words: e.g., composing/decomposing compounds, voice/case alternation, various verb alternation, and lexical derivation.Toward analyzing the compositionality underlying these paraphrase classes, we have examined a class-oriented framework for collecting paraphrase examples, in which sentential paraphrases are collected for each paraphrase class separately by means of automatic candidate generation based on morpho-syntactic paraphrasing patterns, followed by manual judgement.Our preliminary experiments on building two paraphrase sub-corpora have so far been producing promising results with regard to cost-efficiency, exhaustiveness, and reliability.

Content from these authors
© The Association for Natural Language Processing
Previous article Next article
feedback
Top