自然言語処理

巻頭言

第15回年次大会を終えて

村上仁一

2009 年 16 巻 5 号 p. 5_1-5_3
発行日: 2009年
公開日: 2011/07/28

DOIhttps://doi.org/10.5715/jnlp.16.5_1

ジャーナルフリー

PDF形式でダウンロード (170K)

追悼

追悼　田中穂積先生

長尾真

2009 年 16 巻 5 号 p. 5_5-5_6
発行日: 2009年
公開日: 2011/07/28

DOIhttps://doi.org/10.5715/jnlp.16.5_5

ジャーナルフリー

PDF形式でダウンロード (129K)

論文

擬似確率的単語分割コーパスによる言語モデルの改良

森信介, 小田裕樹

2009 年 16 巻 5 号 p. 5_7-5_21
発行日: 2009年
公開日: 2011/07/28

DOIhttps://doi.org/10.5715/jnlp.16.5_7

ジャーナルフリー

抄録を表示する抄録を非表示にする

言語モデルの分野適応において，適応対象の分野の単語境界情報のない生コーパスの有効な利用方法として，確率的単語分割コーパスとしての利用が提案されている．この枠組では，生コーパス中の各文字間に単語境界が存在する確率を付与し，それを用いて単語 n-gram 確率などが計算される．本論文では，この単語境界確率を最大エントロピー法に基づくモデルによって推定することを提案する．さらに，確率的単語分割コーパスを従来の決定的に単語に分割されたコーパスで模擬する方法を提案し，言語モデルの能力を下げることなく計算コストが大幅に削減できることを示す．

抄録全体を表示

PDF形式でダウンロード (372K)
半教師有りクラスタリングを用いた Web 検索結果における人名の曖昧性解消

杉山一成, 奥村学

2009 年 16 巻 5 号 p. 5_23-5_49
発行日: 2009年
公開日: 2011/07/28

DOIhttps://doi.org/10.5715/jnlp.16.5_23

ジャーナルフリー

抄録を表示する抄録を非表示にする

人名は検索語として，しばしば検索エンジンに入力される．しかし，この入力された人名に対して，検索エンジンは，いくつかの同姓同名人物についての Web ページを含む長い検索結果のリストを返すだけである．この問題を解決するために，Web 検索結果における人名の曖昧性解消を目的とした従来研究の多くは，凝集型クラスタリングを適用している．一方，本研究では，ある種文書に類似した文書をマージする半教師有りクラスタリングを用いる．我々の提案する半教師有りクラスタリングは，種文書を含むクラスタの重心の変動を抑えるという点において，新規性がある．

抄録全体を表示

PDF形式でダウンロード (779K)
On Contribution of Sense Dependencies to Word Sense Disambiguation

Jun Hatori, Yusuke Miyao, Jun’ichi Tsujii

2009 年 16 巻 5 号 p. 5_51-5_77
発行日: 2009年
公開日: 2011/07/28

DOIhttps://doi.org/10.5715/jnlp.16.5_51

ジャーナルフリー

抄録を表示する抄録を非表示にする

Traditionally, many researchers have addressed word sense disambiguation (WSD) as an independent classification problem for each word in a sentence. However, the problem with their approaches is that they disregard the interdependencies of word senses. Additionally, since they construct an individual sense classifier for each word, their method is limited in its applicability to the word senses for which training instances are served. In this paper, we propose a supervised WSD model based on the syntactic dependencies of word senses. In particular, we assume that strong dependencies between the sense of a syntactic head and those of its dependents exist. We describe these dependencies on the tree-structured conditional random fields (T-CRFs), and obtain the most appropriate assignment of senses optimized over the sentence. Furthermore, we incorporate these sense dependencies in combination with various coarse-grained sense tag sets, which are expected to relieve the data sparseness problem, and enable our model to work even for words that do not appear in the training data. In experiments, we display the appropriateness of considering the syntactic dependencies of senses, as well as the improvements by the use of coarse-grained tag sets. The performance of our model is shown to be comparable to those of state-of-the-art WSD systems. We also present an in-depth analysis of the effectiveness of the sense dependency features by showing intuitive examples.

抄録全体を表示

PDF形式でダウンロード (261K)
共起パターンの学習による事態間関係知識の獲得

阿部修也, 乾健太郎, 松本裕治

2009 年 16 巻 5 号 p. 5_79-5_100
発行日: 2009年
公開日: 2011/07/28

DOIhttps://doi.org/10.5715/jnlp.16.5_79

ジャーナルフリー

抄録を表示する抄録を非表示にする

大規模コーパスから事態表現間の意味的関係の知識の獲得を目的として，実体間関係獲得手法として提案された Espresso を事態間関係に適用できるように拡張した．この拡張は主に 2 つの点からなり，(1) 知識獲得のために事態表現を定義し，(2) 事態間関係に適合するように共起パターンのテンプレートを拡張した．日本語 Web コーパスを用いて実験したところ，(a) 事態間関係獲得に有用な共起パターンが多数存在し，パターンの学習が有効であることがわかった．また行為―効果関係については 5 億文 Web コーパスから少なくとも 5,000 種類の事態対を約 66% の精度で獲得することができた．

抄録全体を表示

PDF形式でダウンロード (694K)

J-STAGEへの登録はこちら（無料）