自然言語処理

巻頭言（査読無）

材料言語処理ワンダー

大熊智子

2023 年 30 巻 3 号 p. 881-882
発行日: 2023年
公開日: 2023/09/15

DOIhttps://doi.org/10.5715/jnlp.30.881

ジャーナルフリー

PDF形式でダウンロード (161K)

一般論文（査読有）

ホライゾン・スキャニングの自動化のための言語処理応用

西野涼子, 石垣達也, 鷲野壮平, 五十嵐広希, 村井昭彦, 鷲田祐一, 永井由佳里

2023 年 30 巻 3 号 p. 883-906
発行日: 2023年
公開日: 2023/09/15

DOIhttps://doi.org/10.5715/jnlp.30.883

ジャーナルフリー

抄録を表示する抄録を非表示にする

未来学分野におけるホライゾン・スキャニング (Sardar 2010) の自動化を目指し，文書探索およびコメント生成問題とデータセットを提案する．ホライゾン・スキャニングは未来に起こりうる社会変化を分析する枠組みで，1) 新聞記事等を大量に収集し，2) 重要な記事には主観的な意見や記事の要約をコメントとして記述する．収集された記事とコメントは社会変化についてのシナリオ記述に活用される．本研究ではデータセットの分析，探索およびコメント生成モデルの構築，評価を行うことで，問題の特徴を明らかにする．データセットの分析より，1) 探索すべき記事は特徴語および意味的な距離の観点から一般的な記事と異なること，2) コメントは談話構造の観点から類型に分類できることがわかった．また，学習した探索モデルは適合率 @100 において 70% の性能を示した．さらに，探索モデルの出力記事は専門家による採点，および実際のシナリオ記述作業の中で用いることで評価され，実利用に耐えうる品質であることを確認した．一方，コメント生成問題は主観的なテキスト部分の生成が特に挑戦的で，生成技術のさらなる高度化が必要であることがわかった．

抄録全体を表示

PDF形式でダウンロード (682K)
共同作業を行う対話における共通基盤構築過程の記録と分析

光田航, 東中竜一郎, 大賀悠平, 吉田仙

2023 年 30 巻 3 号 p. 907-934
発行日: 2023年
公開日: 2023/09/15

DOIhttps://doi.org/10.5715/jnlp.30.907

ジャーナルフリー

抄録を表示する抄録を非表示にする

共通基盤を構築可能な対話システムを実現するためには，対話において共通基盤が構築される過程を明らかにすることが重要である．しかしながら，既存研究では対話の結果得られる最終的な成果物のみを共通基盤と関連付けて分析しており，共通基盤が構築される過程は明らかにされていない．本研究では，共同作業を行う対話において，共通基盤構築の過程を明らかにすることを目指し，データ収集手法を提案する．具体的には，課題の中間結果をその時点における共通基盤に相当するものとして，自動的に記録するデータ収集手法を提案する．提案手法を用いて 984 対話を収集し，共通基盤構築の過程を定量的に調査した結果，共通基盤の構築過程にいくつかのパタンが見られた．また，共感や同意，ポジティブな表現を通じて自身の理解を伝える発話が出現している場合，共通基盤が構築されている傾向にあると考えられる．さらに，対話中の共通基盤構築の度合いを対話や課題の内容から推定する実験を行い，双方の内容が推定に有用なことを示した．

抄録全体を表示

PDF形式でダウンロード (2328K)
日本語症例テキストの複合語解析・推論システム Medc2l

石田真捺, 谷中瞳, 戸次大介

2023 年 30 巻 3 号 p. 935-958
発行日: 2023年
公開日: 2023/09/15

DOIhttps://doi.org/10.5715/jnlp.30.935

ジャーナルフリー

抄録を表示する抄録を非表示にする

医療分野には，電子カルテや退院サマリといった症例テキストが蓄積されており，これらを新たな知識の発見に繋げるために自然言語処理技術を応用する研究が試みられている．しかし，症例テキストには専門的な複合語が頻出するといった解析の難しさがあり，日本語の症例テキストを用いた深い意味解析や，日本語の症例テキスト間の含意関係認識についての研究は発展途上である．そこで本論文では，日本語の高度な意味解析・推論システム ccg2lambda に複合語解析モジュールを追加することで，症例テキストの意味解析と推論を扱える論理推論システム Medc2l を提案する．複合語解析モジュールは (i) 症例テキストに含まれる複合語の抽出，(ii) 複合語を構成する形態素間の意味的な関係を表す意味現象タグの付与，(iii) 意味現象タグに基づく複合語の構文解析，(iv) 意味現象タグに基づく複合語の意味解析から構成される．(ii) では (i) で抽出した複合語に対して意味現象タグのアノテーションを行い，系列変換モデルの学習によって構築した複合語意味現象タグ分類モデルを用いる．(iii) では予測された意味現象タグを元に複合語の構造を CFG 解析したのち CCG 部分木に変換し，(iv) では (iii) の CCG 部分木に基づいて高階論理の意味表示を導出する．日本語の症例テキストを用いて推論データセットを構築し，提案システムの評価を行った結果，深層学習による含意関係認識モデルと同等またはそれ以上の性能を示し，とくに non-entailment のケースを正しく予測する傾向が見られた．

抄録全体を表示

PDF形式でダウンロード (1248K)
A Comprehensive Empirical Study on Personalized Dialogue Generation

Itsugun Cho, Dongyang Wang, Ryota Takahashi, Hiroaki Saito

2023 年 30 巻 3 号 p. 959-990
発行日: 2023年
公開日: 2023/09/15

DOIhttps://doi.org/10.5715/jnlp.30.959

ジャーナルフリー

抄録を表示する抄録を非表示にする

Current studies on the generation of personalized dialogue primarily contribute to an agent presenting a consistent personality and driving a more informative response. However, we found that the responses generated from most previous models were self-centered, with little consideration for the user in the dialogue. Moreover, we consider human-like conversations to be essentially based on inferring information about the persona of the other party. Therefore, we propose a novel personalized dialogue generator that detects implicit user personas. Because it is difficult to collect a large amount of detailed personal facts for each user, we attempted to model the potential persona of a user and its representation from the dialogue history with no external knowledge. The perception and fader variables were conceived using conditional variational inference. The two latent variables simulate the process of people becoming aware of each other’s personas and producing a corresponding expression in conversations. Subsequently, posterior-discriminated regularization was performed to enhance the training procedure. Finally, a selector was designed to help our model provide long-sighted responses. Comprehensive experiments demonstrate that compared to the state-of-the-art methods, our approach is more concerned with the user’s persona and achieves a notable boost across both automatic metrics and human evaluations.

抄録全体を表示

PDF形式でダウンロード (3585K)
編集操作予測に基づく語彙制約付きデコーディングによるテキスト平易化の難易度制御

舌達也, 梶原智之, 荒瀬由紀

2023 年 30 巻 3 号 p. 991-1010
発行日: 2023年
公開日: 2023/09/15

DOIhttps://doi.org/10.5715/jnlp.30.991

ジャーナルフリー

抄録を表示する抄録を非表示にする

テキスト平易化の難易度制御は,目標難易度に応じて文を平易化することで，言語学習支援に貢献する技術である．このタスクに対する既存手法には，入力を大幅に言い換える学習が困難である問題と柔軟な文生成が難しい問題がある．提案手法では，平易な出力文に出現させる単語の制約と，出現させない単語の制約を作成し，それらによって難易度を制御しつつテキスト平易化を行う．制約は，文中の各単語に対する編集操作予測，難易度判定，難解な単語の平易な言い換えにより作成する．提案手法は，正・負の制約を用いることで言い換えを促進しつつ，系列変換モデルで柔軟に文を生成するため，既存手法の問題を解決できる．評価実験によって，提案手法が文法性を損なったり，文の意味を大幅に欠落させることなく目標とする難易度に応じたテキスト平易化を実現できることを確認した．

抄録全体を表示

PDF形式でダウンロード (679K)
Plug-and-Play Attribute-Aware Text Infilling via A New Attention Mechanism and Two-Level Positional Encoding

Dongyuan Li, Kotaro Funakoshi, Manabu Okumura

2023 年 30 巻 3 号 p. 1011-1041
発行日: 2023年
公開日: 2023/09/15

DOIhttps://doi.org/10.5715/jnlp.30.1011

ジャーナルフリー

抄録を表示する抄録を非表示にする

Text infilling aims to restore incomplete texts by filling in blanks and has attracted increasing attention recently because of its wide application in ancient text restoration, conversation generation, and text rewriting. However, attribute-aware text infilling is yet to be explored, and existing methods seldom focus on the infilling length of each blank or the number and location of the blanks. In this study, we propose a plug-and-play Attribute-aware Text Infilling method using a Pre-trained language model (A-TIP) that contains a text-infilling component and a plug-and-play discriminator. Specifically, we first designed a unified text-infilling component with modified attention mechanisms and intra- and inter-blank positional encoding to better perceive the number of blanks and the infilling length for each blank. We then propose a plug-and-play discriminator to guide generation and improve attribute relevance without decreasing text fluency. Finally, automatic and human evaluations on three open-source datasets indicate that, compared to all the baselines, A-TIP achieves state-of-the-art performance. An additional ablation study demonstrated the robustness of A-TIP.

抄録全体を表示

PDF形式でダウンロード (2363K)
調理動作後の物体の視覚的状態予測を目指した Visual Recipe Flow データセットの構築と評価

白井圭佑, 橋本敦史, 西村太一, 亀甲博貴, 栗田修平, 森信介

2023 年 30 巻 3 号 p. 1042-1060
発行日: 2023年
公開日: 2023/09/15

DOIhttps://doi.org/10.5715/jnlp.30.1042

ジャーナルフリー

抄録を表示する抄録を非表示にする

本稿では，調理レシピにおいて，調理動作後の物体の視覚的な状態の予測を目指し，Visual Recipe Flow (VRF) データセットを提案する．VRF データセットは (i) 物体の視覚的な状態遷移と (ii) レシピ全体のワークフローに対するアノテーションから成る．視覚的な状態遷移は動作前後の物体の観測を表す画像の組として，ワークフローはレシピフローグラフとして，それぞれ表現する．ここでは，データセットの構築方法，アノテーション手順について順に説明し，アノテータ間のアノテーション一致率を測ることでデータセットの品質を調査する．最後に，動作前後の画像と物体のテキスト情報を用いたマルチモーダルな情報検索の実験を行うことで，各アノテーション要素の重要性について調べる．

抄録全体を表示

PDF形式でダウンロード (3320K)
Revisiting The Offensive Text Detection Problem with a Chain-of-Reasoning Approach

Qiang Zhang, Jason Naradowsky, Yusuke Miyao

2023 年 30 巻 3 号 p. 1061-1087
発行日: 2023年
公開日: 2023/09/15

DOIhttps://doi.org/10.5715/jnlp.30.1061

ジャーナルフリー

抄録を表示する抄録を非表示にする

We introduce the task of implicit offensive-text detection (OTD) in dialogues, where a statement may have either an offensive or nonoffensive interpretation depending on the listener and context. We argue that reasoning is crucial for understanding this broader class of offensive utterances, and release SLIGHT, a test dataset to support research on this topic. Experiments using the data show that state-of-the-art methods for offense detection perform poorly when tasked with detecting implicitly offensive statements, achieving only ∼11% accuracy. In contrast to the existing OTD datasets, SLIGHT features human-annotated chains of reasoning that describe the mental process through which an offensive interpretation can be reached from an ambiguous statement. We explore the potential of a multihop reasoning approach, by utilizing the existing entailment models to evaluate the probabilities of these chains. Our results demonstrate that reasoning through chains can yield performances better than that of a baseline entailment setting without chains. Furthermore, the analysis of the chains provides insights into the human interpretation process and emphasizes the importance of incorporating additional commonsense knowledge.

抄録全体を表示

PDF形式でダウンロード (762K)

学会記事（査読無）

「文書レベル関係抽出における根拠認識の統合」の完成まで

馬尤咪

2023 年 30 巻 3 号 p. 1088-1093
発行日: 2023年
公開日: 2023/09/15

DOIhttps://doi.org/10.5715/jnlp.30.1088

ジャーナルフリー

PDF形式でダウンロード (497K)
「自己注意機構における注意の集中が相対位置に依存する仕組み」の研究経緯

山本悠士

2023 年 30 巻 3 号 p. 1094-1099
発行日: 2023年
公開日: 2023/09/15

DOIhttps://doi.org/10.5715/jnlp.30.1094

ジャーナルフリー

PDF形式でダウンロード (532K)
「大規模言語モデルに基づく複数の外部ツールを利用した推論フレームワーク」の研究

稲葉達郎

2023 年 30 巻 3 号 p. 1100-1104
発行日: 2023年
公開日: 2023/09/15

DOIhttps://doi.org/10.5715/jnlp.30.1100

ジャーナルフリー

PDF形式でダウンロード (307K)
「埋め込み表現の意味適応による知識ベース語義曖昧性解消」ができるまで

水木栄

2023 年 30 巻 3 号 p. 1105-1109
発行日: 2023年
公開日: 2023/09/15

DOIhttps://doi.org/10.5715/jnlp.30.1105

ジャーナルフリー

PDF形式でダウンロード (601K)
Evaluating the Robustness of Discrete Prompts

石橋陽一

2023 年 30 巻 3 号 p. 1110-1114
発行日: 2023年
公開日: 2023/09/15

DOIhttps://doi.org/10.5715/jnlp.30.1110

ジャーナルフリー

PDF形式でダウンロード (243K)
「通時コーパス」シンポジウム 2023

小木曽智信

2023 年 30 巻 3 号 p. 1116-1121
発行日: 2023年
公開日: 2023/09/15

DOIhttps://doi.org/10.5715/jnlp.30.1116

ジャーナルフリー

PDF形式でダウンロード (266K)

後付記事（査読無）

編集後記・原稿執筆案内・編集スケジュール・統計情報・学会案内

2023 年 30 巻 3 号 p. 1122-1127
発行日: 2023年
公開日: 2023/09/15

DOIhttps://doi.org/10.5715/jnlp.30.1122

ジャーナルフリー

PDF形式でダウンロード (333K)

J-STAGEへの登録はこちら（無料）