自然言語処理

巻頭言（査読無）

年次大会の拡大と変化

船越孝太郎

2025 年 32 巻 2 号 p. 402-403
発行日: 2025年
公開日: 2025/06/15

DOIhttps://doi.org/10.5715/jnlp.32.402

ジャーナルフリー

PDF形式でダウンロード (164K)

一般論文（査読有）

End-to-end Simultaneous Speech Translation with Style Tags using Human Simultaneous Interpretation Data

Yuka Ko, Ryo Fukuda, Yuta Nishikawa, Yasumasa Kano, Katsuhito Sudoh, S ...

2025 年 32 巻 2 号 p. 404-437
発行日: 2025年
公開日: 2025/06/15

DOIhttps://doi.org/10.5715/jnlp.32.404

ジャーナルフリー

抄録を表示する抄録を非表示にする

Simultaneous speech translation (SimulST) translates speech incrementally, requiring a monotonic input-output correspondence to reduce latency. This is particularly challenging for distant language pairs, such as English and Japanese, as most SimulST models are trained using offline speech translation (ST) data, where the entire speech input is observed during translation. In simultaneous interpretation (SI), a simultaneous interpreter translates source language speech into target language speech without waiting for the speaker to finish speaking. Therefore, the SimulST model can learn SI-style translations using SI data. However, owing to the limited availability of SI data, fine-tuning an offline ST model using SI data may result in overfitting. To address this problem, we propose an efficient training method for the speech-to-text SimulST model using a combination of small SI and relatively large offline ST data. We trained a single model with mixed data by incorporating style tags to instruct the model to generate either SI or offline-style outputs. This approach, called mixed fine-tuning with style tags, can be extended further using the multistage self-training approach. In this case, we use the trained model to generate pseudo-SI data. Our experimental results for several test sets demonstrated that our models trained using mixed fine-tuning and multistage self-training outperformed baselines across various latency ranges.

抄録全体を表示

PDF形式でダウンロード (1058K)
Enhancing Automated Essay Scoring with Grammatical Features using Multi-task Learning and Item Response Theory

Kosuke Doi, Katsuhito Sudoh, Satoshi Nakamura, Taro Watanabe

2025 年 32 巻 2 号 p. 438-479
発行日: 2025年
公開日: 2025/06/15

DOIhttps://doi.org/10.5715/jnlp.32.438

ジャーナルフリー

抄録を表示する抄録を非表示にする

In foreign language learning, writing tasks play a crucial role in developing and assessing learners’ language abilities, but manual scoring requires significant time and effort. Automated essay scoring (AES) is a way to mitigate this problem. Although human raters consider grammatical items and their difficulties as clues for judging learners’ proficiency levels while scoring essays, it is unclear whether the current state-of-the-art AES models, which use BERT-based essay representations, consider these factors. In this paper, we propose to incorporate grammatical features into BERT-based AES models in three ways: (1) using grammatical features as additional model inputs, (2) performing multi-task learning (MTL) with holistic and grammar scores while using grammatical features as model inputs, and (3) reconstructing grammatical features through MTL with holistic scores. For grammatical features, we model learners’ grammar usage using item response theory (IRT), which measures learners’ grammar abilities and characteristics of grammatical items, including their difficulties, based on essay data without teacher labels. The experimental results show that grammatical features improve the scoring performance, and further improvements are brought by MTL with holistic and grammar scores. We also show that weighting grammatical items using IRT-estimated difficulties improve the scoring performance, and IRT-estimated grammar abilities can be used for the labels of MTL.

抄録全体を表示

PDF形式でダウンロード (760K)
大規模言語モデルにおける評価バイアスの尤度に基づく緩和

大井聖也, 金子正弘, 小池隆斗, Mengsay Loem, 岡崎直観

2025 年 32 巻 2 号 p. 480-496
発行日: 2025年
公開日: 2025/06/15

DOIhttps://doi.org/10.5715/jnlp.32.480

ジャーナルフリー

抄録を表示する抄録を非表示にする

大規模言語モデル (Large Language Model; LLM) は言語生成タスクの評価器として用いられている．ところが，ある文章の意味を変えずに語順や構造を変更した文章を作ると，LLM が計算する尤度が大きく変化することがある．そのため，LLM 評価器 (LLM-as-a-Judge) には，尤度が低い文章を不当に低く，尤度が高い文章を不当に高く評価する尤度バイアスが存在する可能性がある．本研究では，尤度バイアスが LLM 評価器の性能を低下させることを明らかにし，Few-shot によるバイアス緩和手法を提案する．実験では，複数の LLM が data-to-text タスクと文法誤り訂正タスクで尤度バイアスを持つ可能性を示した．また，バイアスの強い事例を特定し Few-shot 事例として用いることで，バイアスの緩和に成功した．さらに，尤度バイアスの緩和によって LLM 評価器の評価性能（人手評価との順位相関係数）が向上することを確認し，提案手法の有効性を示した．

抄録全体を表示

PDF形式でダウンロード (508K)
日本語 Natural Questions と BoolQ の構築

植松拓也, 王昊, 福田創, 河原大輔, 柴田知秀

2025 年 32 巻 2 号 p. 497-519
発行日: 2025年
公開日: 2025/06/15

DOIhttps://doi.org/10.5715/jnlp.32.497

ジャーナルフリー

抄録を表示する抄録を非表示にする

高性能かつ頑健な言語処理モデルを構築するために，多様な質問応答 (QA) データセットにおける訓練，評価，分析が重要である．しかし，多様な QA データセットが存在する言語は英語だけであり，他の言語では少数の QA データセットしか存在しない．我々は，少数の基本的な QA データセットしか存在しない日本語を対象とし，人間の情報欲求から自然に発生する質問からなる Natural Questions (NQ) の日本語版を構築する．自然な質問を収集するために検索エンジンのクエリログを用い，アノテーションのコストを低減するためにクラウドソーシングを用いて，Japanese Natural Questions (JNQ) を構築した．また，NQ の派生で yes/no 質問からなる BoolQ の日本語版 Japanese BoolQ (JBoolQ) を構築した．どちらのデータセットを構築する際においても，より良いデータセットを得るために，オリジナルの NQ もしくは BoolQ のデータセット仕様を再定義した．JNQ は 16,641 質問文，JBoolQ は 6,467 質問文からなる．さらに，JNQ から 3 つのタスク，JBoolQ から一つのタスクを定義し，それぞれのベースラインモデルを作成し評価した．これらのデータセットにより，日本語における QA モデルや言語処理モデルの研究が促進されることが期待される．

抄録全体を表示

PDF形式でダウンロード (776K)
大規模言語モデルに推論を教えるための人工論理推論コーパスを用いたアプローチ

森下皓文, 森尾学, 山口篤季, 十河泰弘

2025 年 32 巻 2 号 p. 520-571
発行日: 2025年
公開日: 2025/06/15

DOIhttps://doi.org/10.5715/jnlp.32.520

ジャーナルフリー

抄録を表示する抄録を非表示にする

大規模言語モデル (LLM) はその豊富な知識により，様々な既知の課題を解決した．しかしながら LLM は，推論を用いて新規な課題を解くことを苦手とする．我々はこの問題に対して，「ルールベースで生成した人工論理推論サンプルの学習によって，LLM の推論能力を向上させる」というアプローチを提案する．まず，「どのようなサンプルを設計すれば良いか？」という議論から始める．記号論理学や過去の哲学的論考，また近年の先行研究や我々の予備実験から得られている知見を参照しつつ，設計の指針を打ち立てていく．次に，この設計指針に基づき，多様な推論規則からなる深い推論サンプルを大量に自動生成し，人工論理推論コーパス Formal Logic Deduction Diverse (FLD_×𝟚) を構築する．最後に，FLD_×𝟚 での追加学習が LLM の推論能力を向上させられることを確認する．その結果，LLaMA-3.1 (8B/70B)に対して，論理推論で最大 30 ポイント，数学で最大 7 ポイント，コーディングで最大 10 ポイント，BBH ベンチマーク群で 5 ポイント，の精度向上を達成した．

抄録全体を表示

PDF形式でダウンロード (1977K)
Data-to-Text Generation for Esports Game Commentary of Multiplayer Strategy Game

Zihan Wang, Naoki Yoshinaga

2025 年 32 巻 2 号 p. 572-597
発行日: 2025年
公開日: 2025/06/15

DOIhttps://doi.org/10.5715/jnlp.32.572

ジャーナルフリー

抄録を表示する抄録を非表示にする

Esports, a sports competition on video games, has become one of the most important sporting events. Despite the large accumulation of esports play logs, only a small portion are accompanied by text commentaries that help the audience retrieve and understand the plays. In this study, we introduce the task of generating commentaries from esports game’s data records. We begin by building large-scale esports data-to-text datasets that pair structured data records with textual commentaries from a popular esports game, League of Legends. We then explore several generation models to produce game commentaries from structured data records while also examining the impact of pre-trained language models. To assess the generated commentaries, we designed evaluation metrics that focused on the unique characteristics of esports data, such as evaluating strategic depth. The experimental results of the data-to-text generation using our dataset revealed the remaining challenges of this novel task.

抄録全体を表示

PDF形式でダウンロード (1214K)
Using Linguistic Formalism to Improve Real World Understanding for V&L Models: Case Study on Image Discrimination for Structurally Ambiguous Language Input

Lee Sangmyeong, Seitaro Shinagawa, Koichiro Yoshino, Satoshi Nakamura

2025 年 32 巻 2 号 p. 598-632
発行日: 2025年
公開日: 2025/06/15

DOIhttps://doi.org/10.5715/jnlp.32.598

ジャーナルフリー

抄録を表示する抄録を非表示にする

In the context of Real World Understanding (RWU) for vision and language (V&L) models, accurately aligning language with the corresponding visual scene is critical. Since current models typically assume language inputs to be plain text, RWU faces potential issues with structural ambiguity, where a single sentence can have multiple meanings due to various phrase structures. This paper proposes to use linguistic formalism as input, which enriches language information and addresses the issue of structural ambiguity. Our focus is on the Contrastive Language-Image Pre-training (CLIP) model, a prominent V&L model, focusing on image discrimination tasks of RWU. Our experiments test various approaches to incorporating formalism into the CLIP model, depending on the type of formalism and its processing method. We aim to determine the effectiveness of formalism in discriminating ambiguous images and identify which formalism works best. Additionally, we employ a gradient-based method to gain insights into how formalism is interpreted within the model’s architecture.

抄録全体を表示

PDF形式でダウンロード (919K)
談話関係ラベル付き接続語認識

岡佑依, 柳本大輝, 平尾努, 西田京介

2025 年 32 巻 2 号 p. 633-659
発行日: 2025年
公開日: 2025/06/15

DOIhttps://doi.org/10.5715/jnlp.32.633

ジャーナルフリー

抄録を表示する抄録を非表示にする

暗黙的談話関係認識 (IDRR) は，隣接するテキストスパン間の談話関係を識別するタスクである．しかし，IDRR で用いられる談話関係ラベルは粗い表現であり，全ての談話を網羅的に表現できているわけではない．本稿では，隣接するテキストスパン間の接続語とその談話関係ラベルの組み合わせを識別するタスク，談話関係ラベル付き接続語認識 (Implicit Sense-labeled Connective Recognition, ISCR) を提案する．ISCR は分類タスクとして扱えるが，クラス数の多さ，そしてクラス間のインスタンスの不均等な分布から，従来の分類器で解くことは難しい．そこで本稿では，ISCR をテキスト生成タスクとして扱い，エンコーダ・デコーダモデルを用いて接続語とその談話関係ラベルの両方を生成する．PDTB-3.0，PDTB-2.0 において，従来の分類器と2種類の生成法で比較実験から，生成法が有効であることがわかった．

抄録全体を表示

PDF形式でダウンロード (464K)
地図を刺激に用いた位置情報・経路情報参照表現の収集

大村舞, 川端良子, 小西光, 浅原正幸, 竹内誉羽

2025 年 32 巻 2 号 p. 660-678
発行日: 2025年
公開日: 2025/06/15

DOIhttps://doi.org/10.5715/jnlp.32.660

ジャーナルフリー

抄録を表示する抄録を非表示にする

本研究では，クラウドソーシングを用いて位置情報および経路情報を参照する表現のデータベースを構築し，これらをオープンデータとして公開した．20 の地図を刺激として使用し，位置情報については 1 地図あたり 40 人に目標点の位置情報を記述させ，800 の参照表現を収集した．一方，経路情報については 1 地図あたり 2 経路を設定し，1 経路あたり 40 人に 2 地点間の経路情報を記述させ，1,600 の参照表現を収集した．いずれの情報も，地図上のランドマークに基づく相対参照表現のみであるかを判定し，位置情報参照表現では一人称視点・空間内視点・空間内移動・鳥瞰視点の 4 つに分類，経路情報参照表現では始点・通過地点・終点の情報の有無をラベル付けした．また，各表現のわかりやすさについてアンケート調査を実施し，データとして収集した．

抄録全体を表示

PDF形式でダウンロード (764K)

学会記事（査読無）

「クイズコンペティションの結果分析からみた日本語質問応答の到達点と課題」の執筆を振り返って

有山知希, 鈴木正敏

2025 年 32 巻 2 号 p. 679-683
発行日: 2025年
公開日: 2025/06/15

DOIhttps://doi.org/10.5715/jnlp.32.679

ジャーナルフリー

PDF形式でダウンロード (249K)
「言語モデルの第二言語獲得」の研究経緯

大羽未悠

2025 年 32 巻 2 号 p. 684-690
発行日: 2025年
公開日: 2025/06/15

DOIhttps://doi.org/10.5715/jnlp.32.684

ジャーナルフリー

PDF形式でダウンロード (349K)
「未知の知識に対する事前学習済み言語モデルが持つ推論能力の調査」の研究過程

坂井優介

2025 年 32 巻 2 号 p. 691-698
発行日: 2025年
公開日: 2025/06/15

DOIhttps://doi.org/10.5715/jnlp.32.691

ジャーナルフリー

PDF形式でダウンロード (594K)
NLP2025テーマセッション「金融・経済ドメインのための言語処理」

中川慧

2025 年 32 巻 2 号 p. 699-703
発行日: 2025年
公開日: 2025/06/15

DOIhttps://doi.org/10.5715/jnlp.32.699

ジャーナルフリー

PDF形式でダウンロード (227K)
テーマセッション 2：人と AI の共生に向けた対話システム・言語使用の研究

宇佐美まゆみ, 高橋哲朗, 西川寛之, 東中竜一郎

2025 年 32 巻 2 号 p. 704-712
発行日: 2025年
公開日: 2025/06/15

DOIhttps://doi.org/10.5715/jnlp.32.704

ジャーナルフリー

PDF形式でダウンロード (337K)
NLP2025 テーマセッション「認知・脳と自然言語処理」

西田知史, 小林一郎, 大関洋平, 日髙昇平, 谷中瞳

2025 年 32 巻 2 号 p. 713-719
発行日: 2025年
公開日: 2025/06/15

DOIhttps://doi.org/10.5715/jnlp.32.713

ジャーナルフリー

PDF形式でダウンロード (281K)
NLP2025 テーマセッション「人狼知能：嘘を見破り説得する会話ゲームと LLM」

狩野芳伸, 鳥海不二夫, 稲葉通将, 大澤博隆, 片上大輔, 大槻恭士, アランニャクラウス, 原田慧, 伊藤毅志

2025 年 32 巻 2 号 p. 720-726
発行日: 2025年
公開日: 2025/06/15

DOIhttps://doi.org/10.5715/jnlp.32.720

ジャーナルフリー

PDF形式でダウンロード (270K)
NLP2025 テーマセッション「言語とコミュニケーションの創発」

上田亮

2025 年 32 巻 2 号 p. 727-732
発行日: 2025年
公開日: 2025/06/15

DOIhttps://doi.org/10.5715/jnlp.32.727

ジャーナルフリー

PDF形式でダウンロード (274K)
人文学と言語処理の現在と未来

臼井久生, 大内啓樹, 塚越柚季, 宮川創

2025 年 32 巻 2 号 p. 733-737
発行日: 2025年
公開日: 2025/06/15

DOIhttps://doi.org/10.5715/jnlp.32.733

ジャーナルフリー

PDF形式でダウンロード (271K)
NLP2025 ワークショップ：LLM時代のことばの評価の現在と未来

須藤克仁, 小町守, 梶原智之, 三田雅人

2025 年 32 巻 2 号 p. 738-745
発行日: 2025年
公開日: 2025/06/15

DOIhttps://doi.org/10.5715/jnlp.32.738

ジャーナルフリー

PDF形式でダウンロード (305K)
NLP2025 併設ワークショップ「大規模言語モデルのファインチューニング技術と評価」

大北剛, 勝又智, 鎌田啓輔, 清丸寛一, 児玉貴志, 鈴木潤, 中山功太, Namgi Han, 宮尾祐介

2025 年 32 巻 2 号 p. 746-750
発行日: 2025年
公開日: 2025/06/15

DOIhttps://doi.org/10.5715/jnlp.32.746

ジャーナルフリー

PDF形式でダウンロード (243K)
日本語言語資源の構築と利用性の向上―JLR2025 ワークショップ

浅原正幸, 伊藤敬彦, 大村舞, 河原大輔, 久保隆宏, 坂口慶祐, 柴田知秀, 松田寛, 宮尾祐介

2025 年 32 巻 2 号 p. 751-754
発行日: 2025年
公開日: 2025/06/15

DOIhttps://doi.org/10.5715/jnlp.32.751

ジャーナルフリー

PDF形式でダウンロード (658K)

賛助会員記事（査読無）

データフローアーキテクチャが拓く次世代の自然言語処理

中野匡彦

2025 年 32 巻 2 号 p. 755-760
発行日: 2025年
公開日: 2025/06/15

DOIhttps://doi.org/10.5715/jnlp.32.755

ジャーナルフリー

PDF形式でダウンロード (319K)

後付記事（査読無）

編集後記・原稿執筆案内・編集スケジュール・統計情報・学会案内

2025 年 32 巻 2 号 p. 761-764
発行日: 2025年
公開日: 2025/06/15

DOIhttps://doi.org/10.5715/jnlp.32.761

ジャーナルフリー

PDF形式でダウンロード (327K)

J-STAGEへの登録はこちら（無料）