タスク指向対話システムにおける全モジュールの後処理の同時最適化

大橋 厚元; 東中 竜一郎

doi:10.11517/pjsai.JSAI2025.0_3R1OS4505

Abstract

Post-Processing Networks (PPNs) serve as components that modify outputs from modules in task-oriented dialogue systems, with the goal of enhancing the system's overall task completion capabilities. Traditional PPN approaches, however, have been restricted to handling only a subset of modules within a system, which has significantly constrained potential performance improvements. In this paper, we introduce a method for simultaneously optimizing the post-processing of the outputs of all modules using Universal Post-Processing Networks (UniPPNs). The UniPPN utilizes a single language model capable of processing outputs from any module as a sequence transformation task. We provide a detailed explanation of the UniPPN reinforcement learning algorithm and demonstrate, through both simulation experiments using the MultiWOZ dataset and human evaluation studies, that our approach achieves superior performance compared to conventional PPN approaches.

Content from these authors

Favorites & Alerts

Corresponding author

Conference information

Register with J-STAGE for free!