2026 年 28 巻 2 号 p. 165-174
This study quantitatively investigates the effects of a neural network-based target-speech extraction method on listening effort among deaf and hard-of-hearing (DHH) participants in Japanese multispeaker environments. A five-point listening-effort test was conducted with 22 participants (11 DHH, 11 normal-hearing), who rated both mixtures and extracted speech samples across three signal-to-noise ratios (SNRs; 0, 10, and 20 dB) and two speakers. To appropriately handle ordinal-scale ratings, we employed ordinal logistic mixed models and their hierarchical Bayesian extensions, jointly examining the interactions among SNR, speaker, and participant group. The models showed consistent trends: target-speech extraction benefits were most pronounced at low to mid SNRs, whereas group differences between DHH and normal-hearing participants were observed at high SNR (20 dB). These findings clarify when target-speech extraction most effectively reduces listening effort for DHH participants, providing engineering insights into speech-enhancement technologies for inclusive and accessible real-world speech communication.