Label-Adversarial Jointly Trained Acoustic Word Embedding

Zhaoqi LI; Ta LI; Qingwei ZHAO; Pengyuan ZHANG

doi:10.1587/transinf.2022EDL8012

Regular Section

Label-Adversarial Jointly Trained Acoustic Word Embedding

Zhaoqi LI, Ta LI, Qingwei ZHAO, Pengyuan ZHANG

Author information

Keywords: query-by-example, spoken term detection, acoustic word embeddings, gradient reversal layer

JOURNAL FREE ACCESS

2022 Volume E105.D Issue 8 Pages 1501-1505

DOI https://doi.org/10.1587/transinf.2022EDL8012

Details

Abstract

Query-by-example spoken term detection (QbE-STD) is a task of using speech queries to match utterances, and the acoustic word embedding (AWE) method of generating fixed-length representations for speech segments has shown high performance and efficiency in recent work. We propose an AWE training method using a label-adversarial network to reduce the interference information learned during AWE training. Experiments demonstrate that our method achieves significant improvements on multilingual and zero-resource test sets.

Corresponding author

Register with J-STAGE for free!