Acoustical Science and Technology
Online ISSN : 1347-5177
Print ISSN : 1346-3969
ISSN-L : 0369-4232
PAPERS
Enhancement of esophageal speech using formant synthesis
Kenji MatsuiNoriyo HaraNoriko KobayashiHajime Hirose
Author information
JOURNAL FREE ACCESS

2002 Volume 23 Issue 2 Pages 69-76

Details
Abstract

The feasibility of using the formant analysis-synthesis approach to replace the voicing sources of esophageal speech was explored. Using inverse-filtered signals extracted from normal speakers provided the voicing sources. Pitch extraction was tested with various pitch extraction methods, and then a computationally simple, band-limited auto-correlation method was chosen. To accomplish stable and practical speech enhancement, the input signal was divided into low- and high-frequency channels, then only the low-frequency channel was processed by the formant analysis-synthesis method. A special purpose DSP-hardware unit was designed to perform the proposed analysis-synthesis process in real-time. Subjective evaluation tests (rating scale method) have been made with seven well-trained esophageal speakers and three speech therapists. Results of the subjective test showed that the synthesized speech was significantly improved, especially in cases of “loudness”, “sonority”, “strained”, “stoma noise”, “choppy”, “stability”, “intelligibility”, “recognizability”, and “duration” features.

Content from these authors
© 2002 by The Acoustical Society of Japan
Previous article Next article
feedback
Top