An Efficient Image to Sound Mapping Method Using Speech Spectral Phase and Multi-Column Image

Arata KAWAMURA; Hiro IGARASHI; Youji IIGUNI

doi:10.1587/transfun.E100.A.893

抄録

Image-to-sound mapping is a technique that transforms an image to a sound signal, which is subsequently treated as a sound spectrogram. In general, the transformed sound differs from a human speech signal. Herein an efficient image-to-sound mapping method, which provides an understandable speech signal without any training, is proposed. To synthesize such a speech signal, the proposed method utilizes a multi-column image and a speech spectral phase that is obtained from a long-time observation of the speech. The original image can be retrieved from the sound spectrogram of the synthesized speech signal. The synthesized speech and the reconstructed image qualities are evaluated using objective tests.

著者関連情報

お気に入り & アラート

お気に入りに追加
追加情報アラート
被引用アラート
認証解除アラート

閲覧履歴

[title in Japanese]
ムロトムヨウラン(ラン科)を沖縄に記録する(新産地報告)
ストックホルム滞在記〔第1回〕
[title in Japanese]
Novel Mechanomyogram/electromyogram Hybrid Transducer Measurements Reflect Muscle Strength during Dynamic Exercise — Pedaling of Recumbent Bicycle —

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）