Encoding Detection and Bit Rate Classification of AMR-Coded Speech Based on Deep Neural Network

Seong-Hyeon SHIN; Woo-Jin JANG; Ho-Won YUN; Hochong PARK

doi:10.1587/transinf.2017EDL8155

Abstract

A method for encoding detection and bit rate classification of AMR-coded speech is proposed. For each texture frame, 184 features consisting of the short-term and long-term temporal statistics of speech parameters are extracted, which can effectively measure the amount of distortion due to AMR. The deep neural network then classifies the bit rate of speech after analyzing the extracted features. It is confirmed that the proposed features provide better performance than the conventional spectral features designed for bit rate classification of coded audio.

Content from these authors

Favorites & Alerts

Add to favorites
Additional info alert
Citation alert
Authentication alert

Corresponding author

Register with J-STAGE for free!