スペクトログラム画像を用いた室内音の識別手法とデータ拡張による精度向上の検討

佐野 将太; 橋爪 裕貴; 長谷川 啓介; 川喜田 佑介; 宮崎 剛; 田中 博

doi:10.11371/wiieej.22.03.0_93

Reports of the 303rd Technical Conference of the Institute of Image Electronics Engineers of Japan

Session ID : 22-03-17

DOI https://doi.org/10.11371/wiieej.22.03.0_93

Conference information

Host: The Institute of Image Electronics Engineers of Japan

Name : Reports of the 303rd Technical Conference of the Institute of Image Electronics Engineers of Japan

Number : 303

Location : [in Japanese]

Date : February 21, 2023 - February 22, 2023

A Method for Classifying Room Sounds using Spectrogram Images and Accuracy Improvement by Data Augmentation

*Shota SANO, Yuki HASHIZUME, Keisuke HASEGAWA, Yuusuke KAWAKITA, Tsuyoshi MIYAZAKI, Hiroshi TANAKA

Author information

Keywords: Sound Classification, Spectrogram, Transfer Learning, Indoor Sound Environment

CONFERENCE PROCEEDINGS RESTRICTED ACCESS

Details

Abstract

There are many different sound sources in a room, and classifying these sounds has many applications, such as monitoring a living situation. The authors investigate a method for estimating each sound in the environment by converting timeseries data of indoor sounds into spectrogram images and using them as input for transfer learning to build a discriminative model. In this process, the amount of sound data that can be prepared in advance is limited due to the effort required for recording and the variety of data types required. As a result, there may be cases where sufficient classification accuracy cannot be achieved due to insufficient data for training. Therefore, this study proposes and applies a data augmentation method to improve classification accuracy when the number of data is limited, with the aim of classifying single and mixed sounds exist in a room, and describes the results of clarifying its effectiveness.

Corresponding author

Register with J-STAGE for free!