Dilated Semi-Dense Convolutional Network for Multichannel Speech Enhancement

Tomohiro UEYAMA; Koichi ICHIGE; Takahiro MURAKAMI

doi:10.1587/transfun.2025EAL2086

Abstract

This paper proposes a dilated semi-dense convolutional network for multichannel speech enhancement using machine learning. The baseline, SpatialNet, employs a Conformer for narrowband processing but uses a simple 3-layer CNN block, limiting local information extraction. To improve performance, we replace the Convolutional Neural Network (CNN) block with dilated convolution, dilated dense convolution, and the proposed dilated semi-dense convolution.

Content from these authors

Favorites & Alerts

Add to favorites
Additional info alert
Citation alert
Authentication alert

Corresponding author

Register with J-STAGE for free!