Article ID: 2025EAL2086
This paper proposes a dilated semi-dense convolutional network for multichannel speech enhancement using machine learning. The baseline, SpatialNet, employs a Conformer for narrowband processing but uses a simple 3-layer CNN block, limiting local information extraction. To improve performance, we replace the Convolutional Neural Network (CNN) block with dilated convolution, dilated dense convolution, and the proposed dilated semi-dense convolution.