IEICE Transactions on Information and Systems
Online ISSN : 1745-1361
Print ISSN : 0916-8532
Special Section on Enriched Multimedia — Media technologies opening up the future —
CQTXNet: A Modified Xception Network with Attention Modules for Cover Song Identification
Jinsoo SEOJunghyun KIMHyemi KIM
Author information
JOURNAL FREE ACCESS

2024 Volume E107.D Issue 1 Pages 49-52

Details
Abstract

Song-level feature summarization is fundamental for the browsing, retrieval, and indexing of digital music archives. This study proposes a deep neural network model, CQTXNet, for extracting song-level feature summary for cover song identification. CQTXNet incorporates depth-wise separable convolution, residual network connections, and attention models to extend previous approaches. An experimental evaluation of the proposed CQTXNet was performed on two publicly available cover song datasets by varying the number of network layers and the type of attention modules.

Content from these authors
© 2024 The Institute of Electronics, Information and Communication Engineers
Previous article Next article
feedback
Top