IEICE Transactions on Information and Systems
Online ISSN : 1745-1361
Print ISSN : 0916-8532
Regular Section
APW: Asymmetric Padded Winograd to Reduce Thread Divergence for Computational Efficiency on SIMT Architecture
Wonho LEEJong Wook KWAK
Author information
JOURNAL FREE ACCESS

2025 Volume E108.D Issue 5 Pages 436-439

Details
Abstract

In this letter, we propose Asymmetric Padded Winograd called APW, designed to enhance the computational efficiency of Winograd-based convolution algorithms on SIMT architectures. This approach resolves thread divergence, which typically causes delays in execution due to uneven computational distribution across threads. By integrating asymmetric padding into both filters and inputs, APW unifies the size of sub-filters and sub-inputs. This uniformity maintains a consistent execution path for threads throughout Winograd-based convolution process, effectively minimizing thread divergence. Our experimental results demonstrate that APW substantially reduces thread divergence observed in previous work to nearly zero and cuts down the total execution time by up to 17.78%.

Content from these authors
© 2025 The Institute of Electronics, Information and Communication Engineers
Previous article Next article
feedback
Top