ISIJ International
Online ISSN : 1347-5460
Print ISSN : 0915-1559
ISSN-L : 0915-1559

This article has now been updated. Please use the final version.

Resformer-Unet: A U-shaped Framework Combining ResNet and Transformer for Segmentation of Strip Steel Surface Defects
Kun LuWenyan WangXuejuan PanYuming ZhouZhaoquan ChenYuan ZhaoBing Wang
Author information
JOURNAL OPEN ACCESS Advance online publication

Article ID: ISIJINT-2023-222

Details
Abstract

Identifying surface defects is an essential task in the hot-rolled process. Currently, various computer vision-based classification and detection methods have achieved superior results in recognizing surface defects. However, defects typically exhibit irregular shapes caused by intra-class differences. Therefore, these two methods are unable to accurately identify the specific locations of the defects. To address this issue, this work proposes a U-shaped Encoder-Decoder framework called Resformer-Unet, which can effectively detect surface defects of hot-rolled strip at the pixel-level. In this framework, the Convolutional Neural Network (CNN) and Transformer work in parallel to extract multi-scale features from the image, which enhances the ability of network to capture both global and local information. Additionally, feature coupling modules are employed to fuse multi-scale features, thereby compensating for the information loss that occurs during down-sampling. On the SD-saliency-900 dataset for strip steel surface defect segmentation, Resformer-Unet achieves a mean Dice Similarity Coefficient (DSC) of 89.96% and an average Hausdorff Distance of 12.03%. These results outperform those of several advanced methods.

Content from these authors
© 2023 The Iron and Steel Institute of Japan

This is an open access article under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs license
https://creativecommons.org/licenses/by-nc-nd/4.0/
feedback
Top