ISIJ International
Online ISSN : 1347-5460
Print ISSN : 0915-1559
ISSN-L : 0915-1559
Regular Article
Resformer-Unet: A U-shaped Framework Combining ResNet and Transformer for Segmentation of Strip Steel Surface Defects
Kun LuWenyan WangXuejuan PanYuming ZhouZhaoquan ChenYuan ZhaoBing Wang
著者情報
ジャーナル オープンアクセス HTML

2024 年 64 巻 1 号 p. 67-75

詳細
抄録

Identifying surface defects is an essential task in the hot-rolled process. Currently, various computer vision-based classification and detection methods have achieved superior results in recognizing surface defects. However, defects typically exhibit irregular shapes caused by intra-class differences. Therefore, these two methods are unable to accurately identify the specific locations of the defects. To address this issue, this work proposes a U-shaped Encoder-Decoder framework called Resformer-Unet, which can effectively detect surface defects of hot-rolled strip at the pixel-level. In this framework, the Convolutional Neural Network (CNN) and Transformer work in parallel to extract multi-scale features from the image, which enhances the ability of network to capture both global and local information. Additionally, feature coupling modules are employed to fuse multi-scale features, thereby compensating for the information loss that occurs during down-sampling. On the SD-saliency-900 dataset for strip steel surface defect segmentation, Resformer-Unet achieves a mean Dice Similarity Coefficient (DSC) of 89.96% and an average Hausdorff Distance of 12.03%. These results outperform those of several advanced methods.

Fullsize Image
著者関連情報
© 2024 The Iron and Steel Institute of Japan.

This is an open access article under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs license.
https://creativecommons.org/licenses/by-nc-nd/4.0/
前の記事 次の記事
feedback
Top