Abstract
This study explores the application of deep learning (DL) to evaluate welder qualification specimens. The aim is to establish an objective assessment method and reduce inspectors' workload. The study employed a convolutional neural network (CNN) based on the ResNet architecture for image classification, focusing on medium-thickness bend test specimens from the Japan Welding Engineering Society (JWES) examination.
Initial binary classification experiments distinguishing between “pass” and “fail” demonstrated that accuracy improved with training sample size, achieving over 90% on average. A confidence score, defined from CNN output values, enabled quantitative evaluation of classification reliability and the identification of specimens that could potentially be misclassified.
Subsequently, multi-class classification was conducted to approximate real testing standards. Due to the presence of data imbalance, the analysis was conducted on six representative defect categories instead of the original 13. The accuracy exceeded 70% with 40 images per class and further improved to 76.7% after additional training iterations. Furthermore, the augmentation of the dataset, even under imbalanced conditions, led to a substantial enhancement in performance, culminating in 97.1% accuracy when all available images were utilized. The proposed confidence score demonstrated efficacy in multi-class settings, enhancing reliability-informed decision-making.
These findings demonstrate that CNN-based approaches can accurately and objectively evaluate welder qualification test specimens. The proposed methodology could improve inspection efficiency, reduce examiner workload, and contribute to the development of standardized, automated evaluation systems for welding certification.