Text Recognition in Low Resolution Images Using Trainable Regularization

Supatta VIRIYAVISUTHISAKUL; Parinya SANGUANSAT; Teeradaj RACHARAK; Minh Le NGUYEN; Toshihiko YAMASAKI

doi:10.11517/pjsai.JSAI2023.0_3U5IS405

37th (2023)

セッションID: 3U5-IS-4-05

DOI https://doi.org/10.11517/pjsai.JSAI2023.0_3U5IS405

会議情報

主催: The Japanese Society for Artificial Intelligence

会議名: 2023年度人工知能学会全国大会（第37回）

回次: 37

開催地: 熊本城ホール＋オンライン

開催日: 2023/06/06 - 2023/06/09

Text Recognition in Low Resolution Images Using Trainable Regularization

*Supatta VIRIYAVISUTHISAKUL, Parinya SANGUANSAT, Teeradaj RACHARAK, Minh Le NGUYEN, Toshihiko YAMASAKI

著者情報

キーワード: Scene Text Image, Super-Resolution, Regularization

会議録・要旨集フリー

詳細

抄録

In text recognition task, a part of a text in an image often suffer from the low-resolution problem. Consequently, the recognizer cannot predict the character correctly. To address this problem, we present end-to-end text recognition for the low-resolution image in two stages. The image resolution enhancement is applied before performing the recognition process. Our focus in this paper is the modified loss function for the image resolution enhancement. Normally, the super-resolution model traps an overfitting problem, namely, some characters are predicted in another one that has a similar shape. To avoid this overfitting, the regularization term is normally added to the loss function with the fixed weight ratio, which is hard to optimize. In this paper, we make this fixed weight ratio into a trainable parameter that can be optimized in the backpropagation process. We test this approach with many recognizers and we get the improved results. It can achieve the text recognition accuracy of 76.5% in test set and the highest IQA scores.

責任著者(Corresponding author)

会議情報

J-STAGEへの登録はこちら（無料）