Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
36th (2022)
Session ID : 1O4-GS-7-01
Conference information

Object detection-based card detection for OCR applications
*Zhen ZHAOYoshiki HASIOKA
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

Extracting information from images of cards such as driver’s licenses or credit cards is a computer vision task with widespread needs. In many cases, images taken with a smartphone are taken from an arbitrary position and angle specified by people. To recognize the card’s text with OCR, it is necessary first to localize the card within the image, transform it to a rectangle, and then rotate it to the correct orientation. Deep learning-based methods are able to perform these localization and rotation tasks with high accuracy. However, handling the two tasks with two separate models results in increased processing times. In this work, we propose a solution to this problem which uses a single object detection model to perform both the localization and rotation tasks, thereby allowing cards to be processed quickly without sacrificing accuracy.

Content from these authors
© 2022 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top