IEICE Transactions on Information and Systems
Online ISSN : 1745-1361
Print ISSN : 0916-8532
Regular Section
Real-Time Video Matting Based on RVM and Mobile ViT
Chengyu WUJiangshan QINXiangyang LIAo ZHANZhengqiang WANG
著者情報
ジャーナル フリー

2024 年 E107.D 巻 6 号 p. 792-796

詳細
抄録

Real-time matting is a challenging research in deep learning. Conventional CNN (Convolutional Neural Networks) approaches are easy to misjudge the foreground and background semantic and have blurry matting edges, which result from CNN's limited concentration on global context due to receptive field. We propose a real-time matting approach called RMViT (Real-time matting with Vision Transformer) with Transformer structure, attention and content-aware guidance to solve issues above. The semantic accuracy improves a lot due to the establishment of global context and long-range pixel information. The experiments show our approach exceeds a 30% reduction in error metrics compared with existing real-time matting approaches.

著者関連情報
© 2024 The Institute of Electronics, Information and Communication Engineers
前の記事
feedback
Top