Real-Time Video Matting Based on RVM and Mobile ViT

Chengyu WU; Jiangshan QIN; Xiangyang LI; Ao ZHAN; Zhengqiang WANG

doi:10.1587/transinf.2023EDL8071

抄録

Real-time matting is a challenging research in deep learning. Conventional CNN (Convolutional Neural Networks) approaches are easy to misjudge the foreground and background semantic and have blurry matting edges, which result from CNN's limited concentration on global context due to receptive field. We propose a real-time matting approach called RMViT (Real-time matting with Vision Transformer) with Transformer structure, attention and content-aware guidance to solve issues above. The semantic accuracy improves a lot due to the establishment of global context and long-range pixel information. The experiments show our approach exceeds a 30% reduction in error metrics compared with existing real-time matting approaches.

著者関連情報

お気に入り & アラート

お気に入りに追加
追加情報アラート
被引用アラート
認証解除アラート

閲覧履歴

科学技術に係る行政改革の変遷について(<特集>諸外国の科学技術政策、評価/行革問題)
部材の軽量化による輸送機器の省エネ化
仙台湾沿岸における運河群の津波減災効果
Oxidative Stress and Cardiovascular Dysfunction Associated with Cadmium Exposure: Beneficial Effects of Curcumin and Tetrahydrocurcumin
第20回日本総合健診医学会学術大会講演抄録集健診受診者のアポ蛋白A-1, B測定の意義

発行機関からのお知らせ

PPV is available from https://globals.ieice.org/en_transactions/information

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）