人工知能学会第二種研究会資料
Online ISSN : 2436-5556
Vision Transformer によるCT画像からの腎腫瘍検出
田中 亨鈴木 淳晟亀谷 由隆山田 啓一堀田 一弘高橋 友一佐々 直人松川 宜久岩野 信吾山本 徳則
著者情報
研究報告書・技術報告書 フリー

2022 年 2022 巻 AIMED-012 号 p. 05-

詳細
抄録

Convolutional neural networks (CNNs) have been adopted as standard deep learn- ing models in medical image analysis owing to their ability to automatically extract high-level features from training images. Recently, Vision Transformer (ViT) models have been proposed, which implement the Transformer architecture originally developed for natural language process- ing. Given their high predictive performance, we built a couple of ViT models to detect kidney cancer based on computed tomography (CT) images. Experimental results show that our ViT models outperformed conventional CNNs in terms of detection accuracy with various types of CT images. Moreover, we visualized the attention maps of our ViT models to help understand the basis for their detection output.

著者関連情報
© 2022 著作者
前の記事 次の記事
feedback
Top