MNISTに対する敵対的サンプル攻撃耐性を有する説明困難なルールを用いた分類手法

稲元 勉; 樋上 喜信; 松本 卓也; 榊原 一紀

doi:10.1541/ieejeiss.145.498

抄録

In this paper, we propose a classification method that deploys hard-to-explain rules and is robust against adversarial example (AE) attacks on the MNIST. The purpose of this paper is to solve the technical difficulties of the deep learning-based technology that include the unexplainability of the classification and the vulnerability against the AE attack. The method proposed in this paper is similar to the existing method (DkNN) in terms of using output vectors computed from an artificial neural network (ANN), thus can solve the unexplainability difficulty. The proposed method is different from the DkNN in terms of the architecture of used ANNs and the format of output vectors. Those output vectors are discrete and used as hard-to-explain rules that mitigate the vulnerability against the AE attack. In computational experiments, the MNIST is taken as the target problem, then FGSM and BIM are used as the AE attacks. Computational results display that the proposed method achieved accuracies over 95% for all attacks.

著者関連情報

お気に入り & アラート

閲覧履歴

発行機関からのお知らせ

【電気学会会員の方】購読している論文誌を無料でご覧いただけます（会員ご本人のみの個人としての利用に限ります）。購読者番号欄にMyページへのログインIDを，パスワード欄に生年月日8ケタ（西暦，半角数字。例：19800303）を入力して下さい。

ダウンロード

論文(PDF)の閲覧方法はこちら
閲覧方法 (389.7K)

前身誌

電気学会論文誌. C

電氣學會雜誌

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）