コンセプトをトリガーとしたステルス性の高いバックドア攻撃

大磯 秀幸; 福地 一斗; 秋本 洋平; 佐久間 淳

doi:10.11517/pjsai.JSAI2023.0_3L1GS1103

37th (2023)

Session ID : 3L1-GS-11-03

DOI https://doi.org/10.11517/pjsai.JSAI2023.0_3L1GS1103

Conference information

Host: The Japanese Society for Artificial Intelligence

Name : The 37th Annual Conference of the Japanese Society for Artificial Intelligence

Number : 37

Location : [in Japanese]

Date : June 06, 2023 - June 09, 2023

Backdoor Attacks using the Concepts as a Trigger

*Hideyuki OISO, Kazuto FUKUCHI, Youhei AKIMOTO, Jun SAKUMA

Author information

Keywords: poisoning attacks, backdoor attacks, AI Reliability

CONFERENCE PROCEEDINGS FREE ACCESS

Details

Abstract

Backdoor attacks are a type of attack against machine learning models. The backdoored model classifies input into the wrong class if the input contains certain triggers (e.g., noise or patterns). In this paper, we propose a backdoor attack using concepts as triggers to clarify the vulnerabilities that machine learning models suffer from and to develop a discussion on increasing the security of machine learning models. The concepts are interpretable attributes contained in a sample. For example, attributes such as hair color and smile are concepts of facial images. In existing research, most triggers are assumed to be artificially generated patterns that do not appear in the physical world. In addition, such poisoning samples look natural and stealthy. In our experiments, we demonstrate that the concept can be leveraged as a trigger by evaluating the attack success rate of the proposed method and its tolerance against existing defense methods.

Corresponding author

Conference information

Register with J-STAGE for free!