작성일
2022.04.05
수정일
2022.04.05
작성자
최석환
조회수
166

Robust Defense Techniques against Adversarial Examples for Image-based Deep Learning Models

As a core part of current real-world applications, image-based deep learning model has been widely applied in various fields. However, many studies showed that image-based deep learning model is very vulnerable to adversarial attacks. Here, the term “adversarial attack” represents to attacks that target a deep learning model by modifying legitimate input data with slight humanimperceptible perturbations. It is known that adversarial attacks cause severe damage to practical image-based deep learning models such as self-driving systems, face recognition system, and perceptual ad-blocking system. In this dissertation, we focus on robust defense techniques for imagebased deep learning model against adversarial attacks. To achieve this goal, we first propose two new defense methods against white-box adversarial attacks, each of which detects white-box adversarial attacks or provides robustness against white-box adversarial attacks to image-based deep learning models. Also, a new defense method, which detects black-box adversarial attacks based on perceptual image hashing, is proposed. Specifically, three remarkable results are obtained: 

 

1. Clustering Approach for Detecting White-box Adversarial Attacks: We note that current detection methods against white-box adversarial attacks can classify the input data into only either legitimate one or adversarial one. That is, the current detection methods can only detect the adversarial examples and can not classify the input data into multiple classes of data, i.e. legitimate input data and various types of adversarial attacks. To overcome this limitation of the current detection methods, we propose an advanced detection method which can detect white-box adversarial attacks while classifying the types of adversarial attacks. The proposed detection method extracts key features from adversarial perturbation and feeds the extracted features into the clustering model. From analysis results under various application datasets, we show that the proposed detection method can classify the types of adversarial attacks. We also show that the detection accuracy of the proposed detection method outperforms the accuracy of recent detection methods.

 

2. Two-Step Input Transformation for Defending against White-box Adversarial Attack: Previous defense methods against white-box adversarial attacks suffer from the accuracy degradation for legitimate input data. To solve the accuracy degradation for legitimate input data while keeping the target image-based deep learning models robust against adversarial examples, we propose two-step input transformation architecture. Based on the two-step input transformation architecture, we also propose two new defense methods according to the defender’s knowledge for the target model, which are called EEJE and ARGAN, respectively. From the experimental results under various conditions, we show that the proposed two-step input transformation architecture provides good robustness to image-based deep earning models against white-box adversarial attacks while maintaining the high accuracy even for legitimate input data. In addition, it is shown that EEJE and ARGAN provide better performance than the previous defense methods.

 

3. Perceptual Image Hashing for Defending against Black-box Adversarial Attacks, which is called PIHA (Perceptual Image HAshing): To defense black-box adversarial attacks, the state-of-the-art defense methods use similarity of input data. However, the robustness of those defense methods can be easily mitigated by the adversary. To solve this problem, we propose a new defense method, called PIHA, which uses the concept of perceptual image hashing. Given a query image, PIHA generates a hash sequence and compares the hash sequence with those of previous queries to detect black-box adversarial attacks. Here, a hash sequence has invariance to small perturbations and color changes when detecting black-box adversarial attacks. From the experimental results under various black-box adversarial attacks using the representative benchmark datasets, we show that PIHA provides the good performance in the number of detected attack queries and the detected query rate than the state-of-the-art defense methods, i.e., Stateful Detection and Blacklight.

 

The above three defense techniques described in this dissertation provide good robustness against all possible adversarial attack scenarios. Therefore, we can use image-based deep learning models with confidence from the threat of hostile attacks.

학위연월
2022년 8월
지도교수
최윤호
키워드
Adversarial attack, Deep Learning, Security
소개 웹페이지
https://sites.google.com/view/seokhwan-choi/home
첨부파일
첨부파일이(가) 없습니다.
다음글
DQN 기반 자동화 컨테이너 터미널 장치장 크레인 작업 할당 전략 최적화
김세영 2022-10-13 12:33:32.29
이전글
High-Performance Hardware Architectures for Elliptic Curve-Based Cryptographic Processor
아와루딘 에셉 무하마드 2022-04-01 17:48:36.81
RSS 2.0 139
게시물 검색
박사학위논문
번호 제목 작성자 작성일 첨부파일 조회수
139 Enhancing Threat Detection and Response Automation 이스마일 2025.10.20 5 99
138 Code-mixing 환경을 위한 한국어 통합 G2P 시스템 최성기 2025.10.17 0 201
137 고속 컨베이어 환경에서의 생산 공정물 결함 검출을 위한 AI 비전 시스템 김형건 2025.10.17 0 101
136 Toward Reliable and Scalable Multi-Cell LoRaWAN Ne 호앙 꾸옥 홍 낫 2025.10.16 0 101
135 Differentially Private Context-Aware and Data-Cen 우타리예바 아쎔 2025.10.10 0 130
134 Scalable Quantum Annealing Frameworks for Combinat 정선근 2025.10.02 0 127
133 Comparative Complexity of Neuropeptide and Recepto 류승희 2025.10.01 0 117
132 확산 모델 기반 필기 이미지 생성에 관한 연구 홍동진 2025.04.10 0 199
131 연합학습 기반 그래프 신경망을 활용한 전기차 충전소 최적 선택 기법 류준우 2025.04.09 0 186
130 Exploring Quantum Approach Applied to Cryptanalysi 와다니 리니 위스누 2025.04.08 0 210
129 Towards computation - communication efficient and 응우옌 민 두옹 2025.04.08 0 165
128 Hybrid Quantum Residual Neural Networks for Classi 노대일 2025.04.08 0 177
127 Distributed Resource Management for Massive IoT Ne 응우옌 쑤언 둥 2025.04.08 0 146
126 A Framework for Leveraging Large Language Models i 데리 프라타마 2025.04.07 0 191
125 Discovery and Authentication of Marker Genes Using 프라타마 리안 다니스 아디 2025.04.07 0 210
124 산업 환경의 IEEE 802.15.4 TSCH 기반 네트워크에서 트래픽 처리량 향상을 위한 이희준 2025.04.07 0 194
123 Uncertainty-Based Hybrid Deep Learning Approach fo 멘가라 악셀 기드온 2024.12.10 0 232
122 Effective Deep Learning Primitives Design for Bina 황선진 2024.10.14 0 215
121 Toward Immersive Multiview Video Streaming through 탄중 디온 2024.10.14 0 182
120 A Low-cost Deep Learning Model for Real-time Low L 등 제강 2024.10.10 0 230