Universal Distributional Decision-Based Black-Box Adversarial Attack with Reinforcement Learning

Yiran Huang; Yexu Zhou; Michael Hefenbrock; Till Riedel; Likun Fang; Michael Beigl

doi:10.1007/978-3-031-30111-7_18

Universal Distributional Decision-Based Black-Box Adversarial Attack with Reinforcement Learning

Publikation: Beitrag in Buch/Konferenzbericht/Sammelband/Gutachten › Beitrag in Konferenzband › Beigetragen › Begutachtung

Beitragende

Yiran Huang - , Karlsruher Institut für Technologie (Autor:in)
Yexu Zhou - , Karlsruher Institut für Technologie (Autor:in)
Michael Hefenbrock - , Karlsruher Institut für Technologie (Autor:in)
Till Riedel - , Karlsruher Institut für Technologie (Autor:in)
Likun Fang - , Karlsruher Institut für Technologie (Autor:in)
Michael Beigl - , Karlsruher Institut für Technologie (Autor:in)

Abstract

The vulnerability of the high-performance machine learning models implies a security risk in applications with real-world consequences. Research on adversarial attacks is beneficial in guiding the development of machine learning models on the one hand and finding targeted defenses on the other. However, most of the adversarial attacks today leverage the gradient or logit information from the models to generate adversarial perturbation. Works in the more realistic domain: decision-based attacks, which generate adversarial perturbation solely based on observing the output label of the targeted model, are still relatively rare and mostly use gradient-estimation strategies. In this work, we propose a pixel-wise decision-based attack algorithm that finds a distribution of adversarial perturbation through a reinforcement learning algorithm. We call this method Decision-based Black-box Attack with Reinforcement learning (DBAR). Experiments show that the proposed approach outperforms state-of-the-art decision-based attacks with a higher attack success rate and greater transferability.

Details

Originalsprache	Englisch
Titel	Neural Information Processing
Redakteure/-innen	Mohammad Tanveer, Sonali Agarwal, Seiichi Ozawa, Asif Ekbal, Adam Jatowt
Herausgeber (Verlag)	Springer, Cham
Seiten	206–215
Seitenumfang	10
ISBN (elektronisch)	978-3-031-30111-7
ISBN (Print)	978-3-031-30110-0
Publikationsstatus	Veröffentlicht - 2023
Peer-Review-Status	Ja
Extern publiziert	Ja

Publikationsreihe

Reihe	Lecture Notes in Computer Science, Volume 13625
ISSN	0302-9743

Externe IDs

Scopus	85161696199

Forschungsportal der TU Dresden

Universal Distributional Decision-Based Black-Box Adversarial Attack with Reinforcement Learning

Beitragende

Abstract

Details

Publikationsreihe

Externe IDs

Schlagworte

ASJC Scopus Sachgebiete

Schlagwörter

Bibliotheksschlagworte