Skip to content

Official repository for the CVPRW2023 paper "An Extended Study of Human-like Behavior under Adversarial Training".

License

Notifications You must be signed in to change notification settings

paulgavrikov/adversarial_training_vs_humans

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 

Repository files navigation

An Extended Study of Human-like Behavior under Adversarial Training

Paul Gavrikov, Janis Keuper, Margret Keuper

CC BY-SA 4.0

Presented at: Art of Robustness Workshop @ CVPR 2023

Paper | ArXiv

Abstract: Neural networks have a number of shortcomings. Amongst the severest ones is the sensitivity to distribution shifts which allows models to be easily fooled into wrong predictions by small perturbations to inputs that are often imperceivable to humans and do not have to carry semantic meaning. Adversarial training poses a partial solution to address this issue by training models on worst-case perturbations. Yet, recent work has also pointed out that the reasoning in neural networks is different from humans. Humans identify objects by shape, while neural nets mainly employ texture cues. Exemplarily, a model trained on photographs will likely fail to generalize to datasets containing sketches. Interestingly, it was also shown that adversarial training seems to favorably increase the shift toward shape bias. In this work, we revisit this observation and provide an extensive analysis of this effect on various architectures, the common L2- and Linf-training, and Transformer-based models. Further, we provide a possible explanation for this phenomenon from a frequency perspective.

Citation

If you find our work useful in your research, please consider citing:

@InProceedings{Gavrikov_2023_CVPR,
    author    = {Gavrikov, Paul and Keuper, Janis and Keuper, Margret},
    title     = {An Extended Study of Human-Like Behavior Under Adversarial Training},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops},
    month     = {June},
    year      = {2023},
    pages     = {2360-2367}
}

Legal

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

About

Official repository for the CVPRW2023 paper "An Extended Study of Human-like Behavior under Adversarial Training".

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published