This repository contains modifications and fine-tuning implementations of DVERGE (Diversifying Vulnerabilities for Enhanced Robust Generation of Ensembles) for ongoing research.
Create a Conda environment named dverge
with all dependencies by running:
conda env create -f environment.yml
We use PyTorch 1.4.0, but you may need to adjust the PyTorch version according to the CUDA version on your machine.
The code is tested on a single TITAN Xp GPU. Running on multiple GPUs may require modifications.
Access the pre-trained models and black-box transfer adversarial examples via this link.
- Download and place the
checkpoints/
folder under this repository. - Create a folder named
data/
, downloadtransfer_adv_examples.zip
, unzip it, and place the extractedtransfer_adv_examples/
folder insidedata/
.
Modify and execute the training script for fine-tuning:
bash scripts/training.sh
Run the evaluation script to test the model:
bash scripts/evaluation.sh
The current feature extraction method mainly supports ResNet20. If using a different architecture, modify the feature extraction mechanism using PyTorch forward hooks.
Training DVERGE may result in high variations due to random layer sampling for distillation. See Appendix C.5 of the paper for more details.
To understand decision region plots, refer to this tutorial.
Our code for decision region visualization can be found here. Key settings:
args.steps=1000
: Each axis perturbed 1000 times.args.vmax=0.1
: Maximum perturbation distance 0.1.- Generates 1,000,000 data points per plot.
If you use this work, please cite:
@article{yang2020dverge,
title={DVERGE: Diversifying Vulnerabilities for Enhanced Robust Generation of Ensembles},
author={Yang, Huanrui and Zhang, Jingyang and Dong, Hongliang and Inkawhich, Nathan and Gardner, Andrew and Touchet, Andrew and Wilkes, Wesley and Berry, Heath and Li, Hai},
journal={Advances in Neural Information Processing Systems},
volume={33},
year={2020}
}
This repository adapts the Adaptive Diversity Promoting Regularizer (ADP) training code from its official repo, originally in TensorFlow, and converted into PyTorch here.