FrameShield: Adversarially Robust Video Anomaly Detection

NeurIPS 2025
Mojtaba Nafez, Mobina Poulaei*, Nikan Vasei*, Bardia soltani moakhar, Mohammad Sabokrou, Mohammad Hossein Rohban

📜 Abstract

Weakly Supervised Video Anomaly Detection (WSVAD) has achieved notable advancements, yet existing models remain vulnerable to adversarial attacks, limiting their reliability. Due to weak supervision—where only video-level labels are available—traditional adversarial defenses like adversarial training are ineffective, as video-level perturbations are too weak.

FrameShield introduces a Pseudo-Anomaly Generation method called Spatiotemporal Region Distortion (SRD), which creates localized synthetic anomalies in normal videos while maintaining temporal consistency. These synthetic samples, combined with noisy pseudo-labels, reduce label noise and enable effective adversarial training.

FrameShield substantially enhances robustness across benchmarks, outperforming state-of-the-art methods by +71.0% average AUROC improvement under adversarial settings.

📂 Repository Structure

FrameShield/
│
├── clip/                # Base CLIP implementation
├── configs/             # Dataset-specific configuration files
├── datasets/            # Dataset and DataLoader creation scripts
├── models/              # X-CLIP and FrameShield model definitions
├── runners/             # Bash scripts for training/evaluation
├── utils/               # Logging, checkpointing, and helper utilities
│
├── main.py              # Standard (non-adversarial) training / testing
├── main_advtrain.py     # Adversarial training / pseudo label entry point
├── main_attack.py       # Adversarial attack evaluation
│
└── requirements.txt     # Python dependencies

⚙️ Installation

Firstly, download and move into the repository.

git clone https://github.com/rohban-lab/FrameShield.git
cd FrameShield

Secondly, create a python environment. Below there is an example using conda.

It is recommended to use python 3.10, as it is the version we used for the development.

conda create -n FS python=3.10
conda activate FS

Finally, install are the required dependencies using the command below.

pip install -r requirements.txt

🗂️ Datasets

FrameShield supports multiple benchmark datasets used in the paper:

Dataset	Source	Notes
ShanghaiTech	Train (Kaggle) / Test (Kaggle)	Official ShanghaiTech University website
TAD	Train+Test (Kaggle)	Official repository
UCF Crime	Project Website	Preprocessed for FrameShield
MSAD	Project Website	Apply for the dataset directly on their website.
UCSD-Ped2	Official Paper	Preprocessed for FrameShield

How to use the datasets

Each dataset should be placed under a root directory, which is specified in your config file.

DATA:
    ROOT: '../SHANGHAI/' <-- The root directory
    TRAIN_FILE: 'configs/shanghai/SHANGHAI_train.txt'
    VAL_FILE: 'configs/shanghai/SHANGHAI_test.txt'
    DATASET: shanghai
    .
    .

See example configs in configs/.

Note: Depending on your dataset format, and how clean the frame indices are, you might need to change how the frames are loaded into the dataloaders.

💾 Pretrained Weights

Model	Dataset	Link	Notes
Backbone	Kinetics-400	Google Drive	Initial weights for the PromptMIL stage.

How to use the weights

You just need to download the weight and specify its path using the --pretrained PATH argument.

python ... --pretrained ../weights/k400_16_8.pth

🧠 Usage

For each task, you can choose either of these methods:

Run pre-implemented bash files that are located in the runners/.
- NUM_GPUs indicates the number of GPUs you want to use for the run.
Use python commands manually. Regarding the arguments:
- -cfg: path of your config file.
- --batch-size: training batch size
- --accumulation-steps: optimizer's accumulation steps
- --output: output directory path to save the logs
- --pretrained: path of the pretrained weight
- --only_test: if you want to clean test / generate pseudo labels

Below there are examples for each task using both methods.