Active Patterns Perceived for Stochastic Video Prediction (ASVP)

This is the implementation of 'Active Patterns Perceived for Stochastic Video Prediction' in ACM MM 2022.

1. Important updates

2022-04-01: This project is going to be released, please waiting.
2022-04-03: The procedure of data preparation and proprecessing scripts on KTH are uploaded for inference.
2022-04-04: Code of separating active patterns and non-active patterns from videos is uploaded.
2022-04-06: Released models and code of inferencing and training are uploaded.
...

2. Getting started

The requirements of the hardware and software are given as below.

2.1. Prerequisites

CPU: Intel(R) Core(TM) i7-6900K CPU @ 3.20GHz

GPU: GeForce GTX 1080 Ti

CUDA Version: 10.2

OS: Ubuntu 16.04.6 LTS

2.2. Installing

Configure the virtual environment on Ubuntu.

Create a virtual with python 3.6

conda create -n asvp python=3.6
conda activate asvp

Install requirements (Please pay attention that we use tensorflow-gpu==1.10.0)

pip install -r requirements.txt

Additionally install ffmpeg

conda install x264 ffmpeg -c conda-forge

Here the virtual env is created on Ubuntu.

2.3. Dataset

Datasets contain: KTH human action dataset & BAIR action-free robot pushing dataset. For reproducing the experiment, the processed dataset should be downloaded:

For KTH, raw data and subsequence file should be downloaded firstly. In this turn of submission, please temporarily download from:

raw data and subsequence file. After downloading, drag all .zip and .tar.gz files into ./data directory, and run

bash data/preprocess_kth.sh

Then all preprocessed and subsequence splitted frames are obtained in ./data/kth/processed.

If you only need to inference with released models, please run the code below for converting images into tfrecords for inference

bash data/kth2tfrecords.sh

else, please skip this step and turn to Part 3 for separating active patterns along with non-active ones from videos.

3. Active pattern mining

Active pattern mining is necessary only for training and there is no need to do this if only with respective to inference with released model.

When trying to separate active patterns along with non-active ones from videos, please refer to details.
After all active patters and non-active patterns are mined, these images are convertted to tfrecords for training.

bash data/kth2tfrecords_ap.sh

The final data could be downloaded from drive.

4. Inference with released models

For downloading the released models, the released models should be placed as:

——./pretrained/pretrained_models/kth/ours_asvp

——./pretrained/pretrained_models/bair_action_free/ours_asvp

and the pre-trained models for baseline should be placed as:

——./pretrained/pretrained_models/kth/savp

——./pretrained/pretrained_models/bair_action_free/asvp

4.1. Inference on KTH human action

For running our released model, please run

CUDA_VISIBLE_DEVICES=0 python scripts/evaluate.py --input_dir data/kth --dataset_hparams sequence_length=30 --checkpoint pretrained_models/kth/ours_asvp/model-300000 --mode test --results_dir results_test_samples/kth --batch_size 3

For running the baseline, please run

CUDA_VISIBLE_DEVICES=0 python scripts/evaluate.py --input_dir data/kth --dataset_hparams sequence_length=30 --checkpoint pretrained_models/kth/savp/model-300000 --mode test --results_dir results_test_samples/kth --batch_size 3

4.2. Inference on BAIR action-free robot pushing

For running our released model, please run

CUDA_VISIBLE_DEVICES=0 python scripts/evaluate.py --input_dir data/bair --dataset_hparams sequence_length=22 --checkpoint logs/bair_action_free/ours_asvp/model-300000 --mode test --results_dir results_test_samples/bair_action_free --batch_size 8

For running the baseline, please run

CUDA_VISIBLE_DEVICES=0 python scripts/evaluate.py --input_dir data/bair --dataset_hparams sequence_length=22 --checkpoint pretrained_models/bair_action_free/savp/model-300000 --mode test --results_dir results_test_samples/bair_action_free --batch_size 8

5. Training

For training our model with active patterns and non-active patterns on KTH, please run

CUDA_VISIBLE_DEVICES=0,1 python scripts/train.py --input_dir data/kth --dataset kth --model asvp --model_hparams_dict hparams/kth/ours_asvp/model_hparams.json --output_dir logs/kth/ours_asvp

For training our model with active patterns and non-active patterns on BAIR action-free, please run

CUDA_VISIBLE_DEVICES=0,1 python scripts/train.py --input_dir data/bair --dataset bair --model asvp --model_hparams_dict hparams/bair_action_free/ours_asvp/model_hparams.json --output_dir logs/bair_action_free/ours_asvp

6. More cases

Add additional notes about how to deploy this on a live system

7. License

This project is licensed under the MIT License - see the LICENSE.md file for details

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Active Patterns Perceived for Stochastic Video Prediction (ASVP)

1. Important updates

2. Getting started

2.1. Prerequisites

2.2. Installing

2.3. Dataset

3. Active pattern mining

4. Inference with released models

4.1. Inference on KTH human action

4.2. Inference on BAIR action-free robot pushing

5. Training

6. More cases

7. License

Acknowledgments

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
data		data
hparams		hparams
pretrained_models		pretrained_models
scripts		scripts
separating_active_patterns		separating_active_patterns
video_prediction		video_prediction
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

License

tolearnmuch/ASVP

Folders and files

Latest commit

History

Repository files navigation

Active Patterns Perceived for Stochastic Video Prediction (ASVP)

1. Important updates

2. Getting started

2.1. Prerequisites

2.2. Installing

2.3. Dataset

3. Active pattern mining

4. Inference with released models

4.1. Inference on KTH human action

4.2. Inference on BAIR action-free robot pushing

5. Training

6. More cases

7. License

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages