MHSAN: Multi-Head Self-Attention Network for Visual Semantic Embedding

This repository is the official PyTorch implementation of MHSAN: Multi-Head Self-Attention Network for Visual Semantic Embedding (WACV2020) by Geondo Park, Chihye Han, Wonjun Yoon, Daeshik Kim.

Prepare Dataset

Download the dataset files: coco, Flickr

We use splits produced by Andrej Karpathy. Download the json file from here. Then, put it same directory, respectively. (ref, get_data_path in utils.py)

Train Coco

For train the new model with coco dataset, run train_coco.py Overall hyperparameters for training are set by default.

python train_coco.py -hop 10 -name <model_name> -p-coeff 0.1

Train flickr

For train the new model with flickr dataset, run train_flickr.py Overall hyperparameters for training are set by default.

python train_flickr.py -hop 10 -name <model_name> -p-coeff 0.1

Implementation

Our code is implemented based on : vsepp

Citation

@inproceedings{park2020mhsan,
  title={MHSAN: Multi-Head Self-Attention Network for Visual Semantic Embedding},
  author={Park, Geondo and Han, Chihye and Yoon, Wonjun and Kim, Daeshik},
  booktitle={The IEEE Winter Conference on Applications of Computer Vision},
  pages={1518--1526},
  year={2020}
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
dataset.py		dataset.py
evaluation.py		evaluation.py
example.py		example.py
loss.py		loss.py
model.py		model.py
train_coco.py		train_coco.py
train_flickr.py		train_flickr.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MHSAN: Multi-Head Self-Attention Network for Visual Semantic Embedding

Prepare Dataset

Train Coco

Train flickr

Implementation

Citation

About

Uh oh!

Releases

Packages

Languages

GeondoPark/MHSAN

Folders and files

Latest commit

History

Repository files navigation

MHSAN: Multi-Head Self-Attention Network for Visual Semantic Embedding

Prepare Dataset

Train Coco

Train flickr

Implementation

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages