Skip to content

.t7 caption files #7

@shenkev

Description

@shenkev

Hi Scot,

Quick question about the bird dataset you're using.

I downloaded the bird dataset as per your instructions:

#####How to train a char-CNN-RNN model:
1. Download the birds and flowers data.

Inside the cvpr2016_cub/text_c10 directory, there are .t7 files. E.G 200.Common_Yellowthroat.t7

Upon opening them, I found that they were 60x201x10 tensors of integers. I guessed 60 is the images/specie, 10 is the caption/image. What is the 201 dimension? Is it the vocabulary size of the captions? What are the actual integers? I notice values from 0 to 70ish with a lot of the values being 0.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions