FMoW dataset and results variance

Hi, I'm using datacomp evaluation and it seems that FMoW dataset dramatically increases variance. The main metric is 'worst-region accuracy'. There are 5 regions, 4 of them have more than 700 samples. But 1 have only 4 images. It means that it's possible when the answer in 1 image can change  the FMoW metric from 0 to 0.25. The average will be changed to 0.25/38≈0.0066 accordingly. For instance, average accuracy 70.0 and average accuracy 69.4 may differ by the answer in one picture! 

Because it's impossible to improve the dataset, I suggest just to remove this region from predictions

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FMoW dataset and results variance #61

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

FMoW dataset and results variance #61

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions