-
Notifications
You must be signed in to change notification settings - Fork 64
Closed
Description
Hi, I'm using datacomp evaluation and it seems that FMoW dataset dramatically increases variance. The main metric is 'worst-region accuracy'. There are 5 regions, 4 of them have more than 700 samples. But 1 have only 4 images. It means that it's possible when the answer in 1 image can change the FMoW metric from 0 to 0.25. The average will be changed to 0.25/38≈0.0066 accordingly. For instance, average accuracy 70.0 and average accuracy 69.4 may differ by the answer in one picture!
Because it's impossible to improve the dataset, I suggest just to remove this region from predictions
Metadata
Metadata
Assignees
Labels
No labels