-
Notifications
You must be signed in to change notification settings - Fork 299
Add ResNet training code #436
Conversation
|
@Hakuyume |
1b0a075 to
7abbaf1
Compare
7abbaf1 to
fbcbe56
Compare
|
Both |
| 'url': 'https://chainercv-models.preferred.jp/' | ||
| 'resnet152_imagenet_trained_2018_11_26.npz' | ||
| }, | ||
| }, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Don't we need 'cv2': True?
|
I am checking how slow the training gets without options like RESULT |
|
Thanks for reviewing. |
examples/classification/README.md
Outdated
|
|
||
| The training procedure carefully follows the "ResNet in 1 hour" paper [5]. | ||
|
|
||
| ##### Performance tip |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why did you use 5th level? (4th level #### is not enough?)
|
|
||
| def __call__(self, in_data): | ||
| img, label = in_data | ||
| _, H, W = img.shape |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
H and W are not used?
examples/classification/README.md
Outdated
|
|
||
| ### Weight conversion | ||
|
|
||
| Single crop error rate. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
For the consistency with Trained model, it would be nice to add something like ~ of the models converted from Caffe model.
| # Classification | ||
|
|
||
| ## Performance | ||
| ## ImageNet |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
he and fb should be distinguished from each other in score board.
Hakuyume
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Merge after #652Merge after #432, #435, #321Aims at reproducing scheme used in https://github.com/facebook/fb.resnet.torch.Edit:
The training code reproduces the scheme introduced below.
This performs better than
fb.resnet.torchand scales to the situation with many GPUs.https://arxiv.org/pdf/1706.02677.pdf
Edit 11/16:
Although "ResNet in 1 hour" paper is new, the training scheme is adapted by some researchers (e.g. mixup).