Order of computations in ResNet blocks

https://github.com/bkj/basenet/blob/c61f558c7bd6341dcc1610135553c96bd7d2ca5b/examples/cifar/cifar10.py#L118-L122

What is the motivation behind computing the batch norm and relu before sending the data into the convolutional layer? 

In the implementation done by https://github.com/kuangliu/pytorch-cifar, the computation is done in the following order which seems more conventional, so I am curious why it is changed!

out = F.relu(self.bn1(self.conv1(x)))
out = self.bn2(self.conv2(out))
shortcut = self.shortcut(x) if hasattr(self, 'shortcut') else x 
out += shortcut
out = F.relu(out)

	out = F.relu(self.bn1(x))
	shortcut = self.shortcut(out) if hasattr(self, 'shortcut') else x
	out = self.conv1(out)
	out = self.conv2(F.relu(self.bn2(out)))
	return out + shortcut

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Order of computations in ResNet blocks #2

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Order of computations in ResNet blocks #2

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions