BatchNorm after ReLU

Hi,

I am performing somehow similar benchmark, but on caffenet128 (and moving to ResNets now) on ImageNet. 
One thing, that I have found - the best position of BN in non-res net is after ReLU and without scale+bias layer (https://github.com/ducha-aiki/caffenet-benchmark/blob/master/batchnorm.md):

| Name | Accuracy | LogLoss | Comments |
| --- | --: | --: | :-- |
| Before | 0.474 | 2.35 | As in paper |
| Before + scale&bias layer | 0.478 | 2.33 | As in paper |
| After | **0.499** | **2.21** |  |
| After + scale&bias layer | 0.493 | 2.24 |  |

May be, it is worth testing too.

Second, results on CIFAR-10 often contradicts results on ImageNet. I.e., leaky ReLU > ReLU on CIFAR, but worse on ImageNet. 

P.S. We could cooperate in ImageNet testing, if you agree.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

BatchNorm after ReLU #5

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Name	Accuracy	LogLoss	Comments
Before	0.474	2.35	As in paper
Before + scale&bias layer	0.478	2.33	As in paper
After	0.499	2.21
After + scale&bias layer	0.493	2.24

BatchNorm after ReLU #5

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions