Processing math: 100%

 

 

 

Parameters to train, common settings

With parameter sharing, the convolution involves thus for each filter F×F×D1 weights plus one bias parameter.

In total we have

(F×F×D1)×K+Kbiases,

parameters to train by back propagation.

It is common to let K come in powers of 2, that is 32, 64, 128 etc.

  1. F=3S=1P=1
  2. F=5S=1P=2
  3. F=5S=2P=open
  4. F=1S=1P=0