Processing math: 100%

 

 

 

Derivatives of the hidden layer

Using the chain rule we have the following expressions for say one of the weight parameters (it is easy to generalize to the other weight parameters)

Cw(1)00=Ca(2)a(2)z(2)z(2)z(1)0z(1)0w(1)00=δ(2)z(2)z(1)0z(1)0w(1)00,

which, noting that

z(2)=w(2)0a(1)0+w(2)1a(1)1+b(2),

allows us to rewrite

z(2)z(1)0z(1)0w(1)00=w(2)0a(1)0z(1)0a(1)0.