Processing math: 100%

 

 

 

Analyzing the last results

This is an important expression. The second term on the right handside measures how fast the cost function is changing as a function of the $j$th output activation. If, for example, the cost function doesn't depend much on a particular output node j, then δLj will be small, which is what we would expect. The first term on the right, measures how fast the activation function f is changing at a given activation value zLj.