With parameter sharing, the convolution involves thus for each filter F×F×D1 weights plus one bias parameter.
In total we have
(F×F×D1)×K+Kbiases,parameters to train by back propagation.
It is common to let K come in powers of 2, that is 32, 64, 128 etc.