Processing math: 100%

 

 

 

More LSTM details

The first stage is called the forget gate, where we combine the input at (say, time t), and the hidden cell state input at t1, passing it through the Sigmoid activation function and then performing an element-wise multiplication, denoted by .

It follows

f(t)=σ(Wfx(t)+Ufh(t1)+bf)

where W and U are the weights respectively.