Target Pattern
0
0
0
0
Test Input Sequence
0
0
0
0
LSTM Cell (8 units)
xt (current bit)  +  ht−1 (prev hidden)
linear projections
Gates
⟠ computed in parallel
Forget ft
σ(Wf·[h,x]+b)
Input it
σ(Wi·[h,x]+b)
Candidate gt
tanh(Wg·[h,x]+b)
Output ot
σ(Wo·[h,x]+b)
ft ⊙ ct−1 + it ⊙ gt
Cell State ct
ot ⊙ tanh(ct)
Hidden State ht
Dense(1) → σ
Output
0