introduction to `RNN`

学习地址：
https://www.youtube.com/watch?v=lWkFhVq9-nc

Feed Forward Neural Network, `FFNN`

In a Feed-Forward Network, information flow only in forward direction.

1
2
3
4
5
graph LR;

I{input}; IL(Input Layer); HL(Hidden Layer); OL(Output Layer); O{Predicted output}

I-->IL; IL-->HL; HL-->OL; OL-->O;

Question:

1
2
3
4
5
graph LR;

I{input}; IL(Input Layer); HL(Hidden Layer); OL(Output Layer); O{Predicted output}

I-->IL; IL-->HL; HL-->OL; OL-->O; HL-->HL;

How does RNN works?

Types of RNN：

Solution to Gradient Problem:

LSTMs are special kind of RNN, capable of learning long-term dependencies.

Three steps of LSTMs:

It looks at the previous state ($$h_{t-1}$$) and the current input $$x_t$$ and computes the function.

There are 2 parts:

One is sigmoid function, it decides which values to let through (0 or 1).
$$i_t = \sigma(W_i [h_{t-1}, x_t] + b_i)$$
The other is tanh function, which gives the weightage to the value which are passed deciding their level of importance (-1 to 1).
$$\overset{\sim}{C_t} = tanh(W_C [h_{t-1}, x_t] + b_c)$$

$$o_t = \sigma(W_o [h_{t-1}, x_t] + b_0)$$, $$o_t$$ is called output gate.

$$h_t = o_t * tanh(C_t)$$