Majid
1 min readJun 28, 2019

--

There is a part I don’t get. At state 3 of your derivations, while computing the derivation of S_3 with respect to W_x, why would you omit the term involving S_2? Because S_3 depends on S_2 which in turn depends on W_x

--

--

No responses yet