Hi,
Thanks for the huge effort on the project. However, there is a minor mistake in the LSTM node.
The current LSTM implementation seems to handle the backward pass incorrectly. After generated the code for the reverse direction, it should iterate over the time sequence dimension ("s" variable in the implementation), not the input features ("di"). This leads to wrong outputs for bidirectional or reverse LSTMs.