first version of training loop for dyna#5
first version of training loop for dyna#5Misterhoonster wants to merge 32 commits intoKempnerInstitute:mainfrom
Conversation
examples/crafter/dynax.py
Outdated
|
|
||
| # --- Network Definition --- | ||
|
|
||
| class RecurrentQNetwork(nn.Module): |
There was a problem hiding this comment.
Have a single Network with
- setup
- initialize_carry
- call for when its used with read data
- apply_model
There was a problem hiding this comment.
call: takes in observation, applies rnn, returns state + predictions(q_vals)
apply_model: takes in state and action, applies model (for us environment, for somebody else a neural network), returns next_state + predictions(q_vals)
There was a problem hiding this comment.
call(self, observation) --> [AgentState, Predictions]
apply_model(self, state, action) --> [AgentState, Predictions]
There was a problem hiding this comment.
Think of this as the Agent
examples/crafter/dynax.py
Outdated
|
|
||
| # --- Network Definition --- | ||
|
|
||
| class RecurrentQNetwork(nn.Module): |
There was a problem hiding this comment.
call: takes in observation, applies rnn, returns state + predictions(q_vals)
apply_model: takes in state and action, applies model (for us environment, for somebody else a neural network), returns next_state + predictions(q_vals)
examples/crafter/dynax.py
Outdated
|
|
||
| # --- Network Definition --- | ||
|
|
||
| class RecurrentQNetwork(nn.Module): |
There was a problem hiding this comment.
call(self, observation) --> [AgentState, Predictions]
apply_model(self, state, action) --> [AgentState, Predictions]
examples/crafter/dynax.py
Outdated
|
|
||
| # --- Network Definition --- | ||
|
|
||
| class RecurrentQNetwork(nn.Module): |
There was a problem hiding this comment.
Think of this as the Agent
|
Hey Wilka, just updated my dyna file with your suggested changes! |
|
Pushed the new version with the Loss class! Can you take a look, please? |
Could you take a look at my starter training loop logic?