Releases: whitemech/temprl
Releases · whitemech/temprl
Release v0.3.0
- Simplify APIs of
TemporalWrapper: removefeature_extractorandcombineparameters, as well as reward shaping support. The reason is that these functionalities, in the OpenAI Gym "philosophy", should be delegated to other Gym wrappers, e.g.ObservationWrapperfor combining the features and the automata states. - Remove
flloatdependency. SinceTemporalGoalnow only requires apythomata.DFAobject, it is up to the user to decide how to generate the reward automaton. - Update dependencies to their latest version, e.g.
pythomata. - The
reset()method of the temporal wrapper now first resets the temporal goals, and then makes a step on each of them according to the fluents extracted from the environment's initial state. This is needed because otherwise the initial state of the wrapped environment is ignored. - The support for terminating conditions from the temporal goals is removed. Again, this is because the only job of the DFAs is to provide rewards according to the history of the episode; any other customization of the underlying envrionment, or the wrapper, must be done by using other wrappers.
Release 0.1.2.post1
Merge branch 'release-0.1.2' for postfix 1