Releases · whitemech/temprl · GitHub

20 Jun 15:54

marfvr

Release v0.3.0 Latest

Latest

Simplify APIs of TemporalWrapper: remove feature_extractor and combine parameters, as well as reward shaping support. The reason is that these functionalities, in the OpenAI Gym "philosophy", should be delegated to other Gym wrappers, e.g. ObservationWrapper for combining the features and the automata states.
Remove flloat dependency. Since TemporalGoal now only requires a pythomata.DFA object, it is up to the user to decide how to generate the reward automaton.
Update dependencies to their latest version, e.g. pythomata.
The reset() method of the temporal wrapper now first resets the temporal goals, and then makes a step on each of them according to the fluents extracted from the environment's initial state. This is needed because otherwise the initial state of the wrapped environment is ignored.
The support for terminating conditions from the temporal goals is removed. Again, this is because the only job of the DFAs is to provide rewards according to the history of the episode; any other customization of the underlying envrionment, or the wrapper, must be done by using other wrappers.

Assets 2

24 Mar 20:08

marfvr

Release 0.1.2.post1 Pre-release

Pre-release

Merge branch 'release-0.1.2' for postfix 1

Assets 2