Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Functionality for Stein variational policy gradient and/or regularization through the use of prior policies #44

Open
leonhalgryn opened this issue Jan 12, 2024 · 2 comments
Assignees
Labels
enhancement New feature or request

Comments

@leonhalgryn
Copy link

Are there any plans to add functionality to allow using prior policies for regularization similar to that of the Stein variational policy gradient (SVPG) (SVPG paper available at: https://arxiv.org/abs/1704.02399)

@GreatArcStudios
Copy link

Yeah this would be pretty great if added. Looks like as of now they'd have to add it on a per algorithm basis as it is a modification to the loss function. Perhaps they could abstract away the loss into a module.

@rodrigodesalvobraz
Copy link
Contributor

Thank you. This would be good but we are currently working on higher-priority items. We will leave the issue open and update it when we get to it. Thank you.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

4 participants