Skip to content

Conversation

@dependabot
Copy link

@dependabot dependabot bot commented on behalf of github Aug 12, 2025

Bumps stable-baselines3 from 2.0.0 to 2.7.0.

Release notes

Sourced from stable-baselines3's releases.

v2.7.0: n-step returns for all off-policy algorithms via the n_steps argument

SB3 Contrib (more algorithms): https://github.com/Stable-Baselines-Team/stable-baselines3-contrib RL Zoo3 (training framework): https://github.com/DLR-RM/rl-baselines3-zoo Stable-Baselines Jax (SBX): https://github.com/araffin/sbx

To upgrade:

pip install stable_baselines3 sb3_contrib rl_zoo3 --upgrade

New Features:

  • Added support for n-step returns for off-policy algorithms via the n_steps parameter
from stable_baselines3 import SAC
SAC with n-step returns
model = SAC("MlpPolicy", "Pendulum-v1", n_steps=3, verbose=1)
model.learn(10_000)

  • Added NStepReplayBuffer that allows to compute n-step returns without additional memory requirement (and without for loops)
  • Added Gymnasium v1.2 support

Bug Fixes:

  • Fixed docker GPU image (PyTorch GPU was not installed)
  • Fixed segmentation faults caused by non-portable schedules during model loading (@​akanto)

SB3-Contrib

  • Added support for n-step returns for off-policy algorithms via the n_steps parameter
  • Use the FloatSchedule and LinearSchedule classes instead of lambdas in the ARS, PPO, and QRDQN implementations to improve model portability across different operating systems

RL Zoo

  • linear_schedule now returns a SimpleLinearSchedule object for better portability
  • Renamed LunarLander-v2 to LunarLander-v3 in hyperparameters
  • Renamed CarRacing-v2 to CarRacing-v3 in hyperparameters
  • Docker GPU images are now working again
  • Use ConstantSchedule, and SimpleLinearSchedule instead of constant_fn and linear_schedule
  • Fixed CarRacing-v3 hyperparameters for newer Gymnasium version

SBX (SB3 + Jax)

  • Added support for n-step returns for off-policy algorithms via the n_steps parameter
  • Added KL Adaptive LR for PPO and LR schedule for SAC/TQC

Deprecations:

... (truncated)

Commits
  • bf51a62 Release 2.7.0 (#2161)
  • 2dce430 Added sb3-extra-buffers to the project page (#2157)
  • 30ceaf3 Update to Gymnasium 1.2.0 (#2150)
  • 9e5ccc1 Minor Documentation and Comment Improvements (#2147)
  • e206fc5 Add NStepReplayBuffer and n_steps arguments for off-policy algorithms (#2...
  • 6af0601 Update LunarLander and LunarLanderContinuous Environments from v2 to v3 in th...
  • ef03d33 Fix typos and grammar mistakes (#2140)
  • f9c4ca5 Use classes instead of lambdas for schedules (#2125)
  • 19df267 Update doc about training exceeding total timesteps (#2133)
  • c1e503c Fix docker GPU build (#2120)
  • Additional commits viewable in compare view

Dependabot compatibility score

You can trigger a rebase of this PR by commenting @dependabot rebase.


Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

  • @dependabot rebase will rebase this PR
  • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
  • @dependabot merge will merge this PR after your CI passes on it
  • @dependabot squash and merge will squash and merge this PR after your CI passes on it
  • @dependabot cancel merge will cancel a previously requested merge and block automerging
  • @dependabot reopen will reopen this PR if it is closed
  • @dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
  • @dependabot show <dependency name> ignore conditions will show all of the ignore conditions of the specified dependency
  • @dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Note
Automatic rebases have been disabled on this pull request as it has been open for over 30 days.

Bumps [stable-baselines3](https://github.com/DLR-RM/stable-baselines3) from 2.0.0 to 2.7.0.
- [Release notes](https://github.com/DLR-RM/stable-baselines3/releases)
- [Commits](DLR-RM/stable-baselines3@v2.0.0...v2.7.0)

---
updated-dependencies:
- dependency-name: stable-baselines3
  dependency-version: 2.7.0
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <[email protected]>
@dependabot dependabot bot added dependencies Pull requests that update a dependency file python Pull requests that update Python code labels Aug 12, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

dependencies Pull requests that update a dependency file python Pull requests that update Python code

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant