Skip to content

Conversation

@dependabot
Copy link
Contributor

@dependabot dependabot bot commented on behalf of github Sep 29, 2022

Bumps stable-baselines3 from 1.5.0 to 1.6.1.

Release notes

Sourced from stable-baselines3's releases.

SB3 v1.6.1: Bug fix release

SB3 Contrib: https://github.com/Stable-Baselines-Team/stable-baselines3-contrib

Breaking Changes:

  • Switched minimum tensorboard version to 2.9.1

New Features:

  • Support logging hyperparameters to tensorboard (@​timothe-chaumont)
  • Added checkpoints for replay buffer and VecNormalize statistics (@​anand-bala)
  • Added option for Monitor to append to existing file instead of overriding (@​sidney-tio)
  • The env checker now raises an error when using dict observation spaces and observation keys don't match observation space keys

SB3-Contrib

  • Fixed the issue of wrongly passing policy arguments when using CnnLstmPolicy or MultiInputLstmPolicy with RecurrentPPO (@​mlodel)

Bug Fixes:

  • Fixed issue where PPO gives NaN if rollout buffer provides a batch of size 1 (@​hughperkins)
  • Fixed the issue that predict does not always return action as np.ndarray (@​qgallouedec)
  • Fixed division by zero error when computing FPS when a small number of time has elapsed in operating systems with low-precision timers.
  • Added multidimensional action space support (@​qgallouedec)
  • Fixed missing verbose parameter passing in the EvalCallback constructor (@​burakdmb)
  • Fixed the issue that when updating the target network in DQN, SAC, TD3, the running_mean and running_var properties of batch norm layers are not updated (@​honglu2875)
  • Fixed incorrect type annotation of the replay_buffer_class argument in common.OffPolicyAlgorithm initializer, where an instance instead of a class was required (@​Rocamonde)
  • Fixed loading saved model with different number of envrionments
  • Removed forward() abstract method declaration from common.policies.BaseModel (already defined in torch.nn.Module) to fix type errors in subclasses (@​Rocamonde)
  • Fixed the return type of .load() and .learn() methods in BaseAlgorithm so that they now use TypeVar (@​Rocamonde)
  • Fixed an issue where keys with different tags but the same key raised an error in common.logger.HumanOutputFormat (@​Rocamonde and @​AdamGleave)

Others:

  • Fixed DictReplayBuffer.next_observations typing (@​qgallouedec)
  • Added support for device="auto" in buffers and made it default (@​qgallouedec)
  • Updated ResultsWriter` (used internally by Monitorwrapper) to automatically create missing directories whenfilename`` is a path (@​dominicgkerr)

Documentation:

... (truncated)

Commits
  • 21300c9 Release v1.6.1 (#1080)
  • def0574 Fixed typos (#1076)
  • e22e372 Fix duplicate key error in HumanOutputFormat (#1079)
  • 432b3f8 Fix return type for load, learn in BaseAlgorithm (#1043)
  • 899eee6 Automatically create missing directories of filenames passed to ``Results...
  • b745639 Transfer ABC inheritance from BaseModel to BasePolicy (#1062)
  • d0b129e Updated custom policy docs (#1067)
  • 440735c Fix loading a model with different number of environments (#1058)
  • 18b29a6 Remove forward() method from common.policies.BaseModel (#1061)
  • 98e786f Clarify and standardize verbosity documentation (#1056)
  • Additional commits viewable in compare view

Dependabot compatibility score

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

  • @dependabot rebase will rebase this PR
  • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
  • @dependabot merge will merge this PR after your CI passes on it
  • @dependabot squash and merge will squash and merge this PR after your CI passes on it
  • @dependabot cancel merge will cancel a previously requested merge and block automerging
  • @dependabot reopen will reopen this PR if it is closed
  • @dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
  • @dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Bumps [stable-baselines3](https://github.com/DLR-RM/stable-baselines3) from 1.5.0 to 1.6.1.
- [Release notes](https://github.com/DLR-RM/stable-baselines3/releases)
- [Commits](DLR-RM/stable-baselines3@v1.5.0...v1.6.1)

---
updated-dependencies:
- dependency-name: stable-baselines3
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
@dependabot dependabot bot added the dependencies Pull requests that update a dependency file label Sep 29, 2022
@dependabot @github
Copy link
Contributor Author

dependabot bot commented on behalf of github Oct 10, 2022

Superseded by #64.

@dependabot dependabot bot closed this Oct 10, 2022
@dependabot dependabot bot deleted the dependabot/pip/stable-baselines3-1.6.1 branch October 10, 2022 20:03
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

dependencies Pull requests that update a dependency file

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant