Skip to content

Conversation

@machine-moon
Copy link
Contributor

Pull Request Overview

Summary

  • Introduced 'multi_device' option in configuration for better device management.
  • Refactored training loop in Trainer class for improved logging and clarity.
  • Added vectorized n-step return functions in jax_utils for efficiency.
  • Implemented randint method in KeyManager for random integer generation.

Change Type

  • Bug fix
  • New feature
  • Breaking change
  • Documentation
  • Refactor
  • Performance
  • Other:

Related Issues

What Was Changed

Testing

  • [X ] Added/updated tests
  • [ X] All tests pass locally
  • [ X] Linter and type checks pass

Checklist

  • [ X] Follows style guidelines
  • [ X] Self-reviewed
  • [ X] Code is commented where needed
  • Documentation updated
  • [X ] No new warnings

Screenshots / Notes

… management.

- Refactored training loop in Trainer class for improved logging and clarity.
- Added vectorized n-step return functions in jax_utils for efficiency.
- Implemented randint method in KeyManager for random integer generation.
@machine-moon
Copy link
Contributor Author

old pr, v0.1.95 rules

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants