Skip to content

Conversation

@rsyue
Copy link
Contributor

@rsyue rsyue commented Dec 10, 2025

What does this PR do? Please describe:
When more than 1 GPU present, model sharding can be tested with torchrun --nproc-per-node 8 test_hg_factory.py

Does your PR introduce any breaking changes? If yes, please list them:
None aware of

Check list:

  • Was the content of this PR discussed and approved via a GitHub issue? (no need for typos or documentation improvements)
  • Did you read the contributor guideline?
  • Did you make sure that your PR does only one thing instead of bundling different changes together?
  • Did you make sure to update the documentation with your changes? (if necessary)
  • Did you write any new necessary tests?
  • Did you verify new and existing tests pass locally with your changes?
  • Did you update the CHANGELOG? (no need for typos, documentation, or minor internal changes)

zyaoj and others added 24 commits September 20, 2025 11:48
…del) and changed allowed patterns to allow for the json index file
@meta-cla meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Dec 10, 2025
@rsyue
Copy link
Contributor Author

rsyue commented Dec 11, 2025

This is a PR to merge both the HuggingFace branch as well as a new hardware-specific test for sharding with multiple GPUs

@rsyue rsyue marked this pull request as draft December 11, 2025 17:25
@rsyue rsyue marked this pull request as ready for review December 11, 2025 19:34
@rsyue
Copy link
Contributor Author

rsyue commented Dec 11, 2025

Module renamed to hg_qwen_omni and references updated

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants