-
Notifications
You must be signed in to change notification settings - Fork 332
Support PolyLingua on Gaudi Platform #2367
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
Dependency Review✅ No vulnerabilities or license issues found.Scanned FilesNone |
for more information, see https://pre-commit.ci
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Pull request overview
This PR adds support for running PolyLingua on Intel Gaudi HPU platform by introducing Gaudi-specific configurations and test infrastructure.
Key Changes
- Added comprehensive end-to-end test script for Gaudi platform deployment
- Created Docker Compose configuration for Gaudi/HPU deployment using vLLM with Habana runtime
- Added NUM_CARDS environment variable to support multi-card Gaudi configurations
Reviewed changes
Copilot reviewed 5 out of 5 changed files in this pull request and generated 7 comments.
Show a summary per file
| File | Description |
|---|---|
| PolyLingua/tests/test_compose_on_gaudi.sh | New end-to-end test script for validating PolyLingua on Gaudi platform |
| PolyLingua/docker_compose/intel/hpu/gaudi/compose.yaml | Docker Compose configuration for Gaudi platform with vLLM service using Habana runtime |
| PolyLingua/set_env.sh | Added NUM_CARDS environment variable for Gaudi multi-card support |
| PolyLingua/.env.example | Documented NUM_CARDS configuration option |
| PolyLingua/README.md | Added Qwen2.5-7B-Instruct as Gaudi default model |
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
Signed-off-by: Yao, Qing <qing.yao@intel.com>
Signed-off-by: Yao, Qing <qing.yao@intel.com>
|
|
||
| services: | ||
| vllm-service: | ||
| image: opea/vllm-gaudi:1.4 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please align the vllm-gaudi version with the other Gaudi examples.
Signed-off-by: Yao, Qing <qing.yao@intel.com>
Signed-off-by: Yao, Qing <qing.yao@intel.com>
Signed-off-by: Yao, Qing <qing.yao@intel.com>
Signed-off-by: Yao, Qing <qing.yao@intel.com>
Signed-off-by: Yao, Qing <qing.yao@intel.com>
Description
Support PolyLingua on Gaudi Platform
Issues
n/a.Type of change
List the type of change like below. Please delete options that are not relevant.
Dependencies
List the newly introduced 3rd party dependency if exists.
Tests
Describe the tests that you ran to verify your changes.