Skip to content

fix: read expert_wise_scale per-model instead of from global wrapper …

417fe1b
Select commit
Loading
Failed to load commit list.
Open

feat: add expert_wise_scale support for per-expert FP8 quantization in MoE models #35

fix: read expert_wise_scale per-model instead of from global wrapper …
417fe1b
Select commit
Loading
Failed to load commit list.

Workflow runs completed with no jobs