Enable qwen3 vl moe quant and load #1182

WeiweiZhang1 · 2025-12-23T06:54:51Z

No description provided.

…fp UT Signed-off-by: Zhang, Weiwei1 <weiwei1.zhang@intel.com>

Signed-off-by: Zhang, Weiwei1 <weiwei1.zhang@intel.com>

for more information, see https://pre-commit.ci

Copilot

Pull request overview

This PR enables quantization and loading support for the Qwen3-VL-MoE model by implementing expert-to-linear conversion and adding comprehensive test coverage.

Key Changes:

Added Qwen3-VL-MoE model handler with expert conversion logic similar to existing MoE models
Implemented device-aware E2M1 tensor caching to improve performance
Added test fixtures and test cases for both CPU and CUDA environments

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
auto_round/modelling/qwen3_vl_moe.py	New module implementing LinearQwen3VLMoeTextSparseMoeBlock for expert-to-linear conversion during quantization
auto_round/special_model_handler.py	Registered qwen3_vl_moe in supported models list and expert conversion mapping
test/test_cuda/test_moe_model.py	Added fixture and test case for Qwen3-VL-MoE MXFP4 quantization on CUDA
test/test_cpu/test_moe_model.py	Added fixture and test case for Qwen3-VL-MoE MXFP4 quantization on CPU
auto_round/experimental/qmodules/fp4_utils.py	Refactored E2M1 lookup tensor to use device-aware caching mechanism
auto_round/data_type/utils.py	Added trailing newline for consistency

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

test/test_cuda/test_moe_model.py

test/test_cpu/test_moe_model.py

auto_round/modelling/qwen3_vl_moe.py

Signed-off-by: Zhang, Weiwei1 <weiwei1.zhang@intel.com>

…auto-round into enable_qwen3_vl_moe_quant

yiliu30

Others LGTM

auto_round/modelling/qwen3_vl_moe.py

Co-authored-by: Yi Liu <yi4.liu@intel.com>

refine update_fused_layer_global_scales to fix device mismatch for nv…

09b3a1c

…fp UT Signed-off-by: Zhang, Weiwei1 <weiwei1.zhang@intel.com>

WeiweiZhang1 requested review from Copilot and yiliu30 December 23, 2025 06:54

WeiweiZhang1 and others added 2 commits December 23, 2025 01:55

enable qwen3_vl_moe quantization & quantized model loading

9128032

Signed-off-by: Zhang, Weiwei1 <weiwei1.zhang@intel.com>

[pre-commit.ci] auto fixes from pre-commit.com hooks

5d81645

for more information, see https://pre-commit.ci

Copilot AI reviewed Dec 23, 2025

View reviewed changes

WeiweiZhang1 added 2 commits December 23, 2025 02:02

fixtypo

9f61c9e

Signed-off-by: Zhang, Weiwei1 <weiwei1.zhang@intel.com>

Merge branch 'enable_qwen3_vl_moe_quant' of https://github.com/intel/…

8769f04

…auto-round into enable_qwen3_vl_moe_quant

WeiweiZhang1 requested a review from n1ck-guo December 23, 2025 07:05

yiliu30 approved these changes Dec 23, 2025

View reviewed changes

auto_round/modelling/qwen3_vl_moe.py Outdated Show resolved Hide resolved

auto_round/modelling/qwen3_vl_moe.py Outdated Show resolved Hide resolved

yiliu30 and others added 4 commits December 24, 2025 10:08

Merge branch 'main' into enable_qwen3_vl_moe_quant

a39c551

Update auto_round/modelling/qwen3_vl_moe.py

2afa269

Co-authored-by: Yi Liu <yi4.liu@intel.com>

set calib_all_experts to false

c268e3d

Co-authored-by: Yi Liu <yi4.liu@intel.com>

fix typo

b9b8914

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enable qwen3 vl moe quant and load #1182

Enable qwen3 vl moe quant and load #1182

WeiweiZhang1 commented Dec 23, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yiliu30 left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Enable qwen3 vl moe quant and load #1182

Are you sure you want to change the base?

Enable qwen3 vl moe quant and load #1182

Conversation

WeiweiZhang1 commented Dec 23, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yiliu30 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants