Skip to content

Comments

Enable native ModelOpt quantization support (3/3)#2

Open
Edwardf0t1 wants to merge 348 commits intozhiyu/modelopt-sglang-api-2from
zhiyu/modelopt-sglang-api-3
Open

Enable native ModelOpt quantization support (3/3)#2
Edwardf0t1 wants to merge 348 commits intozhiyu/modelopt-sglang-api-2from
zhiyu/modelopt-sglang-api-3

Conversation

@Edwardf0t1
Copy link
Owner

@Edwardf0t1 Edwardf0t1 commented Sep 9, 2025

Original PR: sgl-project#10154

This PR only shows the diff between the 2nd and 3rd PR for a three-part series to enable native ModelOpt quantization in SGLang

Motivation

Modifications

Accuracy Tests

Benchmarking and Profiling

Checklist

@Edwardf0t1 Edwardf0t1 force-pushed the zhiyu/modelopt-sglang-api-2 branch from 2674259 to aed7dd2 Compare September 12, 2025 23:32
@Edwardf0t1 Edwardf0t1 force-pushed the zhiyu/modelopt-sglang-api-3 branch 2 times, most recently from 19fcedb to 95fc54b Compare September 13, 2025 01:48
@Edwardf0t1 Edwardf0t1 force-pushed the zhiyu/modelopt-sglang-api-2 branch from f074579 to c13b457 Compare September 18, 2025 06:21
@Edwardf0t1 Edwardf0t1 force-pushed the zhiyu/modelopt-sglang-api-3 branch from 95fc54b to d25e5d1 Compare September 23, 2025 08:18
@Edwardf0t1 Edwardf0t1 force-pushed the zhiyu/modelopt-sglang-api-2 branch from 1c16530 to 54524e2 Compare September 25, 2025 23:19
@Edwardf0t1 Edwardf0t1 force-pushed the zhiyu/modelopt-sglang-api-3 branch from d25e5d1 to a9e4353 Compare September 26, 2025 06:25
@Edwardf0t1 Edwardf0t1 force-pushed the zhiyu/modelopt-sglang-api-2 branch from c118561 to e75fbf3 Compare September 29, 2025 23:56
@Edwardf0t1 Edwardf0t1 force-pushed the zhiyu/modelopt-sglang-api-3 branch from b66d1dc to c5181b3 Compare September 30, 2025 00:34
@Edwardf0t1 Edwardf0t1 force-pushed the zhiyu/modelopt-sglang-api-2 branch from e75fbf3 to 5c1587f Compare September 30, 2025 05:26
@Edwardf0t1 Edwardf0t1 force-pushed the zhiyu/modelopt-sglang-api-3 branch from c5181b3 to 15dd13e Compare September 30, 2025 05:34
@Edwardf0t1 Edwardf0t1 force-pushed the zhiyu/modelopt-sglang-api-2 branch 2 times, most recently from 7134aa5 to fe3ee4e Compare October 7, 2025 08:23
@Edwardf0t1 Edwardf0t1 force-pushed the zhiyu/modelopt-sglang-api-3 branch 2 times, most recently from 9c2eaac to 6c34fd9 Compare October 9, 2025 07:56
@Edwardf0t1 Edwardf0t1 force-pushed the zhiyu/modelopt-sglang-api-2 branch from fcfd22b to ff8cb61 Compare October 9, 2025 08:11
@Edwardf0t1 Edwardf0t1 force-pushed the zhiyu/modelopt-sglang-api-3 branch 2 times, most recently from f1dd65e to 6769000 Compare October 9, 2025 20:11
@Edwardf0t1 Edwardf0t1 force-pushed the zhiyu/modelopt-sglang-api-2 branch from 40fefb3 to 9bc99e7 Compare October 11, 2025 03:13
@Edwardf0t1 Edwardf0t1 force-pushed the zhiyu/modelopt-sglang-api-3 branch 3 times, most recently from 7b27705 to 456a3f9 Compare October 14, 2025 08:24
CatherineSue and others added 30 commits October 20, 2025 18:03
Co-authored-by: Simo Lin <linsimo.mark@gmail.com>
Signed-off-by: Zhiyu Cheng <zhiyuc@nvidia.com>
Co-authored-by: luoyuan.luo <luoyuan.luo@antgroup.com>
Co-authored-by: 羽癫 <yudian.zy@antgroup.com>
Co-authored-by: b8zhong <b8zhong@uwaterloo.ca>
Co-authored-by: guangyey <guangye.yu@intel.com>
Co-authored-by: DiweiSun <105627594+DiweiSun@users.noreply.github.com>
Signed-off-by: Zhiyu Cheng <zhiyuc@nvidia.com>
Signed-off-by: xintin <gaurav.verma@amd.com>
 sgl-project#6572] (sgl-project#11416)

Signed-off-by: ybyang <ybyang7@iflytek.com>
Co-authored-by: YorkSu <york_su@qq.com>
Signed-off-by: zhengkezhou1 <madzhou1@gmail.com>
Co-authored-by: Liangsheng Yin <lsyincs@gmail.com>
Signed-off-by: vincentzed <207368749+vincentzed@users.noreply.github.com>
…e_intermediate_size` / `weight_block_size_n` (sgl-project#11702)

Signed-off-by: Kai-Hsun Chen <khchen@x.ai>
Signed-off-by: Shangming Cai <csmthu@gmail.com>
Signed-off-by: Serge Panev <spanev@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.