Mcv binary cache by maryamtahhan · Pull Request #166 · redhat-et/MCU

maryamtahhan · 2026-02-09T12:26:22Z

Enable vllm binary cache support for MCV

Signed-off-by: Maryam Tahhan <mtahhan@redhat.com>

maryamtahhan · 2026-02-10T15:39:51Z

TODO - add torch_inductor dir

maryamtahhan · 2026-02-23T11:14:32Z

No precache

(EngineCore_DP0 pid=22) INFO 02-23 01:35:37 [backends.py:812] Using cache directory: /root/.cache/vllm/torch_compile_cache/8d0a361fbc/rank_0_0/backbone for vLLM's torch.compile
(EngineCore_DP0 pid=22) INFO 02-23 01:35:37 [backends.py:872] Dynamo bytecode transform time: 28.30 s
(EngineCore_DP0 pid=22) [rank0]:W0223 01:35:45.613000 22 torch/_inductor/utils.py:1613] Not enough SMs to use max_autotune_gemm mode
(EngineCore_DP0 pid=22) INFO 02-23 01:35:55 [backends.py:302] Cache the graph of compile range (1, 2048) for later use
(EngineCore_DP0 pid=22) INFO 02-23 01:36:01 [backends.py:319] Compiling a graph for compile range (1, 2048) takes 18.20 s
(EngineCore_DP0 pid=22) INFO 02-23 01:36:01 [monitor.py:34] torch.compile takes 46.50 s in total

with pre-cache:

(EngineCore_DP0 pid=22) INFO 02-23 03:12:47 [backends.py:812] Using cache directory: /root/.cache/vllm/torch_compile_cache/8d0a361fbc/rank_0_0/backbone for vLLM's torch.compile
(EngineCore_DP0 pid=22) INFO 02-23 03:12:47 [backends.py:872] Dynamo bytecode transform time: 7.85 s
(EngineCore_DP0 pid=22) INFO 02-23 03:12:54 [backends.py:267] Directly load the compiled graph(s) for compile range (1, 2048) from the cache, took 1.273 s
(EngineCore_DP0 pid=22) INFO 02-23 03:12:54 [monitor.py:34] torch.compile takes 9.12 s in total

Signed-off-by: Maryam Tahhan <mtahhan@redhat.com>

maryamtahhan added 3 commits February 9, 2026 10:28

mcv: add binary cache create support

5957165

Signed-off-by: Maryam Tahhan <mtahhan@redhat.com>

mcv: build binary cache examples

c27558c

Signed-off-by: Maryam Tahhan <mtahhan@redhat.com>

mcv: binary cache extraction

5c88000

Signed-off-by: Maryam Tahhan <mtahhan@redhat.com>

maryamtahhan force-pushed the mcv-binary-cache branch 2 times, most recently from 0a95604 to c602a2a Compare February 9, 2026 12:35

mcv: update binary cache docs

89b9145

Signed-off-by: Maryam Tahhan <mtahhan@redhat.com>

maryamtahhan force-pushed the mcv-binary-cache branch from c602a2a to 89b9145 Compare February 9, 2026 13:16

maryamtahhan marked this pull request as ready for review February 9, 2026 13:17

maryamtahhan added 4 commits February 9, 2026 14:00

mcv: binary cache fixes

41aa192

Signed-off-by: Maryam Tahhan <mtahhan@redhat.com>

mcv: skip precommit on example caches

fe1faa2

Signed-off-by: Maryam Tahhan <mtahhan@redhat.com>

mcv: fix golang linting issues

33bc359

Signed-off-by: Maryam Tahhan <mtahhan@redhat.com>

mcv: don't sanitize triton paths

0b4cd89

Signed-off-by: Maryam Tahhan <mtahhan@redhat.com>

maryamtahhan force-pushed the mcv-binary-cache branch from 194babb to caf79c1 Compare February 10, 2026 10:03

maryamtahhan requested a review from Billy99 February 10, 2026 10:43

maryamtahhan force-pushed the mcv-binary-cache branch from caf79c1 to 5ab5cd3 Compare February 10, 2026 11:02

maryamtahhan removed the request for review from Billy99 February 10, 2026 16:07

maryamtahhan marked this pull request as draft February 10, 2026 16:07

maryamtahhan marked this pull request as ready for review February 23, 2026 11:13

maryamtahhan requested a review from Billy99 February 23, 2026 11:14

mcv: fix pre-commit issues

5d5d4eb

Signed-off-by: Maryam Tahhan <mtahhan@redhat.com>

maryamtahhan force-pushed the mcv-binary-cache branch from 4329659 to 5d5d4eb Compare February 23, 2026 11:37

Merge branch 'main' into mcv-binary-cache

77efdb4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Mcv binary cache#166

Mcv binary cache#166
maryamtahhan wants to merge 10 commits intoredhat-et:mainfrom
maryamtahhan:mcv-binary-cache

maryamtahhan commented Feb 9, 2026 •

edited

Loading

Uh oh!

maryamtahhan commented Feb 10, 2026

Uh oh!

maryamtahhan commented Feb 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

maryamtahhan commented Feb 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

maryamtahhan commented Feb 10, 2026

Uh oh!

maryamtahhan commented Feb 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

maryamtahhan commented Feb 9, 2026 •

edited

Loading