Skip to content

Conversation

@Teeeio
Copy link

@Teeeio Teeeio commented Jan 20, 2026

PR Category

Serve

PR Types

Bug Fixes

PR Description

The --limit-mm-per-prompt argument in vLLM uses json.loads for parsing, which requires a valid JSON string format. The original configuration files used an invalid format (image=X), causing a JSON parse error during service startup.

This PR fixes the issue by updating the format to valid JSON string syntax in 6 serve configuration files:

Before: limit_mm_per_prompt: image=X
After: limit_mm_per_prompt: '{image: X}'
✅ Verified with vLLM v0.13.0 — the service now starts successfully without parsing errors.

  The vLLM --limit-mm-per-prompt argument uses type=json.loads for
  parsing, which requires valid JSON string format. Original config
  used incorrect format causing JSON parse error.

  Changes:
  - Fixed format in 6 serve configuration files
  - Changed from: limit_mm_per_prompt: image=X
  - Changed to:   limit_mm_per_prompt: '{image: X}'

  Tested with vLLM 0.13.0, service starts successfully.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants