Add LLM-D PoC with KIND by Jooho · Pull Request #46 · Jooho/loopy

Jooho · 2025-06-18T03:05:53Z

Summary by CodeRabbit

New Features
- Introduced a complete playbook and role for deploying LLM-D on Kubernetes (kind) clusters, including simulator-based and standard deployments.
- Added automated scripts and configuration files to streamline installation, deployment, and smoke testing of LLM-D components.
- Provided comprehensive Kubernetes manifests for deploying model serving, routing, caching (Redis), scheduling, prefill, and decode services with all necessary RBAC and service resources.
- Included detailed documentation and example test scripts for manual and automated deployment verification.
Documentation
- Added step-by-step README guides for both standard and simulator-based LLM-D deployments, including prerequisites, deployment flow, and testing instructions.

Signed-off-by: Jooho Lee <jlee@redhat.com>

coderabbitai · 2025-06-18T03:05:59Z

Walkthrough

This update introduces a comprehensive set of Kubernetes manifests, scripts, and configuration files to automate the deployment and testing of the LLM-D (Large Language Model - Distributed) system on KIND clusters. It adds roles, playbooks, and supporting scripts for deploying LLM-D components, including Redis-backed KVCache, scheduler, decode, prefill, routing, and simulator-based deployments, along with smoke test scripts and deployment documentation.

Changes

File(s)	Change Summary
hacks/download-cli.sh	Enhanced script to detect OS/architecture and add Helm 3.17.3 installation logic.
unofficial_components/playbooks/llm-d-on-kind/config.yaml	New playbook config for installing LLM-D on KIND using llm-d-simulator with sequential roles.
unofficial_components/roles/poc-llm-d-install-with-manifests/config.yaml	New role config for `llm-d-install-with-manifests`, defining manifests, env vars, and test script for LLM-D deployment.
unofficial_components/roles/poc-llm-d-install-with-manifests/files/test-request.sh	New bash script for smoke-testing LLM-D deployments via decode pod and gateway.
unofficial_components/roles/poc-llm-d-install-with-manifests/main.sh	New main automation script for deploying LLM-D roles using manifests and running tests.
.../populated_manifests/README.md .../populated_manifests_using_simulator/README.md	New documentation files detailing manual deployment and testing procedures for LLM-D with/without simulator.
.../populated_manifests/decode/ .../populated_manifests/prefill/ .../populated_manifests/kvcache/ .../populated_manifests/scheduler/ .../populated_manifests/route/*	New Kubernetes manifests for deploying decode, prefill, kvcache (Redis), scheduler, and routing components for LLM-D. Includes deployments, services, roles, configmaps, and custom resources.
.../populated_manifests/llm-d-llmIsvc.yaml .../populated_manifests/llm-d-llmIsvcConfig.yaml	New KServe custom resource manifests for LLMInferenceService and its configuration.
.../populated_manifests_using_simulator/decode/ .../populated_manifests_using_simulator/prefill/	New manifests for deploying LLM-D decode and prefill roles using the simulator image.

Sequence Diagram(s)

LLM-D Deployment and Test Flow

sequenceDiagram
    participant User
    participant main.sh
    participant Kubernetes
    participant test-request.sh

    User->>main.sh: Execute deployment script
    main.sh->>Kubernetes: Apply CRDs and manifests (KVCache, Scheduler, Prefill, Decode, Route)
    main.sh->>Kubernetes: Wait for pods to be ready
    main.sh->>test-request.sh: Run smoke test script
    test-request.sh->>Kubernetes: Discover decode pod/gateway
    test-request.sh->>Kubernetes: Send test requests (GET/POST)
    test-request.sh->>main.sh: Return test result
    main.sh->>User: Report deployment and test status

Playbook Flow for LLM-D on KIND

sequenceDiagram
    participant User
    participant Playbook
    participant kind-install role
    participant llm-d-install-with-manifests role

    User->>Playbook: Run llm-d-on-kind playbook
    Playbook->>kind-install role: Setup KIND cluster
    Playbook->>llm-d-install-with-manifests role: Deploy LLM-D components with manifests

Poem

Oh what a flurry of YAML and bash,
The cluster now hums—no need to dash!
Redis and models, schedulers too,
All deployed swiftly, as rabbits do.
Smoke tests are hopping, results are in sight—
LLM-D on KIND, installed just right!
🐇✨

✨ Finishing Touches

📝 Generate Docstrings

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Explain this complex logic.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai explain this code block.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and explain its main purpose.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Support

Need help? Create a ticket on our support page for assistance with any issues or questions.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai generate docstrings to generate docstrings for this PR.
@coderabbitai generate sequence diagram to generate a sequence diagram of the changes in this PR.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

coderabbitai

Actionable comments posted: 31

♻️ Duplicate comments (4)

unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/route/httproute.yaml (1)

9-9: Invalid namespace placeholder.

The %test_namespace% token isn't valid YAML and will cause a parsing error. Please apply the same fix as in decode/svc.yaml.

unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/kvcache/svc.yaml (1)

10-10: Invalid namespace placeholder.

The %test_namespace% token isn't valid YAML and will cause a parsing error. Please apply the same fix as in decode/svc.yaml.

unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests_using_simulator/decode/svc.yaml (1)

5-5: Invalid namespace placeholder.

The %test_namespace% token isn't valid YAML and will cause a parsing error. Please apply the same fix as in decode/svc.yaml.

unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/route/inferencePool.yaml (1)

8-8: Invalid namespace placeholder.

The %test_namespace% token isn't valid YAML and will cause a parsing error. Please apply the same fix as in decode/svc.yaml.

🧹 Nitpick comments (22)

hacks/download-cli.sh (1)

54-61: Add broader architecture support and consistent error handling

Consider extending the case statement to cover additional architectures (e.g., armv7l, ppc64le) and consolidating error messaging by invoking a die helper (instead of echo; exit) for consistent logging and exit codes.

unofficial_components/playbooks/llm-d-on-kind/config.yaml (1)

11-11: Add missing newline at end-of-file and remove trailing spaces.

YAML lint reports no newline at EOF and trailing spaces on the final blank line.
unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/kvcache/redis-script-configmap.yaml (1)
18-20: Include ACL file in Redis startup arguments.

Although you copy users.acl if present, it's not passed to redis-server. Consider adding:
 ARGS+=("--include" "/opt/bitnami/redis/etc/master.conf")
+ARGS+=("--include" "/opt/bitnami/redis/etc/users.acl")
This ensures ACL rules are loaded.
unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests_using_simulator/prefill/deployment.yaml (1)

26-26: Remove server-assigned field.

creationTimestamp: null is a server-populated field and should be omitted from the spec.

unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/kvcache/redis-configuration-configmap.yaml (1)

30-30: Remove superfluous blank line.

YAMLlint warns about extra blank lines; clean up to adhere to style guidelines.

unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/scheduler/role.yaml (1)

46-46: Add trailing newline at end-of-file.

YAMLlint flags the missing newline at EOF. Please ensure the file ends with a single newline character.
unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests_using_simulator/decode/deployment.yaml (1)
76-77: Remove deprecated serviceAccount field.

You’re specifying both serviceAccount and serviceAccountName. The former is deprecated—keep only serviceAccountName:
-      serviceAccount: meta-llama-llama-3-2-3b-instruct-sa
       serviceAccountName: meta-llama-llama-3-2-3b-instruct-sa
unofficial_components/roles/poc-llm-d-install-with-manifests/files/test-request.sh (1)
226-227: Ensure cleanup on script exit

The final pod deletion may be skipped on early exit. Add a trap to remove the curl pod on any exit or interruption.
trap 'kubectl delete pod curl-$ID -n "$NAMESPACE" --force --grace-period=0' EXIT
unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests_using_simulator/README.md (3)
4-8: Improve prerequisites list formatting

Items under Download required binaries are not consistently formatted as a list. Use proper Markdown bullets:
- yq
- kubectl or oc
9-13: Clarify the $MODE variable usage

The snippets reference $MODE (e.g., kubectl $MODE -k ...) but it isn’t defined. Document acceptable values (e.g., apply/delete) or remove it to avoid confusion.

11-14: Use proper Markdown code fences

Replace the literal ~~~ with triple backticks for valid code blocks.
-   ~~~
-   kubectl $MODE -k https://...
-   ~~~
+```bash
+kubectl $MODE -k https://...
+``` 
unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/prefill/deployment.yaml (1)
41-43: Use block scalar for JSON config

Embedding complex JSON on one line is hard to maintain. Refactor to a block scalar:
- --kv-transfer-config
- '{"kv_connector":"MultiConnector", ...}'
+ --kv-transfer-config
+ |
+   {
+     "kv_connector": "MultiConnector",
+     "kv_role": "kv_both",
+     "kv_connector_extra_config": { ... }
+   }
unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/README.md (3)
29-34: Use proper Markdown code fences

The snippet uses ~~~; switch to standard backticks for code blocks.
-	~~~
-	oc new-project llmd-test-manual
-	~~~
+```bash
+oc new-project llmd-test-manual
+``` 
42-53: Inconsistent manifest path references

The Routing section refers to ../populated_manifests/${component}, whereas other sections use ./${component}. Align these paths to the correct relative directory.

121-124: Incorrect namespace placeholder in test snippet

The test output includes %test_namespace%; replace with a real namespace or use an environment variable placeholder (${TEST_NAMESPACE}).
unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/scheduler/deployment.yaml (1)
126-127: Remove deprecated serviceAccount field

The serviceAccount field is deprecated in favor of serviceAccountName. Remove it to avoid redundancy:
-      serviceAccount: meta-llama-llama-3-2-3b-instruct-epp-sa
unofficial_components/roles/poc-llm-d-install-with-manifests/main.sh (2)

1-5: Enable strict mode and quote variable expansions
Add set -euo pipefail at the top and quote all $VAR expansions (e.g., in cp, find, source, and oc commands) to harden the script and avoid word-splitting.

27-53: Remove or use unused variables
Variables role_name, prefill_manifests_dir_path, decode_manifests_dir_path, and PORT are assigned but never used. Either utilize them or delete these assignments to clear ShellCheck warnings.

unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/llm-d-llmIsvcConfig.yaml (1)

148-150: Trim excessive blank lines at EOF
YAML files should end with a single newline. Remove the extra blank lines after line 148.
unofficial_components/roles/poc-llm-d-install-with-manifests/config.yaml (3)
13-13: Remove trailing spaces
Line 13 has trailing whitespace. Delete the extra spaces to satisfy YAML linting.

16-17: Fix typo and spacing in description
Correct “ofLLM-D” → “of LLM-D” and “manfiests” → “manifests” for clarity.
-    This role is used to install Prefill/Decode type deployment ofLLM-D using llm-d-simulator.
+    This role deploys a Prefill/Decode LLM-D setup using llm-d-simulator.
80-80: Ensure newline at end of file
Add a single newline character at the end of the file.

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 49f0be9 and 1eee02e.

📒 Files selected for processing (35)

hacks/download-cli.sh (2 hunks)
unofficial_components/playbooks/llm-d-on-kind/config.yaml (1 hunks)
unofficial_components/roles/poc-llm-d-install-with-manifests/config.yaml (1 hunks)
unofficial_components/roles/poc-llm-d-install-with-manifests/files/test-request.sh (1 hunks)
unofficial_components/roles/poc-llm-d-install-with-manifests/main.sh (1 hunks)
unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/README.md (1 hunks)
unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/decode/deployment.yaml (1 hunks)
unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/decode/sa.yaml (1 hunks)
unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/decode/svc.yaml (1 hunks)
unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/kvcache/deployment.yaml (1 hunks)
unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/kvcache/redis-configuration-configmap.yaml (1 hunks)
unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/kvcache/redis-health-configmap.yaml (1 hunks)
unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/kvcache/redis-script-configmap.yaml (1 hunks)
unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/kvcache/sa.yaml (1 hunks)
unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/kvcache/svc.yaml (1 hunks)
unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/llm-d-llmIsvc.yaml (1 hunks)
unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/llm-d-llmIsvcConfig.yaml (1 hunks)
unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/prefill/deployment.yaml (1 hunks)
unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/prefill/svc.yaml (1 hunks)
unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/route/gateway.yaml (1 hunks)
unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/route/httproute.yaml (1 hunks)
unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/route/inferenceModel.yaml (1 hunks)
unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/route/inferencePool.yaml (1 hunks)
unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/scheduler/deployment.yaml (1 hunks)
unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/scheduler/destinationRule.yaml (1 hunks)
unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/scheduler/role.yaml (1 hunks)
unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/scheduler/rolebinding.yaml (1 hunks)
unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/scheduler/sa.yaml (1 hunks)
unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/scheduler/svc.yaml (1 hunks)
unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests_using_simulator/README.md (1 hunks)
unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests_using_simulator/decode/deployment.yaml (1 hunks)
unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests_using_simulator/decode/sa.yaml (1 hunks)
unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests_using_simulator/decode/svc.yaml (1 hunks)
unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests_using_simulator/prefill/deployment.yaml (1 hunks)
unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests_using_simulator/prefill/svc.yaml (1 hunks)

🧰 Additional context used

🪛 YAMLlint (1.37.1)

unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/scheduler/svc.yaml

[error] 8-8: syntax error: found character '%' that cannot start any token

(syntax)

unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/kvcache/sa.yaml

[error] 9-9: syntax error: found character '%' that cannot start any token

(syntax)

unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/scheduler/destinationRule.yaml

[error] 5-5: syntax error: found character '%' that cannot start any token

(syntax)

unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests_using_simulator/decode/sa.yaml

[error] 5-5: syntax error: found character '%' that cannot start any token

(syntax)

unofficial_components/playbooks/llm-d-on-kind/config.yaml

[error] 11-11: no new line character at the end of file

(new-line-at-end-of-file)

[error] 11-11: trailing spaces

(trailing-spaces)

unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/kvcache/redis-script-configmap.yaml

[error] 27-27: syntax error: found character '%' that cannot start any token

(syntax)

unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/route/httproute.yaml

[error] 9-9: syntax error: found character '%' that cannot start any token

(syntax)

unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/scheduler/sa.yaml

[error] 5-5: syntax error: found character '%' that cannot start any token

(syntax)

unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/prefill/svc.yaml

[error] 5-5: syntax error: found character '%' that cannot start any token

(syntax)

unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/route/inferencePool.yaml

[error] 8-8: syntax error: found character '%' that cannot start any token

(syntax)

unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/kvcache/redis-configuration-configmap.yaml

[warning] 30-30: too many blank lines

(1 > 0) (empty-lines)

[error] 29-29: syntax error: found character '%' that cannot start any token

(syntax)

unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests_using_simulator/decode/svc.yaml

[error] 5-5: syntax error: found character '%' that cannot start any token

(syntax)

unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/decode/svc.yaml

[error] 5-5: syntax error: found character '%' that cannot start any token

(syntax)

unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests_using_simulator/prefill/svc.yaml

[error] 5-5: syntax error: found character '%' that cannot start any token

(syntax)

unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/decode/sa.yaml

[error] 5-5: syntax error: found character '%' that cannot start any token

(syntax)

unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/kvcache/svc.yaml

[error] 10-10: syntax error: found character '%' that cannot start any token

(syntax)

unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/llm-d-llmIsvc.yaml

[error] 5-5: syntax error: found character '%' that cannot start any token

(syntax)

unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests_using_simulator/decode/deployment.yaml

[error] 9-9: syntax error: found character '%' that cannot start any token

(syntax)

unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/scheduler/rolebinding.yaml

[error] 5-5: syntax error: found character '%' that cannot start any token

(syntax)

unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/kvcache/redis-health-configmap.yaml

[error] 103-103: syntax error: found character '%' that cannot start any token

(syntax)

unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/scheduler/role.yaml

[error] 46-46: no new line character at the end of file

(new-line-at-end-of-file)

unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests_using_simulator/prefill/deployment.yaml

[error] 9-9: syntax error: found character '%' that cannot start any token

(syntax)

unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/route/gateway.yaml

[error] 13-13: syntax error: found character '%' that cannot start any token

(syntax)

unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/route/inferenceModel.yaml

[error] 8-8: syntax error: found character '%' that cannot start any token

(syntax)

unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/prefill/deployment.yaml

[error] 9-9: syntax error: found character '%' that cannot start any token

(syntax)

unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/scheduler/deployment.yaml

[error] 10-10: syntax error: found character '%' that cannot start any token

(syntax)

unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/llm-d-llmIsvcConfig.yaml

[error] 5-5: syntax error: found character '%' that cannot start any token

(syntax)

[warning] 149-149: too many blank lines

(3 > 2) (empty-lines)

unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/decode/deployment.yaml

[error] 9-9: syntax error: found character '%' that cannot start any token

(syntax)

unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/kvcache/deployment.yaml

[error] 12-12: syntax error: found character '%' that cannot start any token

(syntax)

unofficial_components/roles/poc-llm-d-install-with-manifests/config.yaml

[error] 13-13: trailing spaces

(trailing-spaces)

[error] 63-63: trailing spaces

(trailing-spaces)

[error] 74-74: trailing spaces

(trailing-spaces)

[error] 75-75: trailing spaces

(trailing-spaces)

[error] 77-77: trailing spaces

(trailing-spaces)

[error] 80-80: no new line character at the end of file

(new-line-at-end-of-file)

[error] 80-80: trailing spaces

(trailing-spaces)

🪛 Shellcheck (0.10.0)

unofficial_components/roles/poc-llm-d-install-with-manifests/main.sh

[warning] 28-28: role_name appears unused. Verify use (or export if used externally).

(SC2034)

[warning] 50-50: prefill_manifests_dir_path appears unused. Verify use (or export if used externally).

(SC2034)

[warning] 52-52: decode_manifests_dir_path appears unused. Verify use (or export if used externally).

(SC2034)

[warning] 53-53: PORT appears unused. Verify use (or export if used externally).

(SC2034)

⏰ Context from checks skipped due to timeout of 90000ms (3)

GitHub Check: kind-cluster-lifecycle-test (3.12)
GitHub Check: cluster-test (3.12)
GitHub Check: non-cluster-test (3.12)

🔇 Additional comments (1)

unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/decode/deployment.yaml (1)

38-40: Verify containerPort matches the --port argument
The container is launched with --port 8001 but only ports 5557 and 80 are exposed. Confirm whether port 80 is intentional; otherwise, add or replace a containerPort: 8001 entry so Kubernetes networking aligns with the service port.

Also applies to: 91-92

coderabbitai · 2025-06-18T03:12:19Z

hacks/download-cli.sh

+# Install Helm
+HELM_VERSION="v3.17.3"
+HELM_TARBALL="helm-${HELM_VERSION}-${OS}-${ARCH}.tar.gz"
+
+if ! check_binary_exists "${root_directory}/bin/helm"; then
+  echo "⬇️ Downloading HELM (v${HELM_VERSION})..."
+  wget --progress=bar:force:noscroll "https://get.helm.sh/${HELM_TARBALL}"
+  tar -zxvf "${HELM_TARBALL}"
+  delete_if_exists "${root_directory}/bin/helm"
+  mv "${OS}-${ARCH}/helm" "${root_directory}/bin/helm"
+  chmod +x "${root_directory}/bin/helm"
+  rm -rf "${OS}-${ARCH}" "${HELM_TARBALL}"
+fi


🛠️ Refactor suggestion

Verify Helm download integrity

It’s best practice to validate the Helm tarball (e.g., by checking its SHA256 checksum or GPG signature) before extraction to prevent installation of corrupted or malicious binaries. Consider fetching and verifying the checksum or signature alongside the download.

🤖 Prompt for AI Agents

In hacks/download-cli.sh around lines 159 to 171, the Helm tarball is downloaded and extracted without verifying its integrity. To fix this, add steps to download the official SHA256 checksum or GPG signature for the Helm tarball, then verify the downloaded file against this checksum or signature before extracting it. If the verification fails, abort the installation to prevent using corrupted or malicious binaries.

coderabbitai · 2025-06-18T03:12:19Z

...oc-llm-d-install-with-manifests/manifests/populated_manifests_using_simulator/decode/sa.yaml

+kind: ServiceAccount
+metadata:
+  name: meta-llama-llama-3-2-3b-instruct-sa
+  namespace: %test_namespace%


⚠️ Potential issue

Invalid namespace placeholder syntax

namespace: %test_namespace% is not valid YAML. Use a templating syntax supported by your deployment tooling (e.g., {{ test_namespace }} for Ansible or ${TEST_NAMESPACE} for envsubst) or document how %test_namespace% is substituted.

🧰 Tools

🪛 YAMLlint (1.37.1)

[error] 5-5: syntax error: found character '%' that cannot start any token

(syntax)

🤖 Prompt for AI Agents

In unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests_using_simulator/decode/sa.yaml at line 5, the namespace placeholder uses invalid syntax `%test_namespace%` which is not valid YAML. Replace this with a valid templating syntax supported by your deployment tool, such as `{{ test_namespace }}` for Ansible or `${TEST_NAMESPACE}` for envsubst, or add documentation explaining how `%test_namespace%` is substituted during deployment.

coderabbitai · 2025-06-18T03:12:19Z

...nents/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/scheduler/sa.yaml

+kind: ServiceAccount
+metadata:
+  name: meta-llama-llama-3-2-3b-instruct-epp-sa
+  namespace: %test_namespace%


⚠️ Potential issue

Invalid namespace placeholder syntax

namespace: %test_namespace% will break YAML parsing. Replace with a valid template or environment variable reference (e.g., {{ test_namespace }} or ${TEST_NAMESPACE}).

🧰 Tools

🪛 YAMLlint (1.37.1)

[error] 5-5: syntax error: found character '%' that cannot start any token

(syntax)

🤖 Prompt for AI Agents

In unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/scheduler/sa.yaml at line 5, the namespace placeholder uses invalid syntax `%test_namespace%` which breaks YAML parsing. Replace this with a valid template or environment variable syntax such as `{{ test_namespace }}` or `${TEST_NAMESPACE}` to ensure proper parsing and substitution.

coderabbitai · 2025-06-18T03:12:20Z

...mponents/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/decode/sa.yaml

+kind: ServiceAccount
+metadata:
+  name: meta-llama-llama-3-2-3b-instruct-sa
+  namespace: %test_namespace%


⚠️ Potential issue

Invalid namespace placeholder syntax

The %test_namespace% placeholder is not YAML-compliant. Switch to a supported variable syntax (e.g., Ansible’s {{ test_namespace }} or shell’s ${TEST_NAMESPACE}) or clarify the substitution mechanism.

🧰 Tools

🪛 YAMLlint (1.37.1)

[error] 5-5: syntax error: found character '%' that cannot start any token

(syntax)

🤖 Prompt for AI Agents

In unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/decode/sa.yaml at line 5, the namespace placeholder `%test_namespace%` uses invalid syntax for YAML. Replace it with a valid variable substitution format such as Ansible's `{{ test_namespace }}` or shell's `${TEST_NAMESPACE}`, depending on the intended templating system, to ensure proper parsing and substitution.

coderabbitai · 2025-06-18T03:12:20Z

...oc-llm-d-install-with-manifests/manifests/populated_manifests/scheduler/destinationRule.yaml

+kind: DestinationRule
+metadata:
+  name: meta-llama-llama-3-2-3b-instruct-insecure-tls
+  namespace: %test_namespace%


⚠️ Potential issue

Invalid namespace placeholder syntax

namespace: %test_namespace% isn’t valid YAML. Use a proper templating expression ({{ test_namespace }} or ${TEST_NAMESPACE}) or explain how %test_namespace% is replaced at runtime.

🧰 Tools

🪛 YAMLlint (1.37.1)

[error] 5-5: syntax error: found character '%' that cannot start any token

(syntax)

🤖 Prompt for AI Agents

In unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/scheduler/destinationRule.yaml at line 5, the namespace placeholder `%test_namespace%` is not valid YAML syntax. Replace `%test_namespace%` with a proper templating expression such as `{{ test_namespace }}` or `${TEST_NAMESPACE}`, depending on the templating engine used, or add a comment explaining how `%test_namespace%` is replaced at runtime to clarify its usage.

coderabbitai · 2025-06-18T03:12:22Z

...roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/kvcache/deployment.yaml

+    app.kubernetes.io/name: redis
+    app.kubernetes.io/version: 7.4.3
+  name: llm-d-redis-master
+  namespace: %test_namespace%


🛠️ Refactor suggestion

Quote the namespace placeholder to satisfy YAML syntax
The raw %test_namespace% token on line 12 will trigger a YAML parser error. Wrap it in quotes so that the sed replacement still works and the manifest remains valid YAML.

Apply this diff:

- namespace: %test_namespace% + namespace: "%test_namespace%"

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

namespace: %test_namespace%

namespace: "%test_namespace%"

🧰 Tools

🪛 YAMLlint (1.37.1)

[error] 12-12: syntax error: found character '%' that cannot start any token

(syntax)

🤖 Prompt for AI Agents

In unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/kvcache/deployment.yaml at line 12, the namespace placeholder %test_namespace% is not quoted, causing YAML syntax errors. Fix this by enclosing %test_namespace% in double quotes to ensure the YAML parser accepts it and the sed replacement works correctly.

coderabbitai · 2025-06-18T03:12:23Z

unofficial_components/roles/poc-llm-d-install-with-manifests/main.sh

+if [[ $DEBUG == "0" ]]; then
+  set -x
+fi


🛠️ Refactor suggestion

Invert debug flag logic
Currently, debugging (set -x) is enabled when DEBUG == "0", which is counterintuitive. Swap the condition so set -x runs when debug is explicitly turned on (e.g. DEBUG=1).

Proposed change:

-if [[ $DEBUG == "0" ]]; then - set -x -fi +if [[ "${DEBUG}" == "1" ]]; then + set -x +fi

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

if [[ $DEBUG == "0" ]]; then

set -x

fi

if [[ "${DEBUG}" == "1" ]]; then

set -x

fi

🤖 Prompt for AI Agents

In unofficial_components/roles/poc-llm-d-install-with-manifests/main.sh around lines 3 to 5, the debug flag logic is inverted causing set -x to run when DEBUG is "0". Change the condition to enable set -x only when DEBUG is explicitly set to "1" or a truthy value, so debugging activates when intended.

coderabbitai · 2025-06-18T03:12:23Z

.../roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/decode/deployment.yaml

+    llm-d.ai/model: meta-llama-llama-3-2-3b-instruct
+    llm-d.ai/role: decode
+  name: meta-llama-llama-3-2-3b-instruct-decode
+  namespace: %test_namespace%


🛠️ Refactor suggestion

Quote the namespace placeholder to satisfy YAML syntax
The unquoted %test_namespace% token on line 9 breaks standard YAML parsing. Wrap it in quotes before performing the sed substitution.

- namespace: %test_namespace% + namespace: "%test_namespace%"

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

namespace: %test_namespace%

namespace: "%test_namespace%"

🧰 Tools

🪛 YAMLlint (1.37.1)

[error] 9-9: syntax error: found character '%' that cannot start any token

(syntax)

🤖 Prompt for AI Agents

In unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/decode/deployment.yaml at line 9, the namespace placeholder %test_namespace% is unquoted, causing YAML parsing errors. Fix this by enclosing %test_namespace% in double quotes to ensure valid YAML syntax before any sed substitution is applied.

coderabbitai · 2025-06-18T03:12:23Z

...oles/poc-llm-d-install-with-manifests/manifests/populated_manifests/llm-d-llmIsvcConfig.yaml

+      - name: LMCACHE_LOOKUP_URL
+        value: llm-d-redis-master.llmd-test.svc.cluster.local:8100
+      - name: HF_HUB_CACHE
+        value: /models


⚠️ Potential issue

Hard-coded namespace in Redis addresses
Values like llm-d-redis-master.llmd-test.svc.cluster.local:8100 bypass the %test_namespace% parameter and will fail in other namespaces. Replace llmd-test with the placeholder (e.g., %test_namespace%) or make it configurable.

Example fix:

- value: llm-d-redis-master.llmd-test.svc.cluster.local:8100 + value: llm-d-redis-master.%test_namespace%.svc.cluster.local:8100

Also applies to: 213-216

🤖 Prompt for AI Agents

In unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/llm-d-llmIsvcConfig.yaml around lines 87 to 90 and also lines 213 to 216, the Redis service address is hard-coded with the namespace 'llmd-test', which prevents reuse in other namespaces. Replace the hard-coded namespace segment 'llmd-test' in the Redis URL with the placeholder '%test_namespace%' or another configurable variable to allow dynamic namespace substitution during deployment.

coderabbitai · 2025-06-18T03:12:23Z

...oles/poc-llm-d-install-with-manifests/manifests/populated_manifests/llm-d-llmIsvcConfig.yaml

+kind: LLMInferenceServiceConfig
+metadata:
+  name: llm-d-config
+  namespace: %test_namespace%


🛠️ Refactor suggestion

Quote the namespace placeholder to satisfy YAML syntax
The %test_namespace% on line 5 must be quoted to form valid YAML.

- namespace: %test_namespace% + namespace: "%test_namespace%"

🧰 Tools

🪛 YAMLlint (1.37.1)

[error] 5-5: syntax error: found character '%' that cannot start any token

(syntax)

🤖 Prompt for AI Agents

In unofficial_components/roles/poc-llm-d-install-with-manifests/manifests/populated_manifests/llm-d-llmIsvcConfig.yaml at line 5, the namespace placeholder %test_namespace% is not quoted, which breaks YAML syntax. Fix this by enclosing %test_namespace% in quotes to ensure it is treated as a string and the YAML remains valid.

Jooho · 2025-06-18T13:27:23Z

/lgtm
/approve

Add LLM-D PoC with KIND

1eee02e

Signed-off-by: Jooho Lee <jlee@redhat.com>

coderabbitai bot reviewed Jun 18, 2025

View reviewed changes

github-actions bot added the lgtm label Jun 18, 2025

github-actions bot approved these changes Jun 18, 2025

View reviewed changes

Jooho merged commit ad88906 into main Jun 18, 2025
7 checks passed

coderabbitai bot mentioned this pull request Jun 18, 2025

Add llm poc #47

Merged

Conversation

Jooho commented Jun 18, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Jun 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

LLM-D Deployment and Test Flow

Playbook Flow for LLM-D on KIND

Poem

Chat

Support

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (.coderabbit.yaml)

Documentation and Community

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

Jooho commented Jun 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Jooho commented Jun 18, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Jun 18, 2025 •

edited

Loading

CodeRabbit Configuration File (`.coderabbit.yaml`)