Skip to content

Conversation

@maxyanghu
Copy link

@maxyanghu maxyanghu commented Jan 26, 2026

📌 Description

Pass correct strides to cudnn prefill

This PR patches flashinfer-ai#2414 into the cut branch for MLPerf Inference v6.0 submission.

🔍 Related Issues

🚀 Pull Request Checklist

Thank you for contributing to FlashInfer! Before we review your pull request, please make sure the following items are complete.

✅ Pre-commit Checks

  • I have installed pre-commit by running pip install pre-commit (or used your preferred method).
  • I have installed the hooks with pre-commit install.
  • I have run the hooks manually with pre-commit run --all-files and fixed any reported issues.

If you are unsure about how to set up pre-commit, see the pre-commit documentation.

🧪 Tests

  • Tests have been added or updated as needed.
  • All tests are passing (unittest, etc.).

Reviewer Notes

@wangshangsam wangshangsam merged commit 4f66641 into mlperf-inf-mm-q3vl-v6.0 Jan 26, 2026
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants