Cutlass APIS tests #642

rishi-yadav · 2025-11-21T10:03:49Z

No description provided.

Copilot

Pull request overview

This PR adds comprehensive test coverage for CUTLASS APIs on Intel Xe architectures, specifically testing grouped GEMM operations and various epilogue fusion operations. The tests validate different precision configurations (FP8, BF16, mixed precision), scaling operations, and per-row bias with element-wise activation.

Key changes include:

New test files for grouped GEMM operations with different precision types
Tests for linear combination with per-row bias and element-wise activation
FP8 scaling tests for E4M3 and E5M2 formats
Enhanced testbed infrastructure with grouped GEMM support

Reviewed changes

Copilot reviewed 7 out of 7 changed files in this pull request and generated 6 comments.

Show a summary per file

File	Description
`test/unit/gemm/device/gemm_universal_mainloopintelxexmx16group_precision.cpp`	Tests for grouped GEMM with mixed precision (int8/bf16) covering various batch sizes and parallelization patterns
`test/unit/gemm/device/gemm_universal_mainloopintelxexmx16group.cpp`	FP8 (E4M3) grouped GEMM tests with basic, single-group, and multi-group configurations
`test/unit/gemm/device/gemm_universal_mainloopintelxexmx16_group_fp8.cpp`	Duplicate FP8 grouped GEMM tests (identical to previous file)
`test/unit/gemm/device/gemm_universal_lincomb_per_rowbias_eltact.cpp`	Tests for linear combination epilogue with per-row bias and ReLU activation, covering edge cases and various matrix sizes
`test/unit/gemm/device/gemm_universal_fp_scaling.cpp`	FP8 scaling tests for both E4M3 and E5M2 formats with convert-only operations
`test/unit/gemm/device/gemm_testbed_3x.hpp`	New grouped GEMM test infrastructure including runner class, reference computation, and device synchronization
`test/unit/gemm/device/CMakeLists.txt`	Build configuration for unified test executable combining all new test files

test/unit/gemm/device/gemm_universal_mainloopintelxexmx16group.cpp

test/unit/gemm/device/gemm_universal_lincomb_per_rowbias_eltact.cpp

test/unit/gemm/device/gemm_testbed_3x.hpp

Copilot

Pull request overview

Copilot reviewed 7 out of 7 changed files in this pull request and generated 5 comments.

Comments suppressed due to low confidence (1)

test/unit/gemm/device/gemm_universal_mainloopintelxexmx16group.cpp:1

The file contains duplicate copyright headers and code sections (lines 1-203 are duplicated at lines 204-406). This appears to be an accidental file concatenation. Remove the duplicate section starting at line 204.

/***************************************************************************************************

test/unit/gemm/device/gemm_universal_mainloopintelxexmx16group.cpp

test/unit/gemm/device/gemm_testbed_3x.hpp

rishi-yadav added 7 commits November 21, 2025 10:03

Cutlass APIS tests

5434ac0

Update gemm_universal_fp_scaling.cpp

cd8265f

Update gemm_universal_lincomb_per_rowbias_eltact.cpp

b60cffc

Update gemm_universal_mainloopintelxexmx16_group_fp8.cpp

cd2763b

Update gemm_universal_mainloopintelxexmx16group.cpp

b28a695

Update gemm_universal_mainloopintelxexmx16group_precision.cpp

bdeced9

Update gemm_universal_mainloopintelxexmx16group.cpp

5ff13dc

rishi-yadav requested a review from Copilot November 21, 2025 15:59

Copilot AI reviewed Nov 21, 2025

View reviewed changes

rishi-yadav marked this pull request as draft November 21, 2025 16:02

Update gemm_universal_mainloopintelxexmx16group.cpp

6e0cbfc

rishi-yadav requested a review from Copilot November 21, 2025 16:07

Copilot AI reviewed Nov 21, 2025

View reviewed changes

rishi-yadav added 6 commits November 21, 2025 21:43

Update gemm_universal_mainloopintelxexmx16group.cpp

713d0b3

Update CMakeLists.txt

81a6dce

Update gemm_testbed_3x_ptr_array.hpp

2f368d2

Update gemm_universal_mainloopintelxexmx16_group_fp8.cpp

55ddbdf

Update gemm_universal_mainloopintelxexmx16group.cpp

0684e75

Update gemm_universal_mainloopintelxexmx16group_precision.cpp

473c3a7

rishi-yadav marked this pull request as ready for review November 26, 2025 12:06

rishi-yadav requested review from Antonyvance, aschabana, rolandschulz and tdeng5 November 26, 2025 12:07

aschabana approved these changes Nov 26, 2025

View reviewed changes

rishi-yadav and others added 5 commits November 28, 2025 14:20

Merge branch 'main' into cutlass_fusion_group_apis

725185a

Merge branch 'main' into cutlass_fusion_group_apis

bb26189

Merge branch 'main' into cutlass_fusion_group_apis

706c414

Merge branch 'main' into cutlass_fusion_group_apis

0249f7a

Update gemm_testbed_3x_ptr_array.hpp

5961253

tdeng5 approved these changes Dec 3, 2025

View reviewed changes

tdeng5 enabled auto-merge (squash) December 4, 2025 00:29

Merge branch 'main' into cutlass_fusion_group_apis

4e98942

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Cutlass APIS tests #642

Cutlass APIS tests #642

Uh oh!

rishi-yadav commented Nov 21, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Cutlass APIS tests #642

Are you sure you want to change the base?

Cutlass APIS tests #642

Uh oh!

Conversation

rishi-yadav commented Nov 21, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants