Add FP8/BF8 support for LDS transpose load #2210

stefankoncarevic · 2026-01-20T12:05:35Z

⚠️ Do not merge until #2184 is merged - this PR depends on LDS transpose load attention support

Implement ds_read_tr8_b64 offset formulas for FP8/BF8 MFMA (16x32, 32x16). Enable mixed fp8/bf8 type combinations for GEMM operations on gfx950.

Motivation

Add FP8 and BF8 data type support for LDS transpose load optimization on gfx950.
This enables efficient matrix loads using ds_read_tr8_b64 hardware instruction

Technical Details

LdsTransposeLoad.cpp: Implemented FP8/BF8 offset formulas in getBasePanelOffsets()
LdsTransposeLoad.cpp: Updated type compatibility check in makeDecision()
Added areBothFp8Types() check to allow mixed fp8/bf8 combinations

Test Plan

Add e2e tests for FP8/BF8 GEMM
Add e2e tests for mixed fp8/bf8 combinations

Test Result

Implement ds_read_tr8_b64 offset formulas for FP8/BF8 MFMA (16x32, 32x16). Enable mixed fp8/bf8 type combinations for GEMM operations on gfx950.

Add FP8/BF8 support for LDS transpose load

9027654

Implement ds_read_tr8_b64 offset formulas for FP8/BF8 MFMA (16x32, 32x16). Enable mixed fp8/bf8 type combinations for GEMM operations on gfx950.

stefankoncarevic requested a review from causten as a code owner January 20, 2026 12:05

stefankoncarevic marked this pull request as draft January 20, 2026 12:05

Add PR CI tests for FP8/BF8 LDS transpose load GEMM operations

84c9425

stefankoncarevic marked this pull request as ready for review January 20, 2026 14:35

stefankoncarevic requested review from dhernandez0, djramic, justinrosner, pabloantoniom and umangyadav January 20, 2026 14:36

stefankoncarevic changed the title ~~[WIP] Add FP8/BF8 support for LDS transpose load~~ Add FP8/BF8 support for LDS transpose load Jan 20, 2026

stefankoncarevic mentioned this pull request Jan 23, 2026

Add INT8 support for LDS transpose load #2214

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add FP8/BF8 support for LDS transpose load #2210

Add FP8/BF8 support for LDS transpose load #2210

stefankoncarevic commented Jan 20, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add FP8/BF8 support for LDS transpose load #2210

Are you sure you want to change the base?

Add FP8/BF8 support for LDS transpose load #2210

Conversation

stefankoncarevic commented Jan 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Technical Details

Test Plan

Test Result

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

stefankoncarevic commented Jan 20, 2026 •

edited

Loading