Fix memory layout of attention outputs #2582

chunhuanMeng · 2025-12-16T05:53:32Z

Fix memory layout of attention outputs

Copilot

Pull request overview

This PR fixes the memory layout of attention outputs in the XPU implementation by adding contiguous permutation operations and replacing empty placeholder tensors with properly initialized values.

Key Changes:

Added contiguous memory layout conversion for attention output tensor
Replaced empty placeholder tensors with initialized tensors using at::full and at::scalar_tensor

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

src/ATen/native/transformers/Attention.cpp

CuiYifeng · 2025-12-23T08:10:10Z

src/ATen/native/transformers/Attention.cpp

+  Tensor out =
+      attention.permute({0, 2, 1, 3}).contiguous().permute({0, 2, 1, 3});


Why does out Tensor need to be contiguous in BLHE format?

CuiYifeng · 2025-12-23T08:10:29Z

src/ATen/native/transformers/Attention.cpp

+      at::full(
+          {B, H, (compute_log_sumexp ? ceil_div(L, kAlignLSE) * kAlignLSE : 0)},
+          0.0,
+          attention.options()),


Why do we need to align L to 32?

fix layout

728e899

Copilot AI review requested due to automatic review settings December 16, 2025 05:53

chunhuanMeng changed the title ~~fix layout~~ Fix memory layout of attention outputs Dec 16, 2025

Copilot AI reviewed Dec 16, 2025

View reviewed changes

src/ATen/native/transformers/Attention.cpp Show resolved Hide resolved

src/ATen/native/transformers/Attention.cpp Outdated Show resolved Hide resolved

src/ATen/native/transformers/Attention.cpp Outdated Show resolved Hide resolved

chunhuanMeng requested a review from CuiYifeng December 16, 2025 07:07

fix the shape of log_sumexp

2bc8dc4

CuiYifeng reviewed Dec 23, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix memory layout of attention outputs #2582

Fix memory layout of attention outputs #2582

Uh oh!

chunhuanMeng commented Dec 16, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

CuiYifeng Dec 23, 2025

Uh oh!

CuiYifeng Dec 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		Tensor out =
		attention.permute({0, 2, 1, 3}).contiguous().permute({0, 2, 1, 3});

Fix memory layout of attention outputs #2582

Are you sure you want to change the base?

Fix memory layout of attention outputs #2582

Uh oh!

Conversation

chunhuanMeng commented Dec 16, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

CuiYifeng Dec 23, 2025

Choose a reason for hiding this comment

Uh oh!

CuiYifeng Dec 23, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants