Skip to content

Conversation

@yucai-intel
Copy link
Contributor

@yucai-intel yucai-intel commented Dec 2, 2025

Issue: #2182 The Q (Query) tensor output size from torch.transform_bias_rescale_qkv was mismatched against the expected reference size in test cases involving Nested Tensors where the sequence length (T) was not a multiple of 8 after implicit padding.

Resolution: The resolution involved introducing logic within the C++ function transform_bias_rescale_qkv_xpu specifically for the Nested Tensor case to explicitly use the calculated sequence length T to resize the output q, k, and v tensors, thereby ensuring their final size matches the shape derived by the Python reference implementation.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants