Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[AMDGPU] SIPeepholeSDWA: Add V_CNDMASK_B32 support #133431

Open
frederik-h opened this issue Mar 28, 2025 · 2 comments
Open

[AMDGPU] SIPeepholeSDWA: Add V_CNDMASK_B32 support #133431

frederik-h opened this issue Mar 28, 2025 · 2 comments

Comments

@frederik-h
Copy link
Contributor

The isConvertibleToSDWA function form llvm/lib/Target/AMDGPU/SIPeepholeSDWA.cpp contains this FIXME:

// FIXME: has SDWA but require handling of implicit VCC use
if (Opc == AMDGPU::V_CNDMASK_B32_e32)
    return false;
@llvmbot
Copy link
Member

llvmbot commented Mar 28, 2025

@llvm/issue-subscribers-backend-amdgpu

Author: Frederik Harwath (frederik-h)

The `isConvertibleToSDWA` function form `llvm/lib/Target/AMDGPU/SIPeepholeSDWA.cpp` contains this FIXME: ```c++ // FIXME: has SDWA but require handling of implicit VCC use if (Opc == AMDGPU::V_CNDMASK_B32_e32) return false; ```

@frederik-h
Copy link
Contributor Author

Note that the AMDGPU::V_CNDMASK_B32_e32 instruction seems to be unproblematic, but the function converts e64 to e32 opcodes at the top and hence the if statement rejects both opcodes. The conversion of AMDGPU::V_CNDMASK_B32_e64 to an SDWA instruction requires that the explicit src2/carryin be provided through vcc.

@frederik-h frederik-h changed the title [AMDGPU] SIPeepholeSDWA: Add AMDGPU::V_CNDMASK_B32 support [AMDGPU] SIPeepholeSDWA: Add V_CNDMASK_B32 support Mar 28, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants