You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The `isConvertibleToSDWA` function form `llvm/lib/Target/AMDGPU/SIPeepholeSDWA.cpp` contains this FIXME:
```c++
// FIXME: has SDWA but require handling of implicit VCC use
if (Opc == AMDGPU::V_CNDMASK_B32_e32)
return false;
```
Note that the AMDGPU::V_CNDMASK_B32_e32 instruction seems to be unproblematic, but the function converts e64 to e32 opcodes at the top and hence the if statement rejects both opcodes. The conversion of AMDGPU::V_CNDMASK_B32_e64 to an SDWA instruction requires that the explicit src2/carryin be provided through vcc.
frederik-h
changed the title
[AMDGPU] SIPeepholeSDWA: Add AMDGPU::V_CNDMASK_B32 support
[AMDGPU] SIPeepholeSDWA: Add V_CNDMASK_B32 support
Mar 28, 2025
The
isConvertibleToSDWA
function formllvm/lib/Target/AMDGPU/SIPeepholeSDWA.cpp
contains this FIXME:The text was updated successfully, but these errors were encountered: