Skip to content

Fixes top‑k attention masking and safe casting

d7d87ef
Select commit
Loading
Failed to load commit list.
Sign in for the full log view
Open

[BUG FIX] Optimize top-k mask construction: prevent unsafe gradient flow and eliminate unnecessary memory allocations #184

Fixes top‑k attention masking and safe casting
d7d87ef
Select commit
Loading
Failed to load commit list.
add-reviewers
succeeded Oct 4, 2025 in 7s