Skip to content

Pull requests: NVIDIA/TransformerEngine

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix meta device check failure when passing torch.device objects
#2519 opened Dec 16, 2025 by LucienXian Loading…
6 of 13 tasks
[PyTorch] Support cudagraph recomputation
#2518 opened Dec 16, 2025 by buptzyb Loading…
1 of 13 tasks
[JAX] HLO FFI tests
#2517 opened Dec 16, 2025 by jberchtold-nvidia Loading…
7 of 13 tasks
Remove test skip logic for GEMM-AR tests
#2516 opened Dec 16, 2025 by vcherepanov-nv Loading…
4 of 13 tasks
Cpu optimizations v2 cpu_overhead
#2514 opened Dec 12, 2025 by vthumbe1503 Draft
13 tasks
Testing v2.6 + pr2201
#2513 opened Dec 12, 2025 by KshitijLakhani Draft
13 tasks
[Common] Optimize fused RoPE kernel performance performance Performance issues
#2508 opened Dec 11, 2025 by yaox12 Draft
13 tasks
[common] Add support for cuBLASLt GEMM for GroupedTensor MoE
#2502 opened Dec 10, 2025 by pggPL Loading…
8 tasks done
Add logic for block-scaled tensors with GEMM swizzled scales enhancement New feature or request MoE performance Performance issues refactor
#2486 opened Dec 6, 2025 by timmoon10 Loading…
14 of 19 tasks
Add support for SWA (left, right) with FusedAttention 2.11.0
#2477 opened Dec 4, 2025 by sudhakarsingh27 Loading…
22 of 28 tasks
[JAX] Einsum with quantization
#2474 opened Dec 3, 2025 by phu0ngng Draft
13 tasks
[PyTorch] Documentation for op fuser API documentation Improvements or additions to documentation
#2447 opened Dec 3, 2025 by timmoon10 Loading…
8 of 13 tasks
Add ccache support to TE and use it in GitHub actions build Build system
#2444 opened Dec 2, 2025 by ptrendx Loading…
1 of 6 tasks
[PyTorch] Enable post-RHT amax estimation fp4
#2442 opened Dec 2, 2025 by negvet Draft
1 of 13 tasks
support cuda graph capture offloading module
#2435 opened Dec 1, 2025 by lhb8125 Draft
13 tasks
[PyTorch] Add FA4 Support
#2432 opened Nov 28, 2025 by yaox12 Draft
1 of 16 tasks
Fix FusedAdam DTensor compatibility issue
#2425 opened Nov 26, 2025 by shjwudp Loading…
13 tasks
[JAX] Wrapper for Permutation Triton kernel MoE
#2419 opened Nov 25, 2025 by tdophung Draft
9 of 16 tasks
ProTip! no:milestone will show everything without a milestone.