Skip to content

Pull requests: NVIDIA/TransformerEngine

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[QA] Add pytest xml report for all tests in qa folder that use pytest
#2169 opened Sep 10, 2025 by shengfangd Loading…
7 of 13 tasks
[JAX] Restore Shardy Rule with CompoundFactor
#2167 opened Sep 9, 2025 by phu0ngng Draft
1 of 13 tasks
[JAX] Collective gemm
#2166 opened Sep 9, 2025 by phu0ngng Draft
13 tasks
Fix issue with RNG state shape
#2164 opened Sep 8, 2025 by epwalsh Draft
5 of 13 tasks
Support for Swiglu Activation used in GPT OSS
#2161 opened Sep 8, 2025 by vthumbe1503 Loading…
8 of 12 tasks
Fix unjoined comm stream in UB communicator
#2160 opened Sep 8, 2025 by djns99 Draft
1 of 13 tasks
[PyTorch] Support activation CPU offloading in fusible ops bug Something isn't working enhancement New feature or request testing Improvements to tests or testing infrastructure
#2158 opened Sep 6, 2025 by timmoon10 Loading…
9 of 13 tasks
Lower precision gated-act to accelerate FP8 current-scaling.
#2153 opened Sep 5, 2025 by mingxu1067 Loading…
8 of 13 tasks
[Common][PyTorch][Rework] PDL for Quantization
#2150 opened Sep 4, 2025 by yaox12 Loading…
1 of 13 tasks
[PyTorch] Add sink attention support from cuDNN 2.8.0
#2148 opened Sep 2, 2025 by cyanguwa Loading…
8 of 13 tasks
[PyTorch] CPU Overhead Micro-optimizations
#2146 opened Sep 2, 2025 by zhongbozhu Loading…
13 tasks
[main][feature][under updating]adapt for offload activation
#2145 opened Sep 2, 2025 by GeYuhong Loading…
1 of 13 tasks
ci: Build and attach bdist wheels to release page
#2138 opened Aug 29, 2025 by ko3n1g Loading…
13 tasks
[PyTorch Debug] Add max_blockwise_X_dynamic_range stats
#2137 opened Aug 29, 2025 by pggPL Loading…
8 of 13 tasks
FP8 Output Quantization for GEMM
#2123 opened Aug 26, 2025 by vthumbe1503 Loading…
7 of 13 tasks
Adds dst.dtype information in copy_ method of quantized tensors.
#2120 opened Aug 26, 2025 by zobeideThePlayer Loading…
3 of 13 tasks
[PyTorch Debug] Fix issue with microbatching + debug value caching
#2108 opened Aug 25, 2025 by pggPL Loading…
8 of 13 tasks
[PyTorch Debug] Fix issue with negative underflow.
#2107 opened Aug 25, 2025 by pggPL Loading…
8 of 13 tasks
Fix test of FSDP2 by correcting init logic and applying autocast
#2105 opened Aug 24, 2025 by ntenenz Loading…
4 of 13 tasks
ProTip! What’s not been updated in a month: updated:<2025-08-14.