-
Notifications
You must be signed in to change notification settings - Fork 501
Pull requests: NVIDIA/TransformerEngine
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[QA] Add pytest xml report for all tests in qa folder that use pytest
#2169
opened Sep 10, 2025 by
shengfangd
Loading…
7 of 13 tasks
blockwise fp8 weight memory optimization: on-demand columnwise fp8 weight creation
#2168
opened Sep 10, 2025 by
skydoorkai
Loading…
7 of 13 tasks
Support for Swiglu Activation used in GPT OSS
#2161
opened Sep 8, 2025 by
vthumbe1503
Loading…
8 of 12 tasks
[PyTorch] Support activation CPU offloading in fusible ops
bug
Something isn't working
enhancement
New feature or request
testing
Improvements to tests or testing infrastructure
#2158
opened Sep 6, 2025 by
timmoon10
Loading…
9 of 13 tasks
Lower precision gated-act to accelerate FP8 current-scaling.
#2153
opened Sep 5, 2025 by
mingxu1067
Loading…
8 of 13 tasks
[Common][PyTorch][Rework] PDL for Quantization
#2150
opened Sep 4, 2025 by
yaox12
Loading…
1 of 13 tasks
[PyTorch] Add sink attention support from cuDNN
2.8.0
#2148
opened Sep 2, 2025 by
cyanguwa
Loading…
8 of 13 tasks
[main][feature][under updating]adapt for offload activation
#2145
opened Sep 2, 2025 by
GeYuhong
Loading…
1 of 13 tasks
[PyTorch] Add record_stream and untyped_storage func op in QuantizedTensor
#2144
opened Sep 2, 2025 by
xiaoxi-wangfj
Loading…
1 of 13 tasks
[PyTorch Debug] Support precision debug tools for fp8 model parameters.
#2141
opened Sep 1, 2025 by
pggPL
Loading…
8 of 13 tasks
ci: Build and attach bdist wheels to release page
#2138
opened Aug 29, 2025 by
ko3n1g
Loading…
13 tasks
[PyTorch Debug] Add max_blockwise_X_dynamic_range stats
#2137
opened Aug 29, 2025 by
pggPL
Loading…
8 of 13 tasks
Fix memory overhead of linear layer when all gather from sequence parallel
#2125
opened Aug 27, 2025 by
yuzhongw-nvidia
Loading…
13 tasks
Adds dst.dtype information in copy_ method of quantized tensors.
#2120
opened Aug 26, 2025 by
zobeideThePlayer
Loading…
3 of 13 tasks
[PyTorch Debug] Fix issue with microbatching + debug value caching
#2108
opened Aug 25, 2025 by
pggPL
Loading…
8 of 13 tasks
[PyTorch Debug] Fix issue with negative underflow.
#2107
opened Aug 25, 2025 by
pggPL
Loading…
8 of 13 tasks
Fix test of FSDP2 by correcting init logic and applying autocast
#2105
opened Aug 24, 2025 by
ntenenz
Loading…
4 of 13 tasks
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-08-14.