v1.0.0rc1 #5697
nv-guomingz
announced in
Announcements
v1.0.0rc1
#5697
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Model Support
What's Changed
max_beam_width=1
for TorchSampler by @netanel-haber in start OAIServer withmax_beam_width=1
for TorchSampler #5427test_fp8_block_scales_4gpus[ep4-mtp_nextn=0-fp8kv=True-attention_dp=True-cuda_graph=True-overlap_scheduler=True-torch_compile=False]
by @venkywonka in [CI] Waivetest_fp8_block_scales_4gpus[ep4-mtp_nextn=0-fp8kv=True-attention_dp=True-cuda_graph=True-overlap_scheduler=True-torch_compile=False]
#5494New Contributors
Full Changelog: v1.0.0rc0...v1.0.0rc1
This discussion was created from the release v1.0.0rc1.
Beta Was this translation helpful? Give feedback.
All reactions