-
Notifications
You must be signed in to change notification settings - Fork 3.1k
Pull requests: microsoft/onnxruntime
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Compile lib instead of executable when checking compiler features
#23329
opened Jan 11, 2025 by
bearbones
•
Review required
fixes bug and enables scatternd for jsep
ep:WebGPU
ort-web webgpu provider
#23311
opened Jan 9, 2025 by
prathikr
•
Review required
ThreadPool: Spend less time busy waiting. (2nd Attempt)
#23278
opened Jan 7, 2025 by
goldsteinn
•
Review required
[WebNN] Support Cast fusion specific for int64 data type
ep:WebNN
WebNN execution provider
#23256
opened Jan 6, 2025 by
Honry
Loading…
[js/node] allow arenaExtendStrategy and gpuMemLimit option for CUDA EP
#23176
opened Dec 21, 2024 by
nomagick
Loading…
[webgpu] support Pad operator
ep:WebGPU
ort-web webgpu provider
#23141
opened Dec 18, 2024 by
xhcao
Loading…
Make MultiHeadAttention op return attention probabilities
#23125
opened Dec 16, 2024 by
amancini-N
Loading…
[Fix] in Xnnpack EP, the conversion for fused activation param isn't correct
#23115
opened Dec 16, 2024 by
mszhanyi
Loading…
[js/webgpu] Optimize matmulnbits with M > 1
ep:WebGPU
ort-web webgpu provider
#23092
opened Dec 12, 2024 by
qjia7
Loading…
[DML] Don't save resources to be released later when the GPU is already done with them.
#22995
opened Dec 3, 2024 by
BTurkelson
Loading…
[Test only] BFloat16 test for SkipSimplifiedLayerNormalization
#22941
opened Nov 25, 2024 by
jiafatom
Loading…
Bump onnx from 1.16.1 to 1.17.0 in /onnxruntime/python/tools/transformers/models/phi2
dependencies
Pull requests that update a dependency file
python
Pull requests that update Python code
#22928
opened Nov 22, 2024 by
dependabot
bot
Loading…
[js/webgpu] support FlashAttention-2 for attention operator
ep:WebGPU
ort-web webgpu provider
#22915
opened Nov 21, 2024 by
xhcao
Loading…
ProTip!
Add no:assignee to see everything that’s not assigned.