Skip to content

Pull requests: waybarrios/vllm-mlx

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: Reasoning parser fix + jump-forward tool logits bias
#114 opened Feb 25, 2026 by raullenchai Loading…
4 tasks done
fix: Use streaming detokenizer for UTF-8-safe incremental decode
#109 opened Feb 24, 2026 by janhilgard Loading…
1 of 4 tasks
feat: Add --gpu-memory-utilization for configurable memory limits
#108 opened Feb 23, 2026 by janhilgard Loading…
5 tasks
feat: add --prefill-step-size CLI flag enhancement New feature or request
#105 opened Feb 23, 2026 by kol22 Loading…
feat: Add MiniMax-M2 tool call parser
#98 opened Feb 18, 2026 by janhilgard Loading…
6 tasks done
fix: Coerce tool call arguments to match schema types
#95 opened Feb 16, 2026 by janhilgard Loading…
5 tasks
Add Step 3.5 Flash model support with MTP
#94 opened Feb 16, 2026 by janhilgard Loading…
4 tasks
Fix streaming tool calls when reasoning parser is active
#93 opened Feb 16, 2026 by janhilgard Loading…
5 tasks done
Fix Metal resource leak under high concurrency
#92 opened Feb 16, 2026 by janhilgard Loading…
4 tasks
feat: add prompt prefix caching to SimpleEngine enhancement New feature or request
#90 opened Feb 15, 2026 by panbanda Loading…
7 tasks done
feat: per-request thinking/reasoning control
#85 opened Feb 15, 2026 by janhilgard Loading…
8 tasks
Add Metal memory stats to MLLMScheduler
#78 opened Feb 12, 2026 by janhilgard Loading…
1 of 2 tasks
Add resumable model download with retry, timeout, and offline mode
#77 opened Feb 12, 2026 by janhilgard Loading…
4 tasks done
Refactor: review improvements for KV cache quantization
#72 opened Feb 11, 2026 by janhilgard Loading…
3 tasks done
feat: repetition detector for degenerate token loops
#65 opened Feb 11, 2026 by janhilgard Loading…
1 of 3 tasks
Add MiniMax M2.1 tool call parser
#51 opened Feb 9, 2026 by janhilgard Loading…
3 of 4 tasks
feat: Add speculative decoding support with draft models
#45 opened Feb 5, 2026 by janhilgard Loading…
7 tasks done
ProTip! Exclude everything labeled bug with -label:bug.