|
| 1 | +apiVersion: model.hydra.io/v1alpha1 |
| 2 | +kind: ModelSpec |
| 3 | +metadata: |
| 4 | + name: qwen3-coder-480b-a35b-instruct-fp8 |
| 5 | +spec: |
| 6 | + config: |
| 7 | + maxTokens: 262144 |
| 8 | + deployments: |
| 9 | + - customRuntimeArgs: |
| 10 | + - --max-model-len=131072 |
| 11 | + - --enable-auto-tool-choice |
| 12 | + - --tool-call-parser |
| 13 | + - qwen3_coder |
| 14 | + resourceRequirements: |
| 15 | + cpu: 16 |
| 16 | + gpuCount: 8 |
| 17 | + gpuType: vgpu |
| 18 | + memory: 640 |
| 19 | + perGPUMemoryGB: 80 |
| 20 | + runtime: vllm |
| 21 | + versionRequired: '>=0.8.5' |
| 22 | + descriptor: |
| 23 | + description: |
| 24 | + enUS: 'Qwen3-Coder is the latest generation of agentic code models in the |
| 25 | + Qwen series, designed for advanced coding, reasoning, and agentic tasks. |
| 26 | + The flagship model, Qwen3-Coder-480B-A35B-Instruct, delivers state-of-the-art |
| 27 | + performance across Agentic Coding, Browser-Use, and fundamental programming |
| 28 | + benchmarks, matching Claude Sonnet in capability. It supports long-context |
| 29 | + understanding with native 256K tokens and extension to 1M via Yarn, |
| 30 | + optimized for large-scale repository comprehension, and features robust |
| 31 | + agentic coding support across platforms like Qwen Code and CLINE with a |
| 32 | + specialized function call design' |
| 33 | + zhCN: 'Qwen3-Coder 是通义系列中最新一代具备智能体能力的代码模型,专为高级编程、推理和智能体任务设计。 |
| 34 | + 旗舰模型 Qwen3-Coder-480B-A35B-Instruct 在智能体编程、浏览任务及基础代码基准测试中表现卓越, |
| 35 | + 可与 Claude Sonnet 相媲美。其原生支持 256K Token 长上下文,并可通过 Yarn 扩展至 1M, |
| 36 | + 优化用于仓库级代码理解;同时在 Qwen Code、CLINE 等平台上提供完善的智能体编程支持,具备专门设计的函数调用格式' |
| 37 | + display: Qwen3-Coder-480B-A35B-Instruct-FP8 |
| 38 | + icon: |
| 39 | + src: https://public-resources.d.run/models/logos/qwen-model-logo.svg |
| 40 | + type: image/svg |
| 41 | + links: |
| 42 | + - description: About |
| 43 | + url: https://github.com/QwenLM |
| 44 | + provider: |
| 45 | + id: alibaba |
| 46 | + name: |
| 47 | + enUS: Alibaba |
| 48 | + zhCN: 通义千问 |
| 49 | + tags: |
| 50 | + - TEXT_GENERATION |
| 51 | + - TOOLS |
| 52 | + source: |
| 53 | + huggingface: |
| 54 | + name: Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8 |
| 55 | + modelscope: |
| 56 | + name: Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8 |
0 commit comments