Skip to content

Commit 1fce179

Browse files
committed
update qwen3-coder-480b-a35b-instruct-fp8 metadata
1 parent 398b98d commit 1fce179

File tree

2 files changed

+56
-48
lines changed

2 files changed

+56
-48
lines changed
Lines changed: 56 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,56 @@
1+
apiVersion: model.hydra.io/v1alpha1
2+
kind: ModelSpec
3+
metadata:
4+
name: qwen3-coder-480b-a35b-instruct-fp8
5+
spec:
6+
config:
7+
maxTokens: 262144
8+
deployments:
9+
- customRuntimeArgs:
10+
- --max-model-len=131072
11+
- --enable-auto-tool-choice
12+
- --tool-call-parser
13+
- qwen3_coder
14+
resourceRequirements:
15+
cpu: 16
16+
gpuCount: 8
17+
gpuType: vgpu
18+
memory: 640
19+
perGPUMemoryGB: 80
20+
runtime: vllm
21+
versionRequired: '>=0.8.5'
22+
descriptor:
23+
description:
24+
enUS: 'Qwen3-Coder is the latest generation of agentic code models in the
25+
Qwen series, designed for advanced coding, reasoning, and agentic tasks.
26+
The flagship model, Qwen3-Coder-480B-A35B-Instruct, delivers state-of-the-art
27+
performance across Agentic Coding, Browser-Use, and fundamental programming
28+
benchmarks, matching Claude Sonnet in capability. It supports long-context
29+
understanding with native 256K tokens and extension to 1M via Yarn,
30+
optimized for large-scale repository comprehension, and features robust
31+
agentic coding support across platforms like Qwen Code and CLINE with a
32+
specialized function call design'
33+
zhCN: 'Qwen3-Coder 是通义系列中最新一代具备智能体能力的代码模型,专为高级编程、推理和智能体任务设计。
34+
旗舰模型 Qwen3-Coder-480B-A35B-Instruct 在智能体编程、浏览任务及基础代码基准测试中表现卓越,
35+
可与 Claude Sonnet 相媲美。其原生支持 256K Token 长上下文,并可通过 Yarn 扩展至 1M,
36+
优化用于仓库级代码理解;同时在 Qwen Code、CLINE 等平台上提供完善的智能体编程支持,具备专门设计的函数调用格式'
37+
display: Qwen3-Coder-480B-A35B-Instruct-FP8
38+
icon:
39+
src: https://public-resources.d.run/models/logos/qwen-model-logo.svg
40+
type: image/svg
41+
links:
42+
- description: About
43+
url: https://github.com/QwenLM
44+
provider:
45+
id: alibaba
46+
name:
47+
enUS: Alibaba
48+
zhCN: 通义千问
49+
tags:
50+
- TEXT_GENERATION
51+
- TOOLS
52+
source:
53+
huggingface:
54+
name: Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8
55+
modelscope:
56+
name: Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8

models/qwen/qwen3-coder-480b-a35b-instruct-fp8-/metadata.yaml

Lines changed: 0 additions & 48 deletions
This file was deleted.

0 commit comments

Comments
 (0)