[Issue]: Deploy GLM-4.6 on MI300X failed. Error msg "aiter : wrapper_mha_batch_prefill() Expected a value of type 'int'  for argument 'max_seqlen_k‘ but instead found type 'NoneType'

### Problem Description

[Issue]: Deploy GLM-4.6 on MI300X failed. Error msg "aiter : wrapper_mha_batch_prefill() Expected a value of type 'int'  for argument 'max_seqlen_k‘ but instead found type 'NoneType'

model :  GLM-4.6 FP8 

https://www.modelscope.cn/models/ZhipuAI/GLM-4.6-FP8

docker image: rocm/sglang-0.5.2

reference link:  https://deepwiki.com/zai-org/GLM-4.5/5.3-sglang-deployment

service start command: 

python3 -m sglang.launch_server \
  --model-path /data/models/GLM-4.6 \
  --tp-size 8 \
  --speculative-algorithm EAGLE \
  --speculative-num-steps 3 \
  --speculative-eagle-topk 1 \
  --speculative-num-draft-tokens 4 \
  --mem-fraction-static 0.8 \
  --disable-shared-experts-fusion \
  --served-model-name glm-4.6 \
  --host 0.0.0.0 \
  --port 8011

<img width="1749" height="863" alt="Image" src="https://github.com/user-attachments/assets/75bd3bcf-f7d5-4c47-b144-51309addc14a" />

### Operating System

Ubuntu 22.04

### CPU

AMD EPYC Genoa 9654

### GPU

AMD 8*MI300X

### ROCm Version

ROCm 7.0

### ROCm Component

_No response_

### Steps to Reproduce

[Issue]: Deploy GLM-4.6 on MI300X failed. Error msg "aiter : wrapper_mha_batch_prefill() Expected a value of type 'int'  for argument 'max_seqlen_k‘ but instead found type 'NoneType'

model :  GLM-4.6 FP8 

https://www.modelscope.cn/models/ZhipuAI/GLM-4.6-FP8

docker image: rocm/sglang-0.5.2

reference link:  https://deepwiki.com/zai-org/GLM-4.5/5.3-sglang-deployment

service start command: 

python3 -m sglang.launch_server \
  --model-path /data/models/GLM-4.6 \
  --tp-size 8 \
  --speculative-algorithm EAGLE \
  --speculative-num-steps 3 \
  --speculative-eagle-topk 1 \
  --speculative-num-draft-tokens 4 \
  --mem-fraction-static 0.8 \
  --disable-shared-experts-fusion \
  --served-model-name glm-4.6 \
  --host 0.0.0.0 \
  --port 8011

### (Optional for Linux users) Output of /opt/rocm/bin/rocminfo --support

_No response_

### Additional Information

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Issue]: Deploy GLM-4.6 on MI300X failed. Error msg "aiter : wrapper_mha_batch_prefill() Expected a value of type 'int' for argument 'max_seqlen_k‘ but instead found type 'NoneType' #1549

Problem Description

Operating System

CPU

GPU

ROCm Version

ROCm Component

Steps to Reproduce

(Optional for Linux users) Output of /opt/rocm/bin/rocminfo --support

Additional Information

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Issue]: Deploy GLM-4.6 on MI300X failed. Error msg "aiter : wrapper_mha_batch_prefill() Expected a value of type 'int' for argument 'max_seqlen_k‘ but instead found type 'NoneType' #1549

Description

Problem Description

Operating System

CPU

GPU

ROCm Version

ROCm Component

Steps to Reproduce

(Optional for Linux users) Output of /opt/rocm/bin/rocminfo --support

Additional Information

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions