Skip to content

Conversation

@cheese-head
Copy link
Contributor

What?

Added support for Qwen3 and GPT-OSS model configurations in kvbench

How?

Added Qwen3 model configurations
Added GPT-OSS model configurations

@copy-pr-bot
Copy link

copy-pr-bot bot commented Dec 10, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

# SPDX-FileCopyrightText: Copyright (c) 2025 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
# SPDX-License-Identifier: Apache-2.0
#
# Licensed under the Apache License, Version 2.0 (the "License");
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You can shorten the license to the first two lines only:

SPDX-FileCopyrightText: Copyright (c) 2025 NVIDIA CORPORATION & AFFILIATES. All rights reserved# SPDX-License-Identifier: Apache-2.0

kv_size = int(kv_size / self.num_layers)
if kv_size <= 0:
raise ValueError("Invalid KV Size: 0")

Copy link
Contributor

@aranadive aranadive Dec 11, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Fix the extra spacing issue here and others below

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants