feat: add initial qwen2.5-vl model and test #2971

drbh · 2025-01-30T17:50:03Z

This PR adds support for qwen2.5-vl models and currently loads the weights and supports reasonable responses. Opening early for exposure and any feedback.

These changes are dependent on #2943 and must be rebased/merged after it is merged

items

small reproducible example:

text-generation-launcher --model-id Qwen/Qwen2.5-VL-3B-Instruct

script

import requests
import json

url = "http://127.0.0.1:3000/generate"
headers = {"Content-Type": "application/json"}
image_urls = [
    "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg",
    "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/tgi/rabbit.png",
]

for image in image_urls:
    query = "Describe the image"
    payload = {
        "inputs": f"<|im_start|>system\nYou are a helpful assistant.<|im_end|>\n<|im_start|>user\n![]({image}){query}<|im_end|>\n<|im_start|>assistant\n",
        "parameters": {"max_new_tokens": 50},
    }
    response = requests.post(url, headers=headers, json=payload)
    print(json.dumps(response.json(), indent=4))

output

{
    "generated_text": "The image showcases the iconic Statue of Liberty in New York City, with the New York City skyline in the background. The statue is a large, green-colored sculpture on a stone pedestal, with the American flag on a flagpole in the foreground. The"
}
{
    "generated_text": "The image depicts a character in a space suit, set in a rocky, desert-like environment with a warm, orange hue. The character is a large, brown, and white rabbit with long, pointed ears and a small, beak-like nose."
}

HuggingFaceDocBuilderDev · 2025-01-31T17:48:17Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Narsil

LGTM!

Narsil · 2025-02-18T15:58:05Z

nix/overlay.nix

+        transformers = python-super.transformers.overrideAttrs (
+          _: _: {
+            src = final.fetchFromGitHub {
+              owner = "huggingface";
+              repo = "transformers";
+              rev = "8d73a38606bc342b370afe1f42718b4828d95aaa";
+              hash = "sha256-MxroG6CWqrcmRS+eFt7Ej87TDOInN15aRPBUcaycKTI=";
+            };
+          }
+        );


Do we still need that with 4.49 release ?

this should be able to be removed with 4.49

Narsil · 2025-02-18T15:58:39Z

server/text_generation_server/models/custom_modeling/qwen2_5_vl.py

+    }
+
+
+class Qwen2_5_VLProcessor(ProcessorMixin):


Is this not defined in transformers ?

this was added in 4.49 so can be be removed with the update too

Narsil · 2025-02-18T15:59:58Z

server/text_generation_server/models/vlm_causal_lm.py

+        if (
+            self.model.config.model_type == "qwen2_vl"
+            or self.model.config.model_type == "qwen2_5_vl"
+        ):


Suggested change

if (

self.model.config.model_type == "qwen2_vl"

or self.model.config.model_type == "qwen2_5_vl"

):

if self.model.config.model_type in {"qwen2_vl", "qwen2_5_vl"}:

Fixing the chatbot for you :)

wish I could blame it on a bot 😅

thanks!

Narsil · 2025-02-18T16:00:20Z

server/text_generation_server/models/vlm_causal_lm.py

+                    if (
+                        config.model_type == "qwen2_vl"
+                        or config.model_type == "qwen2_5_vl"
+                    ):


NIT: Simpler condition.

drbh force-pushed the add-qwen25vl-support branch from 1adfee4 to e9b5806 Compare January 31, 2025 17:36

drbh added 2 commits February 4, 2025 15:06

feat: support qwen2.5 vl model

10aa62f

fix: bump support models doc

1f58577

drbh force-pushed the add-qwen25vl-support branch from 17c93ff to 1f58577 Compare February 4, 2025 20:06

drbh added 2 commits February 5, 2025 02:27

feat: check before rope type adjustment and small refactors

76d526d

fix: add transformer overlay for processor support

07c0080

drbh marked this pull request as ready for review February 5, 2025 15:43

fix: vendor processor and config from transformers

e4e6ea2

Narsil previously approved these changes Feb 18, 2025

View reviewed changes

Narsil reviewed Feb 18, 2025

View reviewed changes

fix: refactor/simplify conditionals

05333b7

drbh dismissed Narsil’s stale review via 05333b7 February 18, 2025 23:36

Narsil approved these changes Feb 19, 2025

View reviewed changes

Narsil merged commit d6a0c67 into main Feb 19, 2025
21 checks passed

Narsil deleted the add-qwen25vl-support branch February 19, 2025 11:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add initial qwen2.5-vl model and test #2971

feat: add initial qwen2.5-vl model and test #2971

drbh commented Jan 30, 2025 •

edited

Loading

HuggingFaceDocBuilderDev commented Jan 31, 2025

Narsil left a comment

Narsil Feb 18, 2025

drbh Feb 18, 2025

Narsil Feb 18, 2025

drbh Feb 18, 2025

Narsil Feb 18, 2025

drbh Feb 18, 2025

Narsil Feb 18, 2025

		}


		class Qwen2_5_VLProcessor(ProcessorMixin):

feat: add initial qwen2.5-vl model and test #2971

feat: add initial qwen2.5-vl model and test #2971

Conversation

drbh commented Jan 30, 2025 • edited Loading

items

HuggingFaceDocBuilderDev commented Jan 31, 2025

Narsil left a comment

Choose a reason for hiding this comment

Narsil Feb 18, 2025

Choose a reason for hiding this comment

drbh Feb 18, 2025

Choose a reason for hiding this comment

Narsil Feb 18, 2025

Choose a reason for hiding this comment

drbh Feb 18, 2025

Choose a reason for hiding this comment

Narsil Feb 18, 2025

Choose a reason for hiding this comment

drbh Feb 18, 2025

Choose a reason for hiding this comment

Narsil Feb 18, 2025

Choose a reason for hiding this comment

drbh commented Jan 30, 2025 •

edited

Loading