fix: improve vLLM plugin compatibility and NCCL receive handling by garrett4wade · Pull Request #109 · inclusionAI/asystem-awex

garrett4wade · 2026-04-22T09:09:29Z

Summary

Add compatibility paths in awex/vllm_plugin.py for newer vLLM OpenAI protocol/router changes and ensure Awex routes are attached via build_app patching when shared router is absent.
Update NCCL reader test flow to apply non-contiguous receive tensor copies, matching production recv behavior.
Harden config and process-group utilities (torch version comparison via packaging.version, registry sharding strategy accessor, and formatting/cleanup updates).

garrett4wade · 2026-04-22T09:11:37Z

gemini-code-assist

Code Review

This pull request implements compatibility for newer vLLM versions by dynamically importing components and patching the build_app function to register Awex routes. It also refines PyTorch version checks and adds support for non-contiguous tensors in weight synchronization tests. Review feedback identifies a security vulnerability regarding unauthenticated endpoints and suggests a more direct method for retrieving the PyTorch version.

gemini-code-assist · 2026-04-22T09:11:39Z

    @router.post("/areal_awex_init")
    async def awex_init(request: AwexInitRequest, raw_request: Request):


The new endpoints /areal_awex_init and /areal_awex_update are registered without any explicit authentication or authorization dependencies. Since these endpoints can trigger significant state changes (like re-initializing the NCCL group or updating model weights), they could be exploited if the vLLM server is exposed. Consider ensuring these routes are protected by the same security mechanisms (e.g., API key checks) used for the standard OpenAI-compatible endpoints.

gemini-code-assist · 2026-04-22T09:11:40Z

    pg_options_param_name = (
-        "backend_options" if str(torch.__version__) >= "2.6" else "pg_options"
+        "backend_options"
+        if Version(version("torch")) >= Version("2.6")


While using packaging.version.Version correctly fixes the string comparison bug, using importlib.metadata.version("torch") is less direct and potentially less robust than using the __version__ attribute already available on the imported torch module. The metadata query can fail in certain environments (e.g., non-standard installations) even if the module is successfully loaded.

Suggested change

if Version(version("torch")) >= Version("2.6")

if Version(torch.__version__) >= Version("2.6")

fix: improve vLLM plugin compatibility and NCCL receive handling

c67fa4a

garrett4wade requested a review from chaokunyang as a code owner April 22, 2026 09:09

gemini-code-assist Bot reviewed Apr 22, 2026

View reviewed changes

chaokunyang approved these changes Apr 22, 2026

View reviewed changes

chaokunyang merged commit 45a917b into inclusionAI:main Apr 22, 2026
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: improve vLLM plugin compatibility and NCCL receive handling#109

fix: improve vLLM plugin compatibility and NCCL receive handling#109
chaokunyang merged 1 commit intoinclusionAI:mainfrom
garrett4wade:main

garrett4wade commented Apr 22, 2026

Uh oh!

garrett4wade commented Apr 22, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Apr 22, 2026

Uh oh!

gemini-code-assist Bot Apr 22, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		@router.post("/areal_awex_init")
		async def awex_init(request: AwexInitRequest, raw_request: Request):

	if Version(version("torch")) >= Version("2.6")
	if Version(torch.__version__) >= Version("2.6")

Conversation

garrett4wade commented Apr 22, 2026

Summary

Uh oh!

garrett4wade commented Apr 22, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants