SP on VLMs #71

aleien95 · 2025-08-13T11:56:51Z

What does this PR do?
Extends existing text sequence parallelism to support multimodal (image + text) training for SFT and DPO with padding removal optimization.

Key Changes
Multimodal sequence parallelism for SFT/DPO: Added support for vision-language models in both Supervised Fine-Tuning (SFT) and Direct Preference Optimization (DPO) training while maintaining backward compatibility with text-only training
Remove padding optimization: Implemented dynamic padding removal to reduce memory usage and improve training efficiency
Enhanced parallel communication: Updated tensor distribution patterns for multimodal data

- Remove data printing functions from SFT and DPO trainers for better performance - Replace 360-example-vl.sh with separate SFT and DPO training scripts - Add SFT visual-language demo dataset (data/sft-vl-demo/) - Update dataset configuration to support new data structure

… code style - Add multimodal_forwards module to centrally manage multimodal model forward logic - Extract and optimize forward function implementations for Qwen2 VL and Qwen2.5 VL - Improve sequence_parallel related code structure

lilin3 added 3 commits July 30, 2025 16:35

feat: add multimodal sequence parallelism support

7f4eeda

aleien95 marked this pull request as ready for review August 13, 2025 11:57

This was referenced Sep 28, 2025

使用多模态数据集微调Qwen2.5-VL，KeyError: 'sequence_parallel_attention'。问题出现在模型加载阶段，Transformers库中的Qwen2.5-VL模型不支持 sequence_parallel_attention 这种注意力实现 #61

Open

加载数据集报错 #76

Closed

lilin3 added 2 commits October 8, 2025 16:11

feat(vl): update VL training scripts and clean up demo data

cd75c4a

refactor: improve readability of sequence parallel attention check

c5cd4c4

HaoshengZou merged commit 5f64acf into Qihoo360:sp Oct 8, 2025

HaoshengZou changed the title ~~Sp vl~~ SP on VLMs Oct 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

SP on VLMs #71

SP on VLMs #71

Uh oh!

aleien95 commented Aug 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

SP on VLMs #71

SP on VLMs #71

Uh oh!

Conversation

aleien95 commented Aug 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants