Skip to content

v3.1.1

Choose a tag to compare

@Jintao-Huang Jintao-Huang released this 20 Feb 06:31
· 1481 commits to main since this release

中文版

新特性

  1. 支持大模型、多模态模型、Agent、多节点GRPO训练,参考这里
  2. 支持Embeding模型训练,参考这里
  3. swift sample支持MCTS、蒸馏方式数据采样,支持多模态模型采样。
  4. 支持自定义数据集评测,参考这里

新模型

  1. AIDC-AI/Ovis2-2B系列
  2. Qwen/Qwen2.5-VL-72B-Instruct-AWQ系列
  3. stepfun-ai/GOT-OCR-2.0-hf
  4. stepfun-ai/Step-Audio-Chat
  5. mistralai/Mistral-Small-24B-Instruct-2501

新数据集

  1. GRPO相关
    • AI-ModelScope/MATH-lighteval
    • LLM-Research/xlam-function-calling-60k
    • AI-MO/NuminaMath-TIR
  2. R1相关
    • liucong/Chinese-DeepSeek-R1-Distill-data-110k-SFT
    • modelscope/MathR, modelscope/MathR-32B-Distill

New Features

  1. Support for large models, multimodal models, Agents, and multi-node GRPO training. Refer to this documentation.
  2. Support for Embedding model training. Refer to this script.
  3. swift sample supports MCTS and distillation data sampling, as well as multimodal model sampling.
  4. Support for custom dataset evaluation. Refer to this documentation.

New Models

  1. AIDC-AI/Ovis2-2B series
  2. Qwen/Qwen2.5-VL-72B-Instruct-AWQ series
  3. stepfun-ai/GOT-OCR-2.0-hf
  4. stepfun-ai/Step-Audio-Chat
  5. mistralai/Mistral-Small-24B-Instruct-2501

New Datasets

  1. Related to GRPO
    • AI-ModelScope/MATH-lighteval
    • LLM-Research/xlam-function-calling-60k
    • AI-MO/NuminaMath-TIR
  2. Related to R1
    • liucong/Chinese-DeepSeek-R1-Distill-data-110k-SFT
    • modelscope/MathR, modelscope/MathR-32B-Distill

What's Changed

New Contributors

Full Changelog: v3.1.0...v3.1.1