Skip to content
Merged
Show file tree
Hide file tree
Changes from 4 commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
5 changes: 3 additions & 2 deletions notebooks/en/_toctree.yml
Original file line number Diff line number Diff line change
Expand Up @@ -88,8 +88,9 @@
title: Hyperparameter Optimization with Optuna and Transformers
- local: function_calling_fine_tuning_llms_on_xlam
title: Fine-tuning LLMs for Function Calling with the xLAM Dataset


- local: grpo_vllm_online_training
title: Efficient Online Training with GRPO and vLLM in TRL


- title: Computer Vision Recipes
isExpanded: false
Expand Down
17,209 changes: 17,209 additions & 0 deletions notebooks/en/grpo_vllm_online_training.ipynb

Large diffs are not rendered by default.

2 changes: 1 addition & 1 deletion notebooks/en/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -7,11 +7,11 @@ applications and solving various machine learning tasks using open-source tools

Check out the recently added notebooks:

- [Efficient Online Training with GRPO and vLLM in TRL](grpo_vllm_online_training)
- [Fine-tuning LLMs for Function Calling with the xLAM Dataset](function_calling_fine_tuning_llms_on_xlam)
- [Post training an VLM for reasoning with GRPO using TRL](fine_tuning_vlm_grpo_trl)
- [TRL GRPO Reasoning with Advanced Reward](trl_grpo_reasoning_advanced_reward)
- [Fine-Tuning a Vision Language Model with TRL using MPO](fine_tuning_vlm_mpo)
- [Fine tuning a VLM for Object Detection Grounding using TRL](fine_tuning_vlm_object_detection_grounding)

You can also check out the notebooks in the cookbook's [GitHub repo](https://github.com/huggingface/cookbook).

Expand Down