Skip to content

Conversation

@albertodepaola
Copy link
Contributor

What does this PR do?

Adds an image finetuning recipe for Llama 3.2 with torchtune. Focused on structured data extraction.

Feature/Issue validation/testing

Tested by running the FT and custom evaluation multiple times, checking result repeatability.
Details in the Readme: https://github.com/meta-llama/llama-cookbook/tree/image-finetuning/getting-started/finetuning/vision

Before submitting

  • This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
  • Did you read the contributor guideline,
    Pull Request section?
  • Was this discussed/approved via a Github issue? Please add a link
    to it if that's the case.
  • Did you make sure to update the documentation with your changes?
  • Did you write any new necessary tests?

Thanks for contributing 🎉!

@meta-cla meta-cla bot added the cla signed label Oct 30, 2025
@varunfb varunfb self-assigned this Nov 3, 2025
@varunfb varunfb self-requested a review November 3, 2025 16:24
@varunfb varunfb merged commit 2f22a9e into main Nov 3, 2025
4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants