Image finetuning #1012

albertodepaola · 2025-10-30T22:36:07Z

What does this PR do?

Adds an image finetuning recipe for Llama 3.2 with torchtune. Focused on structured data extraction.

Feature/Issue validation/testing

Tested by running the FT and custom evaluation multiple times, checking result repeatability.
Details in the Readme: https://github.com/meta-llama/llama-cookbook/tree/image-finetuning/getting-started/finetuning/vision

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

Thanks for contributing 🎉!

getting-started/finetuning/vision/README.md

…o work with Together correctly

meta-cla bot added the cla signed label Oct 30, 2025

varunfb reviewed Oct 30, 2025

View reviewed changes

getting-started/finetuning/vision/README.md Show resolved Hide resolved

getting-started/finetuning/vision/README.md Show resolved Hide resolved

albertodepaola added 11 commits October 31, 2025 16:51

W2 finetuning initial commit

10fb4a5

decoder frozen

0d53284

adding readme

655daad

Adding gitignore for results and output folders

a0d025c

Updating defaults to most memory efficient setup

079f05c

Adding all packages to pip install

d331392

Updating readme, adding images from run. Updating evaluate function t…

543985f

…o work with Together correctly

Fixing percentage in custom benchmark

2a0a1d0

Fixing logger level. Tweaks to Readme.

ad95911

fixing wordlist for benchmark names and other config names

7f7c72f

Adding reference in other readmes

1f3380c

albertodepaola force-pushed the image-finetuning branch from a7c19cf to 1f3380c Compare November 1, 2025 00:02

albertodepaola added 2 commits October 31, 2025 17:24

Fixing typos

f681f2b

Wordlist

e0e9621

albertodepaola requested review from connortreacy and fbnav November 3, 2025 16:12

varunfb self-assigned this Nov 3, 2025

varunfb self-requested a review November 3, 2025 16:24

varunfb approved these changes Nov 3, 2025

View reviewed changes

varunfb merged commit 2f22a9e into main Nov 3, 2025
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Image finetuning #1012

Image finetuning #1012

Uh oh!

albertodepaola commented Oct 30, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Image finetuning #1012

Image finetuning #1012

Uh oh!

Conversation

albertodepaola commented Oct 30, 2025

What does this PR do?

Feature/Issue validation/testing

Before submitting

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants