Multi-stage freeze/unfreeze callback #236

ryspark · 2025-08-14T17:22:08Z

Adds a MultiStageFineTuning callback for flexible, epoch-based freeze/unfreeze schedules.

Define which layers to freeze/unfreeze at specific epochs with freeze_selectors and unfreeze_selectors.
Unfreeze can add new param groups at scaled LR (base_lr / unfreeze_lr_factor).
Optionally scale existing group LRs (scale_existing_groups) when deeper layers are unfrozen.
Keeps ReduceLROnPlateau.min_lrs aligned with optimizer groups.
Each stage runs once and overrides previous freezes for resume-safe fine-tuning.
Test coverage for the new callback since it's more complex than the previous FreezeUnfreeze.

Should be merged after #221 and this helios PR (the tests will fail until this one is merged due to a missing import from helios). The only new changes are to freeze_unfreeze.py and the corresponding test file, plus moe module was moved to helios so deleted here.

Example usage:

from rslearn.train.callbacks.freeze_unfreeze import MultiStageFineTuning

cb = MultiStageFineTuning([
    # Stage 0: Train head only
    {"at_epoch": 0,
     "freeze_selectors": ["backbone"],
     "unfreeze_selectors": ["head"]},

    # Stage 1: Unfreeze LoRA adapters at lower LR, scale head LR down
    # 'unfreeze_selectors' lets us unfreeze specific layers specified by 'freeze_selectors'
    {"at_epoch": 3,
     "freeze_selectors": ["backbone"],
     "unfreeze_selectors": ["lora_adapter", "head"],
     "unfreeze_lr_factor": 10.0,
     "scale_existing_groups": 0.5},

    # Stage 2: Train full model
    {"at_epoch": 6,
     "freeze_selectors": [],
     "unfreeze_selectors": ["backbone", "head"],  # unfreeze everything
     "unfreeze_lr_factor": 1.0},
])

…o ryanp/freeze

ryspark added 13 commits August 14, 2025 17:13

Add new multi-stage freeze

5ac6e1f

Merge branch 'master' into ryanp/freeze

4f476e0

Merge branch 'master' into ryanp/freeze

901b98b

Move moe to helios

8b9f6e6

Merge branch 'ryanp/freeze' of https://github.com/allenai/rslearn int…

52030ec

…o ryanp/freeze

Merge branch 'master' into ryanp/freeze

25d44fb

Merge branch 'master' into ryanp/freeze

41371ea

Fix freeze recursion error

800411c

Merge branch 'ryanp/freeze' of https://github.com/allenai/rslearn int…

5c704a0

…o ryanp/freeze

Merge branch 'master' into ryanp/freeze

a5fdfaa

Linter fix

6271bfe

Fix unfreeze recursion

2cae1ac

Add cosine with warm restarts

ace5b2f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Multi-stage freeze/unfreeze callback #236

Multi-stage freeze/unfreeze callback #236

Uh oh!

ryspark commented Aug 14, 2025 •

edited

Loading

Uh oh!

Uh oh!

Multi-stage freeze/unfreeze callback #236

Are you sure you want to change the base?

Multi-stage freeze/unfreeze callback #236

Uh oh!

Conversation

ryspark commented Aug 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

ryspark commented Aug 14, 2025 •

edited

Loading