-
Notifications
You must be signed in to change notification settings - Fork 465
Added OLMo(E) v1 #816
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: dev
Are you sure you want to change the base?
Added OLMo(E) v1 #816
Conversation
Originally from TransformerLensOrg#718.
Add OLMoE
Fix to OLMo 2 normalization
|
Hey @jonasrohw, looks like you've got this feature pretty much ready to go - just seeing type check failures blocking it. I'd be interested in taking a stab at fixing those type issues if you're not actively working on it. |
|
@taziksh Yeah, I didn't have time to fix some of the type issues. Go ahead! |
|
@jonasrohw
|

Description
Adds support for OLMo v1 model family and OLMoE.
Transformers>3.40will letnumpydo a major upgrade;pyproject.tomlprevents this now.OLMO v2 will require dropping python3.8 support because the required
Transformersversion also drops it. It will be added in a separate PR based on TransformerLens 3.This also completes PR: #718
Fixes # (issue)
Type of change
Please delete options that are not relevant.
Checklist: