-
Notifications
You must be signed in to change notification settings - Fork 4.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
generalize deepspeed linear and implement it for non cuda systems #6932
Conversation
@loadams I think the failure in cpu-torch isn't related to the changes, as it passed yesterday (in the first commit) Thanks in advance |
@loadams tests are passing now. Looks like it was a momentary issue |
@loadams Hi, can you please help review this PR? i see there was one round of review, are there any leftovers? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This all looks good to me, thanks for this generalization! However, please address @tjruwase's suggestions.
No description provided.