-
Notifications
You must be signed in to change notification settings - Fork 151
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[fun] llama.triton #119
Comments
omw |
i implemented gpt-2 in triton few days back. Ig llama would be similar- just need to implement some specific layers. sharing if someone wants the starting code! |
@thevasudevgupta Could you please share the specific code so that I can directly look into it ? |
ohh; I forgot to link it. sorry; |
Do you guys think a triton based inference engine would be a good path? |
@kerthcet no we want to do training here. triton based inference already has too many options like vllm |
about the mm kernel, I wrote something like that, if that interests anyone if the todos there are fixed, I think a pr would make sense? |
🚀 The feature, motivation and pitch
@thomwolf and i have an idea to implement llama from scratch in pure triton, inspired by karpathy. liger kernel already contains most of the kernels except matmul. We would love to call out for any interested! It can be added under our example/ folder!
Alternatives
No response
Additional context
No response
The text was updated successfully, but these errors were encountered: