[fun] llama.triton #119

ByronHsu · 2024-08-27T15:23:38Z

🚀 The feature, motivation and pitch

@thomwolf and i have an idea to implement llama from scratch in pure triton, inspired by karpathy. liger kernel already contains most of the kernels except matmul. We would love to call out for any interested! It can be added under our example/ folder!

Alternatives

No response

Additional context

No response

ziliangpeng · 2024-08-27T15:47:58Z

omw

thevasudevgupta · 2024-08-28T02:53:41Z

i implemented gpt-2 in triton few days back. Ig llama would be similar- just need to implement some specific layers.

sharing if someone wants the starting code!

vigneshbp · 2024-08-28T03:16:42Z

@thevasudevgupta Could you please share the specific code so that I can directly look into it ?

thevasudevgupta · 2024-08-28T03:50:03Z

ohh; I forgot to link it. sorry;

https://github.com/thevasudevgupta/gpt-triton

kerthcet · 2024-08-28T04:57:02Z

Do you guys think a triton based inference engine would be a good path?

ByronHsu · 2024-08-28T04:59:21Z

@kerthcet no we want to do training here. triton based inference already has too many options like vllm

ghostway0 · 2024-08-31T13:12:19Z

about the mm kernel, I wrote something like that, if that interests anyone

if the todos there are fixed, I think a pr would make sense?

ByronHsu added fun hacking and removed fun hacking labels Aug 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[fun] llama.triton #119

[fun] llama.triton #119

ByronHsu commented Aug 27, 2024 •

edited

Loading

ziliangpeng commented Aug 27, 2024

thevasudevgupta commented Aug 28, 2024

vigneshbp commented Aug 28, 2024

thevasudevgupta commented Aug 28, 2024

kerthcet commented Aug 28, 2024

ByronHsu commented Aug 28, 2024

ghostway0 commented Aug 31, 2024 •

edited

Loading

[fun] llama.triton #119

[fun] llama.triton #119

Comments

ByronHsu commented Aug 27, 2024 • edited Loading

🚀 The feature, motivation and pitch

Alternatives

Additional context

ziliangpeng commented Aug 27, 2024

thevasudevgupta commented Aug 28, 2024

vigneshbp commented Aug 28, 2024

thevasudevgupta commented Aug 28, 2024

kerthcet commented Aug 28, 2024

ByronHsu commented Aug 28, 2024

ghostway0 commented Aug 31, 2024 • edited Loading

ByronHsu commented Aug 27, 2024 •

edited

Loading

ghostway0 commented Aug 31, 2024 •

edited

Loading