Feat/faster rms norm #182

S1ro1 · 2024-08-30T19:49:42Z

Summary

Implements partial aggregation in rms_norm, similar to that in layer_norm, as described in #179 .

Testing Done

Hardware Type:
run make test to ensure correctness
run make checkstyle to ensure code style
run make test-convergence to ensure convergence

S1ro1 · 2024-08-30T19:52:25Z

i have made gradients w.r.t weights work properly, except 1 singular test, which just calculates random number of elements wrong with no apparent cause (every other call the number and indices of elements are different), however it's always the same test (test/transformers/test_rms_norm.py::test_correctness[LlamaRMSNorm-0.0-llama-dtype1-0.2-0.2-16-1024-4096]), so help would be appreciated to fix this issue as im getting clueless. (setting the seed makes the elements wrong be deterministic, however still the same issue)

lancerts · 2024-09-04T04:09:10Z

@S1ro1 did a fix on the main branch rmsnorm, can you check if the same error persist after rebase? ty

S1ro1 · 2024-09-04T07:37:27Z

E           Mismatch at index (150,): tensor1[(150,)] = 0.076171875, tensor2[(150,)] = -0.2099609375
E           Mismatch at index (806,): tensor1[(806,)] = -1.6953125, tensor2[(806,)] = -1.1875
E           Mismatch at index (2538,): tensor1[(2538,)] = -1.3125, tensor2[(2538,)] = -0.80078125
E           Mismatch at index (3000,): tensor1[(3000,)] = -0.7578125, tensor2[(3000,)] = -0.373046875
E           Mismatch at index (3927,): tensor1[(3927,)] = -0.1591796875, tensor2[(3927,)] = -0.71484375

@lancerts the error stays the same, just few elements mismatched in always 1 and the test same. I have no clue how to fix that to be fair, so I guess you can unassign me, I'm working on this on/off for 3/4 days but haven't got a sinlge step forward since the bug occurance. I guess I took a too big bite. The branch should have some base work done on the issue, but maybe going from scratch might be a better issue.

lancerts · 2024-10-02T21:43:51Z

#255 resolved by this PR

S1ro1 added 5 commits August 30, 2024 18:01

Smth?

8faf4f1

Having results

b09ae4b

Some weights work

3f891f0

More tests pass

ead988d

1 failing test for dW

27123e3

Cleanup

5487509

S1ro1 added 2 commits September 4, 2024 07:03

Merge branch 'main' into feat/faster-rms-norm

cdbb9f8

Change to patched backward

b0f731d

test rms_norm.py

c64d18d

lancerts closed this Oct 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/faster rms norm #182

Feat/faster rms norm #182

S1ro1 commented Aug 30, 2024 •

edited

Loading

S1ro1 commented Aug 30, 2024 •

edited

Loading

lancerts commented Sep 4, 2024

S1ro1 commented Sep 4, 2024 •

edited

Loading

lancerts commented Oct 2, 2024

Feat/faster rms norm #182

Feat/faster rms norm #182

Conversation

S1ro1 commented Aug 30, 2024 • edited Loading

Summary

Testing Done

S1ro1 commented Aug 30, 2024 • edited Loading

lancerts commented Sep 4, 2024

S1ro1 commented Sep 4, 2024 • edited Loading

lancerts commented Oct 2, 2024

S1ro1 commented Aug 30, 2024 •

edited

Loading

S1ro1 commented Aug 30, 2024 •

edited

Loading

S1ro1 commented Sep 4, 2024 •

edited

Loading