Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
From NVIDIA Megatron-LM for visibility #18
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: multi-query-attention
Are you sure you want to change the base?
Uh oh!
There was an error while loading. Please reload this page.
From NVIDIA Megatron-LM for visibility #18
Changes from all commits
94a3711
13fd57a
410222b
2f1027d
a5f0057
cde92b2
2dd030e
d87bfd1
91e2ee5
2b6b46b
9545270
16ad771
0d33682
82e5ff6
d1a8777
7704169
4819438
46eb0a3
29a0607
b6a7f40
2e88416
1b07529
7a31b35
c86819f
a100a3c
d7444d0
2106bd6
254ef23
c769b67
69b65e0
6b62015
de512dc
c08d89b
d93743a
79d04be
d3df238
66a1dfc
8e11c52
551b734
f778f7b
781e765
6850cc6
8653fad
c47cf0a
8d9dbed
4cd81e8
af28b5a
4dd2f2b
237080b
4db3c78
7139518
c6aab54
eb0c03e
09ca1d2
d6d094a
bf341cb
7683298
0913599
4d5dc62
d9270aa
2e7e438
6299ea7
0532e92
d775d4a
724580d
bebc0e4
ed1eaa9
312f300
c40a446
f6a675a
3d19693
3d784cb
1c29678
f364164
c7fd91a
4840669
f8f6e9b
188435a
66c12ce
77eaa9a
875ad2a
dcf7d36
a6c6250
16c0d28
d6526b1
8db4323
d7bf5aa
4b30ec5
799cee0
b7a6f90
37ee3d1
d6301fb
5b2cb28
7b8bbf2
8efa2a0
6740f5e
028f079
4f6ab63
7aad147
7ceafd9
bdad881
fcbde8a
d1e4fc6
b74396f
1d0995d
500333c
17cb145
3ec579a
1deafac
0e3d8ec
c2527ba
fa4d12c
256c855
d0d8a5c
180ebf0
d7ed78d
28925b8
a6237d0
19db79c
6f178d2
5c05330
223533f
f0edd1c
3271c42
726bc3f
2c69af5
f4fa7d6
39247c2
5cdac7d
6462307
4025494
d8714d9
84111cc
40af198
5cc85f3
96f1e01
3075197
122324c
972553d
e0b9fbb
a8fee91
0118b97
708f565
86660c1
a15a6d4
e674a29
1cf07f3
1e9e94c
2c7e98a
50502b9
9cbd0d7
7b49ca7
02a1dd0
b6517c6
cca55cc
76622ed
0889ce9
1b432dd
49be51a
3a51832
ca9797e
3e3c3c9
9be09c9
3a9a060
ebc82b1
63e8566
0269b8a
dd7de1a
f573042
e000263
72d2354
e58d908
641bf8b
3616ce5
ba3e244
07b22a0
c7a3aa4
230e0cb
a99f647
020abf0
696977f
2108cd8
85a8340
55c1433
3230510
dc68d29
8c1a3f5
b615e73
3a4b71e
4af25fe
ba97a7e
a949f69
294395c
dbc4129
5704c92
3a9cff6
ffe2af1
5b75141
4370d3a
270a0b4
ecedef0
d44b513
efeb85b
136b7f5
aacc3b8
fb452ba
25b8af1
2e29a5e
de48755
1aef9f8
83609d7
746c913
1e0cb14
4971290
bc663b1
90f39a3
5e8c9c4
18420b6
1dc7019
9a4002e
159a6a0
2d60db7
8ed1e2c
6023444
1584dca
2fbece5
bdf57ae
1c0eb4a
a43b818
1e6f75a
c76ed86
ef5e03c
848c8c9
84e9c3a
23d2ada
8399280
2ebb6ee
6c666b6
15a0d47
a3f9e56
5653514
9f72f47
199113b
5a58976
74bec5b
93a0d8e
fa5082f
c223178
e5bc924
8479eb3
File filter
Filter by extension
Conversations
Uh oh!
There was an error while loading. Please reload this page.
Jump to
Uh oh!
There was an error while loading. Please reload this page.
There are no files selected for viewing
This file was deleted.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.