Moves LayerNorm to output of the Encoder's sub-layers by patrickgadd · Pull Request #47 · allegro/allRank

patrickgadd · 2022-09-06T14:45:26Z

Hi there,

As explained in #46, I believe that there is a minor lack of correspondence between the implementation and what's written in the paper ("Context-Aware Learning to Rank with Self-Attention") when it comes to the Transformer architecture.

Sadly I can't say whether this in practice affects performance, as I'm attempting to utilize this work for something entirely different.
However, it looks like that with the fix, learning is a tad more stable.

At any rate, thank you once again for this work and publishing it!

PrzemekPobrotyn · 2022-09-07T11:43:43Z

please see my response in #46

Moves LayerNorm to output of the Encoder's sub-layers

e7d3614

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Moves LayerNorm to output of the Encoder's sub-layers#47

Moves LayerNorm to output of the Encoder's sub-layers#47
patrickgadd wants to merge 1 commit intoallegro:masterfrom
patrickgadd:master

patrickgadd commented Sep 6, 2022

Uh oh!

PrzemekPobrotyn commented Sep 7, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

patrickgadd commented Sep 6, 2022

Uh oh!

PrzemekPobrotyn commented Sep 7, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants