NN Built-In for Embedding Layers #2237

MaximilianSchreff · 2025-02-24T12:00:09Z

This PR adds the embedding layer as a built-in operator in our nn/layers library. The functionality is similar to pytorch.nn.Embedding (https://pytorch.org/docs/stable/generated/torch.nn.Embedding.html)

The layer receives indices as input which refer to indices of an embedding dictionary and returns an embedding matrix where row i refers to embedding vector indices[i] of the embedding dictionary.

This layer is used in every transformer architecture. Here the indices usually come from a tokenizer and the embedding matrix is the input to the actual transformer model.

Testing

Testing forward pass and backward pass for correctness
Implemented as a component test in NNComponentTest.java
Manually calculated test cases for the forward pass
For backward pass, comparison against pytorches autograd module

codecov · 2025-02-25T11:22:12Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 72.47%. Comparing base (78b23cf) to head (336ef19).

Additional details and impacted files

@@            Coverage Diff            @@
##               main    #2237   +/-   ##
=========================================
  Coverage     72.46%   72.47%           
- Complexity    45453    45465   +12     
=========================================
  Files          1469     1469           
  Lines        170893   170893           
  Branches      33325    33325           
=========================================
+ Hits         123846   123863   +17     
+ Misses        37630    37617   -13     
+ Partials       9417     9413    -4

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

phaniarnab · 2025-03-05T14:05:20Z

Thanks @MaximilianSchreff. I will merge it in.

MaximilianSchreff added 5 commits February 13, 2025 15:03

Init function

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified
Learn about vigilant mode

5b541c9

Forward pass

c8d9969

Backward pass

1d7b500

Testing script

edef682

Added testing to automated component tests

Loading
Loading status checks…

336ef19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NN Built-In for Embedding Layers #2237

NN Built-In for Embedding Layers #2237

MaximilianSchreff commented Feb 24, 2025

codecov bot commented Feb 25, 2025

phaniarnab commented Mar 5, 2025

NN Built-In for Embedding Layers #2237

Are you sure you want to change the base?

NN Built-In for Embedding Layers #2237

Conversation

MaximilianSchreff commented Feb 24, 2025

Testing

codecov bot commented Feb 25, 2025

Codecov Report

phaniarnab commented Mar 5, 2025