How to implement attention when query and value have different hidden dims? #3121
Labels
feature request
New feature or request
question
Further information is requested
triaged
Issue has been triaged by maintainers
Hi, I'm trying to export an attention layer with different hidden dimensions for query and value to a TensorRT engine. Do you have any tips
The text was updated successfully, but these errors were encountered: