🐛 [Bug] perf gap reduce on RAFT

##  Bug Description

A user reported Torch-TRT has slower inference time on RAFT compared to the original Pytorch model:

```
Backend                        Time (ms)      Speedup   
----------------------------------------------------------------------
Original                         7.12           -         
Torch-TensorRT                 20.35         0.35x   
ONNX-TRT                         2.96          2.41x
```

I printed out the engine profiles of Torch-TensorRT and ONNX-TRT where the num of layers of Torch-TRT are twice more than ONNX-TRT.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

🐛 [Bug] perf gap reduce on RAFT #3731

Bug Description

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

🐛 [Bug] perf gap reduce on RAFT #3731

Description

Bug Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions