We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent c286767 commit 92f62f8Copy full SHA for 92f62f8
tools/llm/run_llm.py
@@ -71,7 +71,7 @@ def get_model(args):
71
else:
72
model = model.to(torch.float32)
73
74
- return model.cuda()
+ return model
75
76
77
def compile_torchtrt(model, input_ids, args):
0 commit comments