[CUDA][shared memory allocation]fix 'ptxas error : Entry function 'fu… #17267

AIYoungcino · 2024-08-12T02:54:56Z

I convert a vit model from onnx, and then run relay.build with NVIDAI-RTX4090 for compilation.

with tvm.transform.PassContext(opt_level=3):
lib = relay.build(mod, target=target, params=params)
and then meet an error like this: Compilation error:
ptxas error : Entry function 'tvmgen_default_fused_nn_conv2d_add_kernel' uses too much shared data (0x2ab44 bytes, 0x29000 max)

I apologize for resorting to this temporary solution to address the issue I encountered. As a stepping stone, I hope the experts can offer some advice to help me resolve this problem more effectively. Thank you.

AIYoungcino · 2024-08-12T03:00:39Z

I searched for information provided by NVIDIA, and the following are the maximum shared memory limits corresponding to each generation of GPU architecture，
5.x : 64kb
6.x : 64kb
7.x : 96kb
8.x : 164kb

…sion_##' uses too much shared data'

vinx13 · 2024-08-16T00:01:18Z

Dynamic shared memory (shared.dyn scope) should be used in this case to bypass the size limit

AIYoungcino · 2024-08-16T02:33:39Z

Dynamic shared memory (shared.dyn scope) should be used in this case to bypass the size limit

Thank you for your advice. if the size of the result of conv2d exceeds the maximum shared memory limit, storing it in shared memory would lead to overflow. It's typically passed as a parameter to the kernel function or allocated GDRAm space through static extern at compile time.

[CUDA][shared memory allocation]fix 'ptxas error : Entry function 'fu…

72e90d8

…sion_##' uses too much shared data'

AIYoungcino force-pushed the fixmem branch from 6e09af0 to 72e90d8 Compare August 12, 2024 05:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CUDA][shared memory allocation]fix 'ptxas error : Entry function 'fu… #17267

[CUDA][shared memory allocation]fix 'ptxas error : Entry function 'fu… #17267

AIYoungcino commented Aug 12, 2024

AIYoungcino commented Aug 12, 2024

vinx13 commented Aug 16, 2024

AIYoungcino commented Aug 16, 2024

[CUDA][shared memory allocation]fix 'ptxas error : Entry function 'fu… #17267

Are you sure you want to change the base?

[CUDA][shared memory allocation]fix 'ptxas error : Entry function 'fu… #17267

Conversation

AIYoungcino commented Aug 12, 2024

AIYoungcino commented Aug 12, 2024

vinx13 commented Aug 16, 2024

AIYoungcino commented Aug 16, 2024