https://github.com/pytorch/pytorch/blob/main/torch/_inductor/codegen/cpp_wrapper_gpu.py
https://github.com/pytorch/pytorch/blob/main/torch/_inductor/codegen/cpp_wrapper_gpu.py