We should use Halide's device interface for generating HLS kernel code (device) and testbench wrapper code (host) and the Zynq host code.