Hello!
is there a sample code for using dlpackrs (https://github.com/ehsanmok/dlpackrs.git) to convert
tvm’s NDArray
into Array3<f32>
?
context: I’m using YoloV5 converted to TVM. inference runs in microseconds, but I’m forced to pull the whole output from gpu to cpu (output.to_vec()
) too early, which takes about 50~100 milliseconds.
thank you!