how do I use dlpackrs with apache TVM for zero-copy?

Hello!

is there a sample code for using dlpackrs (https://github.com/ehsanmok/dlpackrs.git) to convert tvm’s NDArray into Array3<f32> ?

context: I’m using YoloV5 converted to TVM. inference runs in microseconds, but I’m forced to pull the whole output from gpu to cpu (output.to_vec()) too early, which takes about 50~100 milliseconds.

thank you!