I used the auto tuning to optimize the performance of mobilenetv2-1.0-224x224 in Linux RK3399, and the time is 62ms without auto tuning and 64ms with auto tuning.
In my case, the target = tvm.target.arm_cpu(“rk3399”) and other parts are same as the tutarials.
How do I debug this problem?