Hi community,
I am new to TVM. I am using AutoScheduler to tune the mobilenet v2 quantized tflite model. I am tuning against Snapdragon 660 on Android devices
Here is my target string: llvm -device=arm_cpu -mtriple=arm64-linux-android -mattr=+neon
And I am using NDK aarch64-linux-android-clang++ to compile
The number of the trial is set to 20000 for 36 tasks. Here is the log last line:
Estimated total latency: 39.393 ms Trials: 98 Used time : 3294 s Next ID: 21
From the above log, it should be around 40ms, however, when I ran the model on device, it gives about 140ms.
Anyone has any clues about what is going wrong?
Thanks a lot!