TVM runtime automatically determines number of threads to use based on number of Big cores. Also number of threads to use can be specified by env var TVM_NUM_THREADS
.
But I’m not sure we can change number of threads while the program is running.
That is different from TFLite. TFLite Interpreter API allows to set number of threads at any time while the inference is running.
https://www.tensorflow.org/lite/api_docs/cc/class/tflite/interpreter#classtflite_1_1_interpreter_1ac6d9205e23cfe0a77ed2e69005bb7ec1