Hi there,
TVM will use the maximum number of threads that it can (big cores), but you can change the maximum by setting this environment variable: TVM_NUM_THREADS=[num_threads] (on the board being tuned on).
E.g., you might want a max of two threads, so you would run your RPC server with:
export TVM_NUM_THREADS=2 && python3 -m tvm.exec.rpc_server --host 0.0.0.0 --port=2083 --key=my_key
Personally, I’m still trying to figure out how to use all big cores and all LITTLE cores together.