Limit CPU cores for Auto tuned Model

Thanks!

I’ve tested the performance by limiting threads_number to 2, the latency increase from 13ms to 16ms which is totally acceptable to us. CPU usage decrease from %300 to about %120;

One another question: we want to try different option to resolve the “CPU too hot” problem; and run TVM on LITTLE cores with low energy cost is also one option we want to try; by reading the code in threading_backend.cc, it seems we can also configure TVM runtime to prefer LITTLE cores instead of big cores? how could we configure that(e.g. does calling the method before load TVM module works? do we need to re-tune to achieve that?)