I use TVM to tune a model.When batch is 1,I got good result, about 0.9ms.But when batch is 8,I got a result of 7.5ms,which is smillar to 8 * 0.9ms. All the result after 20000 trails autotuning.
I dont think it is a good result. Did i miss something important to set?