AutoTVM and CPU vectorization: should I split?

If you know the optimal split size (e.g. from the size of register), you can split directly without searching. While s[A].vectorize(x) means vectorize the whole loop, which is impossible in many cases. On CPU, LLVM will decide how to handle such vectorization