Compiling model with target="llvm" not faster

I see in quote a part of the tuning trace starting from ElapsedTime(s) 17787 EstimatedLatency(ms) 471.479 Trials 10176, it refers to 10899 trial, and I referred it as “first line”. while if you take a look into full file, the first line should start from 29*64=1856 trial. and perf from this 1856 to 10176 should be improved significantly