TVM's get_output function is time-consuming with Mali openCL on RK3399

run on CPU should cost “real” running time.