Quick question. Does time_evaluator
include time transferring data from CPU to GPU, and GPU to CPU? What else are included besides kernel launch? Thanks!
It should only include kernel launch time, see here.
Transfer from CPU to GPU happens when you create tvm.nd.array
from numpy array, and transfer from GPU to CPU happens when you do asnumpy()
on tvm.nd.array.
1 Like