Is one goal of TVM to accelerate the inference speed?
I think if one goal for TVM is to accelerate the inference speed, it should be introduced clearly.
Inference speed on CPU is a rigid demand for users.
Is one goal of TVM to accelerate the inference speed?
I think if one goal for TVM is to accelerate the inference speed, it should be introduced clearly.
Inference speed on CPU is a rigid demand for users.
Just googled “tvm bert cpu”.
Problem solved.