How can I test the performance of a single operator?

Hi guys, I’m new to TVM and I was trying to test the performance of a single operator on NVGPU. So far I found the doc about how to test model benchmark(https://github.com/apache/incubator-tvm/blob/main/apps/benchmark/README.md). And the doc about [Tuning High Performance Convolution on NVIDIA GPUs] (https://tvm.apache.org/docs/tutorials/autotvm/tune_conv2d_cuda.html). But neither of them satisfies my needs. What I want is to test the single op performance instead of testing a whole model nor self-defining an operator and tune it. Can anybody tell me how to do this?