How CUDA kernel is launched in TVM stack

I don’t know or think if we are exposing CUDA stream abstraction to python frontend. We typically don’t care about cuda stream (we don’t support any concurrency at runtime).

What is your use case?