Is It enough to set tvm.target.Target(‘cuda’) in order for TVM to take full advantage of specific hardware platforms? because I have seen targets like “nvidia/geforce-rtx-3090”, “nvidia/nvidia-t4” and so on, but it is not a right target for “nvidia/geforce-rtx-a6000”. When I use target "nvidia/geforce-rtx-3090, the target specific information like this:
cuda -keys=cuda,gpu -arch=sm_86 -max_num_threads=1024 -max_threads_per_block=1024 -registers_per_block=65536 -shared_memory_per_block=49152 -thread_warp_size=3,
but if I set the target “cuda -arch=sm_86”, the target specific information only such like:
cuda -keys=cuda,gpu -arch=sm_86 -max_num_threads=1024 -thread_warp_size=32
should I add all other arget specific information for getting better tuning performance? Or could TVM get these information by it self?