I attempt to use cuda auto tuning by running tune_conv2d_cuda.py in tutorial section. However, it does not stop. When I manually stop the process, it returns “ConnectionRefusedError: [Errno 111] Connection refused”.
I build the TVM by using LLVM-10 and GCC 6.3 and turn on the cuda and LLVM flag and turn off the VTA flags. I run the code by suing python 3.5. The hardware is Tesla P100. How can I solve this issue? Is it related to RPC?
I list the HW info, freezing image and error image below.
I think the tune_conv2d_cuda.py is using localrunner in line 190.
"
measure_option = autotvm.measure_option(
builder=autotvm.LocalBuilder(),
runner=autotvm.LocalRunner(repeat=3, min_repeat_ms=100, timeout=4)
)
"
Am I correct?
Sorry, I have misunderstood the problem. It seems that you have finished the tuning and got a best config. The only problem may be the rpc sever will not exit. I think it will not influence the result, so it is ok to kill it after the final output.
I think the issue is related to python version I ran. I used to use python 3.5.2 and programs froze because of non-join threads. When I switch to 3.6.3, issue is gone.