When I run a modified version of the tutorial file “tune_relay_cuda.py” (using target = “rocm”), I get the following error some time auto-tuning starts
Tuning...
Task(func_name=topi_nn_conv2d, args=(('TENSOR', (1, 512, 14, 14), 'float32'), ('TENSOR', (512, 512, 3, 3), 'float32'), (1, 1), (1, 1, 1, 1), (1, 1), 'NCHW', 'float32'), kwargs={}, workload=('conv2d', (1, 512, 14, 14, 'float32'), (512, 512, 3, 3, 'float32'), (1, 1), (1, 1, 1, 1), (1, 1), 'NCHW', 'float32'))
rocm
[Task 1/ 9] Current/Best: 21.86/3183.80 GFLOPS | Progress: (60/100) | 174.55 s
Segmentation fault (core dumped)
I am using a Vega 20 AMD GPU and I was wondering if I should add the -model xx
definition to the target to avoid this.
I was wondering if somebody has experienced the same issue in the past. Any information on this issue would be greatly appreciated