Hi,
When I test the [tune_conv2d_cuda.py] sample, some erros are reported:
Best config:
[(‘tile_f’, [8, 2, 8, 2]), (‘tile_y’, [7, 2, 1, 2]), (‘tile_x’, [1, 1, 28, 1]), (‘tile_rc’, [128, 2, 2]), (‘tile_ry’, [1, 3, 1]), (‘tile_rx’, [1, 3, 1]), (‘auto_unroll_max_step’, 0), (‘unroll_explicit’, 1)],None,453139402
Finish loading 10724 records
Traceback (most recent call last):
File “tune_conv2d_cuda.py”, line 208, in
func(a_tvm, w_tvm, c_tvm)
File “/home/yzw/tvm/python/tvm/_ffi/function.py”, line 128, in call
return f(*args)
File “/home/yzw/tvm/python/tvm/_ffi/_ctypes/function.py”, line 184, in call
ctypes.byref(ret_val), ctypes.byref(ret_tcode)))
File “/home/yzw/tvm/python/tvm/_ffi/base.py”, line 66, in check_call
raise TVMError(py_str(_LIB.TVMGetLastError()))
tvm._ffi.base.TVMError: [13:20:02] /home/yzw/tvm/src/runtime/module_util.cc:52: Check failed: ret == 0 (-1 vs. 0) Assert fail: (int32(arg0.shape[1]) == 512), Argument arg0.shape[1] has an unsatisfied constraint
Stack trace returned 10 entries:
[bt] (0) /home/yzw/tvm/build/libtvm.so(dmlc::StackTraceabi:cxx11+0x5b) [0x7f08de935dab]
[bt] (1) /home/yzw/tvm/build/libtvm.so(+0xa029ce) [0x7f08ded529ce]
[bt] (2) /home/yzw/tvm/build/libtvm.so(TVMFuncCall+0x5e) [0x7f08ded3778e]
[bt] (3) /home/lychee/anaconda3/envs/yzw/lib/python3.7/lib-dynload/…/…/libffi.so.6(ffi_call_unix64+0x4c) [0x7f08e43d5ec0]
[bt] (4) /home/lychee/anaconda3/envs/yzw/lib/python3.7/lib-dynload/…/…/libffi.so.6(ffi_call+0x22d) [0x7f08e43d587d]
[bt] (5) /home/lychee/anaconda3/envs/yzw/lib/python3.7/lib-dynload/_ctypes.cpython-37m-x86_64-linux-gnu.so(_ctypes_callproc+0x2ce) [0x7f08e45eaf8e]
[bt] (6) /home/lychee/anaconda3/envs/yzw/lib/python3.7/lib-dynload/_ctypes.cpython-37m-x86_64-linux-gnu.so(+0x129c4) [0x7f08e45eb9c4]
[bt] (7) python(_PyObject_FastCallKeywords+0x49b) [0x55f94951211b]
[bt] (8) python(_PyEval_EvalFrameDefault+0x523e) [0x55f94957e2ce]
[bt] (9) python(_PyEval_EvalCodeWithName+0x2e8) [0x55f9494b7528]
test case: N, H, W, CO, CI, KH, KW, strides, padding = 1, 28, 28, 512, 256, 3, 3, (1, 1), (1, 1), the erro can recurrent