Hi, When I use the example in test_cutlass.py I get the following error:
INFO:topi:dense pad_to_tensorcore, extra_flops_ratio 0.002197802197802198
INFO:te_compiler:Using injective.cuda for nn.pad based on highest priority (10)
WARNING:autotvm:One or more operators have not been tuned. Please tune your model for better performance. Use DEBUG logging level to see more details.
INFO:te_compiler:Using dense_tensorcore.cuda for nn.dense based on highest priority (20)
INFO:te_compiler:Using injective.cuda for strided_slice based on highest priority (10)
INFO:cutlass:Picked the first kernel found cutlass_tensorop_h1688gemm_256x128_32x2_tn_align4
Traceback (most recent call last):
File "test_cutlass.py", line 542, in <module>
test_dense()
File "test_cutlass.py", line 301, in test_dense
verify_dense(get_dense(M, N, K), M, N, K)
File "test_cutlass.py", line 248, in verify_dense
out = get_output(rt_mod, ["data"], [x])
File "test_cutlass.py", line 62, in get_output
rt_mod.run()
File "/data/tvm-0.9/python/tvm/contrib/graph_executor.py", line 207, in run
self._run()
File "/data/tvm-0.9/python/tvm/_ffi/_ctypes/packed_func.py", line 237, in __call__
raise get_last_ffi_error()
tvm._ffi.base.TVMError: Traceback (most recent call last):
6: TVMFuncCall
5: tvm::runtime::GraphExecutor::Run()
4: _ZNSt17_Function_handlerIFvvEZN3tvm7runtime13GraphExecutor11CreateTVMOpERKNS2_10TVMOpParamERKSt6vectorI8DLTensorSaIS8_EEEUlvE1
3: std::_Function_handler<void (tvm::runtime::TVMArgs, tvm::runtime::TVMRetValue*), tvm::runtime::WrapPackedFunc(int (*)(TVMValue*, int*, int, TVMValue*, int*, void*), tvm::runtime::ObjectPtr<tvm::runtime::Object> const&)::{lambda(tvm::runtime::TVMArgs, tvm::runtime::TVMRetValue*)#1}>::_M_invoke(std::_Any_data const&, tvm::runtime::TVMArgs&&, tvm::runtime::TVMRetValue*&&)
2: tvmgen_default_cutlass_main
1: _Z38tvmgen_defau
0: tvmgen_default_cutlass_main_0_(DLTensor*, DLTensor*, DLTensor*)
File "./tmp/lib0.cu", line 81
TVMError: Check failed: (status == cutlass::Status::kSuccess) is false:
tvm : f94ea8 cuda :11.0 python : 3.7.7
Can you help me see what the problem is?@masahi thx