N= 2**5
D = 64*64
M = 512
with tvm.target.cuda():
X = te.placeholder((N, D), name="X")
W = te.placeholder((D, M), name="W")
B = te.placeholder((1, M), name="B")
H = topi.einsum("ik,kj->ij",X,W)
Y = H+B
s=topi.cuda.schedule_injective([Y])
code =tvm.lower(s, [X, W,B,H,Y], simple_mode=True)
f = tvm.build(code,target = 'cuda',target_host="llvm")
I just want to build a simple perceptron with topi. And it comes up with this fault, printing some generate cuda codes while there is no print call.
I am using 0.8 dev of TVM, llvm 10.0. gcc7 and cuda 11.2 if that helps.