N= 2**5
D = 64*64
M = 512
with tvm.target.cuda():
X = te.placeholder((N, D), name="X")
W = te.placeholder((D, M), name="W")
B = te.placeholder((1, M), name="B")
H = topi.einsum("ik,kj->ij",X,W)
Y = H+B
s=topi.cuda.schedule_injective([Y])
code =tvm.lower(s, [X, W,B,H,Y], simple_mode=True)
f = tvm.build(code,target = 'cuda',target_host="llvm")
I just want to build a simple perceptron with topi. And it comes up with this fault, printing some generate cuda codes while there is no print call.
Thanks for your suggestion. I am also learning TVM. But I don’t see the point of how the input size relates to the build process. IMO, it’s translating the generated C-like codes(the output of lower ) to executable dynamic lib(I saw .so file was created when the script was running).