Two Questions:
-
TVMArrayAlloc
API don’t support allocDLTensor
with strides, although we can set strides outside, but the data memory can’t change again if strides is larger than shape. -
If we export an op library and using it in c/c++ runtime, and bind input/output with decl_buffer that using strides option, should we care about strides in python compute primitives?
Thanks.