Hello, I’m trying to compile VITS (GitHub - jaywalnut310/vits: VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech) model using TVM. I converted VITS into ONNX then giving it to TVMC it throws error “Check failed: (!axes.defined() || static_cast(axes.size()) == ndim) is false: Dimension mismatch: axes has 3 elements, but data.ndim = 4”. I’m getting any trace for the node where this error is generated. I have also verified model using ONNXRuntime, It working fine on ONNXRuntime. I also tried to compile model dividing into parts, this gives me following error “NotImplementedError: Currently the operator ‘gather_nd’ is not supported.” Any solution for this ? I’m really stuck at this compilation.
Model file is here : vits.onnx - Google Drive