Support for TVM Relay with WASM Runtime

I have an int8 quantized model (with json, params) compiled to WASM and another params file that holds dequantization data. While ingesting the model in a NodeJS app, I need to dequantize the weights back to fp32, which requires me to read the parameter files, dequantize the weights and save the parameter dictionary again.

Is there a way to support TVM Relay functions that allow access to the graph and parameters(save_params_dict, load_params_dict) with a WASM library the way the TVM Runtime is supported?

1 Like

Following up here: Are there plans to extend TVM Relay support to WASM?

Hi, I am kind of curious your method to compile the module,json,params items into 1 .wasm. May I get some guidance on that?

Here’s a great example by Tianqi to compile to the model.

I take this and apply it to a pretrained TF model to compile to WASM. If you look carefully at the repo, you will also find a sample script to deploy the model for WebGPU.