[Unity][Tutorial] TVM Unity BYOC

sunggg · March 20, 2023, 10:05pm

As we presented in TVMCon’23. in TVM Unity, compilation flow is modular and composable including BYOC.

This tutorial walk-throughs how BYOC offloading works in Unity pass infra and how it works with other passes, such as lowering and MetaSchedule tuning pass.

Hope this helps and please follow-up on this thread if you have any question or feedback. Thank you!

qzylalala · March 21, 2023, 12:10pm

Great! BTW, when will ‘relax’ be merged into the main branch?

tqchen · March 21, 2023, 2:09pm

Checkout Establish TVM Unity Branch for background, more tutorials will be published and you can play with the code in unity branch atm

tqchen · March 21, 2023, 5:15pm

Please do post followup questions in here(there is a unity tags in forum) and I am sure the community would love to hear about more feedbacks and bring discussions together

biboyang · September 2, 2023, 1:35am

Great tutorial! I have watched the TVMCon23 recording for this tutorial.

My situation is: I have my Python Runtime that can call my HW primitives library from host CPU to a PCIe accelerator card. I can follow the ‘FuseOpsByPattern’ way to map a Relax subgraph (conv_relu for example) to one of my primitives. I want to compile the CPU Runtime with my Python Runtime into one executable, so I can accelerate certain subgraphs within each layer in a multi-layer neural network model.

My question is: How can I call my Python Runtime from the Relax VM (, multiple times)?

My assumption is: I need to register my Python Runtime with MyMod.attrs[‘external_mods’]. This part is missing in the tutorial, since TensorRT is a registered BYOC runtime.

biboyang · September 2, 2023, 6:45am

Is it possible to do the registration as UMA tutorial does: tvm/apps/uma/_template/backend.py? It suits my situation since I also have C Runtime API for my accelerator.

biboyang · September 3, 2023, 12:59am

Or I can register my Python Runtime API without re-compiling TVM, as shown in Registering Runtime Function in Chap.4 of MLC.ai class?

biboyang · September 18, 2023, 11:57pm

Just verified: We are able to call the Python Runtime API from Relax.