from the perspective of deployment flow, are these truly required ?
first let’s clarify deployment flow. so far we’ve demonstrated:
- standalone deployment, graph runtime on device.
- deployment with an RPC server and host-driven graph runtime
ultimately I presume deployment means #1. from that perspective, you absolutely don’t need a flasher or transport. these are just useful for case #2 and for autotuning.
As for the debugging, that would be debugging of operator codegen of tvm and is it an expectation for the user to debug them ?
certainly we would hope this would not be the typical use case, but we should design a debug flow that works.
I think we need to discuss the possibility of a tuner app
could you elaborate more on this? i think we should avoid linking outside of gcc if we can help it, but if there’s good motivation we should discuss it.