I am trying to learn TVM with a task of porting the whisper speech to text model to android phone. I understand there are two ways to address my problem i.e relay → TIR and then relax → TIR. Which approach should I consider? If it is relax, can somebody suggests some examples. I tried to understand mlc-llm but still it is not straight forward.
1 Like
if you are working on foundational models that involves dynamic shape(e.g. whisper) likely relax is something that would be relevant. We also have an in progress update of mlc-llm through nn.Module that hopefully makes some of the porting easier
1 Like
Hi, was there any progress on whisper using tvm? I’d like to contribute. Thanks!