Relay vs Relax with the use case of whisper speech to text model for android phones

musram · November 15, 2023, 4:55am

I am trying to learn TVM with a task of porting the whisper speech to text model to android phone. I understand there are two ways to address my problem i.e relay → TIR and then relax → TIR. Which approach should I consider? If it is relax, can somebody suggests some examples. I tried to understand mlc-llm but still it is not straight forward.

tqchen · November 15, 2023, 8:48pm

if you are working on foundational models that involves dynamic shape(e.g. whisper) likely relax is something that would be relevant. We also have an in progress update of mlc-llm through nn.Module that hopefully makes some of the porting easier

nundys · January 5, 2024, 2:22pm

Hi, was there any progress on whisper using tvm? I’d like to contribute. Thanks!