I am trying to do something similar using MLC-LLM. Could you take a look at this issue? How to do kernel level profiling for LLMs using MLC-LLM
I am trying to do something similar using MLC-LLM. Could you take a look at this issue? How to do kernel level profiling for LLMs using MLC-LLM