Hey, I want to profile the TVM module, e.g: cache-misses, LL-CACHE-MISSES etc. How can I do this?
1 Like
I see, but perf can only give a summary of statistics, which include all the program and thereby many noise for profling a Module. Seems there are no tools that we can inspect the cache behavior just for a python code region.
hmmm why do we need to profile python? It is slow anyways
Thanks for your reply. I see TVM’s C++ API now, I’ll try.