Hexagon simulator + Apache TVM : memory benchmarking

I want to benchmark matrix multiplication in terms of memory for TVM + Hexagon simulator. Are there any resources related to below topics ?

  • Memory utilization
  • Cache utilization
  • SIMD utilization