PAPI counters with basic matmul Relay function

Here is a PR doing what you want: https://github.com/apache/tvm/pull/9553 (though it takes an IRModule instead of a PackedFunc).