I was experiencing an issue while following the “How To Guide” Getting Started With PAPI¶. I’ve had PAPI installed, and built tvm with papi. However, the output of the code in the link above doesn’t output as expected, but as below.
Name Duration (us) Percent Device Count Argument Shapes Hash VM::Argument Shapes
fused_nn_dense_nn_bias_add_nn_relu 24.12 23.20 cpu0 1 float32[1, 784], float32[128, 784], float32[128], float32[1, 128] 35ac6d50e6e03a62
fused_nn_dense_nn_bias_add_nn_relu_1 6.52 6.27 cpu0 1 float32[1, 128], float32[64, 128], float32[64], float32[1, 64] 7c89e1efbba1ce3b
fused_nn_dense_nn_bias_add 4.91 4.72 cpu0 1 float32[1, 64], float32[10, 64], float32[10], float32[1, 10] 8a679957c4723fed
VM::AllocStorage 3.82 3.68 cpu0 5 float32[3136]
fused_nn_batch_flatten 1.39 1.33 cpu0 1 float32[1, 1, 28, 28], float32[1, 784] cafe14d2106368be
VM::AllocTensor 0.99 0.96 cpu0 2 float32[1, 10]
fused_nn_softmax 0.74 0.71 cpu0 1 float32[1, 10], float32[1, 10] 0cc19816e7a3c070
VM::AllocTensor 0.65 0.63 cpu0 1 float32[1, 784]
VM::AllocTensor 0.58 0.56 cpu0 1 float32[1, 64]
VM::AllocTensor 0.49 0.47 cpu0 1 float32[1, 128]
----------
Sum 44.21 42.54 15
Total 103.94 cpu0 1
Also, chaging the metrics to be collected didn’t hava an effect on the output. The output of the code remained same. I’ve tried papi_avail
command and it showed
Of 108 possible events, 0 are available, of which 0 are derived.
No events detected! Check papi_component_avail to find out why.
And papi_component_avail
command showed
Compiled-in components:
Name: perf_event Linux perf_event CPU counters
\-> Disabled: Unknown libpfm4 related error
Name: perf_event_uncore Linux perf_event CPU uncore and northbridge
\-> Disabled: No uncore PMUs or events found
I reinstalled perf with command apt-get install linux-tools-common linux-tools-generic linux-tools-`uname -r`
and now perf -v
command shows perf version 6.5.13
. But the issue above still exists. Does anyone know how to solve it?