FP16 is slower than FP32 in TVM?

See ARM FP16 instrin support in M1 chip - #4 by jwfromm

Try adding -mattr=+fullfp16 to your target string.