Why are neither 256bit SIMD vectors nor FMAD instructions used by autotuning?

I realized that the docs for tvm.target.Target state that mcpu options only serves as an annotation. Thus, the mattr string increases in importance, and I need to pass the +fma flag too.