Best way to deal with kernel layout?

This looks more like a hack, :slight_smile:

If I want to do it in the relay, I should add a version of nn.dense (say, name it nn.dense_transposed_kernel) then register a function convert_dense(…) with register_convert_op_layouts(“nn.dense”), right?