[Question] Does the current vision auto_scheduler support search using wmma?

chenugray · July 18, 2022, 4:06am

above question, any none knows?

vinx13 · July 19, 2022, 12:10am

It’s not supported. However, meta schedule does. We recently upstreamed auto tensorization and now available in the main branch. Here’s an example tuning with meta schedule tune_gemm.py · GitHub

chenugray · July 19, 2022, 12:04pm

Thank you for your reply! I read the paper TensorIR: An Abstraction for Automatic Tensorized Program Optimization. It says TensorIR run bert-large model can be as fast as TensorRT. Does it use meta_schedule for search better op implementation?

When I use auto_schedule search the model exported from tensorflow(bert-base batch 8, seq 384), it can achive about 100 fps. FasterTransformers can be 1100fps. Should I use meta_schedule for search？

@vinx13

vinx13 · July 19, 2022, 10:53pm

Yes it uses meta schedule, which uses tensorization (CUDA fp16 tensor core) and software pipelining for optimization.

eAzure · October 17, 2022, 8:21am

Do the order of rules have any influence on the meta schedule? I think the order of rules cannot be changed in Ansor before. @vinx13

vinx13 · October 17, 2022, 11:31pm

The order matters. It is applied to the workload in the specified order.

eAzure · October 18, 2022, 12:58am

Ok, If I want to see the changes after each rule is applied, how can do it?

vinx13 · October 18, 2022, 5:32pm

You can add some logging here https://github.com/apache/tvm/blob/main/src/meta_schedule/space_generator/post_order_apply.cc#L173. You can also run each separately (see test case for example https://github.com/apache/tvm/blob/main/tests/python/unittest/test_meta_schedule_schedule_rule_mlt.py)

eAzure · October 19, 2022, 12:55am

Ok, thanks for your reply!