Hi, are there any te schedule examples that exceed cublas using the tensor core? Such as dense op? Thanks!
I’m afraid not. Please try TensorIR with meta-schedule
May I ask if it is because of the limitation of te schedule?
Could you tell me why? Thanks!
There are serval reasons:
- On technical, TE schedule is based on scalar computing and the schedule tree, which makes it hard (or even impossible) to generate well-performed tensorized programs.
- For the community reason. As we are pushing TensorIR and Meta-schedule, which has stronger expression ability and performance in most cases, there is little motivation to reproduce it using legacy technics with great difficulty.
OK, thanks! I’m trying the tensorir, it’s very good!