I have been studying the features extracted from the low-level AST in the paper:
“Learning to Optimize Tensor Programs”
I came across the example below,
but I am having trouble understanding the numbers | x | 8 | 8 | 64 |. Since I do not see “x” indexing “A”. The same applies to | y | 64 | 64 | 64 |, since I do not see “y” indexing matrix “B”
I would really appreciate it if someone could explain this to me in more detail, and in general, the concept of touched memory