[TIR] Problem inlining addition into matmul block

not immediately same as this code, but 2.4. TensorIR: Tensor Program Abstraction Case Study — Machine Learing Compiler 0.0.1 documentation should be relevant on reverse inlining