Matrix tiling problem

dongyeye10 · June 14, 2019, 1:47am

I’m trying to tile matrix in different direction to let them fit the buffer size of blocks in GPU for matrix multiplication. I follow the reduction tutorial and the other type of tiling works well, but i found that we can’t tile matrix throw reduce axis, it won’t work because TVM doesn’t support blocks synchronization. %E6%8D%95%E8%8E%B7 I got these three ways but none of them is easy for me. 1.Add blocks synchronization feature but it seems like too complicated and it will spend lot of time. 2.writting this tiling method directly with cuda but its seems like difficult to combine it with TVM. 3.I tried to reshape these two matrix to tile them in other axis like this

I realize that I have to let tvm.compute do calculation in a certain range, I don’ know if it is possible. I’m trapped here, could you please show me a way to solve this prblms?