[Discussion] About the weight layout of Dense/BatchMatmul

rasagna-quic · November 30, 2022, 1:37pm

Should we extend qnn.dense to qnn.matmul to handle the transpose in quantized models? We are facing similar issues in quantized bert models.