Hello,
I am trying to debug some issues I see with quantization in the compiler chain I am trying to build. To simpify testing I want to import a really basic test, which is easiest with fromtext
, something like this:
mod = tvm.relay.fromtext("""
#[version = "0.0.5"]
def @main(%layer1_input: Tensor[(1, 16, 16, 16), uint8] /* span=aten::quantize_per_tensor_0.layer1_input:0:0 */, %c1_weight: Tensor[(16, 16, 3, 3), int8] /* span=quantized::conv2d_0:0:0 ) {
%0 = qnn.conv2d(%0, %1, 0 /* span=quantized::conv2d_0:0:0 */, 0 /* span=quantized::conv2d_0:0:0 */, 1f /* span=quantized::conv2d_0:0:0 */, 1f /* span=quantized::conv2d_0:0:0 */, padding=[0, 0, 0, 0], channels=16, kernel_size=[3, 3], out_dtype="int32") /* span=quantized::conv2d_0:0:0 */;
}
""")
I want to start very small with zero_point = 0
and scale = 1
. Is there a way to back this with data? For starters, a way to just set every value in the input and kernel to one would be good enough for me. Thanks in advance!