TVM graph quantization is giving poor accuracy

joyalbin · September 4, 2019, 3:06am

Hi All,

I tried to quantize the inception model using tvm quantization. Non-Quantized model accuracy on imagenet validation dataset is 76.1% Quantized model accuracy on imagenet validation dataset is 15.7%

Below script modification we have done to enable the quantization.

shape_dict = {‘DecodeJpeg/contents’: (299, 299, 3)} dtype_dict = {‘DecodeJpeg/contents’: ‘uint8’} mod, params = relay.frontend.from_tensorflow(graph_def, layout=layout, shape=shape_dict) mod = relay.quantize.quantize(mod[‘main’], params) with relay.build_config(opt_level=3): graph, lib, params = relay.build(mod, target=target, target_host=target_host, params=params) m = graph_runtime.create(graph, lib, ctx) m.set_input(‘DecodeJpeg/contents’, tvm.nd.array(x.astype(dtype))) m.set_input(**params) m.run()

Someone please correct me if anything wrong in the script? Or the current TVM quantization is giving poor accuracy?

FrozenGene · September 4, 2019, 5:31am

I think @vinx13 @ziheng could help this.

ydy · September 4, 2019, 5:40am

What is the structure of your model，I have similar test results. My test results show that all the results are not good except Inception_v3

joyalbin · September 6, 2019, 4:13am

@ydy I have used inceptionV1 for my test and found the above results.
I am trying to run inceptionV3, but getting some issues related to labels mapping while measuring accuracy. I working on this part.
@ziheng @vinx13 Is this a known issue that the accuracy will be poor? and working is going on towards improving the accuracy?

vinx13 · September 6, 2019, 2:08pm

The recent change may hurt the accuracy, see [Quantization] Problems with recent refactoring changes in the quantization pass