Basically this assertion means that you are adding a bias to some activation (int32 result) and there might be some cases missing here. So you can add a check on lhs_kind and rhs_kind and then handle that case
You should handle it like the other cases. QAnnotateKind.ACTIVATION means the expression is output of a quantized layer (e.g. output of quantized conv2d) QAnnotateKind.INPUT means the expression has been quantized to int8
I am still trying to figure out how to handle this. However, I realized that by using the NCHW layout there is no issue but if I use the default layout is when I hit the assertion. The problem with NCHW is that I see lots of transpose operations being added in the Relay IR