Hello!
I have recently came across TVM as a potential solution for deploying a fast NN to a raspberry PI. However, I am unclear as to how to utilize the optimized bit-reduction system (such as 2 bit binary convs).
I have noted that a PR came in that implements such things but it is not clear to me how to utilize it.
I have successfully ran the autotuner on a RPI using the default configuration but I am not sure how to adapt this process to use fewer bits / more optimization.
Please advise!