Hi! I’m trying out VTA on a different board (ZCU104) and managed to run all the tutorial examples. I wish to explore further how the resources can be better utilized with probably a different GEMM design and so.
I learned from the VTA technical report that “The VTA architecture is fully parameterizable: the shape of the GEMM tensor intrinsic can be modified to influence the utilization of hardware resources.”. If I understand it correctly, the shape of GEMM should be configurable through “vta_config.json” with “LOG_BATCH” and “LOG_BLOCK”. I should also be able to change the buffer sizes through the “LOG_UOP/INP/WGT/ACC_BUFF_SIZE”.
However, when I try to change “LOG_BATCH” and “LOG_BLOCK”, I ended up with wrong calculation results for the tutorial scripts. And when I change the buffer sizes, the bitstream generation process will fail.
May I know if there’s a walkthrough guide on how to search the design for different resource usage? Including how to change the number of units in tensor ALU? Am I understanding it correctly or if there’re more things to be done?
Thanks a lot!