Before start working on actual FPGA, we could evaluate with TSIM (cycle-accurate simulation) in TVM; For experiments with vision models, please refer to Deploy Pretrained Vision Model from MxNet on VTA, and there you can setup SIM/TSIM based simulation of vision tasks like object detection.