Hi, I’ve used Ansor before. If I understand correctly, MS should be a more powerful version of Ansor. I’ve run an MS-tunning process on an Arm Device for 3 days and collected up to 1GB+ tunning records but it ends up strangly not getting any improvements in latency. Below are the script:
from tvm import meta_schedule as ms
from tvm import relay,rpc
from tvm.contrib import graph_executor
database = ms.Database.create(work_dir="a dir with tunning record and workload json")
lib = ms.relay_integration.compile_relay(database, mod, target, params)
lib.export_library(os.path.join(model_path, "model.tar"))
tracker = rpc.connect_tracker("127.0.0.1",port=9190)
remote = tracker.request("xxx")
dev = remote.cpu()
remote.upload(os.path.join(model_path, "model.tar"))
f=remote.load_module("model.tar")
m = graph_executor.GraphModule(f["default"](dev))
print(m.benchmark(dev))
I think the JSON database should be loaded correctly since the script is taking 10G+ RAM when compiling the model. Is this expected or did I miss anything?