and this are some error logs:
Weights exist at dist/models/vicuna-7b-delta-v0, skipping download.
Using path “dist/models/vicuna-7b-delta-v0” for model “vicuna-7b-delta-v0”
Database paths: [‘log_db/dolly-v2-3b’, ‘log_db/redpajama-3b-q4f16’, ‘log_db/redpajama-3b-q4f32’, ‘log_db/rwkv-raven-1b5’, ‘log_db/rwkv-raven-3b’, ‘log_db/rwkv-raven-7b’, ‘log_db/vicuna-v1-7b’]
Target configured: cuda -keys=cuda,gpu -arch=sm_70 -max_num_threads=1024 -max_shared_memory_per_block=49152 -max_threads_per_block=1024 -registers_per_block=65536 -thread_warp_size=32
Automatically using target for weight quantization: cuda -keys=cuda,gpu -arch=sm_70 -max_num_threads=1024 -max_shared_memory_per_block=49152 -max_threads_per_block=1024 -registers_per_block=65536 -thread_warp_size=32
Traceback (most recent call last):
File “build.py”, line 420, in
main()
File “build.py”, line 398, in main
mod = mod_transform_before_build(mod, params, ARGS)
File “build.py”, line 281, in mod_transform_before_build
new_params = utils.transform_params(mod_transform, model_params, args)
File “/home/repo/mlc-llm/mlc_llm/utils.py”, line 255, in transform_params
vm = relax.vm.VirtualMachine(ex, device)
File “/home//repo/tvm-unity/python/tvm/runtime/relax_vm.py”, line 81, in init
rt_mod = rt_mod.jit()
File “/home//repo/tvm-unity/python/tvm/relax/vm_build.py”, line 89, in jit
not_runnable_list = self.mod._collect_from_import_tree(_not_runnable)
File “/home/repo/tvm-unity/python/tvm/runtime/module.py”, line 426, in _collect_from_import_tree
assert (
AssertionError: Module stackvm should be either dso exportable or binary serializable.