[Design] Torchy: Productive Model Definition in TVM Unity

A full-featured Llama2 implementation in only 200 lines of code based on this project: https://github.com/mlc-ai/mlc-llm/pull/631

1 Like