Phasing out Legacy Components

tqchen · March 2, 2025, 11:40pm

For new flow we don’t have caliberation flow inside framework, as many calberations are now moved to the upper layer.

For example, frameworks like MLC-LLM that usually runs quantization first then build up the quantized model through fused dequant mm operators