Question about the "pack-sum" loss of XGBoost in Ansor

In my mind, this part in Ansor is almost similar to AutoTVM, I’m not sure if this has been explained in these two papers.

Recently there’s also another work about the cost model of Ansor: TenSet: A Large-scale Program Performance Dataset for Learned Tensor Compilers | OpenReview

cc @merrymercy