Hierarchy in TVM

Interesting point. I agree that having a hierachy would make the IR more readable. Perhaps the nested structure can be achieved in A-norm form and can be flatten to graph-norm when we need to tune the model?