Our current PoC implementation uses KCompiler Attributes and the Standard MergeComposite, AnnotateTarget, MergeCompilerRegions.
Following up on the above question, what are your thoughts on moving the UMAPartitioner inside relay.build(…) ?
The current plan is to move to the collage implementation by @mbs-octoml as soon as possible which would move partitioning into the relay.build.