Hi all, I’m looking to do to the TensorRT BYOC integration what I just did for CUTLASS, namely make sure all compilation configuration is captured within a “tensorrt” Target instance rather than the current combination of PassContext and environment variables. This helps Collage, both because the overall configuration is just a list-of-Targets, and for some infrastructure issues internal to us in OctoML.
(I’ll also switch TensoRT to be IRModule-at-a-time instead of function-at-a-time, however since TensorRT engines can only have one entry point this won’t have any performance or sharing benefits, it will just be an internal engineering cleanup.)
Just want to check if there are any existing users of the partition_for_tensorrt function and how sensitive I should be to maintaining backwards compatibility?
Given we’ve broken large parts of the integration at various stages over the last few months I suspect this is not being actively used, but please give me a shout otherwise.