Where does the Lowering from Relay to TIR happen?

I tried to correspond the output of relay.analysis.extract_fused_functions with their execution times measured by the debug executor, but was not able to find a way to match these two sets.

I already asked for help a couple of weeks ago, but did not know about extract_fused_functions at this time.

Is there a solution? I also tried to find the function that lowers the fused Relay ops into the TIR PackedFuncs, but was not able to find it. I was hoping to modify it, to store the correspondence there.

Have you found out anything about explicit relay to tir translation. I am currently wondering, about the same question.

1 Like