Insert Relay Pass After Optimizations

r.stahl · April 27, 2022, 10:31am

When implementing a module pass, I decided to have it run after the default Relay optimization passes, but before transition to TIR. This allowed me to benefit from the simple and canonical Relay representation, while already able to reason about the likely temporary buffers between fused operations.

However, in my top level code that uses relay.build, I did not see a way to register my pass to be run after the default Relay passes.

My solution is a simple optional hook that allows the user to register a function that takes a module and params (seem to be gone in current TVM), and returns a module:

const runtime::PackedFunc* pfPostPass = runtime::Registry::Get("relay.backend.PostOptPass");
if (pfPostPass) {
  Map<String, Constant> argParams;
  for (const auto& param : params) {
    argParams.Set(param.first, Constant(param.second));
  }
  relay_module = (*pfPostPass)(relay_module, argParams);
}

It is inserted right after the OptimizeImpl call:

github.com

apache/tvm/blob/c09a24dcdce3bc71133712c003c2135842b64be1/src/relay/backend/build_module.cc#L409


 * \brief Compile a Relay IR module to runtime module.
 *
 * \param relay_module The Relay IR module.
 * \param params The parameters.
 */
void BuildRelay(IRModule relay_module, const String& mod_name) {
  // Relay IRModule -> IRModule optimizations.
  IRModule module = WithAttrs(
      relay_module, {{tvm::attr::kExecutor, executor_}, {tvm::attr::kRuntime, runtime_}});
  relay_module = OptimizeImpl(std::move(module));


  // Get the updated function and new IRModule to build.
  // Instead of recreating the IRModule, we should look at the differences between this and the
  // incoming IRModule to see if we can just pass (IRModule, Function) to the code generator.
  Function func = Downcast<Function>(relay_module->Lookup("main"));
  IRModule func_module = WithAttrs(IRModule::FromExpr(func),
                                   {{tvm::attr::kExecutor, executor_},
                                    {tvm::attr::kRuntime, runtime_},
                                    {tvm::attr::kWorkspaceMemoryPools, workspace_memory_pools_}});


  // Generate code for the updated function.

It would be used as follows:

@tvm.register_func("relay.backend.PostOptPass")
def _post_pass(mod, params):
    return MyCustomPass()(mod)

Is this something that is useful to anyone else? Would you be open to include this in the code base? Is there a better way to achive this?

yuchenj · April 27, 2022, 10:01pm

Hi @r.stahl, we have been aware of this challenge of adding a custom pass to the current compilation pipeline (see the challange C3 in @sunggg’s Relax Pass Infrastructure).

In the design of Relax (Relay Next), we follow the following design principles:

Each pass is a IRModule → IRModule transformation.
Decouple the optimization passes from the build system. For example, the AutoTIR(MetaSchedule) tuning is an optimization pass in Relax, and it is outside of the build.
A minimum and universal build that is able to build every valid IRModule to runtime.Module. The build can be invoked at any stage during the compilation, and can be invoked inside a pass for example to do performance measurement for some tuning passes.

These design principles enable flexible and customizable compilation pipelines without the need to hack into the core of the compiler, and allow developers and researchers to explore new spaces. Feel free to check out @sunggg’s discussion thread for more details.

r.stahl · April 28, 2022, 8:14am

@yuchenj Thank you for making me aware of this! This should definitely give sufficient flexibility to integrate a pass after other optimization passes.

However, since Relax is a larger effort that may take quite some time to upstream, I was wondering whether my proposed simple hook could be useful to anyone else and find support for a quick upstreaming.

masahi · April 28, 2022, 8:32am

I have a feeling that we already have various kinds of “hook”, but not sure if they are relevant here. cc @Mousius @lhutton1 @mbs-octoml

r.stahl · April 28, 2022, 9:50am

@masahi Thanks for your input. I’m aware of the RelayToTIR hook (https://github.com/apache/tvm/pull/8423). That would fit the requirement of “Relay After Optimization”, but would add a burden to make the translation to TIR. Would there be a way to easily call into the default TIR conversion? If yes, this also seems like a good approach.

lhutton1 · May 4, 2022, 12:46pm

Hi @r.stahl, apologies for chiming in late, I’ve been away the past few days.

It sounds as though you’d like to insert a pass into the standard flow of TVM - after the default relay optimizations, but before TIR. I’m not convinced the RelayToTIR hook would work for your use case, since this targets functions considered “external”. Your functions would need to be marked with a Compiler attribute for the relevant RelayToTIR hook to then be run.

I’m curious, what’s the reason for wanting to implement this using a hook, rather than just inserting the pass where you desire? If the concern is that the pass shouldn’t be enabled by default, I think we could make use PassContext to manually turn it on.

Hope this helps

r.stahl · May 6, 2022, 9:03am

@lhutton1 Thank you for clarifying.

In my opinion, the pass makes more sense as an external project, because it carries quite a few dependencies with it, while not being suitable for the default optimization.