Is it possible to run tiny llama using relay?

Relay does not support dynamic-shape and kv-cache. So it’s hard to run through Relay.

We are phasing out Relay, please try moving towards relax :slight_smile:

1 Like