Is there any reason why we don’t look to use pytest-xdist or pytest-parallel ? Currently it takes for ever to run all the tests before submitting , it does look like using pytest-xdist and the pytest -n option might help with improving time for test. We don’t need to use all cores - I suspect we will have issues with TVM_NUM_THREADS et al in some parts of our testing but still it feels quite wasteful for some of the relay tests to be running purely single core.
I don’t have any measurements yet, but playing with pytest -n 16 on my box shows quite a few speed ups in tests/scripts/task_python_integration.sh while running task_python_integration.sh.
Ramana