I’m not sure, I did test it on various models such as mobilenet_v1 and it failed on all of them (see this post I opened a few weeks ago). However, when I use a device with bifrost it works for all the models.
From additional testing I did, this problem occurs even with pytorch models that have only 1 small 2d convolutional layer. I also tested it on a network with 1 fully connected layer and I don’t recall it getting errors (but I may have forgotten…).
In addition, for the simple networks with only 1 layer this error was pretty rare (would happen like 1 in 10 times), but for the larger models it almost always converged to an optimization that gave wrong outputs.