Wrong Impl of relay.nn.ncross_entropy_with_logits

Lyken17 · March 24, 2022, 6:06pm

CrossEntropy should be equal to NLLLoss(LogSoftmax(pred), label), but in current relay’s implentation, it only contains the nllloss part but without softmax.

# prepare the data
tx = torch.randn(1, 10)
tty = torch.zeros(1, 10)
tty[:, 1] = 1
ty = tty.argmax(-1).long()

# PyTorch's results
F.cross_entropy(tx, ty).item()
# 3.778721332550049

F.nll_loss(F.log_softmax(tx, dim=-1), ty).item()
# 3.778721332550049

TVM’s results

x = relay.var("x", shape=[1, 10], dtype="float32")
y = relay.var("y", shape=[1, 10], dtype="float32")
z = relay.nn.cross_entropy_with_logits(x, y)
fn = relay.Function([x, y], z)
mod = tvm.IRModule.from_expr(fn)
mod = relay.transform.InferType()(mod)
lib = relay.build(mod, target="llvm")
g = graph_executor.GraphModule(lib["default"](tvm.cpu(0)))


ttx = tx

dx = ttx.numpy()
dy = tty.numpy()

g.set_input("x", dx)
g.set_input("y", dy)

g.run()
g.get_output(0)
# <tvm.nd.NDArray shape=(), cpu(0)>
#  array(0.45164683, dtype=float32)

After manually adding the log_softmax, the result matches:

ttx = F.log_softmax(tx)
# ttx = tx

dx = ttx.numpy()
dy = tty.numpy()

g.set_input("x", dx)
g.set_input("y", dy)

g.run()
g.get_output(0)
# <tvm.nd.NDArray shape=(), cpu(0)>
# array(3.7787213, dtype=float32)

ganler · March 24, 2022, 9:52pm

Hi @Lyken17 thanks for the interesting finding.

Meanwhile, I also remember that there can be many "cross entropy"s in tensorflow:

This actually makes me thinking if it is just a naming difference? But anyhow, I think at least the document should pointing this out to avoid confusion.

Lyken17 · March 25, 2022, 1:01am

When there might be multiple APIs, the mathmatical part should be the same across all implementations.

As shown in the Cross entropy - Wikipedia, the loss taken the log-propability as the input. So the pytorch’s impl should be correct.