Aten::mul.out receives tensors from mixed devices

In general, I’d probably try to match what PyTorch does with cuda, but I guess you would already be doing that.

Best regards

Thomas