TPU 上的损失发散

问题描述

GPU：

TPU：

我已经初始化了相同的模型并训练了 1 个 epoch，但它的预测结果在 GPU 和 TPU 之间有所不同。训练数据集的损失在 TPU 上发散 (NaN)，而在 GPU 上收敛。

GPU：

model = deblurnet()
model.compile(optimizer="adam",loss="categorical_crossentropy",metrics=["accuracy"])
model.fit(train_ds,epochs=1)
output = model.predict(input)

热塑性聚氨酯：

with tpu_strategy.scope():
    model = deblurnet()
    model.compile(optimizer="adam",epochs=1)
output = model.predict(input)

型号代码：https://gist.github.com/yuntan/8198f80593b6897844236c5a5a7b07da

解决方法

暂无找到可以解决该程序问题的有效方法，小编努力寻找整理中！

如果你已经找到好的解决方法，欢迎将解决方案带上本链接一起发送给小编。

小编邮箱:dio#foxmail.com (将#修改为@）

tensorflow tensorflow tensorflow tpu