在 Google Colab 上中断会话后，从最后一步恢复训练 fast-rccn 模型

问题描述

使用 fast-rcnn 在 Google Colab 上训练我的模型。但是，由于 GPU 时间限制，我的模型在 10,000 步中的 7000 步时停止训练。

我使用以下代码训练模型

!python model_main_tf2.py --model_dir=models/my_faster_rcnn_inception_resnet_v2 --pipeline_config_path=models/my_faster_rcnn_inception_resnet_v2/pipeline.config

如何从最后一步（即 7000）继续训练我的模型？

解决方法

我在学习 this 教程后设法弄明白了。所以为了恢复训练，我将fine_tune_checkpoint的目录位置从预训练的模型目录（即pretrained_mode/checkpoint/ckpt-0）更改为保存模型检查点的模型训练目录位置（即/my_training_model/ckpt-#） . 将# 替换为最后一个 ckpt 的编号，例如ckpt-9

然后我重新运行以下命令以恢复训练：

!python model_main_tf2.py --model_dir=models/my_faster_rcnn_inception_resnet_v2 --pipeline_config_path=models/my_faster_rcnn_inception_resnet_v2/pipeline.config

faster-rcnn google-colaboratory python tensorflow tensorflow tensorflow

在 Google Colab 上中断会话后，从最后一步恢复训练 fast-rccn 模型

问题描述

解决方法

相关问答