Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How to use saved model to resume training in google colab?

I am trying to train a TensorFlow object detection model on a custom dataset on google colab and I have a saved model trained for 5000 steps, is it possible to use saved model to resume training? I am planning to train for another 20000 steps. I am using google colab for training and the training will take around 36 hours, so I'm planning to use checkpoint. How to store best model checkpoints and use them when session runs out?

like image 500
shivu kumar Avatar asked Oct 31 '25 15:10

shivu kumar


1 Answers

For resuming training using weights from a saved checkpoint, in your pipeline.config file, change the line containing fine_tune_checkpoint from <path_to_ckpt>/model.ckpt to <path_to_ckpt>/model.ckpt-XXXX where XXXX is your checkpoint number.

As far as saving only best weights is concerned, you can refer to this post and/or this GitHub link

like image 103
Jitesh Malipeddi Avatar answered Nov 03 '25 06:11

Jitesh Malipeddi



Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!