Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How to interpret zigzag training loss?

My training data consists of about ~700 unique samples (this is for a regression problem). The data is not shuffled, so the first N samples have the same label (say, the value 1.25), then the next M samples have a the same label (say, 2.99), etc. In total there's around 15 unique labels.

I'm using a simple CNN, as the input is an image (64x64x3). Even with no dropout or any other form of regularization, I can't get the training loss to stabilize close to zero.

enter image description here

What is this pattern of the learning loss an indication of? (gray line is the training loss, orange line is the validation loss).

like image 623
rodrigo-silveira Avatar asked Oct 21 '25 17:10

rodrigo-silveira


1 Answers

The only indication you can get from such pattern is that the learning rate is too large, you should decrease it until the loss starts to decrease.

like image 170
Dr. Snoopy Avatar answered Oct 24 '25 09:10

Dr. Snoopy



Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!