Why does the model not converge?
人工智能技术与咨询
人工智能技术与咨询 2022-09-07 10:22:26
0
1
871

Why does the model not converge?

Why does the model not converge?

Why does the model not converge?

人工智能技术与咨询
人工智能技术与咨询

reply all(1)
人工智能技术与咨询

1. The backpropagation chain is broken, that is, some of the variables may be converted into numpy arrays. Although they can still participate in calculations, they have lost the ability to propagate gradients, resulting in the inability to propagate gradients to subsequent variables. 2. Learning rate Unreasonable settings. If the learning rate is set too large, it will easily cause the loss to become nan, causing the model to not converge. If it is set too small, it will cause the model to learn very slowly. 3. The parameters of the neural network layer are not well initialized because Parameter initialization will affect the training speed of the model

Latest Downloads
More>
Web Effects
Website Source Code
Website Materials
Front End Template