简体   繁体   English

人工神经网络软件程序中的学习率参数

[英]Learning Rate parameter in an Artificial Neural Network software program

I am using a software program of the type that is known as an Artificial Neural Network. 我正在使用一种称为人工神经网络的软件程序。 One of the parameters of the software is called Learning Rate (also known as alpha). 该软件的参数之一称为学习率(也称为alpha)。 The learning rate setting can be controlled by moving a slider back and forth. 可以通过前后移动滑块来控制学习率设置。 On one side of the slider is the value 1E-05 on the other side is just 1. In between are various values such as 9E-05, .000045, etc. What I want to know is which one of these 2 learning rates is the fastest learning rate, 1E-05 on one side or 1 on the other. 滑块的一侧是1E-05,另一侧是1。介于两者之间的是各种值,例如9E-05,.000045等。我想知道的是,这两个学习率中的哪个是学习速度最快,一侧为1E-05,另一侧为1E。 Thanks. 谢谢。

Learning rate is not about speed of training it is about size of the step when using quite naive approximation of the function (linear - for 1st order optimizers, or quadratic - for 2nd order). 学习速度与训练速度无关与步伐的大小有关 (使用线性幼稚的函数逼近(线性-用于一阶优化器,或二次函数-用于二阶))。 Consequently very small learning rate should lead to slow training, but big learning rate can lead to lack of training. 因此,很小的学习率会导致训练缓慢,但是很大的学习率会导致训练不足。 Furthermore - values in between can still be not monotonic (you can have training where smaller learning rate actually converges faster than bigger one). 此外-两者之间的值仍然不能单调(您可以进行训练,使较小的学习速度实际上比较大的学习速度收敛)。 So even though naively we could say that big learning rate is faster training - in general this is not true, furthermore - one cannot answer what learning rate is the fastest one. 因此,即使天真地可以说,高学习率是一种更快的培训-而且,总的来说,这不是正确的-人们无法回答什么是最快的学习率。 You can only use some general heuristics/observations here - you can start with big learning rate, and if results are bad, try reducing it. 您只能在此处使用一些常规的启发式方法/观察法-您可以从较高的学习率入手,如果结果不好,请尝试降低学习率。 But in terms of actual training time guarantees - there are none. 但就实际培训时间保证而言,没有任何保证。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM