简体繁体 English

Q-learning中的学习曲线

[英]Learning Curve in Q-learning

原文 2022-02-04 09:31:44 2 1 c++/ reinforcement-learning/ q-learning

My question is I wrote the Q-learning algorithm in c++ with epsilon greedy policy now I have to plot the learning curve for the Q-values.我的问题是我在 c++ 中使用 epsilon 贪心策略编写了 Q 学习算法，现在我必须 plot 的 Q 值的学习曲线。 What exactly I should have to plot because I have an 11x5 Q matrix, so should I take one Q value and plot its learning or should I have to take the whole matrix for a learning curve, could you guide me with it.我应该对 plot 究竟有什么，因为我有一个 11x5 Q 矩阵，所以我应该取一个 Q 值和 plot 它的学习还是我必须取整个矩阵作为学习曲线，你能指导我吗？ Thank you谢谢

1 个解决方案

Learning curves in RL are typically plots of returns over time, not Q-losses or anything like this. RL 中的学习曲线通常是随时间变化的回报图，而不是 Q 损失或类似的东西。 So you should run your environment, compute the total reward (aka return) and plot it at a corresponding time.所以你应该运行你的环境，计算总奖励（又名回报）和 plot 它在相应的时间。

Q-learning学习扫雷行为 - Q-learning to learn minesweeping behavior

q学习计算中的大量状态 - The huge amount of states in q-learning calculation

如何在 Q-learning 中计算 MaxQ？ - How do I calculate MaxQ in Q-learning?

Q学习ludo游戏吗？ - Q learning for ludo game?

实现近似（基于特征）q 学习的问题 - Problems with implementing approximate(feature based) q learning

中小型项目的Maven学习曲线和开销？ - Maven learning curve & overhead for small/medium projects?

对于学习曲线和初学者的适合性（HTTP客户端），提升vs POCO - boost vs POCO as for learning curve and suitability for beginners (HTTP client)

C ++图形API，学习曲线小 - linux - C++ Graphic API with a small learning curve - linux

学习继承 - Learning inheritance

什么是具有最简单学习曲线的C ++ GUI构建选项 - VS / Qt / wxWidgets /等？ - What's the C++ GUI building option with the easiest learning curve - VS/Qt/wxWidgets/etc.?

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 Q-learning学习扫雷行为 - Q-learning to learn minesweeping behavior q学习计算中的大量状态 - The huge amount of states in q-learning calculation 如何在 Q-learning 中计算 MaxQ？ - How do I calculate MaxQ in Q-learning? Q学习ludo游戏吗？ - Q learning for ludo game? 实现近似（基于特征）q 学习的问题 - Problems with implementing approximate(feature based) q learning 中小型项目的Maven学习曲线和开销？ - Maven learning curve & overhead for small/medium projects? 对于学习曲线和初学者的适合性（HTTP客户端），提升vs POCO - boost vs POCO as for learning curve and suitability for beginners (HTTP client) C ++图形API，学习曲线小 - linux - C++ Graphic API with a small learning curve - linux 学习继承 - Learning inheritance 什么是具有最简单学习曲线的C ++ GUI构建选项 - VS / Qt / wxWidgets /等？ - What's the C++ GUI building option with the easiest learning curve - VS/Qt/wxWidgets/etc.?

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM