简体   繁体   English

is_slippery 参数如何影响 Frozenlake 环境中的奖励?

[英]How does the is_slippery parameter affect the reward in Frozenlake Environment?

How does the is_slippery parameter affect the reward in Frozenlake Environment? is_slippery 参数如何影响 Frozenlake 环境中的奖励?

Frozenlake environment has a parameter named is_slippery, which if set to True will move in intended direction with probability of 1/3 else will move in either perpendicular direction with equal probability of 1/3 in both directions. Frozenlake 环境有一个名为 is_slippery 的参数,如果设置为 True,它将以 1/3 的概率沿预期方向移动,否则将以 1/3 的相等概率在两个方向上沿任一垂直方向移动。 How does this is_slippery parameter affect the reward generated from the environment?这个 is_slippery 参数如何影响环境产生的奖励? Or does it merely do the job of deflecting the agent from it intended path?或者它只是做使代理偏离其预期路径的工作?

The "is_slippery" parameter determines if you are using the Frozenlake environment as stochastic (True) or deterministic (False). “is_slippery”参数确定您是使用 Frozenlake 环境作为随机 (True) 还是确定性 (False)。

However, the Frozen Lake environment can also be used in deterministic mode.但是,Frozen Lake 环境也可以在确定性模式下使用。 By setting the property is_slippery=False when creating the environment, the slippery surface is turned off and then the environment always executes the action chosen by the agent.通过在创建环境时设置属性 is_slippery=False ,关闭滑面,然后环境始终执行代理选择的动作。

https://zoo.cs.yale.edu/classes/cs470/materials/hws/hw7/FrozenLake.html https://zoo.cs.yale.edu/classes/cs470/materials/hws/hw7/FrozenLake.html

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 setuptools.setup 的 name 参数值如何影响结果? - How does the value of the name parameter to setuptools.setup affect the results? maximum_filter1d 在 scipy 中如何工作? cval、origin、mode参数如何影响它? - How does maximum_filter1d work in scipy? How does cval, origin, mode parameter affect it? 【强化学习】为什么我的reward变成0就结束了? 我在健身房环境方面遇到了一些麻烦 - [Reinforcement Learning]Why does my reward become 0 and end? I have some trouble with gym environment 为什么我的函数调用会影响我在参数中发送的变量? - Why does my function call affect my variable sent in the parameter? 这个功能列是什么?它如何影响培训? - What is this feature column and how does it affect the training? mixins 的顺序如何影响派生的 class? - How does the order of mixins affect the derived class? “as”子句如何影响 Python 中的“import”? - How does "as" clause affect "import" in Python? 此正则化代码如何影响损失? - How does this regularization code affect loss? Python-变量如何影响循环 - Python - how does the variable affect the loop 分区数如何影响`wholeTextFiles`和`textFiles`? - How does the number of partitions affect `wholeTextFiles` and `textFiles`?
 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM