简体繁体 English

如何使用强化学习在图中找到不同人的步行路径？

[英]How can I Find Walking Paths for Different People in a Graph With Reinforcement Learning?

原文 2021-07-23 18:47:03 1 1 deep-learning/ artificial-intelligence/ reinforcement-learning/ path-finding/ q-learning

I don't know it is possible or not with reinforcement learning but my question is about finding walking paths for different people in a graph.我不知道强化学习是否可行，但我的问题是在图中为不同的人找到步行路径。 A sample image that I draw to help the problem description is here:我绘制的帮助问题描述的示例图像在这里：

The problem is that: There is a garden with graph representation.问题是：有一个带有图形表示的花园。 The black lines are walking ways on the garden and the reds are vertices, so junction points of ways.黑线是花园里的步行道，红线是顶点，所以是道路的交汇点。 Blues are people and they can walk over ways and they can only change their ways on the vertices(red points).蓝人是人，他们可以走在路上，他们只能在顶点（红点）上改变他们的路。 There are no limit for people, there can be 1 or 10+ people in the garden in same time.人数不限，花园可同时容纳1人或10人以上。 People wants to reach to the green gate with the shortest path but while they do that, they must not meet to each other in the same vertex in the same time.人们想以最短的路径到达绿色大门，但是当他们这样做时，他们不能同时在同一顶点相遇。 For example, person A can be in vertex1 and person B can be in the vertex2, but they both can be in the vertex3 together.例如，A 可以在 vertex1 中，B 可以在 vertex2 中，但他们可以一起在 vertex3 中。 I draw example paths for each people with pink color, so my purpose is finding path for each people in the garden which they reach to the green gate without meeting any of them with each other.我为每个人绘制了粉红色的示例路径，所以我的目的是为花园中的每个人找到他们到达绿色大门的路径，而不会遇到任何人。 It is like a time series problem because each person can walk to a vertex in a unit time and all people's speeds are equals.这就像一个时间序列问题，因为每个人在单位时间内可以走到一个顶点，并且所有人的速度都是相等的。

Is there a similar problem or solution for this?是否有类似的问题或解决方案？ Or what can I do to solve this problem?或者我能做些什么来解决这个问题？

1 个解决方案

Not a solution but for an advice, https://gym.openai.com/ has similar problems and also their solutions.不是解决方案，而是建议， https://gym.openai.com/有类似的问题及其解决方案。 I used once q-learning of basic implementation on https://gym.openai.com/envs/FrozenLake-v0/ this environment but on this environment works with one agent(people).我曾经在https://gym.openai.com/envs/FrozenLake-v0/这个环境上使用过一次基本实现的 q-learning，但在这个环境中与一个代理（人）一起工作。 You might want to check this out.你可能想看看这个。 Furthermore look at ant-colony and particle swarm optimization algorithms.再看看蚁群和粒子群优化算法。

如何获得 integer 作为 output 用于连续动作空间 PPO 强化学习？ - How can I get an integer as output for continuous action space PPO reinforcement learning?

类似观察的深度强化学习，但需要完全不同的动作，如何解决？ - Deep reinforcement learning for similar observations but need totally different actions, how to solve it?

如何将 Drake 与深度强化学习一起使用 - How to use Drake with deep reinforcement learning

如何在 openai-gym、强化学习的 Bipedalwalker-v3 中获得目标 Q 值？ - How do I get Target Q-values in Bipedalwalker-v3 in openai-gym, reinforcement learning?

用Keras模型进行强化学习 - Reinforcement Learning with Keras model

深度强化学习，如何制作控制多台机器的代理 - Deep Reinforcement Learning, how to make an agent that control many machines

深度强化学习 - CartPole 问题 - Deep Reinforcement Learning - CartPole Problem

强化学习，为什么表现崩溃了？ - Reinforcement learning, why the performance collapsed?

收集数据集的强化学习训练 - Reinforcement Learning Training on Collected dataset

为什么 RL 被称为“强化”学习？ - Why is RL called 'reinforcement' learning?

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 如何获得 integer 作为 output 用于连续动作空间 PPO 强化学习？ - How can I get an integer as output for continuous action space PPO reinforcement learning? 类似观察的深度强化学习，但需要完全不同的动作，如何解决？ - Deep reinforcement learning for similar observations but need totally different actions, how to solve it? 如何将 Drake 与深度强化学习一起使用 - How to use Drake with deep reinforcement learning 如何在 openai-gym、强化学习的 Bipedalwalker-v3 中获得目标 Q 值？ - How do I get Target Q-values in Bipedalwalker-v3 in openai-gym, reinforcement learning? 用Keras模型进行强化学习 - Reinforcement Learning with Keras model 深度强化学习，如何制作控制多台机器的代理 - Deep Reinforcement Learning, how to make an agent that control many machines 深度强化学习 - CartPole 问题 - Deep Reinforcement Learning - CartPole Problem 强化学习，为什么表现崩溃了？ - Reinforcement learning, why the performance collapsed? 收集数据集的强化学习训练 - Reinforcement Learning Training on Collected dataset 为什么 RL 被称为“强化”学习？ - Why is RL called 'reinforcement' learning?

相关标签

如何使用强化学习在图中找到不同人的步行路径？

问题描述

1 个解决方案

解决方案1 0 2021-07-23 21:01:13

解决方案1
0 2021-07-23 21:01:13