简体   繁体   English

如何使用强化学习在图中找到不同人的步行路径?

[英]How can I Find Walking Paths for Different People in a Graph With Reinforcement Learning?

I don't know it is possible or not with reinforcement learning but my question is about finding walking paths for different people in a graph.我不知道强化学习是否可行,但我的问题是在图中为不同的人找到步行路径。 A sample image that I draw to help the problem description is here:我绘制的帮助问题描述的示例图像在这里: 在此处输入图片说明

The problem is that: There is a garden with graph representation.问题是:有一个带有图形表示的花园。 The black lines are walking ways on the garden and the reds are vertices, so junction points of ways.黑线是花园里的步行道,红线是顶点,所以是道路的交汇点。 Blues are people and they can walk over ways and they can only change their ways on the vertices(red points).蓝人是人,他们可以走在路上,他们只能在顶点(红点)上改变他们的路。 There are no limit for people, there can be 1 or 10+ people in the garden in same time.人数不限,花园可同时容纳1人或10人以上。 People wants to reach to the green gate with the shortest path but while they do that, they must not meet to each other in the same vertex in the same time.人们想以最短的路径到达绿色大门,但是当他们这样做时,他们不能同时在同一顶点相遇。 For example, person A can be in vertex1 and person B can be in the vertex2, but they both can be in the vertex3 together.例如,A 可以在 vertex1 中,B 可以在 vertex2 中,但他们可以一起在 vertex3 中。 I draw example paths for each people with pink color, so my purpose is finding path for each people in the garden which they reach to the green gate without meeting any of them with each other.我为每个人绘制了粉红色的示例路径,所以我的目的是为花园中的每个人找到他们到达绿色大门的路径,而不会遇到任何人。 It is like a time series problem because each person can walk to a vertex in a unit time and all people's speeds are equals.这就像一个时间序列问题,因为每个人在单位时间内可以走到一个顶点,并且所有人的速度都是相等的。

Is there a similar problem or solution for this?是否有类似的问题或解决方案? Or what can I do to solve this problem?或者我能做些什么来解决这个问题?

Not a solution but for an advice, https://gym.openai.com/ has similar problems and also their solutions.不是解决方案,而是建议, https://gym.openai.com/有类似的问题及其解决方案。 I used once q-learning of basic implementation on https://gym.openai.com/envs/FrozenLake-v0/ this environment but on this environment works with one agent(people).我曾经在https://gym.openai.com/envs/FrozenLake-v0/这个环境上使用过一次基本实现的 q-learning,但在这个环境中与一个代理(人)一起工作。 You might want to check this out.你可能想看看这个。 Furthermore look at ant-colony and particle swarm optimization algorithms.再看看蚁群和粒子群优化算法。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM