简体   繁体   English

Cosmos DB Gremlin 查询超时

[英]Cosmos DB Gremlin Query timeout

I am currently creating a PoC using Cosmos DB Graph.我目前正在使用 Cosmos DB Graph 创建一个 PoC。 The data itself is around 100k nodes and 630k edges.数据本身大约有 10 万个节点和 63 万条边。 In one subset of this data (1.7k nodes and 3.8k edges) I am trying to find the shortest path from A to B with the gremlin.在此数据的一个子集中(1.7k 节点和 3.8k 边)中,我试图用 gremlin 找到从 A 到 B 的最短路径。

Somehow this is not possible.不知何故,这是不可能的。 I get a query timeout (30 seconds) or I get a loop error (cannot exceed 32 loops) !?!?我得到一个查询超时(30 秒)或我得到一个循环错误(不能超过 32 个循环)!?!? There must be something wrong (on my side or Cosmos side) - can you please help or give a hint?一定有什么问题(在我这边或 Cosmos 这边) - 你能帮忙或给一个提示吗? I tried a lot of query variants already, but the errors are still there... One of the basic queries I tried我已经尝试了很多查询变体,但错误仍然存​​在......我尝试过的基本查询之一

The limits of the Gremlin API service are documented here:https://docs.microsoft.com/en-us/azure/cosmos-db/gremlin-limits Gremlin API 服务的限制记录在此处:https ://docs.microsoft.com/en-us/azure/cosmos-db/gremlin-limits

It may be necessary that you are looking for an OLAP engine to process such a large shortest path query.您可能需要寻找一个 OLAP 引擎来处理如此大的最短路径查询。 You could consider Spark and their GraphFrame support to process it.您可以考虑使用 Spark 及其 GraphFrame 支持来处理它。 Here is a sample: https://github.com/Azure/azure-cosmosdb-spark/blob/2.4/samples/graphframes/main.scala这是一个示例: https : //github.com/Azure/azure-cosmosdb-spark/blob/2.4/samples/graphframes/main.scala

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM