简体   繁体   中英

Hyperloglog for Tinkerpop, .count() approximation

Is there a solution similar to Hyperloglog for graph databases like Tinkerpop. .count() step takes forever on large dataset, however approximation would be sufficient

For TinkerPop-enabled graph systems, the solution for "counting" is typically handled by Gremlin OLAP (typically with Spark ). Some graphs may optimize for things like counts - as a very simple example TinkerGraph detects something like gV().count() and bypasses the process of iterating all vertices to count them up. Also, some graphs may also provide their own APIs for providing "counts" so it is worth learning a bit about the graph you are using to determine if such capabilities exist.

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM