简体   繁体   English

将hadoop连接到neo4j

[英]connect hadoop to neo4j

I want to import a GML file (Graphml file) from hadoop to neo4j , for that I need to do connection between hadoop and neo4j, I checked the neo4j site but I didn't find anything : https://neo4j.com/developer/apache-hadoop/ this is what they say for the connection between hadoop and neo4j : 我想从hadoop导入GML文件(Graphml文件)到neo4j,为此我需要在hadoop和neo4j之间建立连接,我检查了neo4j站点,但没有找到任何东西: https ://neo4j.com/developer / apache-hadoop /这就是他们说的hadoop和neo4j之间的联系:

In the past there were some approaches that used Hadoop to quickly generate `Neo4j datastores directly. 过去,有一些方法使用Hadoop直接快速生成`Neo4j数据存储区。 While this approach is performant, it is also tightly coupled to the store-format of a certain Neo4j version as it has to duplicate the functionality of writing to split-up store-files. 尽管这种方法很有效,但它也与某些Neo4j版本的存储格式紧密结合,因为它必须复制写入拆分存储文件的功能。 With the parallel neo4j-import tool and APIs introduced in Neo4j 2.2, such a solution is no longer needed. 借助Neo4j 2.2中引入的并行neo4j-import工具和API,不再需要这种解决方案。 The import facilities scale across a large number of CPUs to maximize import performance. 导入工具可跨大量CPU扩展,以最大化导入性能。

Can anyone explain me what does that mean ? 谁能解释我的意思? thank you 谢谢

Maybe I'm not understanding exactly what you're trying to accomplish, but if you have the GraphML file you don't need a connection to hadoop, just copy the file to the Neo4j instance or serve it over the web. 也许我不太了解您要完成的工作,但是如果您拥有GraphML文件,则不需要连接到hadoop,只需将文件复制到Neo4j实例或通过网络提供即可。

GraphML import is available in Neo4j through the use of apoc.import.graphml procedure . 通过使用apoc.import.graphml过程可以在Neo4j中使用GraphML导入。 For example: 例如:

CALL apoc.import.graphml("graph.gml", {batchSize: 10000, readLabels: true, storeNodeIds: false, defaultRelationshipType:"RELATED"};

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM