简体   繁体   English

如何在MapReduce Job中通过python扫描HBase?

[英]How to scan HBase by python in MapReduce Job?

my program of mapreduce jobs is written by python, and I want to get data from HBase as input. 我的mapreduce作业程序是由python编写的,我想从HBase获取数据作为输入。 Is there any way to scan the data in HBase table in MapReduce way, such as the TableMapper/Reducer in the native Java implementation. 有什么方法可以通过MapReduce方式扫描HBase表中的数据,例如本机Java实现中的TableMapper / Reducer。 I learned that HappyBase can provide a easy way to interact with HBase, but it only works in client side. 我了解到HappyBase可以提供一种与HBase进行交互的简便方法,但它仅在客户端有效。

您可以编写一个从HBase获取数据的Pig脚本,然后通过或(如果使用jython则)通过UDF集成python

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM