简体   繁体   English

从Amazon HBase读取数据

[英]read data from amazon hbase

Can anyone suggest me that whether I can read data from amazon hbase using the org.apache.hadoop.conf.Configuration and org.apache.hadoop.hbase.client.HTablePool. 谁能建议我,是否可以使用org.apache.hadoop.conf.Configuration和org.apache.hadoop.hbase.client.HTablePool从Amazon hbase读取数据。

We are migrating to Amazon's EMR framework having hbase running on top of it. 我们正在迁移到具有在其之上运行hbase的Amazon EMR框架。

The present implementation is based on pure Apache hadoop and hbase distributions. 本实现基于纯Apache hadoop和hbase发行版。 I'm trying to verify that no code changes needed even we migrate to amazon's EMR. 我试图验证即使我们迁移到亚马逊的EMR,也无需更改代码。

Please share your thoughts. 请分享您的想法。

While it should not happen, I would expect the problems and changes related to the nature of EC2 and its networking. 尽管它不应该发生,但我希望与EC2及其网络的性质有关的问题和变化。
HBase relay on Regions able to renew their leases in timely manner. HBase中继能够及时更新其租约的区域。 If Region servers are two busy - because of some massive operations over them, they can not do so and get kicked off the cluster. 如果区域服务器是两个繁忙的服务器-由于对其进行了大量操作,则它们将无法运行而无法启动群集。
In amazon performance of the EC2 instances are much less predictable then in dedicated cluster (unless you use cluster instances), so adjusting timeout parameters and/or nature of your loads might be needed to get cluster to work properly 在Amazon中,EC2实例的性能要比专用集群中的性能难以预测(除非您使用集群实例),因此可能需要调整超时参数和/或负载的性质才能使集群正常工作

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM