[英]HBase table as MapReduce input?
I wonder to know what are the pros and cons of having an HBase table as a mapreduce job input? 我想知道将HBase表用作mapreduce作业输入的利弊吗? how it affects the performance?
它如何影响性能?
Pros : 1. point lookup is possible eliminating the need to read whole data. 优点 :1.可以进行点查找,而无需读取整个数据。
Cons : 1. if hbaseBlock size is not tuned properly scanning a very small set may lead to scanning the complete underlying data (1% read in worst case may lead to reading 100% data ) 缺点 :1.如果未正确调整hbaseBlock大小,则扫描很小的一组数据可能会导致扫描完整的基础数据(在最坏的情况下,读取1%的数据可能会导致读取100%的数据)
Overall it all depends how has one tuned hbase depending on his read/write patterns 总的来说,这取决于如何根据他的读写模式对hbase进行调整
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.