简体繁体中英

Hbase mapreduce interaction

原文 2012-11-09 07:39:24 9 1 hadoop/ mapreduce/ hbase

I have an program hbase and mapreduce.

I store data in HDFS, size of this file is : 100G. Now i put this data to Hbase.

I use mapreduce to scan this file lost 5 minutes. But to scan hbase table lost 30 minutes.

How to increase the speed when using hbase and mapreduce ?

Thanks.

1 answers

I am assuming you are having a Single Node HDFS. If you had your 100Gb file in a Multi Node cluster of HDFS, it would have been much faster for both Map Reduce and Hive.

You could try increasing no of mappers and reducers on Map Reduce to gain some performance increase, have a look at this post .

Hive is essentially a Data Warehousing tool built on top of HDFS and every query is underneath is a Map Reduce task itself. So above post would answer this problem also.

HBase with MapReduce

HBase mapreduce: write into HBase in Reducer

Nullpointer exception in HBase MapReduce

hadoop hbase mapreduce combiner

Hbase mapreduce error

HBase table as MapReduce input?

Incremental MapReduce with Hadoop and HBase

MapReduce with put in HBase

MapReduce HBase NullPointerException

Using an HBase table as MapReduce source

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question HBase with MapReduce HBase mapreduce: write into HBase in Reducer Nullpointer exception in HBase MapReduce hadoop hbase mapreduce combiner Hbase mapreduce error HBase table as MapReduce input? Incremental MapReduce with Hadoop and HBase MapReduce with put in HBase MapReduce HBase NullPointerException Using an HBase table as MapReduce source

Related Tags

Hbase mapreduce interaction

Question

1 answers

solution1 0 2012-11-09 08:17:04

solution1
0 2012-11-09 08:17:04