简体繁体中英

HBase rowkey which includes timestamp

原文 2015-11-24 07:51:00 7 1 hadoop/ hbase

I would like to whether it is bad to have rowkeys like the following:

username-timestamp

This rows would be read from MapReduce jobs and will be put using java client API. Also, a subset would be selected using STARTROW, ENDROW.

On one side this seems convinient for my usecase since I can scan for specific interval and rows arebmostly subsequent for MR job, while on the other I read that it is good to avoid long rowkeys and hotspoting.

Is there really a problem with this design and how to overcome it?

I'm new to HBase so any help would be great.

1 answers

The general advice is to avoid monotonically increasing row keys. To that purpose, some software tools include a so called "salt" to the row key, which hashes the keys across regions. A discussion can be found here: http://hbase.apache.org/0.94/book/rowkey.design.html . And here: https://phoenix.apache.org/salted.html . You can also look at Apache Trafodion http://trafodion.apache.org/ , which uses row key salting to distribute SQL-like primary keys.

nested Rowkey in Hbase tables

Designing composite rowkey for Hbase

Hbase RowKey design schema

HBase Get values where rowkey in

Generate composite hbase rowkey using Flume Serializer

Using Impala to query salted Hbase rowkey

HBase query when rowkey is not completely known

Inserting filename as rowkey using HBase MapReduce

HBase MapReduce - Splitting a region based on rowkey

Can I use SingleColumnValueFilter on rowkey in HBase?

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question nested Rowkey in Hbase tables Designing composite rowkey for Hbase Hbase RowKey design schema HBase Get values where rowkey in Generate composite hbase rowkey using Flume Serializer Using Impala to query salted Hbase rowkey HBase query when rowkey is not completely known Inserting filename as rowkey using HBase MapReduce HBase MapReduce - Splitting a region based on rowkey Can I use SingleColumnValueFilter on rowkey in HBase?

Related Tags

HBase rowkey which includes timestamp

Question

1 answers

solution1 1 ACCPTED 2015-11-24 07:59:36

solution1
1 ACCPTED 2015-11-24 07:59:36