简体   繁体   English

HBase上的Apache Gora

[英]Apache Gora over HBase

We are planning to implement data warehouse based on HBase. 我们计划基于HBase实施数据仓库。

There are multiple entities which has Parent child like relation or in other words we have nesting. 有多个具有父子关系的实体,或者说我们有嵌套。

public class Parent {

private String name;

private List<Dependent> dependents;
...
}

As per HBase, we can store these nested entities as value [as json / avro] in one of columnFamily. 根据HBase,我们可以将这些嵌套实体存储为columnFamily之一中的值[as json / avro]。

We are looking at Phoenix for real time querying, I came across Apache Gora as one of the option. 我们正在凤凰城进行实时查询,我遇到了Apache Gora作为其中一种选择。 As per documentation Apache Gora provides a nested entities but there is limited examples & documentation. 根据文档,Apache Gora提供了一个嵌套实体,但示例和文档有限。

Does anyone have examples or experience with Apache Gora over HBase 有没有人有通过HBase使用Apache Gora的示例或经验

I have some development notes at http://people.apache.org/~alfonsonishikawa/gora-174.html and http://people.apache.org/~alfonsonishikawa/gora-174-notes.html (sorry if they are not very clarifying). 我在http://people.apache.org/~alfonsonishikawa/gora-174.htmlhttp://people.apache.org/~alfonsonishikawa/gora-174-notes.html有一些开发说明(很抱歉,如果它们是不太清楚)。

First of all, if I am not wrong, Apache Gora is not integrated into Apache Phoenix and I believe your necessities will not be covered. 首先,如果我没有记错的话,Apache Gora不会集成到Apache Phoenix中,我相信您的必需品不会涵盖在内。

Anyway, about your question regarding nested entities, Gora serializes de dependent entity and puts it into a column. 无论如何,关于您有关嵌套实体的问题,Gora序列化了从属实体并将其放入一列中。 When you have a List/Array of dependents like in your example, each dependent entity goes into a column of a family. 当您具有示例中的依存关系列表/数组时,每个依存关系实体都会进入一个族的列。 The negerated column name is the index. 列的名称是索引。 When you have a Map of dependets, each entity goes into a column of a family, and the column name is the key of the dependent entity in the map. 当您有一个依赖关系图时,每个实体都进入一个族的一列,并且列名是映射中依赖实体的键。

Additionally, The fact that de data is serialized implies that you can't query on their content. 此外,数据已序列化这一事实意味着您无法查询其内容。

I hope this resolve your doubts, but if you have any other question just ask it :) 我希望这可以解决您的疑问,但是如果您还有其他问题,请提出:)

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM