简体繁体中英

Anyone using DynamoDB and Hive without using EMR?

原文 2012-04-18 20:28:58 2 1 hadoop/ amazon-dynamodb/ elastic-map-reduce

I was reading the below integration of using Hive for querying data on DynamoDB. http://aws.typepad.com/aws/2012/01/aws-howto-using-amazon-elastic-mapreduce-with-dynamodb.html

But as per that link, Hive needs to be setup on top of EMR. But I wanted to know if I can use this integration with the standalone Hadoop cluster I already have instead of using EMR. Has anyone done this? Will there be sync issues between data in DynamoDB and HDFS happen compared to using EMR?

1 answers

To be able to use it on your own cluster, you would need the custom StorageHandler for DynamoDB(it probably involves a custom SerDe as well).

It seems to be no available at the moment, at least not at AWS website.

What you can do is use the JDBC interface , provided by Amazon, to produce the queries from your cluster, but it would still be executed on top of EMR.

Processing logs in Amazon EMR with or without using Hive

Connect to Hive on EMR using Apache Drill Embedded

Not able to connect to hive on AWS EMR using java

EMR - Cannot create external Hive table using jdbcstoragehandler

Why do we use the Hive service principal when using beeline to connect to Hive on a Kerberos enabled EMR cluster?

Can anyone say is it possible to create hive external tables using Java

Hadoop EMR using Python

How can I delete a hive database without using hive terminal?

Connect to Hive from Spark without using “hive-site.xml”

data cleaning in hdfs without using hive

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Processing logs in Amazon EMR with or without using Hive Connect to Hive on EMR using Apache Drill Embedded Not able to connect to hive on AWS EMR using java EMR - Cannot create external Hive table using jdbcstoragehandler Why do we use the Hive service principal when using beeline to connect to Hive on a Kerberos enabled EMR cluster? Can anyone say is it possible to create hive external tables using Java Hadoop EMR using Python How can I delete a hive database without using hive terminal? Connect to Hive from Spark without using “hive-site.xml” data cleaning in hdfs without using hive

Related Tags

Anyone using DynamoDB and Hive without using EMR?

Question

1 answers

solution1 0 2012-04-19 14:45:37

solution1
0 2012-04-19 14:45:37