簡體 English 中英

Spark、HiveContext、ThriftServer - 表持久化

[英]Spark, HiveContext, ThriftServer - Table persistence

原文 2016-03-18 15:36:52 2 2 apache-spark/ hive/ apache-spark-sql

我已經配置了數據 SparkStreaming。 我想為各種目標保留這些數據：

為 Tableau 公開（它需要 thriftServer，而 thriftServer 需要 hiveContext）。
有時我希望能夠更新一些數據。

HiveContext 中的數據保存在哪里？ 在記憶中？ 在本地磁盤上？ 它是由 thriftServer 提供的嗎？

2 個解決方案

您可以選擇使用緩存在內存上的數據

your_hive_context.cacheTable("table_name")

Thrift Server 訪問包含所有表，甚至臨時表的全局上下文。

如果您緩存表 Tableau 將更快地獲得查詢結果，但您必須繼續運行 Spark Batch 應用程序。

我還沒有找到一種在不打開新 HiveContext 的情況下更新某些數據的方法。

您可以通過執行以下yourDataFrame.saveAsTable("YourTableName")將數據幀從 spark 保存到配置單元表： yourDataFrame.saveAsTable("YourTableName")

如果要將數據插入現有表，可以使用： yourDataFrame.writer().mode(SaveMode.Append).saveAsTable("YourTableName")

這會將您的 DataFrame 保存在持久的 Hive 表中。 此表的位置將取決於hive-site.xml中的hive-site.xml 。

默認情況下，如果您在本地進行測試，則該位置將位於您本地磁盤上的位置/user/hive/warehouse/YourTableName

如果您在 Yarn/HDFS 上將 Spark 與 Hive 一起使用，則該表將保存在 HDFS 上由 hive-site.xml 配置文件中的屬性hive.metastore.warehouse.dir定義的位置

希望會有所幫助:)

鎖定來自HiveContext的Hive表

[英]locking hive table from spark HiveContext

來自 Spark hivecontext 的查詢會鎖定 hive 表嗎？

[英]Will query from Spark hivecontext lock the hive table?

Spark SQL-Hivecontext-Hive中從一個表到另一表的數據復制

[英]Spark SQL - Hivecontext - Datacopy from one table to another table in Hive

使用Spark Scala將數據插入HiveContext的Hive表中

[英]Insert data into a Hive table with HiveContext using Spark Scala

Spark HiveContext：插入覆蓋從中讀取的同一表

[英]Spark HiveContext : Insert Overwrite the same table it is read from

使用 spark hivecontext 讀取外部配置單元分區表的問題

[英]Issues with reading external hive partitioned table using spark hivecontext

Spark HiveContext與HbaseContext？

[英]Spark HiveContext vs HbaseContext?

使用thriftserver和beeline錯誤將數據從hdfs加載到spark2.1表中

[英]load data from hdfs into spark2.1 table using thriftserver and beeline error

Bluemix Spark中的HiveContext

[英]HiveContext in Bluemix Spark

使用 HiveContext 的多個 Spark 應用程序

[英]Multiple Spark applications with HiveContext

暫無

暫無

聲明:本站的技術帖子網頁，遵循CC BY-SA 4.0協議，如果您需要轉載，請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

相關問題 鎖定來自HiveContext的Hive表來自 Spark hivecontext 的查詢會鎖定 hive 表嗎？ Spark SQL-Hivecontext-Hive中從一個表到另一表的數據復制使用Spark Scala將數據插入HiveContext的Hive表中 Spark HiveContext：插入覆蓋從中讀取的同一表使用 spark hivecontext 讀取外部配置單元分區表的問題 Spark HiveContext與HbaseContext？使用thriftserver和beeline錯誤將數據從hdfs加載到spark2.1表中 Bluemix Spark中的HiveContext 使用 HiveContext 的多個 Spark 應用程序

相關標簽

粵ICP備18138465號 © 2020-2024 STACKOOM.COM