使用 hive.optimize.sort.dynamic.partition 選項避免單個文件

Question

我正在使用蜂巢。

當我使用 INSERT 查詢編寫動態分區並打開 hive.optimize.sort.dynamic.partition 選項（ SET hive.optimize.sort.dynamic.partition=true ）時，每個分區中始終只有一個文件。

但是，如果我關閉該選項（ SET hive.optimize.sort.dynamic.partition=false ）， SET hive.optimize.sort.dynamic.partition=false出現這樣的內存不足異常。

TaskAttempt 3 failed, info=[Error: Error while running task ( failure ) : attempt_1534502930145_6994_1_01_000008_3:java.lang.RuntimeException: java.lang.OutOfMemoryError: Java heap space
        at org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(TezProcessor.java:194)
        at org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(TezProcessor.java:168)
        at org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:370)
        at org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:73)
        at org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:61)
        at java.security.AccessController.doPrivileged(Native Method)
        at javax.security.auth.Subject.doAs(Subject.java:422)
        at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1836)
        at org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:61)
        at org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:37)
        at org.apache.tez.common.CallableWithNdc.call(CallableWithNdc.java:36)
        at java.util.concurrent.FutureTask.run(FutureTask.java:266)
        at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
        at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
        at java.lang.Thread.run(Thread.java:748)
Caused by: java.lang.OutOfMemoryError: Java heap space
        at org.apache.parquet.column.values.dictionary.IntList.initSlab(IntList.java:90)
        at org.apache.parquet.column.values.dictionary.IntList.<init>(IntList.java:86)
        at org.apache.parquet.column.values.dictionary.DictionaryValuesWriter.<init>(DictionaryValuesWriter.java:93)
        at org.apache.parquet.column.values.dictionary.DictionaryValuesWriter$PlainBinaryDictionaryValuesWriter.<init>(DictionaryValuesWriter.java:229)
        at org.apache.parquet.column.ParquetProperties.dictionaryWriter(ParquetProperties.java:131)
        at org.apache.parquet.column.ParquetProperties.dictWriterWithFallBack(ParquetProperties.java:178)
        at org.apache.parquet.column.ParquetProperties.getValuesWriter(ParquetProperties.java:203)
        at org.apache.parquet.column.impl.ColumnWriterV1.<init>(ColumnWriterV1.java:83)
        at org.apache.parquet.column.impl.ColumnWriteStoreV1.newMemColumn(ColumnWriteStoreV1.java:68)
        at org.apache.parquet.column.impl.ColumnWriteStoreV1.getColumnWriter(ColumnWriteStoreV1.java:56)
        at org.apache.parquet.io.MessageColumnIO$MessageColumnIORecordConsumer.<init>(MessageColumnIO.java:184)
        at org.apache.parquet.io.MessageColumnIO.getRecordWriter(MessageColumnIO.java:376)
        at org.apache.parquet.hadoop.InternalParquetRecordWriter.initStore(InternalParquetRecordWriter.java:109)
        at org.apache.parquet.hadoop.InternalParquetRecordWriter.<init>(InternalParquetRecordWriter.java:99)
        at org.apache.parquet.hadoop.ParquetRecordWriter.<init>(ParquetRecordWriter.java:100)
        at org.apache.parquet.hadoop.ParquetOutputFormat.getRecordWriter(ParquetOutputFormat.java:327)
        at org.apache.parquet.hadoop.ParquetOutputFormat.getRecordWriter(ParquetOutputFormat.java:288)
        at org.apache.hadoop.hive.ql.io.parquet.write.ParquetRecordWriterWrapper.<init>(ParquetRecordWriterWrapper.java:67)
        at org.apache.hadoop.hive.ql.io.parquet.MapredParquetOutputFormat.getParquerRecordWriterWrapper(MapredParquetOutputFormat.java:128)
        at org.apache.hadoop.hive.ql.io.parquet.MapredParquetOutputFormat.getHiveRecordWriter(MapredParquetOutputFormat.java:117)
        at org.apache.hadoop.hive.ql.io.HiveFileFormatUtils.getRecordWriter(HiveFileFormatUtils.java:286)
        at org.apache.hadoop.hive.ql.io.HiveFileFormatUtils.getHiveRecordWriter(HiveFileFormatUtils.java:271)
        at org.apache.hadoop.hive.ql.exec.FileSinkOperator.createBucketForFileIdx(FileSinkOperator.java:619)
        at org.apache.hadoop.hive.ql.exec.FileSinkOperator.createBucketFiles(FileSinkOperator.java:563)
        at org.apache.hadoop.hive.ql.exec.FileSinkOperator.createNewPaths(FileSinkOperator.java:867)
        at org.apache.hadoop.hive.ql.exec.FileSinkOperator.getDynOutPaths(FileSinkOperator.java:975)
        at org.apache.hadoop.hive.ql.exec.FileSinkOperator.process(FileSinkOperator.java:715)
        at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:897)
        at org.apache.hadoop.hive.ql.exec.SelectOperator.process(SelectOperator.java:95)
        at org.apache.hadoop.hive.ql.exec.tez.ReduceRecordSource$GroupIterator.next(ReduceRecordSource.java:356)
        at org.apache.hadoop.hive.ql.exec.tez.ReduceRecordSource.pushRecord(ReduceRecordSource.java:287)
        at org.apache.hadoop.hive.ql.exec.tez.ReduceRecordProcessor.run(ReduceRecordProcessor.java:317)
]], Vertex did not succeed due to OWN_TASK_FAILURE, failedTasks:1 killedTasks:299, Vertex vertex_1534502930145_6994_1_01 [Reducer 2] killed/failed due to:OWN_TASK_FAILURE]Vertex killed, vertexName=Map 1, vertexId=vertex_1534502930145_6994_1_00, diagnostics=[Vertex received Kill while in RUNNING state., Vertex did not succeed due to OTHER_VERTEX_FAILURE, failedTasks:0 killedTasks:27, Vertex vertex_1534502930145_6994_1_00 [Map 1] killed/failed due to:OTHER_VERTEX_FAILURE]DAG did not succeed due to VERTEX_FAILURE. failedVertices:1 killedVertices:1

我猜這個異常是因為 reducer 同時寫入許多分區而引發的。 但我找不到如何控制它。 我關注了這篇文章，但這對我沒有幫助。

我的環境是這樣的：

AWS EMR 5.12.1
使用 tez 作為執行引擎
hive 版本是 2.3.2，tez 版本是 0.8.2
HDFS 塊大小為 128MB
使用 INSERT 查詢可以寫入大約 30 個動態分區

這是我的示例查詢。

SET hive.exec.dynamic.partition.mode=nonstrict;
SET hive.optimize.sort.dynamic.partition=true;
SET hive.exec.reducers.bytes.per.reducer=1048576;
SET mapred.reduce.tasks=300;
FROM raw_data
INSERT OVERWRITE TABLE idw_data
  PARTITION(event_timestamp_date)
  SELECT
    *
  WHERE 
    event_timestamp_date BETWEEN '2018-09-09' AND '2018-10-09' 
DISTRIBUTE BY event_timestamp_date
;

Answer 1

distribute by partition key有助於解決 OOM 問題，但此配置可能會導致每個 reducer 寫入整個分區，具體取決於hive.exec.reducers.bytes.per.reducer配置，默認情況下可以設置非常高的值，例如 1Gb。 distribute by partition key可能會導致額外的減少階段， hive.optimize.sort.dynamic.partition也是hive.optimize.sort.dynamic.partition 。

因此，為了避免 OOM 並實現最大性能：

在插入查詢的末尾添加distribute by partition key ，這將導致相同的reducer 處理相同的分區鍵。 或者，除了此設置之外，您還可以使用hive.optimize.sort.dynamic.partition=true
將hive.exec.reducers.bytes.per.reducer設置為如果一個分區中有太多數據將觸發更多減速器的值。 只需檢查hive.exec.reducers.bytes.per.reducer當前值並相應地減少或增加它以獲得適當的減速器並行度。 此設置將決定單個 reducer 將處理多少數據以及每個分區將創建多少文件。

例子：

set hive.exec.reducers.bytes.per.reducer=33554432;

insert overwrite table partition (load_date)
select * from src_table
distribute by load_date;

另請參閱有關控制映射器和減速器數量的答案： https : //stackoverflow.com/a/42842117/2700344

Answer 2

最后我發現了什么問題。

首先，執行引擎是 tez。 mapreduce.reduce.memory.mb選項沒有幫助。 您應該使用hive.tez.container.size選項。 寫入動態分區時，reducer 打開多個記錄寫入器。 Reducer 需要足夠的內存來同時寫入多個分區。

如果您使用hive.optimize.sort.dynamic.partition選項，則運行全局分區排序，但排序意味着有減速器。 在這種情況下，如果沒有另一個 reducer 任務，每個分區都由一個 reducer 處理。 這就是為什么分區中只有一個文件。 DISTRIBUTE BY 做更多的reduce 任務，所以它可以在每個分區中創建更多的文件，但存在相同的內存問題。

因此，容器內存大小非常重要！ 不要忘記使用hive.tez.container.size選項來更改 tez 容器內存大小！

使用 hive.optimize.sort.dynamic.partition 選項避免單個文件

問題描述

2 個解決方案

解決方案1
1 2018-10-21 16:16:23

解決方案2
1 已采納 2018-10-24 15:51:50

使用 hive.optimize.sort.dynamic.partition 選項避免單個文件

問題描述

2 個解決方案

解決方案1 1 2018-10-21 16:16:23

解決方案2 1 已采納 2018-10-24 15:51:50

解決方案1
1 2018-10-21 16:16:23

解決方案2
1 已采納 2018-10-24 15:51:50