簡體 English 中英

火花容器被紗線殺死

[英]spark container get killed by yarn

原文 2017-07-14 02:59:16 1 1 apache-spark/ yarn

我有一個675GB鑲木地板文件的龐大數據集，具有快速壓縮功能，我必須將其與4個，5個表（大小為10 GB）一起加入。 我有一個500多個節點的群集，每個節點具有128 GB的ram，但是我只能運行最多28 GB的執行程序，否則yarn無法分配內存。 請建議我該如何處理這種情況。 目前，我正在運行pyspark 1.6，並且每個節點僅使用26 Gb ram運行1個執行程序。 但是，如果我在蜂巢中運行整個聯接，則需要花費一些時間，但要完成。 我應該如何有效地使用我的集群並通過這種連接進行處理

謝謝sPradeep

1 個解決方案

您應該嘗試增加spark.sql.shuffle.partitions ，默認情況下為200。此參數控制改組時（例如，在joins，groupBy等期間）的分區（因此是任務）的數量。 嘗試將值設置為5000，看看是否可行。

Yarn Spark HBase-ExecutorLostFailure容器因超出內存限制而被YARN殺死

[英]Yarn Spark HBase - ExecutorLostFailure Container killed by YARN for exceeding memory limits

因超過 Spark Scala 中的 memory 限制而被 YARN 殺死的容器

[英]Container killed by YARN for exceeding memory limits in Spark Scala

紗線容器失效的火花

[英]Spark on Yarn Container Failure

由於超過內存限制而被YARN殺死的容器

[英]Container killed by YARN for exceeding memory limits

由於超過內存限制而被YARN殺死的容器。使用52.6 GB的50 GB物理內存。考慮提升spark.yarn.executor.memoryOverhead

[英]Container killed by YARN for exceeding memory limits. 52.6 GB of 50 GB physical memory used. Consider boosting spark.yarn.executor.memoryOverhead

Spark YARN 配置問題：容器不斷失敗

[英]Spark YARN config issue : Container keep failing

如何解決火花上的紗線容器上漿問題？

[英]How to solve yarn container sizing issue on spark?

在 zeppein 停止后，由 Zeppelin 在 Yarn Cluster 模式下啟動的 Spark（Yarn）應用程序不會被終止

[英]Spark (Yarn) applications started by Zeppelin in Yarn Cluster Mode aren't killed after zeppein is stopped

Kubernetes 上的 Spark：執行程序 pod 被無聲地殺死

[英]Spark on Kubernetes: Executor pods silently get killed

在紗線上使用火花時火花執行器和紗線容器是什么關系

[英]what is the relationship between spark executor and yarn container when using spark on yarn

暫無

暫無

聲明:本站的技術帖子網頁，遵循CC BY-SA 4.0協議，如果您需要轉載，請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

相關問題 Yarn Spark HBase-ExecutorLostFailure容器因超出內存限制而被YARN殺死因超過 Spark Scala 中的 memory 限制而被 YARN 殺死的容器紗線容器失效的火花由於超過內存限制而被YARN殺死的容器由於超過內存限制而被YARN殺死的容器。使用52.6 GB的50 GB物理內存。考慮提升spark.yarn.executor.memoryOverhead Spark YARN 配置問題：容器不斷失敗如何解決火花上的紗線容器上漿問題？在 zeppein 停止后，由 Zeppelin 在 Yarn Cluster 模式下啟動的 Spark（Yarn）應用程序不會被終止 Kubernetes 上的 Spark：執行程序 pod 被無聲地殺死在紗線上使用火花時火花執行器和紗線容器是什么關系

相關標簽

粵ICP備18138465號 © 2020-2024 STACKOOM.COM