簡體 English 中英

火花執行者失敗了

[英]spark executor lost failure

原文 2015-04-10 16:38:24 0 1 scala/ apache-spark/ out-of-memory/ executor

我正在使用databricks spark cluster（AWS），並測試我的scala實驗。 使用LogisticRegressionWithLBFGS算法訓練10 GB數據時遇到了一些問題。 我遇到問題的代碼塊如下：

import org.apache.spark.mllib.classification.LogisticRegressionWithLBFGS
val algorithm = new LogisticRegressionWithLBFGS()
algorithm.run(training_set)

首先，我有很多執行程序丟失失敗和java內存問題，然后我用更多分區重新分區我的training_set並且內存不足問題已經消失，但仍然得到執行程序丟失失敗。

我的群集共有72個核心和500GB內存。 任何人都能對此有所了解嗎？

1 個解決方案

LBFGS使用密集向量在內部存儲beta（特征權重），一切都在內存中。 因此，無論訓練集中的特征稀疏，特征的總數都是值得注意的。

因此，要解決此問題，用戶應增加執行程序內存或限制訓練集中的功能總數。

Spark：Executor丟失失敗（添加groupBy作業后）

[英]Spark: Executor Lost Failure (After adding groupBy job)

Spark分區執行時間不均勻，執行者頻繁失敗

[英]Spark partitions taking uneven time to execute with frequent executor lost failure

丟失的執行器嘗試在Yarn / hdfs集群中使用Spark / GraphX加載圖

[英]Lost Executor trying to load Graph using Spark/GraphX in Yarn/hdfs Cluster

由於某些未知原因，由於執行器丟失，Spark Job在saveAsHadoopDataset階段失敗

[英]Spark Job fails at saveAsHadoopDataset stage due to Lost Executor due to some unknown reason

Spark 中是否有 Executor Startup 的鈎子？

[英]Is there a hook for Executor Startup in Spark?

Spark執行程序上的並發任務

[英]Concurrent tasks on a Spark executor

Spark應用程序僅使用1個執行程序

[英]Spark application uses only 1 executor

Spark局部變量廣播到執行器

[英]Spark local variable broadcast to executor

簡單的sparksql連接查詢丟失執行程序

[英]Lost executor on simple sparksql join query

Spark-階段中丟失任務

[英]Spark - Lost task in stage

暫無

暫無

聲明:本站的技術帖子網頁，遵循CC BY-SA 4.0協議，如果您需要轉載，請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

相關問題 Spark：Executor丟失失敗（添加groupBy作業后） Spark分區執行時間不均勻，執行者頻繁失敗丟失的執行器嘗試在Yarn / hdfs集群中使用Spark / GraphX加載圖由於某些未知原因，由於執行器丟失，Spark Job在saveAsHadoopDataset階段失敗 Spark 中是否有 Executor Startup 的鈎子？ Spark執行程序上的並發任務 Spark應用程序僅使用1個執行程序 Spark局部變量廣播到執行器簡單的sparksql連接查詢丟失執行程序 Spark-階段中丟失任務

相關標簽

粵ICP備18138465號 © 2020-2024 STACKOOM.COM