简体繁体中英

How is logistic regression parallelized in Spark?

原文 2017-07-26 08:57:28 2 1 scala/ apache-spark/ machine-learning/ apache-spark-mllib

我想对ML库中用于并行化逻辑回归的方法有一些了解，我已经尝试检查源代码，但是我不理解该过程。

1 answers

Spark uses a so called mini batch gradient descent for regression:

http://ruder.io/optimizing-gradient-descent/index.html#minibatchgradientdescent

In a nutshell, it works like this:

Select a sample of the data
Compute the gradient on each row of the sample
Aggregate the gradient
Back to step 1

The actual optimisation code for Spark is from this line: https://github.com/apache/spark/blob/master/mllib/src/main/scala/org/apache/spark/mllib/optimization/GradientDescent.scala#L234

Spark Logistic regression and metrics

Spark: Logistic regression

Logistic Regression in Spark for predictive analysis

Scala and Spark - Logistic Regression - NullPointerException

How to get the coefficients of the best logistic regression in a spark-ml CrossValidatorModel?

Scala/Spark - correlation matrix after logistic regression

Calculation of areaUnderROC of logistic regression model in Spark

Spark multiclass logistic regression class number and labels

Spark: How to run logistic regression using only some features from LabeledPoint?

SPARK: Perforf linear/logistic regression from spark-glmnet package

暂无

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Spark Logistic regression and metrics Spark: Logistic regression Logistic Regression in Spark for predictive analysis Scala and Spark - Logistic Regression - NullPointerException How to get the coefficients of the best logistic regression in a spark-ml CrossValidatorModel? Scala/Spark - correlation matrix after logistic regression Calculation of areaUnderROC of logistic regression model in Spark Spark multiclass logistic regression class number and labels Spark: How to run logistic regression using only some features from LabeledPoint? SPARK: Perforf linear/logistic regression from spark-glmnet package

Related Tags

粤ICP备18138465号 © 2020-2024 STACKOOM.COM