這個函數 computeSVD 在 Pyspark 中使用 MapReduce

Question

computeSVD()是否使用 map ， reduce 因為它是預定義的函數？

我不知道函數的代碼。

from pyspark.mllib.linalg import Vectors
from pyspark.mllib.linalg.distributed import RowMatrix

rows = sc.parallelize([
    Vectors.sparse(5, {1: 1.0, 3: 7.0}),
    Vectors.dense(2.0, 0.0, 3.0, 4.0, 5.0),
    Vectors.dense(4.0, 0.0, 0.0, 6.0, 7.0)
])

mat = RowMatrix(rows)

# Compute the top 5 singular values and corresponding singular vectors.
svd = mat.computeSVD(5, computeU=True)   <------------- this function
U = svd.U       # The U factor is a RowMatrix.
s = svd.s       # The singular values are stored in a local dense vector.
V = svd.V       # The V factor is a local dense matrix.

Answer 1

確實如此，來自Spark 文檔

此頁面記錄了基於 RDD 的 API（spark.mllib 包）的 MLlib 指南部分。 請參閱基於 DataFrame 的 API（spark.ml 包）的 MLlib 主指南，它現在是 MLlib 的主要 API。

如果你想查看代碼庫，這里是https://github.com/apache/spark/blob/master/mllib/src/main/scala/org/apache/spark/mllib/linalg/distributed/RowMatrix。斯卡拉#L328

這個函數 computeSVD 在 Pyspark 中使用 MapReduce

問題描述

1 個解決方案

解決方案1
1 已采納 2022-04-22 18:47:25

這個函數 computeSVD 在 Pyspark 中使用 MapReduce

問題描述

1 個解決方案

解決方案1 1 已采納 2022-04-22 18:47:25

解決方案1
1 已采納 2022-04-22 18:47:25