Matlab中的梯度下降

Question

我正在學習機器學習課程。 機器學習對我來說是一個很好的領域。 在第一次編程練習中，我在梯度體面算法中遇到了一些困難。 如果有人可以幫助我，我將不勝感激。

這是更新thetas的說明;

“你將在文件gradientDescent.m中實現梯度下降。循環結構已經為你編寫，你只需要在每次迭代中提供θ的更新。

    function [theta, J_history] = gradientDescent(X, y, theta, alpha, num_iters)
    %GRADIENTDESCENT Performs gradient descent to learn theta
    %   theta = GRADIENTDESENT(X, y, theta, alpha, num_iters) updates theta by 
    %   taking num_iters gradient steps with learning rate alpha

   % Initialize some useful values
   m = length(y); % number of training examples
   J_history = zeros(num_iters, 1);

   for iter = 1:num_iters

% ====================== YOUR CODE HERE ======================
% Instructions: Perform a single gradient step on the parameter vector
%               theta. 
%
% Hint: While debugging, it can be useful to print out the values
%       of the cost function (computeCost) and gradient here.
%
    % ============================================================

% Save the cost J in every iteration    
J_history(iter) = computeCost(X, y, theta);

end

end

所以我這樣做是為了同時更新這些內容;

    temp0 = theta(1,1) - (alpha/m)*sum((X*theta-y));
    temp1 = theta(2,1) - (alpha/m)*sum((X*theta-y).*X);
    theta(1,1) = temp0;
    theta(2,1) = temp1;

我運行此代碼時收到錯誤。 有人可以幫我嗎？

Answer 1

我已經解釋了為什么你可以使用矢量化形式：

theta = theta - (alpha/m) * (X' * (X * theta - y)); 或等同物

theta = theta - (alpha/m) * ((X * theta - y)' * X)';

在這個答案。

在下面引用它：

梯度下降算法矩陣版本的說明：

這是用於微調θ值的梯度下降算法：

假設給出以下X，y和θ值：

m =訓練樣本的數量
n =要素數+ 1

這里

m = 5（訓練樣例）
n = 4（特征+ 1）
X = mxn矩陣
y = mx 1向量矩陣
θ= nx 1向量矩陣
x ⁱ是第i ^個訓練示例
x _j是給定訓練示例中的^第 j ^個特征

進一步，

h(x) = ([X] * [θ]) （mx 1我們訓練集的預測值矩陣）
h(x)-y = ([X] * [θ] - [y]) （在我們的預測中mx 1個錯誤矩陣）

機器學習的整個目標是最小化預測中的錯誤。 基於上述推論，我們的誤差矩陣是mx 1向量矩陣，如下所示：

為了計算_θj的新的價值，我們必須得到乘以^第 j ^個訓練集X.這是特征值的所有錯誤（m行）的總和 ，把所有的值E，分別與^第 j ^個功能將它們相乘相應的訓練示例，並將它們全部加在一起。 這將幫助我們在得到_θj的新的（希望更好）值。 對所有j或特征數重復此過程。 在矩陣形式中，這可以寫成：

這可以簡化為：

[E]' x [X]將給出一個行向量矩陣，因為E'是1 xm矩陣，X是mxn矩陣。 但我們對獲得列矩陣很感興趣，因此我們將轉換結果矩陣。

更簡潔，它可以寫成：

同樣的結果也可以寫成：

Answer 2

theta = theta - (alpha/m) * (X' * (X * theta - y));

這是正確的答案

Answer 3

temp0 = theta(1,1) - (alpha/m)*sum((X*theta-y));
temp1 = theta(2,1) - (alpha/m)*sum((X*theta-y).*X(:,2));
theta(1,1) = temp0;
theta(2,1) = temp1;

或者您可以使用以下代碼。 它更簡單。 只有兩個參數theta1和theta2。 但如果存在更多參數，那就更好了。

for i=1:2
    theta(i) = theta(i) - (alpha/m)*sum((X*theta-y).*X(:,i));
end

Answer 4

你得到的Error using .* Matrix dimensions must agree. Error in gradientDescent (line 20) temp1 = theta(2,1) - (alpha/m)*sum((X*theta-y).*X); Error using .* Matrix dimensions must agree. Error in gradientDescent (line 20) temp1 = theta(2,1) - (alpha/m)*sum((X*theta-y).*X); 表示.*不起作用。 因此，在該行之前，添加以下代碼：

size(X*theta-y)
size(X)

如果你想做(X*theta-y).*X ，那么X*theta-y和X應該是相同的大小。 如果不是，則需要檢查算法。

Answer 5

這個問題有一點需要注意：

X = [ones(m, 1), data(:,1)];

所以

theta = theta - (alpha / m) * (X' * (X * theta - y));

和

temp0 = theta(1, 1) - (alpha / m) * sum((X * theta - y));
temp1 = theta(2, 1) - (alpha / m) * sum((X * theta - y) .* X(:, 2));
theta(1, 1) = temp0;
theta(2, 1) = temp1;

兩者都是對的

Matlab中的梯度下降

問題描述

5 個解決方案

解決方案1
17 2017-02-19 18:33:05

在下面引用它：

解決方案2
5 2014-10-06 12:44:22

解決方案3
5 2015-01-28 14:52:20

解決方案4
3 已采納 2014-06-01 22:35:11

解決方案5
1 2017-03-09 07:28:02

Matlab中的梯度下降

問題描述

5 個解決方案

解決方案1 17 2017-02-19 18:33:05

在下面引用它：

解決方案2 5 2014-10-06 12:44:22

解決方案3 5 2015-01-28 14:52:20

解決方案4 3 已采納 2014-06-01 22:35:11

解決方案5 1 2017-03-09 07:28:02

解決方案1
17 2017-02-19 18:33:05

解決方案2
5 2014-10-06 12:44:22

解決方案3
5 2015-01-28 14:52:20

解決方案4
3 已采納 2014-06-01 22:35:11

解決方案5
1 2017-03-09 07:28:02