简体   繁体   English

在Perceptron Learning Model的Python实现中将数组传递给numpy.dot()

[英]Passing an array to numpy.dot() in Python implementation of Perceptron Learning Model

I'm trying to put together a Python implementation of a single-layer Perceptron classifier. 我正在尝试将单层Perceptron分类器的Python实现放在一起。 I've found the example in Sebastian Raschka's book 'Python Machine Learning' very useful, but I have a question about one small part of his implementation. 我发现Sebastian Raschka的书“Python机器学习”中的例子非常有用,但我对他实现的一小部分有疑问。 This is the code: 这是代码:

import numpy as np    
class Perceptron(object):
    """Perceptron classifier.

    Parameters
    ------------
    eta : float
        Learning rate (between 0.0 and 1.0)
    n_iter : int
        Passes over the training dataset.

    Attributes
    -----------
    w_ : 1d-array
        Weights after fitting.
    errors_ : list
        Number of misclassifications in every epoch.

    """
    def __init__(self, eta=0.01, n_iter=10):
        self.eta = eta
        self.n_iter = n_iter

    def fit(self, X, y):
        """Fit training data.

        Parameters
        ----------
        X : {array-like}, shape = [n_samples, n_features]
            Training vectors, where n_samples 
            is the number of samples and
            n_features is the number of features.
        y : array-like, shape = [n_samples]
            Target values.

        Returns
        -------
        self : object

        """
        self.w_ = np.zeros(1 + X.shape[1])
        self.errors_ = []

        for _ in range(self.n_iter):
            errors = 0
            for xi, target in zip(X, y):
                update = self.eta * (target - self.predict(xi))
                self.w_[1:] += update * xi
                self.w_[0] += update
                errors += int(update != 0.0)
            self.errors_.append(errors)
        return self

    def net_input(self, X):
        """Calculate net input"""
        return np.dot(X, self.w_[1:]) + self.w_[0]

    def predict(self, X):
        """Return class label after unit step"""
        return np.where(self.net_input(X) >= 0.0, 1, -1)

The part I can't get my head around is why we define net_input() and predict() to take an array X rather than just a vector. 我无法net_input()的部分是为什么我们定义net_input()predict()来获取数组X而不仅仅是一个向量。 Everything works out, since we're only passing the vector xi to predict() in the fit() function (and so therefore also only passing a vector to net_input() ), but what is the logic behind defining the functions to take an array? 一切都工作正常,因为我们只是在fit()函数中传递向量xipredict() (因此也只是将向量传递给net_input() ),但是定义函数的逻辑是什么呢?阵列? If I understand the model correctly, we are only taking one sample at a time, calculating the dot product of the weights vector and the feature vector associated with the sample, and we never need to pass an entire array to net_input() or predict() . 如果我正确理解模型,我们一次只取一个样本,计算权重向量和与样本相​​关的特征向量的点积,我们永远不需要将整个数组传递给net_input()predict()

Your concern seems to be why is X in net_input and predict defined as an array not a vector (I'm assuming your definitions are what i mentioned in the comment above--really though i would say that there is no distinction in this context)... What gives you the impression that X is an 'array' as opposed to a 'vector'? 您的担忧似乎是为什么net_input中的X和预测被定义为数组而不是向量(我假设您的定义是我在上面的注释中提到的 - 实际上我会说在这种情况下没有区别) ......是什么让你觉得X是'数组'而不是'向量'?

The typing here is determined by what you pass the function, so if you pass it a vector, X is a vector (python uses what's called duck typing). 这里的输入是由你传递函数决定的,所以如果你传递一个向量,X就是一个向量(python使用所谓的duck typing)。 So to answer the question, 'why are net_input and predict defined to take an array as opposed to a vector?'... They're not, they are simply defined to take parameter X, which is whatever type you pass it... 所以回答这个问题,'为什么net_input和predict被定义为采用数组而不是向量?'......它们不是,它们只是被定义为采用参数X,这是你传递它的任何类型。 。

Maybe you are confused by his reuse of the variable name X as a 2d array of training data in the context of fit but as a single sample in the other functions... They may share a name but they are distinct from eachother, being in different scopes. 也许你很困惑他将变量名称X重新用作拟合上下文中的2d训练数据数组,但作为其他函数中的单个样本......他们可能共享一个名称,但它们彼此不同,在于不同的范围。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM