创建一个神经网络来分类 mnist 数据集，而不使用 keras。错误：通过张量损失时需要磁带

Question

Here is the code.这是代码。 I have done the forward pass but I receive an error anytime I run it and I don't know what the problem is.我已经完成了前向传递，但每当我运行它时都会收到一个错误，我不知道问题是什么。 I first of all create a batch with the features and labels, do the forward pass and try to use keras SGD optimizer.我首先使用特征和标签创建一个批次，进行前向传递并尝试使用 keras SGD 优化器。

This is the error I get:这是我得到的错误：

And this is my code:这是我的代码：

import tensorflow as tf
from tensorflow.keras.utils import to_categorical
import numpy as np
def batches(batch_size, features, labels):
    """
    Create batches of features and labels
    :param batch_size: The batch size
    :param features: List of features
    :param labels: List of labels
    :return: Batches of (Features, Labels)
    """
    assert len(features) == len(labels)
    outout_batches = []
    
    sample_size = len(features)
    features = tf.Variable(features, dtype='float32')
    for start_i in range(0, sample_size, batch_size):
        end_i = start_i + batch_size
        batch = (features[start_i:end_i], labels[start_i:end_i])
        outout_batches.append(batch)
        
    return outout_batches


def get_logits(features, weights, biases):
# network's forward pass, multiply inputs with weight
    return tf.add(tf.matmul(features, weights), biases)


def get_cost(logits, labels):
# returns the cost of the pass
    return tf.reduce_mean(tf.nn.softmax_cross_entropy_with_logits(logits=logits, labels=labels))

def vectorize(features):
# reshapes the features to a vector for input
    return features.reshape(features.shape[0], features.shape[1] * features.shape[2])

(train_x, train_y), (test_x, test_y) = tf.keras.datasets.mnist.load_data()
train_x, test_x = train_x.astype('float32'), test_x.astype('float32')
train_x, test_y = train_x.astype('float32'), test_y.astype('float32')
train_y, test_y = to_categorical(train_y, 10), to_categorical(test_y, 10)
train_x = vectorize(train_x)

n_inputs = 28 * 28
n_classes = 10

weights = tf.Variable(tf.random.normal([n_inputs, n_classes]), dtype='float32', name='weights')
biases = tf.Variable(tf.random.normal([n_classes]), dtype='float32', name='biases')

batch_list = batches(32, train_x, train_y)
for x, y in batch_list:
    logits = get_logits(x, weights, biases)
    cost = tf.reduce_mean(tf.nn.softmax_cross_entropy_with_logits(logits=logits, labels=y))
    opt = tf.keras.optimizers.SGD(learning_rate=0.001)
    optimizer = opt.minimize(loss=cost)

Answer 1

This is because your loss is a Tensor.这是因为您的损失是张量。 In the optimizer.minimize() the loss argument can be a Tensor or callable.在optimizer.minimize()中，损失参数可以是张量或可调用的。 If a callable, loss should take no arguments and return the value to minimize.如果是可调用的，则损失不应采用 arguments 并返回值以最小化。 If loss is a Tensor, the tape argument must be passed .如果loss 是张量，则必须传递 tape 参数。

So the modified code could be like this:所以修改后的代码可能是这样的：

for x, y in batch_list:
  with tf.GradientTape() as tape:
    logits = get_logits(x, weights, biases)
    cost = tf.reduce_mean(tf.nn.softmax_cross_entropy_with_logits(logits=logits, labels=y))
    opt = tf.keras.optimizers.SGD(learning_rate=0.001)
  optimizer = opt.minimize(loss=cost, var_list=[weights,biases], tape=tape)

Answer 2

Hi i made a little change towards your code, not sure if it fits your situation but i would normally do it like this嗨，我对您的代码做了一些更改，不确定它是否适合您的情况，但我通常会这样做

optimizer = tf.keras.optimizers.SGD(learning_rate=0.001)

for x, y in batch_list:
    with tf.GradientTape() as tape:
        logits = get_logits(x, weights, biases)
        cost = tf.reduce_mean(tf.nn.softmax_cross_entropy_with_logits(logits=logits, labels=y))
        
    grads = tape.gradient(loss, [weights, biases])
    optimizer.apply_gradients(zip(grads, [weights, biases]))

创建一个神经网络来分类 mnist 数据集，而不使用 keras。错误：通过张量损失时需要磁带

问题描述

2 个解决方案

解决方案1
1 2021-06-13 15:28:29

解决方案2
1 2021-06-13 15:31:21

创建一个神经网络来分类 mnist 数据集，而不使用 keras。 错误：通过张量损失时需要磁带

问题描述

2 个解决方案

解决方案1 1 2021-06-13 15:28:29

解决方案2 1 2021-06-13 15:31:21

创建一个神经网络来分类 mnist 数据集，而不使用 keras。错误：通过张量损失时需要磁带

解决方案1
1 2021-06-13 15:28:29

解决方案2
1 2021-06-13 15:31:21