CNN 模型无法训练

Question

I'm trying to train a CNN model to give a prediction of format我正在尝试训练一个 CNN 模型来预测格式

array([ 0., 0., 0., 0., -1.], dtype=float32) . array([ 0., 0., 0., 0., -1.], dtype=float32) 。

My Training data looks like this :我的训练数据如下所示：

0        [[-1.0, -1.0, -1.0, -1.0, -1.0], [0.0, 0.0, 0....
1        [[0.0, 0.0, 0.0, 0.0, 0.0], [1.0, 0.0, 1.0, 0....
2        [[0.0, 0.0, 0.0, -1.0, -1.0], [0.0, 0.0, 0.0, ...
3        [[-1.0, -1.0, -1.0, -1.0, -1.0], [-1.0, -1.0, ...
4        [[-1.0, -1.0, -1.0, -1.0, -1.0], [0.0, 0.0, 0....
                               ...                        
15484    [[-1.0, -1.0, -1.0, -1.0, -1.0], [0.0, 2.0, 1....
15485    [[-1.0, -1.0, -1.0, -1.0, -1.0], [-1.0, -1.0, ...
15486    [[-1.0, -1.0, -1.0, -1.0, -1.0], [0.0, 2.0, 0....
15487    [[1.0, 0.0, 0.0, 0.0, 0.0], [1.0, 0.0, 0.0, 0....
15488    [[-1.0, -1.0, -1.0, -1.0, -1.0], [-1.0, -1.0, ...

With each row of shape (24,5) looking like this :每行形状 (24,5) 看起来像这样：

array([[-1., -1., -1., -1., -1.],
       [ 0.,  0.,  0.,  0.,  0.],
       [ 0.,  3.,  0.,  0.,  1.],
       [ 1.,  0.,  0.,  0.,  0.],
       [ 0.,  0.,  0.,  0., -1.],
       [ 0.,  1.,  2.,  0.,  0.],
       [ 0.,  3.,  0.,  0., -1.],
       [ 0.,  1.,  0.,  0., -1.],
       [ 0.,  1.,  0.,  0., -1.],
       [ 1.,  0.,  0.,  0.,  0.],
       [ 1.,  0.,  0.,  0., -1.],
       [ 1.,  0.,  0.,  0.,  0.],
       [ 0.,  1.,  0.,  0., -1.],
       [ 0.,  0.,  0.,  0.,  0.],
       [ 0.,  1.,  0.,  0., -1.],
       [ 0.,  0.,  0.,  0., -1.],
       [ 0.,  1.,  0.,  0., -1.],
       [ 0.,  1.,  0.,  0.,  0.],
       [ 0.,  1.,  0.,  0., -1.],
       [ 0.,  0.,  0.,  0., -1.],
       [ 0.,  1.,  0.,  0., -1.],
       [ 0.,  0.,  0.,  0.,  0.],
       [ 0.,  0.,  0.,  0., -1.],
       [ 1.,  2.,  0.,  0., -1.]], dtype=float32)

The model I'm using looks like this :我使用的模型如下所示：

model = tf.keras.Sequential(layers=[
                tf.keras.layers.Conv2D(32,kernel_size = 3, activation=tf.nn.relu, input_shape=(28,28,1)),
                tf.keras.layers.BatchNormalization(),
                tf.keras.layers.Conv2D(32,kernel_size = 3, activation=tf.nn.relu),
                tf.keras.layers.BatchNormalization(),
                tf.keras.layers.Conv2D(32,kernel_size = 5,strides=2,padding='same', activation=tf.nn.relu),
                tf.keras.layers.BatchNormalization(),

                tf.keras.layers.Conv2D(64,kernel_size = 3, activation=tf.nn.relu),
                tf.keras.layers.BatchNormalization(),
                tf.keras.layers.Conv2D(64,kernel_size = 3, activation=tf.nn.relu),
                tf.keras.layers.BatchNormalization(),
                tf.keras.layers.Conv2D(64,kernel_size = 3, strides=2, padding='same', activation=tf.nn.relu),
                tf.keras.layers.BatchNormalization(),
                tf.keras.layers.Flatten(),
                tf.keras.layers.Dense(128,activation=tf.nn.relu),
                tf.keras.layers.BatchNormalization(),
                tf.keras.layers.Dense(1, activation=tf.nn.softmax)])

        model.compile(optimizer=tf.keras.optimizers.Adam(0.1), 
                      loss=tf.keras.losses.CategoricalCrossentropy(), 
                      metrics=tf.keras.metrics.Accuracy())

I'm new to the field and is currently getting the following error :我是该领域的新手，目前遇到以下错误：

ValueError: Failed to convert a NumPy array to a Tensor (Unsupported object type numpy.ndarray).

Can someone tell me what I'm doing wrong here?有人可以告诉我我在这里做错了什么吗？ I added a tf.convert_to_tensor() function on the training data but is still getting the same error.我在训练数据上添加了一个tf.convert_to_tensor()函数，但仍然出现相同的错误。

Adding .astype("float32") seems to not be working either.添加.astype("float32")似乎也不起作用。

Answer 1

The tf.convert_to_tensor() function should actually have done it. tf.convert_to_tensor()函数实际上应该已经完成了。 Your toy-example array converted likethis a_tensor = tf.convert_to_tensor(a, dtype=tf.int32) would be a valid input for the CNN, as would be an array of n such arrays (then with shape=(n, 24, 5) ).像这样转换的玩具示例数组a_tensor = tf.convert_to_tensor(a, dtype=tf.int32)将是 CNN 的有效输入， n这样的数组的数组也是如此（然后shape=(n, 24, 5) )。 You could post a minimal working example of your code, so we can check for syntax ... for now it is hard to tell what went wrong.您可以发布代码的最小工作示例，这样我们就可以检查语法......现在很难判断出了什么问题。

Apart from that two small observations: Your input shape in the CNN does not yet show the right shape fitting your data.除了这两个小观察：您在 CNN 中的输入形状尚未显示适合您的数据的正确形状。 And the last dense softmax layer is also not yet what your proposed prediction format looks like.最后一个密集的 softmax 层也不是您提出的预测格式的样子。

CNN 模型无法训练

问题描述

1 个解决方案

解决方案1
0 2022-07-17 18:38:27

CNN 模型无法训练

问题描述

1 个解决方案

解决方案1 0 2022-07-17 18:38:27

解决方案1
0 2022-07-17 18:38:27