簡體   English   中英

使用 tensorflow 2 進行模型子類化中的 ValueError

[英]ValueError in model subclassing with tensorflow 2

我正在嘗試使用 keras 中的模型子類來實現 WideResnet。 我無法理解我的代碼有什么問題:

class ResidualBlock(layers.Layer):
  def __init__(self, filters, kernel_size, dropout, dropout_percentage, strides=1, **kwargs):
    super(ResidualBlock, self).__init__(**kwargs)
          
    self.conv_1 = layers.Conv2D(filters, (1, 1), strides=strides)
    self.bn_1 = layers.BatchNormalization()
    self.rel_1 = layers.ReLU()
    self.conv_2 = layers.Conv2D(filters, kernel_size, padding="same", strides=strides)
    self.dropout = layers.Dropout(dropout_percentage)
    self.bn_2 = layers.BatchNormalization()
    self.rel_2 = layers.ReLU()
    self.conv_3 = layers.Conv2D(filters, kernel_size, padding="same")
    
    self.add = layers.Add()
    self.dropout = dropout
    self.strides = strides

  def call(self, inputs):
    x = inputs

    if self.strides > 1:
      x = self.conv_1(x)
    res_x = self.bn_1(x)
    res_x = self.rel_1(x)
    res_x = self.conv_2(x)
    if self.dropout:
      res_x = self.dropout(x)
    res_x = self.bn_2(x)
    res_x = self.rel_2(x)
    res_x = self.conv_3(x)
    inputs = self.add([x, res_x])
    return inputs

class WideResidualNetwork(models.Model):
  def __init__(self, input_shape, n_classes, d, k, kernel_size=(3, 3), dropout=False, dropout_percentage=0.3, strides=1, **kwargs):
    
    super(WideResidualNetwork, self).__init__(**kwargs)

    if (d-4)%6 != 0:
      raise ValueError('Please choose a correct depth!')

    self.rel_1 = layers.ReLU()
    self.conv_1 = layers.Conv2D(16, (3, 3), padding='same')
    self.conv_2 = layers.Conv2D(16*k, (1, 1))
    self.dense = layers.Dense(n_classes)

    self.dropout = dropout
    self.dropout_percentage = dropout_percentage
    self.N = int((d - 4) / 6)
    self.k = k
    self.d = d
    self.kernel_size = kernel_size

  def build(self, input_shape):
    self.bn_1 = layers.BatchNormalization(input_shape=input_shape)

  def call(self, inputs):
    x = self.bn_1(inputs)
    x = self.rel_1(x)
    x = self.conv_1(x)
    x = self.conv_2(x)

    for _ in range(self.N):
      x = ResidualBlock(16*self.k, self.kernel_size, self.dropout, self.dropout_percentage)(x)
    
    x = ResidualBlock( 32*self.k, self.kernel_size, self.dropout, self.dropout_percentage, strides=2)(x)

    for _ in range(self.N-1):
      x = ResidualBlock( 32*self.k, self.kernel_size, self.dropout, self.dropout_percentage)(x)

    x = ResidualBlock( 64*self.k, self.kernel_size, self.dropout, self.dropout_percentage, strides=2)(x)
    
    for _ in range(self.N-1):
      x = ResidualBlock( 64*self.k, self.kernel_size, self.dropout, self.dropout_percentage)(x)
    
    x = layers.GlobalAveragePooling2D()(x)
    x = self.dense(x)
    x = layers.Activation("softmax")(x)

    return x

當我嘗試以這種方式擬合模型時:

(x_train, y_train), (x_test, y_test) = tf.keras.datasets.cifar10.load_data()
model = WideResidualNetwork(x_train[0].shape, 10, 28, 1)
x_train, x_test = x_train/255. , x_test/255.
model = WideResidualNetwork(x_train[0].shape, 10, 28, 1)
model.compile(optimizer='adam', loss='sparse_categorical_crossentropy', metrics=['accuracy'])

epochs = 40
batch_size = 64
validation_split = 0.2
h = model.fit(x_train, y_train, epochs=epochs, batch_size=batch_size, validation_split=validation_split)

我收到以下錯誤:

...
 <ipython-input-26-61c1bdb3546c>:31 call  *
        x = ResidualBlock(16*self.k, self.kernel_size, self.dropout, self.dropout_percentage)(x)
    <ipython-input-9-3fea1e77cb6e>:23 call  *
        res_x = self.bn_1(x)
...
ValueError: tf.function-decorated function tried to create variables on non-first call.

所以我不明白問題出在哪里,我也嘗試將初始化移動到構建中,但沒有結果,錯誤仍然存​​在。 可能我的知識有一些空白在此先謝謝您

您正在將 ResidualBlocks、GlobalAveragePooling2D 和 Activation 層初始化到 call 方法中。 嘗試將它們移動到 init 中,就像您對其他層所做的那樣,它不應該給您那個錯誤。

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM