簡體   English   中英

RGB 自動編碼器輸出空白圖像

[英]RGB Autoencoder outputs blank images

我正在做一項大學作業,作業是制作一個降噪自動編碼器來編碼和解碼帶有噪聲的圖像。 我已經正確地實現了一切,但是當試圖預測它輸出白色圖像時。

圖層:

input_img = keras.Input(shape=(32, 32, 3))
x = layers.Conv2D(32, (3, 3), activation='relu', padding='same')(input_img)
x = layers.MaxPooling2D((2, 2), padding='same')(x)
x = layers.Conv2D(32, (3, 3), activation='relu', padding='same')(x)
x = layers.MaxPooling2D((2, 2), padding='same')(x)
x = layers.Conv2D(32, (3, 3), activation='relu', padding='same')(x)
x = layers.Conv2D(32, (3, 3), activation='relu', padding='same')(x)
encoded = layers.MaxPooling2D((2, 2), padding='same')(x)
encoded = x
x = layers.Conv2DTranspose(32, (3, 3), strides=(2, 2), activation='relu', padding='same')(encoded)
x = layers.Conv2DTranspose(32, (3, 3), strides=(2, 2), activation='relu', padding='same')(x)
decoded = layers.Conv2D(3, (3, 3), activation='relu', padding='same')(x)
autoencoder = keras.Model(input_img, decoded)
s = 20 * 5000
lr_schedule = tf.keras.optimizers.schedules.ExponentialDecay(0.01, s, 0.1)
autoencoder.compile(optimizer=tf.keras.optimizers.SGD(learning_rate=lr_schedule), loss='mean_squared_error',metrics=['accuracy'])
autoencoder.summary()

加載數據:

(x_train, _), (x_test, _) = cifar100.load_data()

x_train = x_train.astype('float32') / 255.
x_test = x_test.astype('float32') / 255.
x_train = np.reshape(x_train, (len(x_train), 32, 32, 3))
x_test = np.reshape(x_test, (len(x_test), 32, 32, 3))
x_train = x_train[:5000]
x_test = x_test[6000:7000]

noise_factor = 0.2
x_train_noisy = x_train + noise_factor * np.random.normal(loc=0.0, scale=1.0, size=x_train.shape)
x_test_noisy = x_test + noise_factor * np.random.normal(loc=0.0, scale=1.0, size=x_test.shape)
x_train_noisy = np.clip(x_train_noisy, 0., 1.)
x_test_noisy = np.clip(x_test_noisy, 0., 1.)

適合並預測:

autoencoder.fit(x_train_noisy, x_train,
                epochs=50,
                batch_size=128,
                shuffle=True)
decoded_imgs = autoencoder.predict(x_test_noisy)

for i in range(5):
    # Display original
    ax = plt.subplot(2, 5, i + 1)
    plt.imshow(x_test_noisy[i])
    plt.gray()
    ax.get_xaxis().set_visible(False)
    ax.get_yaxis().set_visible(False)
    # Display reconstruction
    ax = plt.subplot(2, 5, i + 1 + 5)
    plt.imshow(decoded_imgs[i])
    plt.gray()
    ax.get_xaxis().set_visible(False)
    ax.get_yaxis().set_visible(False)
plt.show()

輸出:

在此處輸入圖像描述

生成的輸出可能是由於最大池化,它通過減少前一個卷積層輸出中的像素數量來降低圖像的維度。請嘗試像下面這樣更改模型。

input_img = keras.Input(shape=(32, 32, 3))
x = layers.Conv2D(16, (3, 3), activation='relu', padding='same')(input_img)
encoded = layers.Conv2D(16, (3, 3), activation='relu', padding='same')(x)

x = layers.Conv2DTranspose(16, (3, 3), strides=(2, 2), activation='relu', padding='same')(encoded)
x = layers.Conv2DTranspose(16, (3, 3), strides=(2, 2), activation='relu', padding='same')(encoded)
x = layers.MaxPooling2D((2, 2), padding='same')(x)
decoded = layers.Conv2D(3, (3, 3), activation='relu', padding='same')(x)

請在此處找到完整的代碼。 謝謝!

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM