keras中的自定義丟失function？熵。數學/實施問題

Question

我正在使用一篇論文，它實現了一個具有自定義損失 function 的自動編碼器來處理振動信號。

我無法在 keras 上實現它。 他們將“最大相關熵”實現為損失 function 以避免信號背景噪聲問題。

這是描述：

Gaussian kernel 是相關熵中最流行的 Mercer kernel，定義為

其中 r 是 kernel 大小。 然后，可以通過最大化以下 function 來設計新的自動編碼器損失 function：

由於我從未實現自定義損失 function 我在 python 中的數學存在問題。 kernel 用於我需要實現的損失 function。 這就是我所擁有的：

file = np.load('./data/CWRU_48k_load_1_CNN_data.npz')  # Numpy Array 

data = file['data'].reshape(len(file['data']), 1024)
labels = file['labels']
category_labels = np.unique(labels)
labels = pd.Categorical(labels, categories = category_labels).codes

train_data, test_data, train_labels, test_labels = train_test_split(data, labels, test_size = int(data.shape[0]*0.2), random_state = 100, stratify = labels)

train_labels = to_categorical(train_labels)
test_labels = to_categorical(test_labels)

# Data shape. Sample Len: 1024. Outputs/Classifications: 10
print(train_data.shape, train_labels.shape, test_data.shape, test_labels.shape)
#(3680, 1024) (3680, 10) (920, 1024) (920, 10)

act_func = 'relu'
out_func = 'softmax'
k_inic = 'glorot_uniform'  

def create_model(shape=[512, 100], loss_func='mse'):
    model = Sequential()

    for shape_size in shape:
        model.add(Dense(shape_size, activation=act_func, kernel_initializer=k_inic))

    model.add(Dense(10, activation=out_func, kernel_initializer=k_inic))
    model.compile(loss=loss_func, optimizer=keras.optimizers.Adam(), metrics=["accuracy"])
    model.build(input_shape=(None, 1024))

    return model

BATCH_SIZE = 45
EPOCHS = 200
VALIDATION_SPLIT = 0.05

# Design Mercer Kernel
def kernel(x, sigma=1):
    return (1/(K.sqrt(2*np.pi)*sigma))*K.exp((-(x*x)/(2*sigma*sigma)))

# Use Mercer Kernel on Maximum Correntropy for loss function
def correntropy(y_true, y_pred):
    sum_score = 0.0
    for i in range(len(y_true)):
        sum_score = kernel(y_true[i] - y_pred[i])
    sum_score = sum_score/len(y_true)
    return -sum_score

# Create AutoEncoder model with my custom loss function
model = create_model(shape=[512, 100], loss_func=correntropy)
history = model.fit(train_data, train_labels, epochs = EPOCHS, batch_size = BATCH_SIZE, validation_data=(test_data, test_labels), 
                        callbacks = callbacks.callbacks, verbose = 0)

res = model.evaluate(test_data, test_labels, batch_size = BATCH_SIZE, verbose = 0)[1]

但我有這個錯誤：

AttributeError: in user code:

    /home/user/.local/lib/python3.8/site-packages/keras/engine/training.py:853 train_function  *
        return step_function(self, iterator)
    /tmp/ipykernel_95935/2003563015.py:26 correntropy  *
        sum_score = kernel(y_true[i] - y_pred[i])
    /tmp/ipykernel_95935/2239884018.py:20 kernel  *
        return (1/(K.sqrt(2*np.pi)*sigma))*K.exp((-(x*x)/(2*sigma*sigma)))
    /home/user/.local/lib/python3.8/site-packages/tensorflow/python/util/dispatch.py:206 wrapper  **
        return target(*args, **kwargs)
    /home/user/.local/lib/python3.8/site-packages/keras/backend.py:2539 sqrt
        zero = _constant_to_tensor(0., x.dtype.base_dtype)

    AttributeError: 'float' object has no attribute 'dtype'

錯誤似乎出現在kernel上，但我該如何修復以使用張量？

print(y_true)
print(y_pred)
>> Tensor("IteratorGetNext:1", shape=(None, 10), dtype=float32)
>> Tensor("sequential_161/dense_491/Softmax:0", shape=(None, 10), dtype=float32)

Answer 1

我在您的代碼中注意到 3 個主要內容：

您正在組合來自不同包（K，np）的數學函數。 盡可能堅持使用原生 tensorflow 函數（例如 tf.math.reduce_sum）。 有很多東西。 檢查文檔以獲取概述
自定義損失函數應該轉換成 tensorflow 圖形兼容函數，就像把tf.function裝飾器放在它前面一樣簡單。 看這里
循環通常做得不好。 盡可能矢量化你的函數。

總之，我認為這樣的事情應該可以工作（沒有測試過）：

import tensorflow as tf
tf_pi = tf.constant(np.pi, dtype=tf.float32)

@tf.function
def kernel(x, sigma=1):
    return (1 / (tf.sqrt(2 * tf_pi) * sigma)) * tf.exp((-(x * x) / (2 * sigma * sigma)))


@tf.function
def correntropy(y_true, y_pred):
    return -tf.math.reduce_mean(kernel(y_true - y_pred))

keras中的自定義丟失function？熵。數學/實施問題

問題描述

1 個解決方案

解決方案1
0 2022-02-04 20:40:34

keras中的自定義丟失function？ 熵。 數學/實施問題

問題描述

1 個解決方案

解決方案1 0 2022-02-04 20:40:34

keras中的自定義丟失function？熵。數學/實施問題

解決方案1
0 2022-02-04 20:40:34