当我测试为多个类别数据集训练的图像分类模型时，为什么会得到相同的类别预测？

Question

I'm trying to build image classification model on flower image dataset using tf.data with 4 different classes.我正在尝试使用具有 4 个不同类的 tf.data 在花卉图像数据集上构建图像分类模型。 When I test trained model I'm getting same class prediction even for different class images but training goes smoothly with good training accuracy and validation accuracy and also it gives good accuracy on test datasets.当我测试经过训练的模型时，即使对于不同的类图像，我也会得到相同的类预测，但是训练进行得很顺利，具有良好的训练准确度和验证准确度，并且它在测试数据集上也提供了很好的准确度。

My implementation of training and testing pipelines are as below我的训练和测试管道的实现如下

dataset_url = "https://storage.googleapis.com/download.tensorflow.org/example_images/flower_photos.tgz"
data_dir = tf.keras.utils.get_file(origin=dataset_url,
                                   fname='flower_photos',
                                   untar=True)
data_dir = pathlib.Path(data_dir)
data_dir = pathlib.Path(r'C:\Users\Hilary\.keras\datasets\flower_photos')
slide_labels =os.listdir(data_dir)

CLASS_NAMES = slide_labels 
NUM_CLASSES = len(CLASS_NAMES)
num_examples = len(list(data_dir.glob('*/*.jpg')))

def get_label(file_path):
  # convert the path to a list of path components
    parts = tf.strings.split(file_path, os.path.sep)
  # The second to last is the class-directory
    return tf.where(parts[-2] == CLASS_NAMES)[0][0]

def decode_img(img):
  # convert the compressed string to a 3D uint8 tensor
    img = tf.image.decode_jpeg(img, channels=3)
    return img  

def process_path(file_path):
    label = get_label(file_path)
  # load the raw data from the file as a string
    img = tf.io.read_file(file_path)
    img = decode_img(img)
    features = {'image': img, 'label': label}
    return features

list_ds = tf.data.Dataset.list_files(str(data_dir/'*/*'))
ds = list_ds.map(process_path, num_parallel_calls=tf.data.experimental.AUTOTUNE)
print("Total number of images", len(ds))
print("total No of classes: ",NUM_CLASSES)


train_size = int(0.90 * num_examples)
val_size = int(0.09* num_examples)
test_size = int(0.01 * num_examples)

full_dataset = ds.shuffle(reshuffle_each_iteration=False, buffer_size=len(ds))
train_dataset = full_dataset.take(train_size)
test_val_dataset = full_dataset.skip(train_size)
val_dataset = test_val_dataset.take(val_size)
test_dataset = test_val_dataset.skip(val_size)
print("Number of examples on training set is ", len(train_dataset))

When I do inference on individual class image as below当我对单个班级图像进行推断时，如下所示

for img_f in list(paths.list_images(r'C:\Users\hillary\.keras\datasets\flower_photos\sunflowers')):
    img = cv2.imread(img_f)
    test_img = [img]
    # test_img = [np.expand_dims(img, axis=0) for img in test_img]
    test_img = tf.concat(test_img, axis=0)
    test_img = tf.image.resize(test_img, [128, 128])
    test_img = tf.cast(image, tf.float32) / 255.0
    # test_img = tf.expand_dims(image, axis = 0)
    logits = model(test_img)
    y_probabilities = tf.nn.softmax(logits).numpy()[0]
    print(y_probabilities)
    index_max_proba = np.argmax(tf.nn.softmax(logits))
    print(class_labels[index_max_proba])

I get results as我得到的结果为

[1.2498085e-03 1.7629927e-01 8.2240731e-01 3.1031032e-05 1.2520954e-05]
roses
[1.2498085e-03 1.7629927e-01 8.2240731e-01 3.1031032e-05 1.2520954e-05]
roses
[1.2498085e-03 1.7629927e-01 8.2240731e-01 3.1031032e-05 1.2520954e-05]
roses
[1.2498085e-03 1.7629927e-01 8.2240731e-01 3.1031032e-05 1.2520954e-05]
roses
[1.2498085e-03 1.7629927e-01 8.2240731e-01 3.1031032e-05 1.2520954e-05]
roses
[1.2498085e-03 1.7629927e-01 8.2240731e-01 3.1031032e-05 1.2520954e-05]
roses
[1.2498085e-03 1.7629927e-01 8.2240731e-01 3.1031032e-05 1.2520954e-05]
roses
[1.2498085e-03 1.7629927e-01 8.2240731e-01 3.1031032e-05 1.2520954e-05]
roses
[1.2498085e-03 1.7629927e-01 8.2240731e-01 3.1031032e-05 1.2520954e-05]
roses
[1.2498085e-03 1.7629927e-01 8.2240731e-01 3.1031032e-05 1.2520954e-05]
roses
[1.2498085e-03 1.7629927e-01 8.2240731e-01 3.1031032e-05 1.2520954e-05]
roses
[1.2498085e-03 1.7629927e-01 8.2240731e-01 3.1031032e-05 1.2520954e-05]
roses
[1.2498085e-03 1.7629927e-01 8.2240731e-01 3.1031032e-05 1.2520954e-05]
roses
[1.2498085e-03 1.7629927e-01 8.2240731e-01 3.1031032e-05 1.2520954e-05]
roses
[1.2498085e-03 1.7629927e-01 8.2240731e-01 3.1031032e-05 1.2520954e-05]
roses
[1.2498085e-03 1.7629927e-01 8.2240731e-01 3.1031032e-05 1.2520954e-05]
roses
[1.2498085e-03 1.7629927e-01 8.2240731e-01 3.1031032e-05 1.2520954e-05]
roses

which predicts as roses for sunflower class images same with other class images as well它预测向日葵类图像的玫瑰与其他类图像相同

I tested this pipeline for different dataset and models I was getting same results which are single class prediction for different class images ..我针对不同的数据集和模型测试了这条管道，我得到了相同的结果，它们是针对不同类图像的单类预测..

Any help or suggestion to rectify my mistake will be appreciated任何纠正我的错误的帮助或建议将不胜感激

Answer 1

You're loading the file with opencv which loads the image in BGR format, while you load it with tf.io in the original pipeline.您正在使用 opencv 加载文件，该文件以 BGR 格式加载图像，而在原始管道中使用 tf.io 加载它。 Try converting it to RGB with the following code尝试使用以下代码将其转换为 RGB

img = cv2.imread(img_f)
img = cv2.cvtColor(img, cv2.COLOR_BGR2RGB)
test_img = [img]

Answer 2

It's funny that the output are all identical, not just similar.有趣的是，输出都是相同的，而不仅仅是相似。

Why are you dividing the image values by 255 in test but not in training?为什么在测试中将图像值除以 255 而不是在训练中？

当我测试为多个类别数据集训练的图像分类模型时，为什么会得到相同的类别预测？

问题描述

2 个解决方案

解决方案1
0 2022-07-20 20:02:46

解决方案2
0 2022-07-22 20:43:46

当我测试为多个类别数据集训练的图像分类模型时，为什么会得到相同的类别预测？

问题描述

2 个解决方案

解决方案1 0 2022-07-20 20:02:46

解决方案2 0 2022-07-22 20:43:46

解决方案1
0 2022-07-20 20:02:46

解决方案2
0 2022-07-22 20:43:46