在Tensorflow中使用tf.contrib.data.Datasets API时无法处理错误

Question

我已经创建了tfrecords作为我的数据库。 该数据库由9个不同的tfrecord文件组成。 目的是将来自9个数据库的一批样本输入到模型中。 因此，我已将zip函数与TFRecordDataset一起使用。 每个样本都包含一个框架及其功能集。 因此，我需要从每个tfrecord文件中抽取8个样本，这总共提供了72个(features, image) 。 因此，我将图像中的特征单独提取出来，如下面的代码所示。

问题：当我到达第一个时期的末尾时，剩余数据少于72个。结果，第二个时期的数据相加就构成了72个样本的批次。 因此，由于我正在训练递归神经网络，因此这在我的情况下是不可思议的，因此我的状态应保持一致（现在无需讨论）。

因此，我没有使用重复功能，而是尝试实现https://www.tensorflow.org/programmers_guide/datasets中提到的内容。在“处理多个时期”下，即使用try和except进行循环。

# Compute for 100 epochs.
for _ in range(100):
  sess.run(iterator.initializer)
  while True:
    try:
      sess.run(next_element)
    except tf.errors.OutOfRangeError:
      break

  # [Perform end-of-epoch calculations here.]

一旦这样做，我就会遇到另一个问题。 首先，这是我的完整代码：

import tensorflow as tf
import numpy as np
import time
import cv2

num_epoch = 1
batch_size = 8 # This is set to 8 since
num_threads = 9
common = "C:/Users/user/PycharmProjects/AffectiveComputingNew/database/"
filenames = [(common + "train_1_db.tfrecords"), (common + "train_2_db.tfrecords"), (common + "train_3_db.tfrecords"),
     (common + "train_4_db.tfrecords"), (common + "train_5_db.tfrecords"), (common + "train_6_db.tfrecords"),
     (common + "train_7_db.tfrecords"), (common + "train_8_db.tfrecords"), (common + "train_9_db.tfrecords")]

# Transforms a scalar string `example_proto` into a pair of a scalar string and
# a scalar integer, representing an image and its label, respectively.
def _parse_function(example_proto):
    features = {
        'height': tf.FixedLenFeature([], tf.int64),
        'width': tf.FixedLenFeature([], tf.int64),
        'image_raw': tf.FixedLenFeature([], tf.string),
        'features': tf.FixedLenFeature([432], tf.float32)
    }

    parsed_features = tf.parse_single_example(example_proto, features)

    # This is how we create one example, that is, extract one example from the database.
    image = tf.decode_raw(parsed_features['image_raw'], tf.uint8)
    # The height and the weights are used to
    height = tf.cast(parsed_features['height'], tf.int32)
    width = tf.cast(parsed_features['width'], tf.int32)

    # The image is reshaped since when stored as a binary format, it is flattened. Therefore, we need the
    # height and the weight to restore the original image back.
    image = tf.reshape(image, [height, width, 3])

    features = parsed_features['features']

    return features, image

random_features = tf.Variable(tf.zeros([72, 432], tf.float32))
random_images = tf.Variable(tf.zeros([72, 112, 112, 3]))

datasets = []
for _ in filenames:
    datasets.append(tf.contrib.data.TFRecordDataset(_).map(_parse_function))

dataset_ziped = tf.contrib.data.TFRecordDataset.zip((datasets[0], datasets[1], datasets[2], datasets[3],
      datasets[4], datasets[5], datasets[6], datasets[7], datasets[8]))
#dataset = dataset_ziped.repeat(num_epoch)
dataset = dataset_ziped.batch(batch_size)

iterator = dataset.make_initializable_iterator()
next_batch = iterator.get_next() # This has shape: [9, 2]

features = tf.concat((next_batch[0][0], next_batch[1][0], next_batch[2][0], next_batch[3][0],
                      next_batch[4][0], next_batch[5][0], next_batch[6][0], next_batch[7][0],
                      next_batch[8][0]), axis=0)
features = tf.reshape(features, shape=[9, 8, 432]) # where 8 * 9 = 72
features = tf.transpose(features, perm=[1, 0, 2]) # shape becomes: [8, 9, 432]
features = tf.reshape(features, shape=[72, 432]) # Now frames will be: 1st frame from 1st video, second from second video...

images = tf.concat((next_batch[0][1], next_batch[1][1], next_batch[2][1], next_batch[3][1],
                    next_batch[4][1], next_batch[5][1], next_batch[6][1], next_batch[7][1],
                    next_batch[8][1]), axis=0)
images = tf.reshape(images, shape=[9, 8, 112, 112, 3])
images = tf.transpose(images, perm=[1, 0, 2, 3, 4])
images = tf.reshape(images, shape=[72, 112, 112, 3])

init_op = tf.global_variables_initializer()

with tf.Session() as sess:
    # Initialize `iterator` with training data.
    sess.run(init_op)

    for _ in range(num_epoch):
        sess.run(iterator.initializer)

        # This while loop will run indefinitly until the end of the first epoch
        while True:
            try:
                lst = []
                features_np = sess.run([features])[0] # since the output is always: (1, 72, 432)

                for f in features_np:
                    lst.append(f[0])

            except tf.errors.OutOfRangeError:
                print('errorrrrr')

因此，由于样本数量不再是72个，因此在以下行遇到错误：

features = tf.reshape(features, shape=[9, 8, 432]) # where 8 * 9 = 72

因此，我需要一种方法来处理此错误。 我尝试断言如下：

assert_op = tf.Assert(tf.equal(tf.shape(features[0]), batch_size * 9), [features])
with tf.control_dependencies([assert_op])... after features = tf.concat...

而且它没有用。 我按如下方式尝试了tf.cond（但效果不佳）：

tf.cond(tf.equal(tf.shape(features)[0], batch_size * 9),
        lambda: tf.assign(random_features, features),
        lambda: tf.assign(random_features, random_features))

features = tf.reshape(random_features, shape=[9, 8, 432]) # where 8 * 9 = 72
....

总而言之，我需要一种方法来遍历各个时期，而不会交织来自不同迭代的样本，并且同时使用reshape函数（在本例中，批量大小小于72）同时解决了该功能的问题。

任何帮助深表感谢！！

Answer 1

因此，由于最后一个批次被截断，因此我创建了一个临时variable ，每当批次大小等于代码中指定的batch_size时，该variable就会被分配初始批次的值。 因此，一旦当前批处理的大小不再等于batch_size ，我就使用我创建的临时批处理。 以下是解决方案：

import tensorflow as tf
import numpy as np
import time
import cv2

num_epoch = 2
batch_size = 8 # This is set to 8 since
num_threads = 9
common = "C:/Users/user/PycharmProjects/AffectiveComputingNew/database/"
filenames = [(common + "train_1_db.tfrecords"), (common + "train_2_db.tfrecords"), (common + "train_3_db.tfrecords"),
     (common + "train_4_db.tfrecords"), (common + "train_5_db.tfrecords"), (common + "train_6_db.tfrecords"),
     (common + "train_7_db.tfrecords"), (common + "train_8_db.tfrecords"), (common + "train_9_db.tfrecords")]

# Transforms a scalar string `example_proto` into a pair of a scalar string and
# a scalar integer, representing an image and its label, respectively.
def _parse_function(example_proto):
    features = {
        'height': tf.FixedLenFeature([], tf.int64),
        'width': tf.FixedLenFeature([], tf.int64),
        'image_raw': tf.FixedLenFeature([], tf.string),
        'features': tf.FixedLenFeature([432], tf.float32)
    }

    parsed_features = tf.parse_single_example(example_proto, features)

    # This is how we create one example, that is, extract one example from the database.
    image = tf.decode_raw(parsed_features['image_raw'], tf.uint8)
    # The height and the weights are used to
    height = tf.cast(parsed_features['height'], tf.int32)
    width = tf.cast(parsed_features['width'], tf.int32)

    # The image is reshaped since when stored as a binary format, it is flattened. Therefore, we need the
    # height and the weight to restore the original image back.
    image = tf.reshape(image, [height, width, 3])

    features = parsed_features['features']

    return features, image
# Here is the temp var that I will use whenever the return batch from the dataset doesn't have a size of batch_size * 9 mentioned above. 
random_features = tf.Variable(tf.zeros([72, 432], tf.float32))
random_images = tf.Variable(tf.zeros([72, 112, 112, 3], tf.uint8))

datasets = []
for _ in filenames:
    datasets.append(tf.contrib.data.TFRecordDataset(_).map(_parse_function))

dataset_ziped = tf.contrib.data.TFRecordDataset.zip((datasets[0], datasets[1], datasets[2], datasets[3],
      datasets[4], datasets[5], datasets[6], datasets[7], datasets[8]))
dataset = dataset_ziped.batch(batch_size)

iterator = dataset.make_initializable_iterator()
next_batch = iterator.get_next() # This has shape: [9, 2]

features = tf.concat((next_batch[0][0], next_batch[1][0], next_batch[2][0], next_batch[3][0],
                      next_batch[4][0], next_batch[5][0], next_batch[6][0], next_batch[7][0],
                      next_batch[8][0]), axis=0)
images = tf.concat((next_batch[0][1], next_batch[1][1], next_batch[2][1], next_batch[3][1],
                    next_batch[4][1], next_batch[5][1], next_batch[6][1], next_batch[7][1],
                    next_batch[8][1]), axis=0)

def get_features(features, images):
    with tf.control_dependencies([tf.assign(random_features, features), tf.assign(random_images, images)]):
        features = tf.reshape(features, shape=[9, 8, 432]) # where 8 * 9 = 72
        features = tf.transpose(features, perm=[1, 0, 2]) # shape becomes: [8, 9, 432]
        features = tf.reshape(features, shape=[72, 432]) # Now frames will be: 1st frame from 1st video, second from second video...

        images = tf.reshape(images, shape=[9, 8, 112, 112, 3])
        images = tf.transpose(images, perm=[1, 0, 2, 3, 4])
        images = tf.reshape(images, shape=[72, 112, 112, 3])
        return features, images

condition1 = tf.equal(tf.shape(features)[0], batch_size * 9)
condition2 = tf.equal(tf.shape(images)[0], batch_size * 9)

condition = tf.logical_and(condition1, condition2)

features, images = tf.cond(condition,
                           lambda: get_features(features, images),
                           lambda: get_features(random_features, random_images))

init_op = tf.global_variables_initializer()

with tf.Session() as sess:
    # Initialize `iterator` with training data.
    sess.run(init_op)

    for _ in range(num_epoch):
        sess.run(iterator.initializer)

        # This while loop will run indefinitly until the end of the first epoch
        while True:
            try:
                lst = []
                features_np, images_np = sess.run([features, images])

                for f in features_np:
                    lst.append(f[0])

                print(lst)
            except tf.errors.OutOfRangeError:
                print('errorrrrr')
                break

请注意，我总是提到batch_size * 9因为在压缩数据集时，这将创建一个样本数据，其元素是从9个不同的数据集中获取的。 因此，由于我将batch_size分配为8，有72个样本，所以我取8 * 9。

在Tensorflow中使用tf.contrib.data.Datasets API时无法处理错误

问题描述

1 个解决方案

解决方案1
0 已采纳 2017-09-22 22:05:32

在Tensorflow中使用tf.contrib.data.Datasets API时无法处理错误

问题描述

1 个解决方案

解决方案1 0 已采纳 2017-09-22 22:05:32

解决方案1
0 已采纳 2017-09-22 22:05:32