使用 Tensorflow Interleave 提高性能

Question

I have an input pipe that is performing poorly with low CPU, GPU, and disk utilization.我有一个输入管道，它在 CPU、GPU 和磁盘利用率低的情况下表现不佳。 I've been reading the tensorflow "Better performance with tf.data API" doc and the Dataset docs, but I don't understand what's going on well enough to apply it to my situation.我一直在阅读 tensorflow “Better performance with tf.data API” doc 和 Dataset docs，但我不明白发生了什么足以将其应用于我的情况。 Here's my current setup:这是我目前的设置：

img_files = sorted(tf.io.gfile.glob(...))
imgd = tf.data.FixedLengthRecordDataset(img_files, inrez*inrez)
#POINT1A
imgd = imgd.map(lambda s: tf.reshape(tf.io.decode_raw(s, tf.int8), (inrez,inrez,1)))
imgd = imgd.map(lambda x: tf.cast(x, dtype=tf.float32))

out_files = sorted(tf.io.gfile.glob(...))
outd = tf.data.FixedLengthRecordDataset(out_files, 4, compression_type="GZIP")
#POINT1B
outd = outd.map(lambda s: tf.io.decode_raw(s, tf.float32))

xsrc = tf.data.Dataset.zip((imgd, outd)).batch(batchsize)
xsrc = xsrc.repeat()        # indefinitely
#POINT2
xsrc = xsrc.prefetch(buffer_size=tf.data.experimental.AUTOTUNE)

Should I interleave the whole pipe right at the end (POINT2), before the prefetch?在预取之前，我应该在末尾（POINT2）交错整个管道吗？ Or interleave imgd and outd separately, after each FixedLengthRecordDataset (POINT1A, POINT1B), and parallelize the maps?或者在每个 FixedLengthRecordDataset (POINT1A, POINT1B) 之后分别交错 imgd 和 outd，并并行化地图？ (need to keep the imgd and outd synced up!) What's up with Dataset.range(rvalue)---seems it's necessary but not obvious what rvalue to use? （需要保持 imgd 和 outd 同步！）Dataset.range(rvalue) 怎么了---似乎有必要但不明显使用什么右值？ Is there a better overall plan?有没有更好的整体方案？

Note that the datasets are very large and do not fit in RAM.请注意，数据集非常大，不适合 RAM。

Answer 1

Interleave lets you process each file in a separate logical thread (in parallel), then combine the data from the files into a single dataset. Interleave 允许您在单独的逻辑线程（并行）中处理每个文件，然后将文件中的数据合并到单个数据集中。 Since your data comes from two corresponding files, you need to be careful to preserve the order.由于您的数据来自两个对应的文件，因此您需要小心保留顺序。

Here is an example of how you could put the interleave near the end of the dataset:以下是如何将交错放置在数据集末尾附近的示例：

img_files = ...
out_files = ...
files = tf.data.Dataset.zip(img_files, out_files)

def parse_img_file(img_file):
  imgd = tf.data.FixedLengthRecordDataset(img_files, inrez*inrez)
  ...

def parse_out_file(out_file):
  ...

def parse_files_fn(img_file, out_file):
  img_file_dataset = parse_img_file(img_file)
  out_file_dataset = parse_out_file(out_file)
  return tf.data.Dataset.zip(img_file_dataset, out_file_dataset)

dataset = files.interleave(parse_files_fn, num_parallel_calls=tf.data.experimental.AUTOTUNE)
dataset = dataset.repeat()

Each thread of the interleave will produce elements from a different pair of (img, out) files, and the elements produced from each pair of files will be interleaved together.交错的每个线程将从不同的 (img, out) 文件对生成元素，并且从每对文件生成的元素将被交错在一起。

使用 Tensorflow Interleave 提高性能

问题描述

1 个解决方案

解决方案1
0 2020-02-14 18:55:55

使用 Tensorflow Interleave 提高性能

问题描述

1 个解决方案

解决方案1 0 2020-02-14 18:55:55

解决方案1
0 2020-02-14 18:55:55