用numpy的genfromtxt讀取每第n行的最快方法

Question

我用numpy的genfromtxt讀取我的數據：

import numpy as np
measurement = np.genfromtxt('measurementProfile2.txt', delimiter=None, dtype=None, skip_header=4, skip_footer=2, usecols=(3,0,2))
rows, columns = np.shape(measurement)
x=np.zeros((rows, 1), dtype=measurement.dtype)
x[:]=394
measurement = np.hstack((measurement, x))
np.savetxt('measurementProfileFormatted.txt',measurement)

這很好用。 但我想在最終的輸出文件中只需要5-th 6-th n-th ）行。 根據numpy.genfromtxt.html ，沒有參數可以做到這一點。 我不想迭代數組。 有沒有推薦的方法來處理這個問題？

Answer 1

為了避免讀取整個數組，您可以將np.genfromtxt與itertools.islice組合以跳過行。 這比讀取整個陣列然后切片要快一些（至少對於我嘗試過的較小陣列）。

例如，這是file.txt的內容：

然后例如：

>>> import itertools
>>> with open('file.txt') as f_in:
        x = np.genfromtxt(itertools.islice(f_in, 0, None, 3), dtype=int)

返回一個數組x與0 ， 3和6的上面的文件的索引的元素：

array([12, 17, 62])

Answer 2

無論如何你必須閱讀整個文件，選擇第n個元素做類似的事情：

>>> a = np.arange(50)
>>> a[::5]
array([ 0,  5, 10, 15, 20, 25, 30, 35, 40, 45])

Answer 3

如果您只想在最終輸出文件中使用特定行，那么為什么不保存那些行而不是保存整個“測量”矩陣：

 output_rows = [5,7,11] np.savetxt('measurementProfileFormatted.txt',measurement[output_rows,:])

用numpy的genfromtxt讀取每第n行的最快方法

問題描述

3 個解決方案

解決方案1
4 已采納 2015-01-15 12:02:16

解決方案2
0 2015-01-15 11:19:19

解決方案3
0 2015-01-15 14:25:42

用numpy的genfromtxt讀取每第n行的最快方法

問題描述

3 個解決方案

解決方案1 4 已采納 2015-01-15 12:02:16

解決方案2 0 2015-01-15 11:19:19

解決方案3 0 2015-01-15 14:25:42

解決方案1
4 已采納 2015-01-15 12:02:16

解決方案2
0 2015-01-15 11:19:19

解決方案3
0 2015-01-15 14:25:42