二维NumPy数组的切片截面

Question

I'm looking to print a cross-section of a numpy array. 我正在寻找打印numpy数组的横截面。 I am looking to pull the first 15 rows and only data from the 5th index column. 我想拉前15行，仅从第5个索引列中提取数据。

import csv as csv
import numpy as np 

csv_file_object = open('train.csv', 'rU')
header = next(csv_file_object)
data = []

for row in csv_file_object:
    data.append(row)

data = np.array(data)

print(data[0:15,5])

When I run the above code, I receive the following error: IndexError: too many indices for array. 当我运行上面的代码时，我收到以下错误：IndexError：数组的索引过多。

Running the code without adding the column filter works as expected. 运行代码而不添加列过滤器按预期工作。 The csv file is a 12x892 (x,y) dataset. csv文件是12x892（x，y）数据集。

Answer 1

The csv_file_object is a file object contains all lines of your csv file without splitting. csv_file_object是一个文件对象，包含csv文件的所有行，但不拆分。 You have to use csv module to read them properly: 您必须使用csv模块正确阅读它们：

with open('train.csv', 'rU') as csv_file_object:
    reader = csv.DictReader(csv_file_object)
    data = np.array([row.values() for row in reader])

But since you want to convert them to a numpy array you better to use genfromtxt function: 但是由于您要将它们转换为numpy数组，因此最好使用genfromtxt函数：

data = np.genfromtxt('train.csv')

You can pass arguments like dtype , delimiter and names (a boolean for keeping the headers). 您可以传递dtype ， delimiter和names类的参数（用于保留标题的布尔值）。

Answer 2

UPDATE: I did not call csv.reader() prior to opening the file 更新：打开文件之前，我没有调用csv.reader（）

import csv as csv
import numpy as np 

csv_file_object = csv.reader(open('train.csv', 'rU'))
header = next(csv_file_object)
data = []

for row in csv_file_object:
    data.append(row)

data = np.array(data)

print(data.shape)
print(data[0:15,5])

二维NumPy数组的切片截面

问题描述

2 个解决方案

解决方案1
0 2016-09-25 22:29:54

解决方案2
0 2016-09-25 23:19:24

二维NumPy数组的切片截面

问题描述

2 个解决方案

解决方案1 0 2016-09-25 22:29:54

解决方案2 0 2016-09-25 23:19:24

解决方案1
0 2016-09-25 22:29:54

解决方案2
0 2016-09-25 23:19:24