如果所有值都在同一列中，如何从csv文件中读取数据？

Question

I have a csv file in the following format: 我有一个csv文件，格式如下：

"age","job","marital","education","default","balance","housing","loan"
58,"management","married","tertiary","no",2143,"yes","no"
44,"technician","single","secondary","no",29,"yes","no"

However, instead of being separated by tabs (different columns), they all lie in the same first column. 但是，它们不是由制表符（不同的列）分隔，而是位于相同的第一列中。 When I try reading this using pandas, the output gives all the values in the same list instead of a list of lists. 当我尝试使用pandas读取它时，输出会在同一列表中提供所有值，而不是列表列表。

My code: 我的代码：

dataframe = pd.read_csv("marketing-data.csv", header = 0, sep= ",")
dataset = dataframe.values
print(dataset)

O/p: O / P：

[[58 'management' 'married' ..., 2143 'yes' 'no']
 [44 'technician' 'single' ..., 29 'yes' 'no']]

What I need: 我需要的：

[[58, 'management', 'married', ..., 2143, 'yes', 'no']
 [44 ,'technician', 'single', ..., 29, 'yes', 'no']]

What is it I am missing? 我错过了什么？

Answer 1

I think you are confused by the print() output which doesn't show commas. 我认为你对print()输出感到困惑，它没有显示逗号。

Demo: 演示：

In [1]: df = pd.read_csv(filename)

Pandas representation: 熊猫代表：

In [2]: df
Out[2]:
   age         job  marital  education default  balance housing loan
0   58  management  married   tertiary      no     2143     yes   no
1   44  technician   single  secondary      no       29     yes   no

Numpy representation: Numpy代表：

In [3]: df.values
Out[3]:
array([[58, 'management', 'married', 'tertiary', 'no', 2143, 'yes', 'no'],
       [44, 'technician', 'single', 'secondary', 'no', 29, 'yes', 'no']], dtype=object)

Numpy string representation (result of print(numpy_array) ): Numpy string表示（ print(numpy_array) ）：

In [4]: print(df.values)
[[58 'management' 'married' 'tertiary' 'no' 2143 'yes' 'no']
 [44 'technician' 'single' 'secondary' 'no' 29 'yes' 'no']]

Conclusion: your CSV file has been parsed correctly. 结论：您的CSV文件已正确解析。

Answer 2

I don't really see a difference between what you want and what you get.. but parsing the csv file with the built in csv module give your desired result 我真的没有看到你想要的和你得到的东西之间的区别......但是使用内置的csv模块解析csv文件可以得到你想要的结果

import csv
with open('file.csv', 'rb') as csvfile:
     spamreader = csv.reader(csvfile, delimiter=',', quotechar='|')
     print list(spamreader)

[ [

['age', 'job', 'marital', 'education', 'default', 'balance', 'housing', 'loan'], ['年龄'，'工作'，'婚姻'，'教育'，'默认'，'平衡'，'住房'，'贷款']，

['58', 'management', 'married', 'tertiary', 'no', '2143', 'yes', 'no'], ['58'，'管理'，'已婚'，'大专'，'不'，'2143'，'是'，'不']，

['44', 'technician', 'single', 'secondary', 'no', '29', 'yes', 'no'] ['44'，'技师'，'单身'，'中学'，'不'，'29'，'是'，'不']

] ]

如果所有值都在同一列中，如何从csv文件中读取数据？

问题描述

2 个解决方案

解决方案1
2 已采纳 2017-08-05 10:49:48

解决方案2
1 2017-08-05 10:27:20

如果所有值都在同一列中，如何从csv文件中读取数据？

问题描述

2 个解决方案

解决方案1 2 已采纳 2017-08-05 10:49:48

解决方案2 1 2017-08-05 10:27:20

解决方案1
2 已采纳 2017-08-05 10:49:48

解决方案2
1 2017-08-05 10:27:20