如何在python中将数据行拆分为以逗号分隔的列

Question

I have a text file in the following format that I am trying to convert into rows and columns: 我有一个以下格式的文本文件，试图将其转换为行和列：

red,red,blue
blue,red,blue 
blue,blue,red

Once the conversion is complete, I want to store the above in a rows variable: 转换完成后，我想将以上内容存储在rows变量中：

row[0] # should return 'red red blue'
row[0][2] # should return 'blue'

So far I have gotten as far as: 到目前为止，我已经做到：

file = open('myfile.txt')
for row in file:
    # do something here

But i'm not sure what to do next.. can someone help? 但是我不确定下一步该怎么办..有人可以帮忙吗？ Thanks in advance! 提前致谢！

Answer 1

Solution without any external modules : 没有任何外部模块的解决方案：

output = []

with open('file.txt', 'r') as reading:
    file_input = reading.read().split('\n')

for row in file_input:
    output.append(row.split(','))

print(output)

Answer 2

1.numpy solution : (because numpy tag) 1. numpy 解决方案 ：（因为numpy标签）

Use numpy.genfromtxt for numpy array: 将numpy.genfromtxt用于numpy数组：

import numpy as np
arr = np.genfromtxt('file.txt',dtype='str',delimiter=',')
print (arr)
[['red' 'red' 'blue']
 ['blue' 'red' 'blue']
 ['blue' 'blue' 'red']]

print (arr[0])
['red' 'red' 'blue']

print (arr[0][2])
blue

2.pandas solution : 2.pandas解决方案 ：

Use read_csv for DataFrame and for select values loc : 将read_csv用于DataFrame和选择值loc ：

import pandas as pd

df = pd.read_csv('file.txt', header=None)
print (df)
      0     1      2
0   red   red   blue
1  blue   red   blue
2  blue  blue    red

#select first row to Series
print (df.loc[0])
0     red
1     red
2    blue
Name: 0, dtype: object

#select value by index and column
print (df.loc[0, 2])
blue

3.pure python solutions : 3.纯python解决方案 ：

If want nested lists use nested list comprehension : 如果要嵌套列表，请使用nested list comprehension ：

data = [[item for item in line.rstrip('\r\n').split(',')] 
         for line in open('file.txt')]
print (data)

[['red', 'red', 'blue'], ['blue', 'red', 'blue'], ['blue', 'blue', 'red']]

Or with module csv : 或与模块csv ：

import csv

reader = csv.reader(open("file.txt"), delimiter=',')
data = [word for word in [row for row in reader]]
print (data)

[['red', 'red', 'blue'], ['blue', 'red', 'blue'], ['blue', 'blue', 'red']]

print (data[0])
['red', 'red', 'blue']

print (data[0][2])
blue

Answer 3

Alternative solution with pandas module which is good for csv files processing: 带有pandas模块的替代解决方案，非常适合csv文件处理：

import pandas as pd

df = pd.read_csv('file.txt', header=None).T

print(df[0].tolist())    # ['red', 'red', 'blue']
print(df[0][2])          # blue

如何在python中将数据行拆分为以逗号分隔的列

问题描述

3 个解决方案

解决方案1
4 已采纳 2017-10-14 18:13:30

解决方案2
3 2017-10-14 18:12:58

解决方案3
1 2017-10-14 18:15:18

如何在python中将数据行拆分为以逗号分隔的列

问题描述

3 个解决方案

解决方案1 4 已采纳 2017-10-14 18:13:30

解决方案2 3 2017-10-14 18:12:58

解决方案3 1 2017-10-14 18:15:18

解决方案1
4 已采纳 2017-10-14 18:13:30

解决方案2
3 2017-10-14 18:12:58

解决方案3
1 2017-10-14 18:15:18