从字符串转换为熊猫数据框

Question

I'm importing a csv file from AWS S3 in AWS Lambdawith below code: 我正在使用以下代码从AWS Lambda中的AWS S3导入一个csv文件：

file = s3.get_object(Bucket = bucket, Key = key)
rows = file['Body'].read().decode('utf-8').splitlines(False)

I'm getting input in below format : 我收到以下格式的输入：

data = "a,b,c,d,\\"x,y\\",e,f" 数据=“ a，b，c，d，\\” x，y \\“，e，f”

and I want output in below format: 我想以以下格式输出：

>>>`>>> df
   0  1  2  3    4  5  6
0  a  b  c  d  x,y  e  f`

i have to split data based on ',' but if some thong is between " " they should remain as it is. 我必须根据'分割数据，但是如果在“”之间有一些皮带，它们应该保持原样。

Or if you have any other solution for import csv file from s3 to lambda and converting in Data Frame, Please suggest 或者，如果您有将sv文件从s3导入到lambda并在数据帧中进行转换的其他解决方案，请提出建议

Answer 1

use csv module 使用csv模块

try this, 尝试这个，

from csv import reader
import pandas as pd
data=["a,b,c,d,\"x,y\",e,f"]
df=pd.DataFrame( list(reader(data)))
print df

Output: 输出：

   0  1  2  3    4  5  6
0  a  b  c  d  x,y  e  f

从字符串转换为熊猫数据框

问题描述

1 个解决方案

解决方案1
2 已采纳 2018-11-13 10:11:42

从字符串转换为熊猫数据框

问题描述

1 个解决方案

解决方案1 2 已采纳 2018-11-13 10:11:42

解决方案1
2 已采纳 2018-11-13 10:11:42