[英]How can I read a pipe-delimited transaction file and generate the sales reports
我有一个以竖线分隔的交易文件。 我想从此数据中确定以下内容。
Pedro|groceries|apple|1.42
Nitin|tobacco|cigarettes|15.00
Susie|groceries|cereal|5.50
Susie|groceries|milk|4.75
Susie|tobacco|cigarettes|15.00
Susie|fuel|gasoline|44.90
Pedro|fuel|propane|9.60
这可以通过pandas.DataFrame.groupby
:
import pandas as pd
from io import StringIO
mystr = """Pedro|groceries|apple|1.42
Nitin|tobacco|cigarettes|15.00
Susie|groceries|cereal|5.50
Susie|groceries|milk|4.75
Susie|tobacco|cigarettes|15.00
Susie|fuel|gasoline|44.90
Pedro|fuel|propane|9.60"""
df = pd.read_csv(StringIO(mystr), header=None, sep='|',
names=['Name', 'Category', 'Product', 'Sales'])
# Report 1
rep1 = df.groupby('Name')['Sales'].sum()
# Name
# Nitin 15.00
# Pedro 11.02
# Susie 70.15
# Name: Sales, dtype: float64
# Report 2
rep2 = df.groupby(['Name', 'Category'])['Sales'].sum()
# Name Category
# Nitin tobacco 15.00
# Pedro fuel 9.60
# groceries 1.42
# Susie fuel 44.90
# groceries 10.25
# tobacco 15.00
# Name: Sales, dtype: float64
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.