[英]group and divide values in python
What I want to make records that have the same "NROCUENTA", create a column where the result of the first "SALDO" divided by the number of records in that group 我要制作具有相同“ NROCUENTA”的记录,请创建一个列,其中第一个“ SALDO”的结果除以该组中的记录数
import pandas as pd
import csv, sys
try:
file_encoding = 'utf8'
input_fd = open('DAT_210.del', encoding=file_encoding)
df = pd.read_csv(input_fd, sep = ' ', quotechar='"', error_bad_lines=False)
result=df.groupby('NROCUENTA').apply(
lambda x: ................................
)
except csv.Error as e:
sys.exit('file {}, line {}: {}'.format("datahist.del", reader.line_num, e))
resutl2=result.to_csv('result001.csv',mode = 'w', index=False )
SALDO=FIRST(SALDO)/COUNT(NROCUENTA) SALDO =第一(SALDO)/ COUNT(NROCUENTA)
DATA 数据
"NROCUENTA" "SALDO"
"210-1-388" 159.20
"210-1-388" 159.20
"210-1-1219" 0.93
"210-1-11657" 0.06
"210-1-11657" 0.06
"210-1-11657" 0.06
RESULT 结果
"210-1-388" 79.6
"210-1-388" 79.6
"210-1-1219" 0.93
"210-1-11657" 0.02
"210-1-11657" 0.02
"210-1-11657" 0.02
TRIED 试过
I was trying with the dfply
library, but it throws errors at me and I decided to do it with pandas 我正在尝试使用
dfply
库,但它向我抛出了错误,因此我决定使用pandas
IIUC, you need transform
with count
and divide it by SALDO
columns. IIUC,您需要使用
count
transform
并将其除以SALDO
列。 I assign result to column AVG_SALDO
我将结果分配给
AVG_SALDO
列
df['AVG_SALDO'] = df['SALDO'] / df.groupby('NROCUENTA').SALDO.transform('count')
Out[1112]:
NROCUENTA SALDO AVG_SALDO
0 210-1-388 159.20 79.60
1 210-1-388 159.20 79.60
2 210-1-1219 0.93 0.93
3 210-1-11657 0.06 0.02
4 210-1-11657 0.06 0.02
5 210-1-11657 0.06 0.02
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.