[英]How to calculate count and percentage in groupby in Python
分組后我有以下輸出
Publisher.groupby('Category')['Title'].count()
Category
Coding 5
Hacking 7
Java 1
JavaScript 5
LEGO 43
Linux 7
Networking 5
Others 123
Python 8
R 2
Ruby 4
Scripting 4
Statistics 2
Web 3
在上面的輸出中,我也想要百分比,即第一行5*100/219
等等。 我正在做以下
Publisher.groupby('Category')['Title'].agg({'Count':'count','Percentage':lambda x:x/x.sum()})
但它給了我一個錯誤。 請幫忙
我認為你可以使用:
P = Publisher.groupby('Category')['Title'].count().reset_index()
P['Percentage'] = 100 * P['Title'] / P['Title'].sum()
樣品:
Publisher = pd.DataFrame({'Category':['a','a','s'],
'Title':[4,5,6]})
print (Publisher)
Category Title
0 a 4
1 a 5
2 s 6
P = Publisher.groupby('Category')['Title'].count().reset_index()
P['Percentage'] = 100 * P['Title'] / P['Title'].sum()
print (P)
Category Title Percentage
0 a 2 66.666667
1 s 1 33.333333
df = pd.DataFrame({'Category':['a','a','s'],
'Title':[4,5,6]})
df=df.groupby('Category')['Title'].count().rename("percentage").transform(lambda x: x/x.sum())
df.reset_index()
#output in dataframe type
Category percentage
0 a 0.666667
1 s 0.333333
#please let me know if it doesn't solve your current problem
聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.