Pandas 兩列中的唯一值？

Question

我對 pandas 很陌生。我有兩個與兩個玩家游戲相關的數據框

DF1:matches # match information 

match_num   winner_id   loser_id    points
270      201504         201595       28
271      201514         201426       19
272      201697         211901       21             
273      201620         211539       30 
274      214981         203564.      10

對於第270場比賽， 201504 -> winner和201595-> loser各分享28分。

我需要找出哪些玩家獲得的總分最高？

我正在使用 Hashmap 來解決這個問題？

hmap = defaultdict(int)
for index,row in matches_df.iterrows():
    hmap[row["winner_id"]] +=  row["points"]
    hmap[row["loser_id"]] +=  row["points"]
max_key = max(hmap, key=hmap.get)

這可以使用 pandas SQL 方式解決嗎？

Answer 1

用戶melt堆疊兩個 id 列，然后 groupby：

(df[['winner_id','loser_id','points']]
   .melt('points', value_name='id')
   .groupby('id')['points'].sum()
)

Output：

id
201426.0    19
201504.0    28
201514.0    19
201595.0    28
201620.0    30
201697.0    21
203564.0    10
211539.0    30
211901.0    21
214981.0    10
Name: points, dtype: int64

Pandas 兩列中的唯一值？

問題描述

1 個解決方案

解決方案1
1 已采納 2021-10-07 03:41:33

Pandas 兩列中的唯一值？

問題描述

1 個解決方案

解決方案1 1 已采納 2021-10-07 03:41:33

解決方案1
1 已采納 2021-10-07 03:41:33