pandas groupby使用字典值，应用sum

Question

I have a defaultdict: 我有一个defaultdict：

dd = defaultdict(list,
        {'Tech': ['AAPL','GOOGL'],
         'Disc': ['AMZN', 'NKE']  }

and a dataframe that looks like this: 和一个如下所示的数据框：

         AAPL AMZN GOOGL NKE
1/1/10   100  200  500   200
1/2/10   100  200  500   200
1/310    100  200  500   200

and the output I'd like is to SUM the dataframe based on the values of the dictionary, with the keys as the columns: 我想要的输出是根据字典的值对数据帧进行求和，并将键作为列：

         TECH DISC 
1/1/10   600  400 
1/2/10   600  400  
1/3/10   600  400

The pandas groupby documentation says it does this if you pass a dictionary but all I end up with is an empty df using this code: pandas groupby文档说，如果你传递一个字典，它会这样做，但我最终得到的是使用此代码的空df：

df.groupby(by=dd).sum()   ##returns empty df

Answer 1

Create the dict in the right way , you can using by with axis=1 以正确的方式创建dict ，您可以使用by axis=1

# map each company to industry
dd_rev = {w: k for k, v in dd.items() for w in v}
# {'AAPL': 'Tech', 'GOOGL': 'Tech', 'AMZN': 'Disc', 'NKE': 'Disc'}

# group along columns
df.groupby(by=dd_rev,axis=1).sum() 

Out[160]: 
        Disc  Tech
1/1/10   400   600
1/2/10   400   600
1/310    400   600

Answer 2

you can create a new dataframe using the defaultdict and dictionary comprehension in 1 line 您可以使用1行中的defaultdict和字典理解创建新的数据框

pd.DataFrame({x: df[dd[x]].sum(axis=1) for x in dd})
# output:

        Disc  Tech
1/1/10   400   600
1/2/10   400   600
1/310    400   600

pandas groupby使用字典值，应用sum

问题描述

2 个解决方案

解决方案1
4 已采纳 2018-06-11 01:26:35

解决方案2
1 2018-06-11 01:33:30

pandas groupby使用字典值，应用sum

问题描述

2 个解决方案

解决方案1 4 已采纳 2018-06-11 01:26:35

解决方案2 1 2018-06-11 01:33:30

解决方案1
4 已采纳 2018-06-11 01:26:35

解决方案2
1 2018-06-11 01:33:30