Pandas Column Multiindex相互減去列

Question

pandas DataFrame：

構造函數：

c = pd.MultiIndex.from_product([['AAPL','AMZN'],['price','custom']])
i = pd.date_range(start='2017-01-01',end='2017-01-6')
df1 = pd.DataFrame(index=i,columns=c)

df1.loc[:,('AAPL','price')] = list(range(51,57))
df1.loc[:,('AMZN','price')] = list(range(101,107))
df1.loc[:,('AAPL','custom')] = list(range(1,7))
df1.loc[:,('AMZN','custom')] = list(range(17,23))
df1.index.set_names('Dates',inplace=True)
df1.sort_index(axis=1,level=0,inplace=True) # needed for pd.IndexSlice[]

df1

產生:(無法弄清楚如何格式化Jupyter Notebook的輸出）

    AAPL    AMZN
    custom  price   custom  price
Dates               
2017-01-01  1   51  17  101
2017-01-02  2   52  18  102
2017-01-03  3   53  19  103
2017-01-04  4   54  20  104
2017-01-05  5   55  21  105
2017-01-06  6   56  22  106

問題：如何在MultiIndex的第二級創建第三列，即price和custom之間的差異？ 這應該針對每個頂級列級別單獨計算，即分別針對AAPL和AMZN計算。

嘗試的解決方案：

我嘗試過兩種方式使用pd.IndexSlice ，它們都給了我所有的NaNs ：

df1.loc[:,pd.IndexSlice[:,'price']].sub(df1.loc[:,pd.IndexSlice[:,'custom']])
df1.loc[:,pd.IndexSlice[:,'price']] - df1.loc[:,pd.IndexSlice[:,'custom']]

返回：

    AAPL    AMZN
    custom  price   custom  price
Dates               
2017-01-01  NaN NaN NaN NaN
2017-01-02  NaN NaN NaN NaN
2017-01-03  NaN NaN NaN NaN
2017-01-04  NaN NaN NaN NaN
2017-01-05  NaN NaN NaN NaN
2017-01-06  NaN NaN NaN NaN

如何添加差異的第三列？

謝謝。

Answer 1

您可以考慮減去值：

df1.loc[:, pd.IndexSlice[:, 'price']] - df1.loc[:,pd.IndexSlice[:,'custom']].values

要加入它，您可以使用pd.concat ：

In [221]: df2 = (df1.loc[:, pd.IndexSlice[:, 'price']] - df1.loc[:,pd.IndexSlice[:,'custom']].values)\
                            .rename(columns={'price' : 'new'})

In [222]: pd.concat([df1, df2], axis=1)
Out[222]: 
             AAPL         AMZN       AAPL AMZN
           custom price custom price  new  new
Dates                                         
2017-01-01      1    51     17   101   50   84
2017-01-02      2    52     18   102   50   84
2017-01-03      3    53     19   103   50   84
2017-01-04      4    54     20   104   50   84
2017-01-05      5    55     21   105   50   84
2017-01-06      6    56     22   106   50   84

Answer 2

您可以按值減去，然后重命名並最后加入原始：

a = df1.loc[:,pd.IndexSlice[:,'price']].sub(df1.loc[:,pd.IndexSlice[:,'custom']].values, 1)
       .rename(columns={'price':'sub'})
df1 = df1.join(a).sort_index(axis=1)
print (df1)
             AAPL             AMZN          
           custom price sub custom price sub
Dates                                       
2017-01-01      1    51  50     17   101  84
2017-01-02      2    52  50     18   102  84
2017-01-03      3    53  50     19   103  84
2017-01-04      4    54  50     20   104  84
2017-01-05      5    55  50     21   105  84
2017-01-06      6    56  50     22   106  84

Pandas Column Multiindex相互減去列

問題描述

2 個解決方案

解決方案1
4 已采納 2017-08-18 18:41:59

解決方案2
3 2017-08-18 18:42:24

Pandas Column Multiindex相互減去列

問題描述

2 個解決方案

解決方案1 4 已采納 2017-08-18 18:41:59

解決方案2 3 2017-08-18 18:42:24

解決方案1
4 已采納 2017-08-18 18:41:59

解決方案2
3 2017-08-18 18:42:24