Pandas 替换多索引行中的值

Question

所以，我有一个 MultiIndex DataFrame 并且我无法找出行来修改行索引值。

在此示例中，我想设置 c = 1 其中“a”索引为 4：

import pandas as pd
import numpy as np

df = pd.DataFrame({('colA', 'x1'): {(1, np.nan, 0): np.nan, (4, np.nan, 0): np.nan},
('colA', 'x2'): {(1, np.nan, 0): np.nan, (4, np.nan, 0): np.nan},
('colA', 'x3'): {(1, np.nan, 0): np.nan, (4, np.nan, 0): np.nan},
('colA', 'x4'): {(1, np.nan, 0): np.nan, (4, np.nan, 0): np.nan}})

df.index.set_names(['a', 'b', 'c'], inplace=True)

print(df)


            colA
              x1    x2  x3  x4
a   b   c               
1   NaN 0   NaN NaN NaN NaN
4   NaN 0   NaN NaN NaN NaN

所需的 output：

            colA
              x1    x2  x3  x4
a   b   c               
1   NaN 0   NaN NaN NaN NaN
4   NaN 1   NaN NaN NaN NaN

任何帮助表示赞赏。

Answer 1

假设我们从df开始。

x = df.reset_index()
x.loc[x[x.a == 4].index, 'c'] = 1
x = x.set_index(['a', 'b', 'c'])
print(x)

        colA            
          x1  x2  x3  x4
a b   c                 
1 NaN 0  NaN NaN NaN NaN
4 NaN 1  NaN NaN NaN NaN

Answer 2

解决方案

分离索引，对其进行处理，然后将其与数据一起放回原处。

逻辑

分离索引并将其处理为 dataframe
准备一个多索引
以下两个选项之一：
- 将数据和 MultiIndex 结合在一起Method-1
- 更新原dataframe Method-2的索引

代码

# separate the index and process it
names = ['a', 'b', 'c'] # same as df.index.names
#dfd = pd.DataFrame(df.to_records())
dfd = df.index.to_frame().reset_index(drop=True)
dfd.loc[dfd['a']==4, ['c']] = 1

# prepare index for original dataframe: df
index = pd.MultiIndex.from_tuples([tuple(x) for x in dfd.loc[:, names].values], names=names)

## Method-1
# create new datframe with updated index
dfn = pd.DataFrame(df.values, index=index, columns=df.columns)
# dfn --> new dataframe

## Method-2
# reset the index of original dataframe df
df.set_index(index)

Output ：

            colA            
              x1  x2  x3  x4
a   b   c                   
1.0 NaN 0.0  NaN NaN NaN NaN
4.0 NaN 1.0  NaN NaN NaN NaN

虚拟数据

import pandas as pd
import numpy as np

df = pd.DataFrame({('colA', 'x1'): {(1, np.nan, 0): np.nan, (4, np.nan, 0): np.nan},
('colA', 'x2'): {(1, np.nan, 0): np.nan, (4, np.nan, 0): np.nan},
('colA', 'x3'): {(1, np.nan, 0): np.nan, (4, np.nan, 0): np.nan},
('colA', 'x4'): {(1, np.nan, 0): np.nan, (4, np.nan, 0): np.nan}})

df.index.set_names(['a', 'b', 'c'], inplace=True)

Pandas 替换多索引行中的值

问题描述

2 个解决方案

解决方案1
3 2020-05-14 20:55:42

解决方案2
2 已采纳 2020-05-14 21:16:56

解决方案

逻辑

代码

虚拟数据

Pandas 替换多索引行中的值

问题描述

2 个解决方案

解决方案1 3 2020-05-14 20:55:42

解决方案2 2 已采纳 2020-05-14 21:16:56

解决方案

逻辑

代码

虚拟数据

解决方案1
3 2020-05-14 20:55:42

解决方案2
2 已采纳 2020-05-14 21:16:56