"在以值為條件的新熊貓列中加入列名"

Question

我有以下數據集：

data = {'Environment': ['0', '0', '0'],
        'Health': ['1', '0', '1'],
            'Labor': ['1', '1', '1'],
             }

df = pd.DataFrame(data, columns=['Environment', 'Health', 'Labor'])

我想創建一個新列 df['Keyword'] ，其值是值 > 0 的列名的連接。

預期結果：

data = {'Environment': ['0', '0', '0'],
            'Health': ['1', '0', '1'],
                'Labor': ['1', '1', '1'],
                     'Keyword': ['Health, Labor', 'Labor', 'Health, Labor']}
    
df_test = pd.DataFrame(data, columns=['Environment', 'Health', 'Labor', 'Keyword']) 
df_test
df = pd.DataFrame(data, columns=['Environment', 'Health', 'Labor'])

我該怎么做？

Answer 1

帶有.apply()<\/code>的其他版本：

df['Keyword'] = df.apply(lambda x: ', '.join(b for a, b in zip(x, x.index) if a=='1'),axis=1)
print(df)

Answer 2

另一種使用mask<\/code>和stack<\/code>的方法，然后使用 groupby 來聚合項目。

stack<\/code>默認會丟棄 na 值。

df['keyword'] = df.mask(
               df.lt(1)).stack().reset_index(1)\
                        .groupby(level=0)["level_1"].agg(list)

print(df)

   Environment  Health  Labor          keyword
0            0       1      1  [Health, Labor]
1            0       0      1          [Labor]
2            0       1      1  [Health, Labor]

Answer 3

樣本數據值中的第一個問題是字符串，所以如果想要比較更多用途：

df = df.astype(float).astype(int)

"在以值為條件的新熊貓列中加入列名"

問題描述

3 個解決方案

解決方案1
2 2020-09-09 08:33:15

解決方案2
2 2020-09-09 08:39:12

解決方案3
1 已采納 2020-09-09 08:31:39

"在以值為條件的新熊貓列中加入列名"

問題描述

3 個解決方案

解決方案1 2 2020-09-09 08:33:15

解決方案2 2 2020-09-09 08:39:12

解決方案3 1 已采納 2020-09-09 08:31:39

解決方案1
2 2020-09-09 08:33:15

解決方案2
2 2020-09-09 08:39:12

解決方案3
1 已采納 2020-09-09 08:31:39