遍歷 Pandas dataframe 的列並創建新變量

Question

我無法弄清楚如何迭代 pandas dataframe 中的變量並對每個變量執行相同的算術 function。

我有一個 dataframe df ，其中包含三個數值變量x1 、 x2和x3 。 我想通過將每個變量乘以 2 來創建三個新變量。這就是我正在做的事情：

existing = ['x1','x2','x3']
new = ['y1','y2','y3']

for i in existing:
    for j in new:
        df[j] = df[i]*2

上面的代碼實際上是在 dataframe 中創建了三個新變量y1 、 y2和y3 。 但是y1和y2的值被y3的值覆蓋，並且所有三個變量都具有相同的值，對應於y3的值。 我不確定我錯過了什么。

非常感謝任何指導/建議。 謝謝。

Answer 1

您在這里循環了 9 次 - 每列 3 次，每次迭代都會覆蓋前一個。

你可能想要類似的東西

for e, n in zip(existing,new):
    df[n] = df[e]*2

Answer 2

我會做一些更通用的事情

#existing = ['x1','x2','x3']
exisiting = df.columns
new = existing.replace('x','y') 
#maybe you need map+lambda/for for each existing string

for (ind_existing, ind_new) in zip(existing,new):
    df[new[ind_new]] = df[existing[ind_existing]]*2 
#maybe there is more elegant way by using pandas assign function

Answer 3

您可以將原始 DataFrame 與具有雙倍值的列連接起來：

cols_to_double = ['x0', 'x1', 'x2']
new_cols = list(df.columns) + [c.replace('x', 'y') for c in cols_to_double]

df = pd.concat([df, 2 * df[cols_to_double]], axis=1, copy=True)
df.columns = new_cols

因此，如果您的輸入df Dataframe 是：

   x0  x1  x2  other0  other1
0   0   1   2       3       4
1   0   1   2       3       4
2   0   1   2       3       4
3   0   1   2       3       4
4   0   1   2       3       4

執行前幾行后，您將獲得：

   x0  x1  x2  other0  other1  y0  y1  y2
0   0   1   2       3       4   0   2   4
1   0   1   2       3       4   0   2   4
2   0   1   2       3       4   0   2   4
3   0   1   2       3       4   0   2   4
4   0   1   2       3       4   0   2   4

這里是創建df的代碼：

import pandas as pd
import numpy as np

df = pd.DataFrame(
    data=np.column_stack([np.full((5,), i) for i in range(5)]),
    columns=[f'x{i}' for i in range(3)] + [f'other{i}' for i in range(2)]
)

遍歷 Pandas dataframe 的列並創建新變量

問題描述

3 個解決方案

解決方案1
2 已采納 2021-01-16 14:38:22

解決方案2
0 2021-01-16 14:40:07

解決方案3
0 2021-01-16 15:54:17

遍歷 Pandas dataframe 的列並創建新變量

問題描述

3 個解決方案

解決方案1 2 已采納 2021-01-16 14:38:22

解決方案2 0 2021-01-16 14:40:07

解決方案3 0 2021-01-16 15:54:17

解決方案1
2 已采納 2021-01-16 14:38:22

解決方案2
0 2021-01-16 14:40:07

解決方案3
0 2021-01-16 15:54:17