從列中減去每列的平均值並返回它

Question

我有一個包含許多列的數據集。 我必須創建一個 function 獲取每列的平均值並從列中的每一行中減去它，然后返回減去這些平均值的數據集。 我在這里發現了一個類似的問題並應用了答案，但我不斷收到錯誤消息。 這是我的代碼：

def exercise1(df):
    df1 = DataFrame(df)
    df2 = df1 - df1.mean()
    return df2

exercise1(data)

# Where data is the a csv file regarding salaries in the San Francisco area.

我收到以下錯誤

TypeError: ufunc 'subtract' did not contain a loop with signature matching types dtype('<U32') dtype('<U32') dtype('<U32')

我無法弄清楚我做錯了什么

Answer 1

您可以使用try-except在列上執行 for 循環：

def exercise1(df):
    df1 = df.copy()
    for col in df1.columns:
        try:     # if we can compute the mean then substract
            df1[col] -= df1[col].mean()
        except:  # otherwise just ignore the column
            pass

    return df1

Answer 2

df.mean() 生成 pandas 系列數據類型，其中僅包含原始 DataFrame 中的數字列。

means = df.mean()

您可以使用以下方法獲取該系列的索引值：

means.index

用它來切片你原來的 DataFrame 並減去平均值

df2 = df[means.index] - means

Answer 3

您需要指定要從中減去的列：

df = {'values1': [1,2,3], 'values2': [4,5,6]}
def exercise1(df):
    df1 = pd.DataFrame(df)
    df2 = df1['values2'] - df1['values2'].mean()
    return df2

print(exercise1(df))

從列中減去每列的平均值並返回它

問題描述

3 個解決方案

解決方案1
1 已采納 2019-10-21 19:45:51

解決方案2
1 2019-10-21 20:10:21

解決方案3
0 2019-10-21 19:54:26

從列中減去每列的平均值並返回它

問題描述

3 個解決方案

解決方案1 1 已采納 2019-10-21 19:45:51

解決方案2 1 2019-10-21 20:10:21

解決方案3 0 2019-10-21 19:54:26

解決方案1
1 已采納 2019-10-21 19:45:51

解決方案2
1 2019-10-21 20:10:21

解決方案3
0 2019-10-21 19:54:26