如何使用 Numpy 通过来自另一个 DataFrame 的因子来缩放 DataFrame 中的列

Question

To Scale each columns (A, B, C) in a DataFrame df:要缩放 DataFrame df 中的每一列（A、B、C）：

l1 = [1,2,3]
l2 = [4,5,6]
l3 = [7,8,9]

df = pd.DataFrame([z for z in zip(l1,l2,l3)], columns= ['A', 'B', 'C'])

with scaling factors in a DataFrame scaling:在 DataFrame 缩放中使用缩放因子：

scaling = pd.DataFrame(dict(id=['B', 'A','C'], scaling = [0.2, 0.3, 0.4]))

using Numpy:使用 Numpy：

df = pd.DataFrame(np.array(df)*np.array(scaling['scaling']), columns=df.columns)

How to obtain right factors from scaling with the corresponding id ['B', 'A','C'] using Numpy?如何使用 Numpy 从对应的 id ['B', 'A','C'] 缩放中获得正确的因子？

I expected to have the following result with print(df)我希望使用 print(df) 得到以下结果

   A    B    C
0  0.3  0.8  2.8
1  0.6  1.0  3.2
2  0.9  1.2  3.6

Answer 1

Try something like:尝试类似：

import pandas as pd

l1 = [1, 2, 3]
l2 = [4, 5, 6]
l3 = [7, 8, 9]

df = pd.DataFrame([z for z in zip(l1, l2, l3)], columns=['A', 'B', 'C'])

scaling = pd.DataFrame(dict(id=['B', 'A', 'C'], scaling=[0.2, 0.3, 0.4]))

# Get Scaling Into a more Usable Format
scaling = scaling.set_index('id').reindex(df.columns).to_numpy().reshape(1, -1)

# Perform scaling
scaled_df = df * scaling
print(scaled_df)

The goal is to just get scaling into a shape that can be easily applied to the DataFrame scaling .目标只是将scaling为可以轻松应用于 DataFrame scaling的形状。 Once scaling is in the right shape and order:一旦缩放处于正确的形状和顺序：

   scaling
A      0.3
B      0.2
C      0.4

[[0.3 0.2 0.4]]

It can just be multiplied by the df :它可以乘以df ：

     A    B    C
0  0.3  0.8  2.8
1  0.6  1.0  3.2
2  0.9  1.2  3.6

如何使用 Numpy 通过来自另一个 DataFrame 的因子来缩放 DataFrame 中的列

问题描述

1 个解决方案

解决方案1
0 2021-05-10 01:51:57

如何使用 Numpy 通过来自另一个 DataFrame 的因子来缩放 DataFrame 中的列

问题描述

1 个解决方案

解决方案1 0 2021-05-10 01:51:57

解决方案1
0 2021-05-10 01:51:57