从多个数据框创建列

Question

I need to create some new columns based on the value of a dataframe filed and a look up dataframe with some rates. 我需要根据提交的dataframe的值和具有一定比率的查找dataframe创建一些新列。

Having df1 as 具有df1作为

    zone    hh      hhind
0   14      112.0   3.4
1   15      5.0     4.4
2   16      0.0     1.0

and a look_up df as 和一个look_up df为

    ind     per1    per2    per3    per4
0   1.0     1.000   0.000   0.000   0.000
24  3.4     0.145   0.233   0.165   0.457
34  4.4     0.060   0.114   0.075   0.751

how can i update df1.hh1 by multiplying the look_up.per1 based on df1.hhind and lookup.ind 我怎样才能更新df1.hh1由乘以look_up.per1基于df1.hhind和lookup.ind

    zone    hh      hhind  hh1
 0  14      112.0   3.4    16.240
 1  15      5.0     4.4    0.300
 2  16      0.0     1.0    0.000

at the moment im getting the result by merging the tables and the doing the arithmetic. 目前，我通过合并表格和进行算术来获得结果。

r = pd.merge(df1, look_up, left_on="hhind", right_on="ind")
r["hh1"] = r.hh *r.per1

i'd like to know if there is a more straight way to accomplish this by not merging the tables? 我想知道是否有一种更直接的方式来完成此工作，而不合并表？

Answer 1

You could first set hhind and ind as the index axis of df1 and look_up dataframes respectively. 您可以首先将hhind和ind分别设置为df1和look_up数据帧的索引轴。 Then, multiply corresponding elements in hh and per1 element-wise. 然后，将相应的元素分别乘以hh和per1逐个元素。

Map these results to the column hhind and assign these to a new column later as shown: 将这些结果映射到后面的列，然后将它们分配给新列，如下所示：

mapper = df1.set_index('hhind')['hh'].mul(look_up.set_index('ind')['per1'])
df1.assign(hh1=df1['hhind'].map(mapper))

Answer 2

另一个解决方案：

df1['hh1'] = (df1['hhind'].map(lambda x: look_up[look_up["ind"]==x]["per1"])) * df1['hh']

从多个数据框创建列

问题描述

2 个解决方案

解决方案1
1 2017-02-15 16:59:21

解决方案2
1 2017-02-15 17:06:12

从多个数据框创建列

问题描述

2 个解决方案

解决方案1 1 2017-02-15 16:59:21

解决方案2 1 2017-02-15 17:06:12

解决方案1
1 2017-02-15 16:59:21

解决方案2
1 2017-02-15 17:06:12