如何比较两列并获取 python pandas dataframe 中两列中所有匹配项的第三列的平均值？

Question

I have the following table named Rides :我有下表名为Rides ：

start_id start_id	end_id end_id	eta埃塔
A一个	B乙	5 5
B乙	C C	4 4
A一个	C C	6 6
A一个	B乙	5 5
B乙	A一个	3 3
C C	A一个	3 3
B乙	C C	6 6
C C	A一个	5 5
A一个	B乙	8 8

From the Rides Table, I want to Create a new table which should look like something like below:从Rides表中，我想创建一个新表，如下所示：

start_id start_id	end_id end_id	mean _eta意思是_eta
A一个	B乙	6 ((5+5+8)/3) 6 ((5+5+8)/3)
B乙	C C	5 ((4+6)/2)) 5 ((4+6)/2))
A一个	C C	6 6
B乙	A一个	3 3
C C	A一个	4 ((3+5)/2)) 4 ((3+5)/2))

so mean_eta of 1st row is returning 8 as there are three matching rides between start_ID = "A" and end_ID = "B" with eta 5,5,8 , so the mean_eta = (5+5+8)/3 = 6 How should I do it?所以第一行的 mean_eta 返回 8 因为 start_ID = "A"和 end_ID = "B"之间有三个匹配的游乐设施，eta 5,5,8 ，所以mean_eta = (5+5+8)/3 = 6如何我应该这样做吗？ Please help.请帮忙。

Answer 1

groupby and get the aggregate mean. groupby 并获得总平均值。 Code below;代码如下；

df.groupby(['start_id','end_id'])['eta'].agg('mean').to_frame('eta-mean')

如何比较两列并获取 python pandas dataframe 中两列中所有匹配项的第三列的平均值？

问题描述

1 个解决方案

解决方案1
3 已采纳 2021-03-04 20:08:54

如何比较两列并获取 python pandas dataframe 中两列中所有匹配项的第三列的平均值？

问题描述

1 个解决方案

解决方案1 3 已采纳 2021-03-04 20:08:54

解决方案1
3 已采纳 2021-03-04 20:08:54