如何使用dplyr根据组选择数据框列的第一个值？

Question

I have 2 frames of data which I joined using the left_join() function.我有 2 帧数据，我使用left_join()函数加入了left_join()数据。 Then, I grouped the data by Group using the group_by() function.然后，我使用group_by()函数按组对数据进行分组。 Using the mutate() function I want to create a column to repeatedly display the first value of column V2 according to the sort group.使用mutate()函数我想创建一个列，根据排序组重复显示列 V2 的第一个值。

In MWE the first value of V2 for Group 1 is 5 and for Group 2 it is 7.5.在 MWE 中，第 1 组V2的第一个值是 5，而第 2 组是 7.5。 However, the code I wrote for this is selecting the first value from column V2 and repeating for both groups without separating as I want.但是，我为此编写的代码是从 V2 列中选择第一个值并为两个组重复，而不按我的需要分开。

Note: it is simple because it seems to copy column V2 but this selection of the first value is necessary for me to do other calculations.注意：这很简单，因为它似乎复制了V2列，但是选择第一个值对我进行其他计算是必要的。

Any tips?有小费吗？

library(dplyr)

Group <- c(1, 2)
V1 <- c(10, 20, 30)
V2 <- c(5, 7.5)

df1 <- expand.grid(V1 = V1,
                   Group = Group) 

df2 <- data.frame(Group, V2)

df <- df1 %>%
  left_join(df2) %>%
  group_by(Group) %>%
  mutate(first = first(.$V2))

V1 V1	Group团体	V2 V2	first第一的	The `first` column I want我想要的`first`列
10 10	1 1	5.0 5.0	5 5	5.0 5.0
20 20	1 1	5.0 5.0	5 5	5.0 5.0
30 30	1 1	5.0 5.0	5 5	5.0 5.0
10 10	2 2	7.5 7.5	5 5	7.5 7.5
20 20	2 2	7.5 7.5	5 5	7.5 7.5
30 30	2 2	7.5 7.5	5 5	7.5 7.5

Answer 1

Remove the .$ and it will work as .$ get the entire column breaking the group attribute and thus the first will be the first row value of the entire column删除.$ ，它将作为.$获取整列打破组属性，因此第first将是整列的第一行值

library(dplyr)
df1 %>%
  left_join(df2) %>%
  group_by(Group) %>%
  mutate(first = first(V2))

如何使用dplyr根据组选择数据框列的第一个值？

问题描述

1 个解决方案

解决方案1
2 已采纳 2021-07-16 17:21:39

如何使用dplyr根据组选择数据框列的第一个值？

问题描述

1 个解决方案

解决方案1 2 已采纳 2021-07-16 17:21:39

解决方案1
2 已采纳 2021-07-16 17:21:39