R dplyr根据乐趣指数汇总一个列值（另一列）

Question

I have a data frame as this, and want the output as shown desired at the end. 我有一个数据框，并希望最后显示所需的输出。 Instead, I get the NA output in the middle. 相反，我在中间得到NA输出。 Is there any way to do what I want using dplyr? 有没有办法用dplyr做我想做的事情？

x <- c(1234, 1234, 1234, 5678, 5678)
y <- c(95138, 30004, 90038, 01294, 15914)
z <- c('2014-01-20', '2014-10-30', '2015-04-12', '2010-2-28', '2015-01-01')
df <- data.frame(x, y, z)
df$z <- as.Date(df$z)
df %>% group_by(x) %>% summarise(y = y[max(z)])

What I get:
     x  y
1 1234 NA
2 5678 NA

Desired Output:
     x     y 
1 1234 90038
2 5678 15914

Answer 1

You can try which.max to get the numeric index of max values that can be used for subsetting the 'y' element. 您可以尝试使用which.max来获取可用于对'y'元素进行子集化的max的数字索引。 Using max just gives the maximum values of z . 使用max只给出z的最大值。

df %>%
    group_by(x) %>%
    summarise(y= y[which.max(z)])
#     x     y
#1 1234 90038
#2 5678 15914

Answer 2

在dplyr使用filter和max 。

df%>%group_by(x)%>%filter(z==max(z))

R dplyr根据乐趣指数汇总一个列值（另一列）

问题描述

2 个解决方案

解决方案1
7 已采纳 2015-05-05 15:25:46

解决方案2
3 2015-05-05 15:50:12

R dplyr根据乐趣指数汇总一个列值（另一列）

问题描述

2 个解决方案

解决方案1 7 已采纳 2015-05-05 15:25:46

解决方案2 3 2015-05-05 15:50:12

解决方案1
7 已采纳 2015-05-05 15:25:46

解决方案2
3 2015-05-05 15:50:12