基于公共值组合 R 中的数据框行

Question

Given a data frame:给定一个数据框：

    > df <- data.frame( L=c('a','b','b'), t0=c(1,10,20), t1=c(9,19,39))
    > df
      L t0 t1
    1 a  1  9
    2 b 10 19
    3 b 20 39

    I want:
    > df
        L t0 t1
      1 a  1  9
      2 b 10 39

The identical values for df$L equals "b" signify that the start (t0) of the first instance of 'b' should be the new 't0' value and the new 't1' value of the last instance of (contiguous) 'b' should be the new 't1' value. df$L 的相同值等于 "b" 表示 'b' 第一个实例的开始 (t0) 应该是新的 't0' 值和 (contiguous) ' 最后一个实例的新 't1' 值b' 应该是新的 't1' 值。 In effect, if t0 and t1 are times, then I want to merge the time durations of adjacent rows that have the same value for 'L'.实际上，如果 t0 和 t1 是时间，那么我想合并具有相同“L”值的相邻行的持续时间。

Answer 1

After grouping by 'L', summarise to take the first value of 't0' and last value of 't1' (or min and max )通过“L”分组后， summarise采取first “T0”和的值last的“T1”的值（或min和max ）

df %>%
   group_by(L) %>%
    summarise(t0 = first(t0), t1 = last(t1))
# A tibble: 2 x 3
#  L        t0    t1
#  <fct> <dbl> <dbl>
#1 a         1     9
#2 b        10    39

Based on the OP's comments, if we are also grouping by adjacent similar elements in 'L', use rleid根据 OP 的评论，如果我们还按“L”中相邻的相似元素进行分组，请使用rleid

library(data.table)
df1 %>% 
    group_by(grp = rleid(L), L) %>%
    summarise(t0 = first(t0), t1 = last(t1))

data数据

df1 <- data.frame( L=c('a','b','b','a','b','b'), 
        t0=c(1,10,20,40,60,70), t1=c(9,19,39,49,69,79))

Answer 2

You can split by L and return the range .您可以按L split并返回range 。

df <- do.call(rbind, lapply(split(df[-1], df[1]), range))
df
#  [,1] [,2]
#a    1    9
#b   10   39

df <- data.frame(L=rownames(df), t0=df[,1], t1=df[,2])
df
#  L t0 t1
#a a  1  9
#b b 10 39

Answer 3

Maybe you can try aggreate and merge也许你可以尝试aggreate和merge

res <- merge(aggregate(t0 ~ L,df,min),aggregate(t1 ~ L,df,max))

such that以至于

> res
  L t0 t1
1 a  1  9
2 b 10 39

Answer 4

Using data.table :使用data.table ：

library(data.table)
setDT(df)
df[, .(t0 = t0[1], t1 = t1[.N]), by = L]

#    L t0 t1
# 1: a  1  9
# 2: b 10 39

基于公共值组合 R 中的数据框行

问题描述

4 个解决方案

解决方案1
4 已采纳 2019-12-04 05:37:30

data数据

解决方案2
3 2019-12-04 08:12:39

解决方案3
0 2019-12-04 08:34:38

解决方案4
0 2019-12-04 08:38:21

基于公共值组合 R 中的数据框行

问题描述

4 个解决方案

解决方案1 4 已采纳 2019-12-04 05:37:30

data数据

解决方案2 3 2019-12-04 08:12:39

解决方案3 0 2019-12-04 08:34:38

解决方案4 0 2019-12-04 08:38:21

解决方案1
4 已采纳 2019-12-04 05:37:30

解决方案2
3 2019-12-04 08:12:39

解决方案3
0 2019-12-04 08:34:38

解决方案4
0 2019-12-04 08:38:21