简体   繁体   English

根据条件分配行程编号

[英]Assign trip number based on condition

I have a time series data. 我有一个时间序列数据。 I would like to group and number rows when column "soak" > 3600. The first row when soak > 3600 is numbered as 1, and the consecutive rows are numbered as 1 too until another row met the condition of soak > 3600. Then that row and consequent rows are numbered as 2 until the third occurrence of soak > 3600. 当列“ soak”> 3600时,我想对行进行分组和编号。当soak> 3600时,第一行被编号为1,连续的行也被编号为1,直到另一行满足soak> 3600的条件。行和随后的行编号为2,直到第三次浸泡> 3600。

A small sample of my data and the code I tried is also provided. 还提供了我的数据和我尝试过的代码的一小部分。

My code did the count, but seems using the ave() gave me some decimal numbers... Is there a way to output integer? 我的代码进行了计数,但似乎使用ave()给了我一些十进制数字...有没有办法输出整数?

starts <- structure(list(datetime = structure(c(1440578907, 1440579205, 
1440579832, 1440579885, 1440579926, 1440579977, 1440580044, 1440580106, 
1440580195, 1440580256, 1440580366, 1440580410, 1440580476, 1440580529, 
1440580931, 1440580966, 1440587753, 1440587913, 1440587933, 1440587954
), class = c("POSIXct", "POSIXt"), tzone = ""), soak = c(NA, 
70L, 578L, 21L, 2L, 41L, 14L, 16L, 32L, 9L, 45L, 20L, 51L, 25L, 
364L, 4L, 6764L, 20L, 4L, 5L)), row.names = c(NA, -20L), class = c("data.table", 
"data.frame"), .internal.selfref = <pointer: 0x000000000a4d1ef0>)

starts$trip <- with(starts, ave(tdiff, cumsum(replace(soka, NA, 10000) > 3600)))

Using dplyr 使用dplyr

library(dplyr)
starts %>% mutate(trip = cumsum(replace(soak, is.na(soak), 1) > 3600))

And with base R 与基数R

starts$trip = with(starts, ave(soak, FUN=function(x) cumsum(replace(x, is.na(x), 1) > 3600)))

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM